Koksal, Asiye Tuba and Bozal, Ozge and Yurekli, Emre and Gezici, Gizem (2020) #Turki$hTweets: a benchmark dataset for Turkish text correction. In: Findings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020, Virtual, Online
Full text not available from this repository. (Request a copy)
Official URL: http://dx.doi.org/10.18653/v1/2020.findings-emnlp.374
Abstract
#Turki$hTweets is a benchmark dataset for the task of correcting the user misspellings, with the purpose of introducing the first public Turkish dataset in this area. #Turki$hTweets provides correct/incorrect word annotations with a detailed misspelling category formulation based on the real user data. We evaluated four state-of-the-art approaches on our dataset to present a preliminary analysis for the sake of reproducibility.
Item Type: | Papers in Conference Proceedings |
---|---|
Divisions: | Faculty of Engineering and Natural Sciences |
Depositing User: | Gizem Gezici |
Date Deposited: | 09 Aug 2023 11:51 |
Last Modified: | 09 Aug 2023 11:51 |
URI: | https://research.sabanciuniv.edu/id/eprint/47027 |