#Turki$hTweets: a benchmark dataset for Turkish text correction

Koksal, Asiye Tuba and Bozal, Ozge and Yurekli, Emre and Gezici, Gizem (2020) #Turki$hTweets: a benchmark dataset for Turkish text correction. In: Findings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020, Virtual, Online

Full text not available from this repository. (Request a copy)


#Turki$hTweets is a benchmark dataset for the task of correcting the user misspellings, with the purpose of introducing the first public Turkish dataset in this area. #Turki$hTweets provides correct/incorrect word annotations with a detailed misspelling category formulation based on the real user data. We evaluated four state-of-the-art approaches on our dataset to present a preliminary analysis for the sake of reproducibility.
Item Type: Papers in Conference Proceedings
Divisions: Faculty of Engineering and Natural Sciences
Depositing User: Gizem Gezici
Date Deposited: 09 Aug 2023 11:51
Last Modified: 09 Aug 2023 11:51
URI: https://research.sabanciuniv.edu/id/eprint/47027

Actions (login required)

View Item
View Item