#Turki$hTweets: a benchmark dataset for Turkish text correction

Koksal, Asiye Tuba and Bozal, Ozge and Yurekli, Emre and Gezici, Gizem (2020) #Turki$hTweets: a benchmark dataset for Turkish text correction. In: Findings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020, Virtual, Online

Full text not available from this repository. (Request a copy)

Official URL: http://dx.doi.org/10.18653/v1/2020.findings-emnlp.374

Abstract

#Turki$hTweets is a benchmark dataset for the task of correcting the user misspellings, with the purpose of introducing the first public Turkish dataset in this area. #Turki$hTweets provides correct/incorrect word annotations with a detailed misspelling category formulation based on the real user data. We evaluated four state-of-the-art approaches on our dataset to present a preliminary analysis for the sake of reproducibility.

Item Type:	Papers in Conference Proceedings
Divisions:	Faculty of Engineering and Natural Sciences
Depositing User:	Gizem Gezici
Date Deposited:	09 Aug 2023 11:51
Last Modified:	09 Aug 2023 11:51
URI:	https://research.sabanciuniv.edu/id/eprint/47027

Actions (login required)

: View Item