Şapcı, Ali Osman Berk and Taştan, Öznur and Yeniterzi, Reyyan (2020) Active learning for Turkish text classification [Türkçe metin sınıflandırması için aktif öğrenme]. In: 28th Signal Processing and Communications Applications Conference (SIU), Gaziantep, Turkey
Full text not available from this repository. (Request a copy)
Official URL: https://dx.doi.org/10.1109/SIU49456.2020.9302289
Abstract
Many natural language processing (NLP) applications are solved in a supervised learning framework, which often requires a sufficiently large amount of labeled data. However, the labeled data is scarce as the process of obtaining class labels is often costly and/or time-consuming and sometimes requires indepth expert knowledge. Thus, obtaining labeled sets is a bottleneck. On the other hand, unlabeled data is abundant and easy to access in a variety of domains. The active learning paradigm adresses this challenge by effectively selecting informative and/or representative instances from the unlabeled pool to be labeled. In this way, active learners aim to reduce the cost of label acquisition without sacrificing model performance. In this work, we apply conventional active learning techniques on various Turkish text classification tasks. Experiments demonstrate that active learning helps to attain good performances while reducing the required labeled data.
Item Type: | Papers in Conference Proceedings |
---|---|
Uncontrolled Keywords: | Active learning; natural language processing; supervised learning; text classification |
Divisions: | Faculty of Engineering and Natural Sciences |
Depositing User: | Öznur Taştan |
Date Deposited: | 09 Aug 2023 15:27 |
Last Modified: | 09 Aug 2023 15:27 |
URI: | https://research.sabanciuniv.edu/id/eprint/46985 |