Active learning for Turkish text classification [Türkçe metin sınıflandırması için aktif öğrenme]

Şapcı, Ali Osman Berk and Taştan, Öznur and Yeniterzi, Reyyan (2020) Active learning for Turkish text classification [Türkçe metin sınıflandırması için aktif öğrenme]. In: 28th Signal Processing and Communications Applications Conference (SIU), Gaziantep, Turkey

Full text not available from this repository. (Request a copy)

Abstract

Many natural language processing (NLP) applications are solved in a supervised learning framework, which often requires a sufficiently large amount of labeled data. However, the labeled data is scarce as the process of obtaining class labels is often costly and/or time-consuming and sometimes requires indepth expert knowledge. Thus, obtaining labeled sets is a bottleneck. On the other hand, unlabeled data is abundant and easy to access in a variety of domains. The active learning paradigm adresses this challenge by effectively selecting informative and/or representative instances from the unlabeled pool to be labeled. In this way, active learners aim to reduce the cost of label acquisition without sacrificing model performance. In this work, we apply conventional active learning techniques on various Turkish text classification tasks. Experiments demonstrate that active learning helps to attain good performances while reducing the required labeled data.
Item Type: Papers in Conference Proceedings
Uncontrolled Keywords: Active learning; natural language processing; supervised learning; text classification
Divisions: Faculty of Engineering and Natural Sciences
Depositing User: Öznur Taştan
Date Deposited: 09 Aug 2023 15:27
Last Modified: 09 Aug 2023 15:27
URI: https://research.sabanciuniv.edu/id/eprint/46985

Actions (login required)

View Item
View Item