Baloğlu, Emre Batuhan (2023) Reinforcement Learning For Text Classification: An Evaluation Of Policy-Gradient Methods With Various Topologies. [Thesis]
PDF
10569767.pdf
Download (1MB)
10569767.pdf
Download (1MB)
Official URL: https://risc01.sabanciuniv.edu/record=b3205818
Abstract
Usage of reinforcement learning (RL) in natural language processing (NLP) tasks has gained momentum in recent years. In this thesis, we present an improved approach to the task of text classification through the integration of various deep learning topologies such as transformers and large language models (LLMs) into the feature extraction process within a reinforcement learning framework. In this proposed method, the RL policies are trained to observe a portion of the text and determine whether to classify the text or to proceed to the next part of the document. The policies were optimized with the REINFORCE (Williams, 1992) algorithm utilizing a designed reward signal. The effectiveness of the proposed method was evaluated and compared against other state-of-the-art models on standard text classification benchmark datasets, demonstrating the superiority of the proposed approach in terms of efficiency while losing little performance in accuracy. The results indicate that the use of the LLMs in the feature extraction process, coupled with RL policies with designed reward signals, provides a promising avenue for the development of effective and efficient text classification models.
Item Type: | Thesis |
---|---|
Uncontrolled Keywords: | reinforcement learning, natural language processing, text classification. -- pekiştirmeli öğrenme, doğal dil işleme, metin sınıflandırma. |
Subjects: | T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800-8360 Electronics > TK7885-7895 Computer engineering. Computer hardware |
Divisions: | Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng. Faculty of Engineering and Natural Sciences |
Depositing User: | Dila Günay |
Date Deposited: | 22 Dec 2023 12:35 |
Last Modified: | 22 Dec 2023 12:35 |
URI: | https://research.sabanciuniv.edu/id/eprint/48877 |