Özdemir, Anıl and Yeniterzi, Reyyan (2020) SU-NLP at SemEval-2020 task 12: offensive language identification in Turkish tweets. In: International Workshop on Semantic Evaluation, Barcelona, Spain
This is the latest version of this item.
PDF
2020.semeval-1.288.pdf
Restricted to Registered users only
Download (167kB) | Request a copy
2020.semeval-1.288.pdf
Restricted to Registered users only
Download (167kB) | Request a copy
PDF
2020.semeval-1.288.pdf
Download (167kB)
2020.semeval-1.288.pdf
Download (167kB)
PDF
2020.semeval-1.288.pdf
Download (167kB)
2020.semeval-1.288.pdf
Download (167kB)
Abstract
This paper summarizes our group’s efforts in the offensive language identification shared task, which is organized as part of the International Workshop on Semantic Evaluation (Sem-Eval2020). Our final submission system is an ensemble of three different models, (1) CNN-LSTM, (2) BiLSTM-Attention and (3) BERT. Word embeddings, which were pre-trained on tweets, are used while training the first two models. BERTurk, which is the first BERT model for Turkish, is also explored. Our final submitted approach ranked as the second best model in the Turkish sub-task.
Item Type: | Papers in Conference Proceedings |
---|---|
Divisions: | Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng. Faculty of Engineering and Natural Sciences |
Depositing User: | Reyyan Yeniterzi |
Date Deposited: | 01 Sep 2021 03:03 |
Last Modified: | 08 Aug 2023 11:03 |
URI: | https://research.sabanciuniv.edu/id/eprint/41675 |
Available Versions of this Item
-
SU-NLP at SemEval-2020 task 12: offensive language identification in Turkish tweets. (deposited 19 Sep 2020 08:35)
- SU-NLP at SemEval-2020 task 12: offensive language identification in Turkish tweets. (deposited 01 Sep 2021 03:03) [Currently Displayed]