SU-NLP at SemEval-2020 task 12: offensive language identification in Turkish tweets

Özdemir, Anıl and Yeniterzi, Reyyan (2020) SU-NLP at SemEval-2020 task 12: offensive language identification in Turkish tweets. In: International Workshop on Semantic Evaluation, Barcelona, Spain

This is the latest version of this item.

[thumbnail of 2020.semeval-1.288.pdf] PDF
2020.semeval-1.288.pdf
Restricted to Registered users only

Download (167kB) | Request a copy
[thumbnail of 2020.semeval-1.288.pdf] PDF
2020.semeval-1.288.pdf

Download (167kB)
[thumbnail of 2020.semeval-1.288.pdf] PDF
2020.semeval-1.288.pdf

Download (167kB)

Abstract

This paper summarizes our group’s efforts in the offensive language identification shared task, which is organized as part of the International Workshop on Semantic Evaluation (Sem-Eval2020). Our final submission system is an ensemble of three different models, (1) CNN-LSTM, (2) BiLSTM-Attention and (3) BERT. Word embeddings, which were pre-trained on tweets, are used while training the first two models. BERTurk, which is the first BERT model for Turkish, is also explored. Our final submitted approach ranked as the second best model in the Turkish sub-task.
Item Type: Papers in Conference Proceedings
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng.
Faculty of Engineering and Natural Sciences
Depositing User: Reyyan Yeniterzi
Date Deposited: 01 Sep 2021 03:03
Last Modified: 08 Aug 2023 11:03
URI: https://research.sabanciuniv.edu/id/eprint/41675

Available Versions of this Item

Actions (login required)

View Item
View Item