SU-NLP at SemEval-2020 task 12: offensive language identification in Turkish tweets

Özdemir, Anıl and Yeniterzi, Reyyan (2020) SU-NLP at SemEval-2020 task 12: offensive language identification in Turkish tweets. In: International Workshop on Semantic Evaluation, Barcelona, Spain

This is the latest version of this item.

PDF
2020.semeval-1.288.pdf
Restricted to Registered users only
Download (167kB) | Request a copy

PDF
2020.semeval-1.288.pdf
Download (167kB)

Abstract

This paper summarizes our group’s efforts in the offensive language identification shared task, which is organized as part of the International Workshop on Semantic Evaluation (Sem-Eval2020). Our final submission system is an ensemble of three different models, (1) CNN-LSTM, (2) BiLSTM-Attention and (3) BERT. Word embeddings, which were pre-trained on tweets, are used while training the first two models. BERTurk, which is the first BERT model for Turkish, is also explored. Our final submitted approach ranked as the second best model in the Turkish sub-task.

Item Type:	Papers in Conference Proceedings
Divisions:	Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng. Faculty of Engineering and Natural Sciences
Depositing User:	Reyyan Yeniterzi
Date Deposited:	01 Sep 2021 03:03
Last Modified:	08 Aug 2023 11:03
URI:	https://research.sabanciuniv.edu/id/eprint/41675

Available Versions of this Item

SU-NLP at SemEval-2020 task 12: offensive language identification in Turkish tweets. (deposited 19 Sep 2020 08:35)
- SU-NLP at SemEval-2020 task 12: offensive language identification in Turkish tweets. (deposited 01 Sep 2021 03:03) [Currently Displayed]

Actions (login required)

: View Item