Aslam, Kousar (2015) Comparative analysis of Active Learning strategies in Twitter domain. [Thesis]
PDF
KousarAslam_10100312.pdf
Download (4MB)
KousarAslam_10100312.pdf
Download (4MB)
Abstract
Since its launch in the year 2006, Twitter has been One of the most popular social media platforms where users are free to share opinions, ideas and feelings. Latest statistics tell us that nearly 350,000 tweets are being posted every minute On Twitter. Also twitter is the first place to track the response to any important incident or events in the world. For this reason, Twitter has attracted the researchers from many fields, including Sentiment Analysis which deals with opinion mining from text. Twitter data is rich in containing the sentiments but is inherent with the problem of being very informal and unstructured which makes it very difficult to convert this data information. Labeling this large amount of data build classifiers for supervised learning is next to impossible. So we make use of Active Learning which is a subfield of Machine Learning and concerns with the selection of most informative instances to train the classifiers thus saving labeling efforts. This thesis deals with the comparative analysis of selected Active learning sampling strategies with twitter domain. The results show Uncertainty Sampling beats Random Satnpling and (Query by Committee consistently An analysis of agreelllent levels among annotators for twitter data has also been presented.
Item Type: | Thesis |
---|---|
Uncontrolled Keywords: | Active learning. -- Sentiment analysis. -- Twitter. |
Subjects: | T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800-8360 Electronics > TK7885-7895 Computer engineering. Computer hardware |
Divisions: | Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng. Faculty of Engineering and Natural Sciences |
Depositing User: | IC-Cataloging |
Date Deposited: | 03 Apr 2018 14:52 |
Last Modified: | 26 Apr 2022 10:14 |
URI: | https://research.sabanciuniv.edu/id/eprint/34369 |