Cross collection aspect based opinion mining using topic models

Warning The system is temporarily closed to updates for reporting purpose.

Kaporo, Hemed Hamisi (2018) Cross collection aspect based opinion mining using topic models. [Thesis]

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader

Official URL: http://risc01.sabanciuniv.edu/record=b1817215 (Table of Contents)


Aspect based opinion mining is the automated science of identifying and extracting sentiments associated to individual aspects in a text document. Over the years this science has emerged to be a cornerstone for analysis of public opinion on consumer products and social-political events. The task is more fruitful and likewise more challenging when comparison of opinion on aspects of multiple entities is of essence. Different methods in literature have attempted to extract aspects in a single collection or collection by collection across multiple collection. These approaches do not appeal when number of collections is large and hence su er significant performance drawbacks. In this work we perform aspect based opinion mining across contrasting multiple collections, simultaneously. We utilize existing cross collection topic models to identify topics that prevail across multiple collections, we propose a topic refinement algorithm that successfully converts these topics into semantically coherent and visually identifiable aspects. We compare the quality of aspects extracted by our algorithm to topics returned by two cross collection topic models. Finally we evaluate the accuracy of sentiment scores when measured over features extracted by the two cross collection topic models. We conclude that with proposed improvements cross collection topic models outperform state of art approaches in aspect based sentiment analysis.

Item Type:Thesis
Uncontrolled Keywords:Cross collection topic modeling. -- Aspect based sentiment analysis. -- Text mining. -- Çapraz koleksiyon konu modellerini. -- Anlam temelli görüş madenciliği. -- Metin madenciliği.
Subjects:T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800-8360 Electronics > TK7885-7895 Computer engineering. Computer hardware
ID Code:36615
Deposited By:IC-Cataloging
Deposited On:08 Oct 2018 15:42
Last Modified:25 Mar 2019 17:30

Repository Staff Only: item control page