Text mining with rapidminer

Ertek, Gürdal and Tapucu, Dilek and Arın, İnanç (2012) Text mining with rapidminer. In: Hofmann, Markus and Klinkenberg, Ralf, (eds.) Use Cases with RapidMiner. John Wiley & Sons, USA. (Accepted/In Press)

Warning
There is a more recent version of this item available.
[thumbnail of Text Mining with RapidMiner] PDF (Text Mining with RapidMiner)
Chapter_03_v21_after_feedback.pdf
Restricted to Registered users only

Download (4MB) | Request a copy
Item Type: Book Section / Chapter
Additional Information: The goal of this chapter is to introduce the text mining capabilities of RAPIDMINER through a use case. The use case involves mining reviews for hotels at TripAdvisor.com, a popular web portal. We will be demonstrating basic text mining in RAPIDMINER using the text mining extension. We will present two different RAPIDMINER processes, namely Process01 and Process02, which respectively describe howtext mining can be combined with association mining and cluster modeling. While it is possible to construct each of these processes from scratch by inserting the appropriate operators into the process view, we will instead import these two processes readily from existing model files. Throughout the chapter, we will at times deliberately instruct the reader to take erroneous steps that result in undesired outcomes. We believe that this is a very realistic way of learning to use RAPIDMINER, since in practice, the modeling process frequently involves such steps that are later corrected.
Uncontrolled Keywords: Re-Mining, Association, Mining Results, Visualization, Data Envelopment, Analysis, Decision Trees
Subjects: T Technology > T Technology (General)
H Social Sciences > HD Industries. Land use. Labor > HD0028 Management. Industrial Management
T Technology > T Technology (General) > T055.4-60.8 Industrial engineering. Management engineering > T58.5 Information technology
Q Science > QA Mathematics > QA273-280 Probabilities. Mathematical statistics
H Social Sciences > HD Industries. Land use. Labor > HD0030.2 Electronic data processing. Information technology
Q Science > QA Mathematics > QA076 Computer software
H Social Sciences > HD Industries. Land use. Labor > HD2321-4730.9 Industry
Q Science > QA Mathematics > QA075 Electronic computers. Computer science
T Technology > T Technology (General) > T055.4-60.8 Industrial engineering. Management engineering > T58.6-58.62 Management information systems
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng.
Faculty of Engineering and Natural Sciences
Faculty of Engineering and Natural Sciences > Academic programs > Manufacturing Systems Eng.
Depositing User: Gürdal Ertek
Date Deposited: 01 Dec 2012 23:08
Last Modified: 26 Apr 2022 08:29
URI: https://research.sabanciuniv.edu/id/eprint/20997

Available Versions of this Item

Actions (login required)

View Item
View Item