title   
  

Text mining with rapidminer

Ertek, Gürdal and Tapucu, Dilek and Arın, İnanç (2012) Text mining with rapidminer. In: Hofmann, Markus and Klinkenberg, Ralf, (eds.) Use Cases with RapidMiner. John Wiley & Sons, USA. (Accepted/In Press)

WarningThere is a more recent version of this item available.

[img]PDF (Text Mining with RapidMiner) - Registered users only - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
3968Kb

Item Type:Book Section / Chapter
Additional Information:The goal of this chapter is to introduce the text mining capabilities of RAPIDMINER through a use case. The use case involves mining reviews for hotels at TripAdvisor.com, a popular web portal. We will be demonstrating basic text mining in RAPIDMINER using the text mining extension. We will present two different RAPIDMINER processes, namely Process01 and Process02, which respectively describe howtext mining can be combined with association mining and cluster modeling. While it is possible to construct each of these processes from scratch by inserting the appropriate operators into the process view, we will instead import these two processes readily from existing model files. Throughout the chapter, we will at times deliberately instruct the reader to take erroneous steps that result in undesired outcomes. We believe that this is a very realistic way of learning to use RAPIDMINER, since in practice, the modeling process frequently involves such steps that are later corrected.
Uncontrolled Keywords:Re-Mining, Association, Mining Results, Visualization, Data Envelopment, Analysis, Decision Trees
Subjects:T Technology > T Technology (General)
H Social Sciences > HD Industries. Land use. Labor > HD0028 Management. Industrial Management
T Technology > T Technology (General) > T055.4-60.8 Industrial engineering. Management engineering > T58.5 Information technology
Q Science > QA Mathematics > QA273-280 Probabilities. Mathematical statistics
H Social Sciences > HD Industries. Land use. Labor > HD0030.2 Electronic data processing. Information technology
Q Science > QA Mathematics > QA076 Computer software
H Social Sciences > HD Industries. Land use. Labor > HD2321-4730.9 Industry
Q Science > QA Mathematics > QA075 Electronic computers. Computer science
T Technology > T Technology (General) > T055.4-60.8 Industrial engineering. Management engineering > T58.6-58.62 Management information systems
ID Code:20997
Deposited By:Gürdal Ertek
Deposited On:01 Dec 2012 23:08
Last Modified:17 Jan 2014 16:27

Available Versions of this Item

Repository Staff Only: item control page