Text mining with rapidminer
Ertek, Gürdal and Tapucu, Dilek and Arın, İnanç (2012) Text mining with rapidminer. In: Hofmann, Markus and Klinkenberg, Ralf, (eds.) Use Cases with RapidMiner. John Wiley & Sons, USA. (Accepted/In Press)
Item Type: | Book Section / Chapter |
---|
Additional Information: | The goal of this chapter is to introduce the text mining capabilities of RAPIDMINER through a use case. The use case involves mining reviews for hotels at TripAdvisor.com, a popular web portal. We will be demonstrating basic text mining in RAPIDMINER using the text mining extension. We will present two different RAPIDMINER processes, namely Process01 and Process02, which respectively describe howtext mining can be combined with association mining and cluster modeling. While it is possible to construct each of these processes from scratch by inserting the appropriate operators into the process view, we will instead import
these two processes readily from existing model files.
Throughout the chapter, we will at times deliberately instruct the reader to take erroneous steps that result in undesired outcomes. We believe that this is a very realistic way of learning to use RAPIDMINER, since in practice, the modeling process frequently involves such steps that are later corrected. |
---|
Uncontrolled Keywords: | Re-Mining, Association, Mining Results, Visualization, Data Envelopment, Analysis, Decision Trees |
---|
Subjects: | T Technology > T Technology (General) H Social Sciences > HD Industries. Land use. Labor > HD0028 Management. Industrial Management T Technology > T Technology (General) > T055.4-60.8 Industrial engineering. Management engineering > T58.5 Information technology Q Science > QA Mathematics > QA273-280 Probabilities. Mathematical statistics H Social Sciences > HD Industries. Land use. Labor > HD0030.2 Electronic data processing. Information technology Q Science > QA Mathematics > QA076 Computer software H Social Sciences > HD Industries. Land use. Labor > HD2321-4730.9 Industry Q Science > QA Mathematics > QA075 Electronic computers. Computer science T Technology > T Technology (General) > T055.4-60.8 Industrial engineering. Management engineering > T58.6-58.62 Management information systems |
---|
ID Code: | 20997 |
---|
Deposited By: | Gürdal Ertek |
---|
Deposited On: | 01 Dec 2012 23:08 |
---|
Last Modified: | 31 Jul 2019 16:47 |
---|
Available Versions of this Item- Text mining with rapidminer. (deposited 01 Dec 2012 23:08) [Currently Displayed]
Repository Staff Only: item control page
|