Ertek, Gürdal and Tapucu, Dilek and Arın, İnanç (2012) Text mining with rapidminer. In: Hofmann, Markus and Klinkenberg, Ralf, (eds.) Use Cases with RapidMiner. John Wiley & Sons, USA. (Accepted/In Press)
There is a more recent version of this item available.
PDF (Text Mining with RapidMiner)
Chapter_03_v21_after_feedback.pdf
Restricted to Registered users only
Download (4MB) | Request a copy
Chapter_03_v21_after_feedback.pdf
Restricted to Registered users only
Download (4MB) | Request a copy
Item Type: | Book Section / Chapter |
---|---|
Additional Information: | The goal of this chapter is to introduce the text mining capabilities of RAPIDMINER through a use case. The use case involves mining reviews for hotels at TripAdvisor.com, a popular web portal. We will be demonstrating basic text mining in RAPIDMINER using the text mining extension. We will present two different RAPIDMINER processes, namely Process01 and Process02, which respectively describe howtext mining can be combined with association mining and cluster modeling. While it is possible to construct each of these processes from scratch by inserting the appropriate operators into the process view, we will instead import these two processes readily from existing model files. Throughout the chapter, we will at times deliberately instruct the reader to take erroneous steps that result in undesired outcomes. We believe that this is a very realistic way of learning to use RAPIDMINER, since in practice, the modeling process frequently involves such steps that are later corrected. |
Uncontrolled Keywords: | Re-Mining, Association, Mining Results, Visualization, Data Envelopment, Analysis, Decision Trees |
Subjects: | T Technology > T Technology (General) H Social Sciences > HD Industries. Land use. Labor > HD0028 Management. Industrial Management T Technology > T Technology (General) > T055.4-60.8 Industrial engineering. Management engineering > T58.5 Information technology Q Science > QA Mathematics > QA273-280 Probabilities. Mathematical statistics H Social Sciences > HD Industries. Land use. Labor > HD0030.2 Electronic data processing. Information technology Q Science > QA Mathematics > QA076 Computer software H Social Sciences > HD Industries. Land use. Labor > HD2321-4730.9 Industry Q Science > QA Mathematics > QA075 Electronic computers. Computer science T Technology > T Technology (General) > T055.4-60.8 Industrial engineering. Management engineering > T58.6-58.62 Management information systems |
Divisions: | Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng. Faculty of Engineering and Natural Sciences Faculty of Engineering and Natural Sciences > Academic programs > Manufacturing Systems Eng. |
Depositing User: | Gürdal Ertek |
Date Deposited: | 01 Dec 2012 23:08 |
Last Modified: | 26 Apr 2022 08:29 |
URI: | https://research.sabanciuniv.edu/id/eprint/20997 |
Available Versions of this Item
- Text mining with rapidminer. (deposited 01 Dec 2012 23:08) [Currently Displayed]