I recently started exploring RapidMiner to do sentiment
analysis and text classification of social media data. So I am going to post
some tutorials on RapidMiner based on what I have learned so far on this tool.
In this post, I am writing on very basic thing – How to
read, write data and transform cases in RapidMiner.
RapidMiner is a free tool and can be downloaded from www.rapid-i.com
. Make sure you have Text Analytics
plugin of RapidMiner installed.
Below is the model, I have built in RapidMiner to read
and write text.
It includes 5 operators
- Read Excel
- Nominal to Text
- Process Documents
- Transform Cases
- Write Excel
Select the excel file from your system which you want to load.
Connect it with the “Nominal to Text” operator. This operator replaces all nominal attributes by corresponding string
attributes.
Then connect it further
with “Process Documents to Data”.
At the end connect “Write
Excel” operator. Select the excel file where you want to write the output. This
operator can be used to write data into Microsoft Excel spreadsheets. This
operator creates Excel files readable by Excel 95, 97, 2000, XP, 2003 and
newer.
In the process documents, I have added operator “Transform
Cases” as shown in below pic.
“Transforms case” operator transforms all characters in a
document to either lower case or upper case, respectively.
Select from the drop down whether you want to transform
all text to upper or lower case.
Run the Model.
You can see that the text have been transformed to lower
case and will be written in output excel file.
If you are looking for XML of this model, please leave your
email ID in comment box and I will share the XML with you.
I am looking for the XML. Kann you send me this one: markklein2001-2@yahoo.de
ReplyDelete