Monthly Archives: November 2011

A nice short article on memory in R

There is a nice short article on memory issue in R at http://www.matthewckeller.com/html/memory.html. If you use R to process large data, you might find it helpful. It introduces: – checking how much memory an object is taking; – the memory … Continue reading

Posted in R | Tagged | Leave a comment

Using Text Mining to Find Out What @RDataMining Tweets are About

This post shows an example on text mining of Twitter data with R packages twitteR, tm and wordcloud. Package twitteR provides access to Twitter data, tm provides functions for text mining, and wordcloud visualizes the result with a word cloud. … Continue reading

Posted in Data Mining, R | Tagged , , | 24 Comments

Help: stemming and stem completion with package tm in R

I came across a problem below when doing stemming and stem completion with package tm in R. Word “mining” was stemmed to “mine” with stemDocument(), and then completed to “miners”with stemCompletion(). However, I prefer to keep “mining” intact. For stemCompletion(), … Continue reading

Posted in Data Mining, R | Tagged , | 2 Comments