The Lansing Area R Users Group (laRUG) brings together R users in the Lansing area for discussions on data science (analysis and predictive modeling), big data, and all things R. This group provides a relaxed environment to exchange ideas and discuss R. Whether you are new to R (and programming), a statistician, or an advanced user, we are the group for you.
The May 2017 meetup focused on R packages for text mining and the types of analysis possible.
The tm
and tidytext
packages were discussed. Throughout the discussion, tidytext
was seen as the cleanest way to work with data.frames
. The RTextTools
package is another package to perform text mining and sentiment analysis.
tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools
CRAN linktm: Text Mining Package
CRAN linkRTextTools: Automatic Text Classification via Supervised Learning
no longer developed CRAN linktidytext
TutorialsThe following tidytext
tutorials are an excellent resource for learning the procedure and protocol for cleaning and analyzing text. The following vignettes and tutorials were the basis for our discussion.
tidytext
vignettes
tidytext
tutorials