A one-hour introduction to text mining in R for the summer research internship at CESTA.
The lesson results in a sentiment analysis of a small corpus, currently the last eight US State of the Union addresses.
Participants need to install two pieces of free software:
- The R language and interpreter.
- RStudio, an integrated developer environment for R.
The lesson also depends on several R packages such as tm
, which we will install as we go.
Once you have installed both R and RStudio, open RStudio.
If you want to test your installation, type the following at the >
prompt:
print('hello, world')
If it works, you should see the following:
> print('hello, world')
[1] "hello, world"
Students who are interested in learning more about R may wish to check out some of these books: