Computational Text Analysis Workshop Materials
HTML R
Permalink
Failed to load latest commit information.
Bonus-Scripts
Data
Results character plotting with blackalicious Jul 31, 2015
img
.gitignore
0-Intro.html
1-Pre-processing.Rmd
1-Pre-processing.html refreshing Aug 16, 2015
2-discriminating-words.Rmd
2-discriminating-words.html
3-sentiment.Rmd
3-sentiment.html editing date Aug 8, 2015
4-clustering.Rmd
4-clustering.html
5-mallet.Rmd wd Nov 23, 2015
5-mallet.html knitting, credits Aug 7, 2015
6-Literary.Rmd wd Nov 23, 2015
A-syllabus.md
B-Tech-Requirements.md
C-Rstudio.Rmd renaming Aug 16, 2015
C-Rstudio.html renaming Aug 16, 2015
D-Annotated-Bibliography.md
E-Credits.md renaming Aug 16, 2015
README.md

README.md

Computational Text Analysis Workshop

About

This workshop was originally prepared for the 2015 Digital Humanities @ Berkeley Summer Institute. It has since been taught elsewhere.

This course introduces students to modern quantitative text analysis techniques, with the ultimate goal of providing the skills necessary to apply the methods in their own research. We will use the open source programming language R. Demonstration corpora are provided.

Topics Covered

  • Acquiring and Preprocessing texts
  • Discriminating Words
  • Dictionary Methods and Sentiment Analysis
  • The Vector Space Model and the Geometry of Text (Multi-dimensional Scaling, Most Similar Texts, Clustering)
  • Topic Models
  • Quantifying Style: Grammar, Alliteration, and other Poetic Concerns

See the entire syllabus here.

Requirements

This workship will be using the R programming language. See the software requirements here.

Students are strongly encouraged to complete this brief tutorial to learn the basic syntax of the R programming language.

Contact

Rochelle Terman: rterman@gmail.com