This repository contains the materials for the first part of the Introduction to Data Science course offered by the University of Rosario in June, 2018. This portion of the course covers an introduction to the foundations of data science, text analysis, working with social media data and some unsupervised models. Most sections contain a theoretical overview of the topic and coding examples in R.
Logistical information about the course can be found here:
Much of this course is a combination of courses I have helped teach in the past; all of the materials here are free for anyone to use. I want to particularly thank Pablo Barbera, Ken Benoit, and Matt Salganik for sharing their course materials or making them publicly avaliable.
Intro to Text as Data (slides)
Introduction to Using Text as Data (R)
Unsupervised Models of Text (R)
Dickens Austen PCA example (R)