NYU Shortcourse -- "Data Science and Social Science" materials
Pull request Compare This branch is 70 commits ahead, 4 commits behind alexhanna:master.
Failed to load latest commit information.
intro updated readme Jan 22, 2016
modeling-viz updated readme Jan 22, 2016
practicum set echo=TRUE in exercise solution Jan 22, 2016
scraping scraping updates Jan 22, 2016
sna more README updates Jan 22, 2016
text class updates Jan 22, 2016
.gitignore updated gitignore to exclude Rdata Jan 21, 2016
README.md README updates Jan 22, 2016
installing_RStudio.pdf updated README Jan 16, 2016


New York University Shortcourse: Data Science and Social Science

Co-sponsored by

January 20-22, 2016


(with materials prepared by Alex Hanna)

Teaching Assistants

  • Denis Stukal
  • Kevin Munger
  • Peter Crosta
  • Varun D N


This is a three-day short course covering key topics at the intersection of Data Science and Social Science. Each day is structured as a series of modules that will combine instruction on data science methods with implementation using real data in R. his course covers an introduction to the R programming and statistical language, modeling and visualization, automated textual analysis, social network analysis, and web scraping & APIs.

Setup and Preparation

You will need to bring a laptop to all sessions of the workshop. You will need R and RStudio installed. Follow the instructions here to install both.

Instructions for using course materials on GitHub

You have three options for downloading the course material found on this page:

  1. You can download the materials by clicking on each link.

  2. You can "clone" repository, using the buttons found to the right side of your browser window as you view this repository. This is the button labelled "Clone in Desktop". If you do not have a git client installed on your system, you will need to get one here and also to make sure that git is installed. This is preferred, since you can refresh your clone as new content gets pushed to the course repository. (And new material will get actively pushed to the course repository at least once per day as this course takes place.)

  3. Most simply, you can choose the button on the right marked "Download zip" which will download the entire repository as a zip file.

You can also subscribe to the repository if you have a GitHub account, which will send you updates each time new changes are pushed to the repository.


Day Time Topic Instructor
Jan 20 09:00-12:00 Intro to R and Data Munging Dan
Jan 20 13:30-16:30 Data Modeling and Visualization Dan
Jan 21 09:00-12:00 Automated Textual Analysis Pablo
Jan 21 13:30-16:30 Social Network Analysis Pablo
Jan 22 09:00-12:00 Web scraping & APIs Pablo
Jan 22 13:30-16:30 Research Practicum Dan