Skip to content

PS239T/spring_2021

Repository files navigation

Computational Tools and Techniques for Social Science

Jupyter binder: https://mybinder.org/v2/gh/PS239T/spring_2021/64a6e3c52d40c287cd032432d8c9a5be3df624bf

This course will help social science graduate students to think computationally and develop proficiency with computational tools and techniques, necessary to conduct research in computational social science. Mastering these tools and techniques not only enables students to collect, wrangle, analyze, and interpret data with less pain and more fun, but it also let students to work on research projects that would previously seem impossible.

Learning Objectives

  • The goal of this course is to help students become a more efficient and innovative researcher by leveraging the power of automation.

  • The course is currently divided into two main subjects (fundamentals and applications) and six main sessions.

Part I Fundamentals

  • In the first section, students learn best practices in data and code management using Git and Bash.

  • In the second, students learn how to wrangle, model, and visualize data easier and faster.

  • In the third, students learn how to use functions to automate repeated things and develop their own data tools (e.g., R packages, Shiny apps).

Part II Applications

  • In the fourth, students learn how to collect and parse semi-structured data at scale (e.g., scraping websites, PDF files, and social media posts).

  • In the fifth, students learn how to analyze high-dimensional data (e.g., text) using machine learning.

  • In the final, students learn how to access, query, and manage big data using SQL.

Time and location

  • Meeting dates: 01/19/2021 - 05/07/2021

  • Lecture: Monday 9-11 AM (Zoom)

  • Section: Friday 8-10 AM (Zoom)

Textbook

The online book version of the course materials by Jae Yeon Kim is currently work in progress.

Contact

Please create issues on the course GitHub repository, if you have questions, comments, or suggestions.

Instructor(s): Jae Yeon Kim (jaeyeonkim@berkeley.edu) and Nick Kuipers (nkuipers@berkeley.edu)

Special thanks

This course is a remix version of the course originally developed by Rochelle Terman then revised by Rachel Bernhard. Other teaching materials draw from the workshops Jae Yeon Kim has created for D-Lab and Data Science Discovery Program at UC Berkeley.

This work is licensed under a Creative Commons Attribution 4.0 International License.