Tutorial on extracting data via APIs and webscraping
Jupyter Notebook HTML
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
1_APIs
2_HTML_CSS
3_Beautiful_Soup
Bonus_Materials
.gitignore
0_Intro.html
A_Schedule
B_Tech-Requirements.md
LICENSE
README.md
Selenium.ipynb

README.md

Binder

Extracting Data from the Internet in Python

This workshop will cover how to extract data from the web using Python. We'll be covering both APIs and webscraping.

Topics Covered

  • How the web works
  • Accessing databases via RESTful APIs
  • HTML / CSS
  • Manipulating a webpage with Google DevTools
  • Webscraping with Beautiful Soup
  • Scraping javascript-heavy sites and interactive sites with Selenium

Requirements

This workshop will be using the Python programming language. See the software requirements here.

We will assume a basic knowledge of Python. If you've taken the D-Lab's Python Intensive, that should be sufficient.

Please note that materials are still in development, and will be changing.

Contact

Rochelle Terman: rterman@gmail.com