Skip to content

ucla-data-science-center/scrape-interwebz

 
 

Repository files navigation

Binder

Extracting Data from the Internet in Python

This workshop will cover how to extract data from the web using Python. We'll be covering both APIs and webscraping.

Topics Covered

  • How the web works
  • Accessing databases via RESTful APIs
  • HTML / CSS
  • Manipulating a webpage with Google DevTools
  • Webscraping with Beautiful Soup
  • Scraping javascript-heavy sites and interactive sites with Selenium

Requirements

This workshop will be using the Python programming language. See the software requirements here.

We will assume a basic knowledge of Python. If you've taken the D-Lab's Python Intensive, that should be sufficient.

Please note that materials are still in development, and will be changing.

Contact

Rochelle Terman: rterman@gmail.com

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 97.4%
  • HTML 2.5%
  • CSS 0.1%