The repository contains examples of the following two common libraries used for Web Scraping:
- BeautifulSoup
- beautifulsoup.ipynb retrieves, saves and outputs all "book" div's from a static website.
- Selenium with Python
- selenium.ipynb retrieves, saves and outputs a table generated with Angular.js in browser mode.
- selenium_headless.ipynb retrieves, saves and outputs a table generated with Angular.js in headless mode.
Note: For the Selenium examples the chromedriver can be installed here, and has to be the same version as your Google Chrome/Chromium distribution.