Skip to content
Script that converts website url and targeted elements into a readable csv table, customizable for any website
Python HTML
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
__pycache__
images-wiki
manual-scrapes
README.md
scraping_template.py
simple.html
template_blog_table.csv
template_cpu_table.csv
template_scrapes.py

README.md

Webscraping Template

A Python script that converts website url and targeted elements into a readable csv table, customizable for any website. The template blog/gpu csv files should have an identical output as the manual blog/gpu csv files in the manual-scrapes folder. simple.html is a short HTML file you can practice webscraping on.

Make sure to install BeautifulSoup with pip install bs4 and requests with pip install requests

See Wiki for a more thorough walkthrough.

Beautiful Soup Documentation (bs4):

https://www.crummy.com/software/BeautifulSoup/bs4/doc/

Useful videos for introduction to web scraping (Code in manual-scrapes are based off/inspired by these):

Web Scraping with BeautifulSoup and Requests | Corey Schafer: https://www.youtube.com/watch?v=ng2o98k983k

Web Scraping with Python and BeautifulSoup | Data Science Dojo: https://www.youtube.com/watch?v=XQgXKtPSzUI

You can’t perform that action at this time.