Skip to content

MariyaSha/WebScraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

WebScraping

Create a database from scratch by extracting html elements from a webpage
Modules Used: Urllib.request, BeautifulSoup, Regex and Pandas.

Step by step walk-through:

Step 1: pulling HTML out of a webpage.
Step 2: targeting elements of interest inside the HTML.
Step 3: fine-tuning targeted elements with Regex (Regular Expressions), string concatenation and slicing.
Step 4: storing the data inside a DataFrame.
Step 5: exporting DataFrame into a CSV file.

Also available in a video explination: https://youtu.be/ySNSY7iiBDY
Author: Mariya Sha
Email: mariyasha888@gmail.com
LinkedIn: www.linkedin.com/in/mariyasha888/

About

Create a database from scratch by extracting html elements from a webpage

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published