Create a database from scratch by extracting html elements from a webpage
Modules Used: Urllib.request, BeautifulSoup, Regex and Pandas.
Step by step walk-through:
Step 1: pulling HTML out of a webpage.
Step 2: targeting elements of interest inside the HTML.
Step 3: fine-tuning targeted elements with Regex (Regular Expressions), string concatenation and slicing.
Step 4: storing the data inside a DataFrame.
Step 5: exporting DataFrame into a CSV file.
Also available in a video explination: https://youtu.be/ySNSY7iiBDY
Author: Mariya Sha
Email: mariyasha888@gmail.com
LinkedIn: www.linkedin.com/in/mariyasha888/
MariyaSha/WebScraping
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Create a database from scratch by extracting html elements from a webpage
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published