Skip to content

A web scraper that scrape all the book data from the website and stores in a sqlite3 database.

Notifications You must be signed in to change notification settings

sushil-rgb/himpub-web-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

himpub-web-scraper

A web scraper that scrape all the book data from the himpub and stores in a MYSQL database. The scraper use concurrency features for faster scraping. It currently scrapes around 3K records in 4 minutes.

Instructions

Clone the repository Install necessary modules: pip install requests pip install beautifulsoup4

run the scraperToDb.py file and the scraper will scrape and store the data in MYSQL database.

About

A web scraper that scrape all the book data from the website and stores in a sqlite3 database.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages