Webscraper v4

Webscraping application made in Python to process online marketplace data for different products on various platforms
Uses Selenium and Chrome driver to open webpages in an optionally headless (invisible) browser and access their content
Currently scrapable: Amazon India, Flipkart, BigBasket

USAGE

Setup

Required on your system:

Python (added to PATH): Install from https://www.python.org/downloads/ and add to PATH variable
Chrome: Install from https://www.google.com/intl/en_us/chrome/
Chrome Driver: Download (same version as Chrome!) from https://chromedriver.chromium.org/downloads (versions <= 114) or https://googlechromelabs.github.io/chrome-for-testing (versions >= 115)
Python modules (selenium and tqdm): Run command pip install -r path/to/WebScraper/requirements.txt (see also: Updating)

Edit chrome_driver_path in consts.txt according to your system

Running

Windows users can quickly run by clicking on RUN.bat
Otherwise, run commands cd path/to/WebScraper and python main.py

Updating

If cloned with Git, Windows users can use UPDATE.bat to pull the latest version while preserving consts.txt
Otherwise, run commands cd path/to/WebScraper and python updater.py
This will ask you if you would like to update Python packages as well (note that this is a time-taking process)

Notes

Selenium handshake failure errors can mostly be ignored

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.gitignore		.gitignore
ChromeDriver v115+ Download Link.url		ChromeDriver v115+ Download Link.url
README.md		README.md
RUN.bat		RUN.bat
UPDATE.bat		UPDATE.bat
amazonScraper.py		amazonScraper.py
bigbasketScraper.py		bigbasketScraper.py
common.py		common.py
consts.txt		consts.txt
familydollarScraper.py		familydollarScraper.py
flipkartScraper.py		flipkartScraper.py
kotsovolosScraper.py		kotsovolosScraper.py
main.py		main.py
requirements.txt		requirements.txt
updater.py		updater.py
zeptoScraper.py		zeptoScraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Webscraper v4

USAGE

Setup

Running

Updating

Notes

About

Releases

Packages

Languages

kushagraVerma/WebScraper

Folders and files

Latest commit

History

Repository files navigation

Webscraper v4

USAGE

Setup

Running

Updating

Notes

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages