GitHub - karthikn2789/Selenium-Project: This repository contains example for a web scraping tutorial using Selenium.

Selenium Web Scraping Project

The project contains a selenium example written in Python to demonstrate web scraping. In this project, PM2.5 values from https://openaq.org are extracted and stored in a JSON file using Selenium.

To run the openaq_scraper.py file, Selenium and a webdriver needs to be installed. Selenium can be installed using the following command.

pip install selenium

Webdriver for 5 major browsers are supported by Selenium. Chromedriver for Chrome browser can be installed using the following commands.

wget https://chromedriver.storage.googleapis.com/83.0.4103.39/chromedriver_linux64.zip

unzip chromedriver_linux64.zip

sudo mv chromedriver /usr/local/bin/

Geckodriver for Firefox can be installed with the following command.

sudo apt install firefox-geckodriver

To run the project, execute openaq_scraper.py file and it will generate 3 JSON files as output. countries_list.json contains names of all the countries listed in https://openaq.org. URLs of all the locations reporting PM2.5 values will be stored in urls.json. And the extracted PM2.5 values will be stored in openaq_data.json.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
countries_list.json		countries_list.json
openaq_data.json		openaq_data.json
openaq_scraper.py		openaq_scraper.py
urls.json		urls.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selenium Web Scraping Project

About

Releases

Packages

Languages

License

karthikn2789/Selenium-Project

Folders and files

Latest commit

History

Repository files navigation

Selenium Web Scraping Project

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages