Web Scraper Suite

Overview

This repository contains a suite of web scrapers developed using puppeteer for both static and dynamic web pages. The scrapers are designed to extract data from various websites, providing a flexible solution for web data extraction tasks.

Features

Scrapers for both static and dynamic websites
Utilizes Selenium for dynamic page interaction
Extracts data from multiple websites
Easy-to-use interface
Customizable scraping parameters

Installation

Clone the repository to your local machine.
Install the required dependencies using npm:
Download the appropriate web driver for your browser and ensure it's in your system PATH.

Usage

Configure the scraping parameters in the scraper scripts according to your requirements.
Run the desired scraper script using node:
The scraped data will be saved in the specified output format (e.g., CSV, JSON).

Websites Scraped

Blog

For more insights into web scraping techniques, tips, and tutorials, visit medium.com/@aymenmehmood812.

Contributors

Aymen Mehmood

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
Mastodonimages		Mastodonimages
altimages		altimages
images		images
.gitignore		.gitignore
Alt news.js		Alt news.js
Mastodon.js		Mastodon.js
README.md		README.md
altoutput.csv		altoutput.csv
altoutputvideo.csv		altoutputvideo.csv
mastodonoutput.csv		mastodonoutput.csv
mastodonoutputvideo.csv		mastodonoutputvideo.csv
output.csv		output.csv
package-lock.json		package-lock.json
package.json		package.json
politifact.js		politifact.js

ayemenn/Web-Scrapper

Folders and files

Latest commit

History

Repository files navigation

Web Scraper Suite

Overview

Features

Installation

Usage

Websites Scraped

Blog

Contributors

License

About

Topics

Resources

Stars

Watchers

Forks

Languages