Welcome to the Web Scraping Scripts repository under the HashSlap Summer of Code (HSSoC) initiative!
This repository houses modular and powerful web scraping scripts to extract useful data from the web — whether you're automating data collection, learning web scraping, or building utilities, this is the perfect playground.
Here you'll find Python scraping scripts for:
- 📰 News websites
- 📦 E-commerce price trackers
- 🧑🎓 GitHub profile scrapers
- 📊 Stock market data grabbers
- 🎥 YouTube metadata collectors
Built using popular tools like requests
, BeautifulSoup
, and Selenium
.
- Provide real-world scraping examples
- Help beginners learn how to parse HTML and interact with websites
- Build a collaborative collection of reliable scrapers
- Support modular, reusable script architecture
.
├── ecommerce/
│ └── amazon_price_tracker.py
├── news/
│ └── bbc_headlines.py
├── social/
│ └── github_stats_scraper.py
├── video/
│ └── youtube_video_data.py
└── README.md
- Use Python 3.x
- Organize scripts by category
- Keep your code clean and documented
- Output must be in JSON, CSV, or pretty printed format
📌 You must only scrape public content & respect robots.txt
.
git clone https://github.com/your-username/scraping-scripts.git
cd scraping-scripts
python3 your-script.py
This project is licensed under the MIT License.