Skip to content

HashSlap-Summer-of-Code/scraping-scripts

🕸️ Web Scraping Scripts

Welcome to the Web Scraping Scripts repository under the HashSlap Summer of Code (HSSoC) initiative!

This repository houses modular and powerful web scraping scripts to extract useful data from the web — whether you're automating data collection, learning web scraping, or building utilities, this is the perfect playground.


📦 What's in This Repo?

Here you'll find Python scraping scripts for:

  • 📰 News websites
  • 📦 E-commerce price trackers
  • 🧑‍🎓 GitHub profile scrapers
  • 📊 Stock market data grabbers
  • 🎥 YouTube metadata collectors

Built using popular tools like requests, BeautifulSoup, and Selenium.


🧠 Goals of This Project

  • Provide real-world scraping examples
  • Help beginners learn how to parse HTML and interact with websites
  • Build a collaborative collection of reliable scrapers
  • Support modular, reusable script architecture

🗂 Folder Structure

.
├── ecommerce/
│   └── amazon_price_tracker.py
├── news/
│   └── bbc_headlines.py
├── social/
│   └── github_stats_scraper.py
├── video/
│   └── youtube_video_data.py
└── README.md

🤝 Contributing Guidelines

  • Use Python 3.x
  • Organize scripts by category
  • Keep your code clean and documented
  • Output must be in JSON, CSV, or pretty printed format

📌 You must only scrape public content & respect robots.txt.


🚀 Getting Started

git clone https://github.com/your-username/scraping-scripts.git
cd scraping-scripts
python3 your-script.py

📜 License

This project is licensed under the MIT License.


About

Repo with scraping scripts to scrape data from websites!

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 5