Web-Scraper

🚀 A powerful Python web scraping toolkit with CLI & export support. Scrape any website, clean & save data (CSV/JSON/XLSX), schedule jobs, and extend with AI for insights. Perfect for learners, researchers & pros building data-driven apps.

Web Scraping Command-Line Tool 🕵️‍♂️📊

A simple yet powerful Python-based command-line tool for extracting data from websites and presenting it in a clean, tabular format. This project demonstrates the use of requests, BeautifulSoup, and BeautifulTable to fetch, parse, and display web data efficiently.

✨ Features

• 🔎 Fetch and parse live website content

• 📑 Extract structured information from HTML

• 📊 Display data in a neat table format

• 💾 Save scraped data with an alias for later use

• ⚡ Lightweight and easy-to-use command-line interface

🛠️ Tech Stack

• Python 3.9+

• Requests – for making HTTP requests

• BeautifulSoup4 – for HTML parsing

• BeautifulTable – for tabular output

📂 Project Structure

🚀 Installation & Usage

1.Create and activate a virtual environment

python -m venv venv source venv/bin/activate # Mac/Linux venv\Scripts\activate # Windows

2.Install dependencies

Inside your CMD type

pip install requests pip install beautifulsoup4 pip install beautifultable pip install lxml pip install certifi pip install attrs pip install soupsieve

Or instead Installing one by one you can run all at a same time

pip install requests beautifulsoup4 beautifultable lxml certifi attrs soupsieve

Run the script

python web_scraping_command_line_tool.py

📸 Demo

📈 Future Improvements

• 🌍 Multi-website scraping support

• 📊 Export data to CSV, Excel, or JSON

• 🔧 Add custom scraping rules (XPath/CSS selectors)

• ⚡ Parallel scraping for speed

• 🌐 Option to scrape JS-rendered websites (via Selenium/Playwright)

🤝 Contributing

Contributions, issues, and feature requests are welcome! Feel free to fork this repo and submit a pull request.

📜 License

This project is licensed under the MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
WEB SCRAPING.jpg		WEB SCRAPING.jpg
Web Scraping with BeautifulSoup.ipynb		Web Scraping with BeautifulSoup.ipynb
Web Scraping with BeautifulSoup.py		Web Scraping with BeautifulSoup.py
scrap wikipedia.png		scrap wikipedia.png
scraped_data.json		scraped_data.json
web_scraping_command_line_tool.py		web_scraping_command_line_tool.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Web-Scraper

About

Uh oh!

Releases

Packages

Languages

License

Tanviib12/Web-Scraper

Folders and files

Latest commit

History

Repository files navigation

Web-Scraper

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages