A lightweight Python scraper that turns any webpage into clean, AI-ready data formats: JSON, Markdown, or HTML.
FusionProxy Mini Scraper is a simple but powerful tool for quickly scraping and cleaning content.
It focuses on being minimal, readable, and extendable, making it perfect for learning, prototyping, or building your own scraping toolkit.
With just a few lines of code, you can:
- Fetch and parse any webpage
- Extract title, meta description, and main text
- Convert results into JSON, Markdown, or return raw HTML
- Run it via CLI or as a Python module
Minimal API & CLI
- One main class:
MiniScraper
- CLI for quick use:
python -m src.fusionproxy_scraper https://example.com --mode json