Skip to content

πŸš€ Python web scraper β€” a fun, lightweight demo repo that fetches a URL, extracts the main content, converts it to Markdown, and exposes both a Python API and a simple CLI. Built for hackers, tinkerers, and learners.

License

Notifications You must be signed in to change notification settings

fusionproxy/fusionproxy-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

FusionProxy Mini Scraper

FusionProxy Mini Scraper

A lightweight Python scraper that turns any webpage into clean, AI-ready data formats: JSON, Markdown, or HTML.

Python License: MIT Tests Style


πŸ“– Overview

FusionProxy Mini Scraper is a simple but powerful tool for quickly scraping and cleaning content.
It focuses on being minimal, readable, and extendable, making it perfect for learning, prototyping, or building your own scraping toolkit.

With just a few lines of code, you can:

  • Fetch and parse any webpage
  • Extract title, meta description, and main text
  • Convert results into JSON, Markdown, or return raw HTML
  • Run it via CLI or as a Python module

✨ Key Features

Minimal API & CLI
  • One main class: MiniScraper
  • CLI for quick use:
    python -m src.fusionproxy_scraper https://example.com --mode json

About

πŸš€ Python web scraper β€” a fun, lightweight demo repo that fetches a URL, extracts the main content, converts it to Markdown, and exposes both a Python API and a simple CLI. Built for hackers, tinkerers, and learners.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published