Skip to content
View scrapoxy's full-sized avatar
πŸ’­
Happy !
πŸ’­
Happy !

Block or report scrapoxy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
scrapoxy/README.md

Scrapoxy

Scrapoxy is a super proxies manager that orchestrates all your proxies into one place 🎯, rather than spreading management across multiple scrapers πŸ•ΈοΈ.

Deployed on your own infrastructure, Scrapoxy serves as a single proxy endpoint for your scrapers.

It creates a pool of private proxies from your datacenter subscription πŸ”’, integrates them with proxy vendors πŸ”Œ, handles IP rotation and fingerprinting, and smartly routes traffic to avoid bans 🚫.


Scrapoxy

πŸš€πŸš€ MORE INFO AT SCRAPOXY.IOπŸš€πŸš€

Features

☁️ Datacenter Providers with easy installation ☁️

Scrapoxy supports many datacenter providers like AWS, Azure, or GCP.

It installs a proxy image on each datacenter, helping the quick launch of a proxy instance. Traffic is routed to proxy instances to provide many IP addresses.

Scrapoxy handles the startup/shutdown of proxy instances to rotate IP addresses effectively.

🌐 Proxy Services 🌐

Scrapoxy supports many proxy services like Rayobyte, IPRoyal or Zyte.

It connects to these services and uses a variety of parameters such as country or OS type, to create a diversity of proxies.

πŸ’» Hardware materials πŸ’»

Scrapoxy supports many 4G proxy farms hardware types like Proxidize.

It uses their APIs to handle IP rotation on 4G networks.

πŸ“œ Free Proxy Lists πŸ“œ

Scrapoxy supports lists of HTTP/HTTPS proxies and SOCKS4/SOCKS5 proxies.

It takes care of testing their connectivity to aggregate them into the proxy pool.

⏰ Timeout free ⏰

Scrapoxy only routes traffic to online proxies.

This feature is useful with residential proxies. Sometimes, proxies may be too slow or inactive. Scrapoxy detects these offline nodes and excludes them from the proxy pool.

πŸ”„ Auto-Rotate proxies πŸ”„

Scrapoxy automatically changes IP addresses at regular intervals.

Scrapers can have thousands of IP addresses without managing proxy rotation.

πŸƒ Auto-Scale proxies πŸƒ

Scrapoxy monitors incoming traffic and automatically scales the number of proxies according to your needs.

It also reduces proxy count to minimize your costs.

πŸͺ Sticky sessions on Browser πŸͺ

Scrapoxy can keep the same IP address for a scraping session, even for browsers.

It includes HTTP requests/responses interception mechanism to inject a session cookie, ensuring continuity of the IP address throughout the browser session.

🚨 Ban management 🚨

Scrapoxy injects the name of the proxy into the HTTP responses.

When a scraper detects that a ban has occurred, it can notify Scrapoxy to remove the proxy from the pool.

πŸ“‘ Traffic interception πŸ“‘

Scrapoxy intercepts HTTP requests/responses to modify headers, keeping consistency in your scraping stack. It can add session cookies or specific headers like user-agent.

πŸ“Š Traffic monitoring πŸ“Š

Scrapoxy measures incoming and outgoing traffic to provide an overview of your scraping session.

It tracks metrics such as the number of requests, active proxy count, requests per proxy, and more.

🌍 Coverage monitoring 🌍

Scrapoxy displays the geographic coverage of your proxies to better understand the global distribution of your proxies.

πŸš€ Easy-to-use and production-ready πŸš€

Scrapoxy is suitable for both beginners and experts.

It can be started in seconds using Docker, or be deployed in a complex, distributed environment with Kubernetes.

πŸ”“ Free πŸ”“

Scrapoxy is free, only pay for support.

Documentation

More information on scrapoxy.io.

Follow-up

Discord Docker

Star History Chart

Pinned Loading

  1. scrapoxy scrapoxy Public

    Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple scrapers. It manages IP rotation and fingerprinting, and smar…

    2.3k 257