scrapy-tor-downloader

Scrapy middleware with TOR support for more robust scrapers or anonymous scraping.

Dependencies 🌐

Installation 📥

This is a python package hosted on pypi, so to install simply run the following command:

pip install scrapy-tor-downloader

Settings

TOR_PROXY_ENABLED

Whether TOR is used to proxy any request (defaults to false).

Meta field to enable/disable this per request is: tor_proxy_enabled

TOR_FALLBACK_ENABLED

Whether TOR is used when a request fails as a fallback (defaults to true).

Meta field to enable/disable this per request is: tor_fallback_enabled

TOR2WEB_PROXY

Whether a tor2web proxy is used for onion address. The value of this setting is the domain for the proxy.

Meta field to add this per request is: tor2web_proxy

tor_reset_session

Whether to reset the TOR session before processing the request. This field only exists in the meta on the request as tor_reset_session and is a boolean.

Usage example 👀

In order to use this plugin simply add the following settings and substitute your variables:

DOWNLOADER_MIDDLEWARES = {
    "tormiddleware.middleware.TORDownloaderMiddleware": 631
}

This will immediately allow you begin using TOR as a fallback when one of your requests fail. In order to use it as a proxy you can add the following to your settings:

TOR_PROXY_ENABLED = True

This will make every request hit TOR for a response. If you have turned the proxy on the TOR fallback is ignored, however if it is off the fallback is still on by default, which means if a request returns an error it will be tried again on TOR. In order to turn this off add the following to your settings:

TOR_FALLBACK_ENABLED = False

If you want to make use of tor2web proxies for onion addresses, you can add it to the settings like so:

TOR2WEB_PROXY = "https://onion.moe"

License 📝

The project is available under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
tormiddleware		tormiddleware
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tormiddleware

tormiddleware

.gitignore

.gitignore

LICENSE

LICENSE

MANIFEST.in

MANIFEST.in

README.md

README.md

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

scrapy-tor-downloader

Dependencies 🌐

Installation 📥

Settings

TOR_PROXY_ENABLED

TOR_FALLBACK_ENABLED

TOR2WEB_PROXY

tor_reset_session

Usage example 👀

License 📝

About

Releases

Packages

Languages

License

8W9aG/scrapy-tor-downloader

Folders and files

Latest commit

History

Repository files navigation

scrapy-tor-downloader

Dependencies 🌐

Installation 📥

Settings

TOR_PROXY_ENABLED

TOR_FALLBACK_ENABLED

TOR2WEB_PROXY

tor_reset_session

Usage example 👀

License 📝

About

Resources

License

Stars

Watchers

Forks

Languages