Skip to content

Library designed to replace the SQLite backend by a redis backend on Scrapy queue management

License

Notifications You must be signed in to change notification settings

speakol-ads/scrapyd-redis

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scrapyd Redis

Scrapyd is a fantastic open-source library for management of crawlers using scrapy-framework. However, the builtin queue management is implemented to work using SQLite which ends up being a problem when we need to scale.

This library is designed to replace the SQLite backend by a Redis backend. In other words, all the queue management will be done using Redis.

This library is a fork of the original https://github.com/Tiago-Lira/scrapyd-mongodb implementation

Install

$ pip install git+https://github.com/speakol-ads/scrapyd-redis.git

Config

To start using this library you just need to override the application option in your scrapy.cfg file:

[scrapyd]
application = scrapyd_redis.application.get_application
...

If you want to customize the access to the database, you can add into your scrapy.cfg file:

[scrapyd]
redis_db = 13
redis_host = 127.0.0.1
redis_port = 6379
redis_pass =   # (Optional)
...

About

Library designed to replace the SQLite backend by a redis backend on Scrapy queue management

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%