Resque pool is a simple library for managing a pool of resque workers. Given a
a config file (
config/resque-pool.yml), it will manage
your workers for you, starting up the appropriate number of workers for each.
- Less memory consumption - If you are using Ruby Enterprise Edition, or any ruby with copy-on-write safe garbage collection, this could save you a lot of memory when you are managing many workers.
- Simpler (less) config - If you are using monit or an init script to start up your workers, you can start up one pool, and it will manage your workers for you.
- Faster startup - if you are starting many workers at once, you would normally have them competing for CPU as they load their environments. Resque-pool can load the environment once and almost instantaneously fork all of the workers.
How to use
To configure resque-pool, you can use either
config/resque-pool.yml. To use resque-pool, require its rake tasks
resque/pool/tasks) in your rake file, and call the
The YAML file supports both using root level defaults as well as environment
specific overrides. For example, to use resque-pool with rails, in
foo: 1 bar: 2 "foo,bar,baz": 1 production: "foo,bar,baz": 4
require 'resque/pool/tasks' # this task will get called before resque:pool:setup # preload the rails environment in the pool master task "resque:setup" => :environment do # generic worker setup, e.g. Hoptoad for failed jobs end task "resque:pool:setup" do # close any sockets or files in pool master ActiveRecord::Base.connection.disconnect! # and re-open them in the resque worker parent Resque::Pool.after_prefork do |job| ActiveRecord::Base.establish_connection end end
Then you can start the queues via:
rake resque:pool RAILS_ENV=production VERBOSE=1
This will start up seven worker processes, one exclusively for the foo queue, two exclusively for the bar queue, and four workers looking at all queues in priority. This is similar to if you ran the following:
rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=foo & rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=bar & rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=bar & rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=foo,bar,baz & rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=foo,bar,baz & rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=foo,bar,baz & rake resque:work RAILS_ENV=production VERBOSE=1 QUEUES=foo,bar,baz &
Resque already forks for its own child processes. The pool master will stay
around monitoring the resque worker parents, giving three levels: a single pool
master, many worker parents, and a worker child per worker (when the actual job
is being processed). For example,
ps -ef f | grep [r]esque (in Linux) might
return something like the following:
rails 13858 1 0 13:44 ? S 0:02 resque-pool-master: managing [13867, 13875, 13871, 13872, 13868, 13870, 13876] rails 13867 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Waiting for foo rails 13868 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Waiting for bar rails 13870 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Waiting for bar rails 13871 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Waiting for foo,bar,baz rails 13872 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Forked 7481 at 1280343254 rails 7481 13872 0 14:54 ? S 0:00 \_ resque-1.9.9: Processing foo since 1280343254 rails 13875 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Waiting for foo,bar,baz rails 13876 13858 0 13:44 ? S 0:00 \_ resque-1.9.9: Forked 7485 at 1280343255 rails 7485 13876 0 14:54 ? S 0:00 \_ resque-1.9.9: Processing bar since 1280343254
An example startup script, which redirects STDOUT and STDERR and creates a pid file, is given in the examples directory.
The pool master responds to the following signals:
HUP- reload the config file, e.g. to change the number of workers per queue list
QUITto each worker parent and shutdown the master after all workers are done.
QUITto each worker parent and immediately shutdown master
TERMto each worker parent and immediately shutdown master
QUITto each worker, but keep master running (send
HUPto reload config and restart workers)
CONT- send the signal on to all worker parents (see Resque docs).
HUP, workers that are no longer needed will be gracefully shutdown
Workers will watch the pool master, and gracefully shutdown if the master process dies (for whatever reason) before them.
- do appropriate logging (e.g. all to one logfile, each queue to its own
logfile, or each worker to its own logfile). Logfile location must be
configurable, but default to
log/resque-pool.log. Of course, since resque "logs" by writing to $stdout, this is really no more than redirecting stdout to the appropriate logfile.
- (optionally) daemonize, setting a PID file somewhere. configurable, of
course, but default to
- recover gracefully from a malformed config file (on startup and HUP)
- procline for malformed config file, graceful shutdown... and other states?
- figure out a good automated way to test this (cucumber or rspec?)
- clean up the code (I stole most of it from unicorn, and it's still a bit bastardized); excessive use of vim foldmarkers are a code smell.
- incorporate resque-batchworker features? (v2.0)
- web interface for adding and removing workers (etc) (v2.0)
- John Schult (config file can be split by environment)