artofmission / delayed_job forked from tobi/delayed_job

Database based asynchronously priority queue system -- Extracted from Shopify

This URL has Read+Write access

name age message
file MIT-LICENSE Sun Feb 17 13:01:35 -0800 2008 Initial extraction [tobi]
file README.textile Loading commit data...
file init.rb
directory lib/
directory spec/
directory tasks/ Sat Nov 01 06:30:02 -0700 2008 Fix issues with last merge in relation to jobs:... [tedkulp]
README.textile

Delayed::Job

Delated_job (or DJ) encapsulates the common pattern of asynchronously executing longer tasks in the background.

It is a direct extraction from Shopify where the job table is responsible for a multitude of core tasks. Amongst those tasks are:

  • sending massive newsletters
  • image resizing
  • http downloads
  • updating smart collections
  • updating solr, our search server, after product changes
  • batch imports
  • spam checks

Changes

  • 1.8 Added status column and the ability to query a job’s status and set a job’s status to an arbitrary value.
  • 1.7 Added failed_at column which can optionally be set after a certain amount of failed job attempts. By default failed job attempts are destroyed after about a month.
  • 1.6 Renamed locked_until to locked_at. We now store when we start a given job instead of how long it will be locked by the worker. This allows us to get a reading on how long a job took to execute.
  • 1.5 Job runners can now be run in parallel. Two new database columns are needed: locked_until and locked_by. This allows us to use pessimistic locking, which enables us to run as many worker processes as we need to speed up queue processing.
  • 1.0 Initial release

Setup

The library evolves around a delayed_jobs table which looks as follows:

create_table :delayed_jobs, :force => true do |table| table.integer :priority, :default => 0 table.integer :attempts, :default => 0 table.text :handler table.string :last_error table.datetime :run_at table.datetime :locked_at table.datetime :failed_at table.string :locked_by table.string :status table.timestamps end

Usage

Jobs are simple ruby objects with a method called perform. Any object which responds to perform can be stuffed into the jobs table.
Job objects are serialized to yaml so that they can later be resurrected by the job runner.

class NewsletterJob < Struct.new(:text, :emails) def perform emails.each { |e| NewsletterMailer.deliver_text_to_email(text, e) } end end Delayed::Job.enqueue NewsletterJob.new(‘lorem ipsum…’, Customers.find(:all).collect(&:email))

There is also a second way to get jobs in the queue: send_later.

BatchImporter.new(Shop.find(1)).send_later(:import_massive_csv, massive_csv)

This will simply create a Delayed::PerformableMethod job in the jobs table which serializes all the parameters you pass to it. There are some special smarts for active record objects
which are stored as their text representation and loaded from the database fresh when the job is actually run later.

Running the jobs

You can invoke rake jobs:work which will start working off jobs. You can cancel the rake task with CTRL-C.

You can also run by writing a simple script/job_runner, and invoking it externally:


  #!/usr/bin/env ruby
  require File.dirname(__FILE__) + '/../config/environment'
  
  Delayed::Worker.new.start  

Cleaning up

You can invoke rake jobs:clear to delete all jobs in the queue.

Job Status

To get the status of a job, invoke Delayed::Status.get(1). By default, the status will be one of “pending”, “finished”, or “failed”.

You can set the status of a job to any arbitrary text value by invoking Delayed::Status.set(1, "working on it").

If you provide a delayed_job_id in your object, then you can access the job’s status from inside the #perform action. This is useful for long-running tasks where you might want to update the status periodically.

class NewsletterJob < Struct.new(:text, :emails, :delayed_job_id)
def perform
emails.each do |e|
NewsletterMailer.deliver_text_to_email(text, e)
Delayed::Status.set(delayed_job_id, “still going…”)
end
end
end