mperham--sidekiq/lib/sidekiq/middleware/server/retry_jobs.rb

require 'sidekiq/scheduled'

module Sidekiq
  module Middleware
    module Server
      ##
      # Automatically retry jobs that fail in Sidekiq.
      # Sidekiq's retry support assumes a typical development lifecycle:
      # 0. push some code changes with a bug in it
      # 1. bug causes message processing to fail, sidekiq's middleware captures
      #    the message and pushes it onto a retry queue
      # 2. sidekiq retries messages in the retry queue multiple times with
      #    an exponential delay, the message continues to fail
      # 3. after a few days, a developer deploys a fix.  the message is
      #    reprocessed successfully.
      # 4. if 3 never happens, sidekiq will eventually give up and throw the
      #    message away.
      #
      # A message looks like:
      #
      #     { 'class' => 'HardWorker', 'args' => [1, 2, 'foo'] }
      #
      # We'll add a bit more data to the message to support retries:
      #
      #  * 'queue' - the queue to use
      #  * 'retry_count' - number of times we've retried so far.
      #  * 'error_message' - the message from the exception
      #  * 'error_class' - the exception class
      #  * 'failed_at' - the first time it failed
      #  * 'retried_at' - the last time it was retried
      #
      # We don't store the backtrace as that can add a lot of overhead
      # to the message and everyone is using Airbrake, right?
      class RetryJobs
        include Sidekiq::Util

        # delayed_job uses the same basic formula
        MAX_COUNT = 25
        DELAY = proc { |count| (count ** 4) + 15 }

        def call(worker, msg, queue)
          yield
        rescue => e
          raise unless msg['retry']

          msg['queue'] = queue
          msg['error_message'] = e.message
          msg['error_class'] = e.class.name
          count = if msg['retry_count']
            msg['retried_at'] = Time.now.utc
            msg['retry_count'] += 1
          else
            msg['failed_at'] = Time.now.utc
            msg['retry_count'] = 0
          end

          if msg['backtrace'] == true
            msg['error_backtrace'] = e.backtrace
          elsif msg['backtrace'].to_i != 0
            msg['error_backtrace'] = e.backtrace[0..msg['backtrace'].to_i]
          end

          if count <= MAX_COUNT
            delay = DELAY.call(count)
            logger.debug { "Failure! Retry #{count} in #{delay} seconds" }
            retry_at = Time.now.to_f + delay
            payload = Sidekiq.dump_json(msg)
            Sidekiq.redis do |conn|
              conn.zadd('retry', retry_at.to_s, payload)
            end
          else
            # Goodbye dear message, you (re)tried your best I'm sure.
            logger.debug { "Dropping message after hitting the retry maximum: #{msg}" }
          end
          raise
        end

      end
    end
  end
end
Scheduled jobs! Bump to 2.0.0. Performs can now be scheduled at arbitrary points in the future. 2012-05-25 23:21:42 -04:00			`require 'sidekiq/scheduled'`
HOT new automatic retry feature. Needs testing. 2012-03-17 16:41:53 -04:00
			`module Sidekiq`
			`module Middleware`
			`module Server`
			`##`
			`# Automatically retry jobs that fail in Sidekiq.`
Scheduled jobs! Bump to 2.0.0. Performs can now be scheduled at arbitrary points in the future. 2012-05-25 23:21:42 -04:00			`# Sidekiq's retry support assumes a typical development lifecycle:`
			`# 0. push some code changes with a bug in it`
			`# 1. bug causes message processing to fail, sidekiq's middleware captures`
			`# the message and pushes it onto a retry queue`
			`# 2. sidekiq retries messages in the retry queue multiple times with`
			`# an exponential delay, the message continues to fail`
			`# 3. after a few days, a developer deploys a fix. the message is`
			`# reprocessed successfully.`
			`# 4. if 3 never happens, sidekiq will eventually give up and throw the`
			`# message away.`
			`#`
HOT new automatic retry feature. Needs testing. 2012-03-17 16:41:53 -04:00			`# A message looks like:`
			`#`
			`# { 'class' => 'HardWorker', 'args' => [1, 2, 'foo'] }`
			`#`
			`# We'll add a bit more data to the message to support retries:`
			`#`
			`# * 'queue' - the queue to use`
			`# * 'retry_count' - number of times we've retried so far.`
			`# * 'error_message' - the message from the exception`
			`# * 'error_class' - the exception class`
			`# * 'failed_at' - the first time it failed`
			`# * 'retried_at' - the last time it was retried`
			`#`
			`# We don't store the backtrace as that can add a lot of overhead`
			`# to the message and everyone is using Airbrake, right?`
			`class RetryJobs`
Auto failure retry now working! 2012-03-18 02:04:31 -04:00			`include Sidekiq::Util`
Scheduled jobs! Bump to 2.0.0. Performs can now be scheduled at arbitrary points in the future. 2012-05-25 23:21:42 -04:00
			`# delayed_job uses the same basic formula`
			`MAX_COUNT = 25`
			`DELAY = proc { \|count\| (count ** 4) + 15 }`
Auto failure retry now working! 2012-03-18 02:04:31 -04:00
HOT new automatic retry feature. Needs testing. 2012-03-17 16:41:53 -04:00			`def call(worker, msg, queue)`
			`yield`
			`rescue => e`
Client API update: - Add API for configuring options per Worker class - Removed the Client API issues preventing it working on Ruby 1.8 - Cleanups to various APIs for upcoming 1.0 release. 2012-04-01 22:53:45 -04:00			`raise unless msg['retry']`

HOT new automatic retry feature. Needs testing. 2012-03-17 16:41:53 -04:00			`msg['queue'] = queue`
			`msg['error_message'] = e.message`
			`msg['error_class'] = e.class.name`
			`count = if msg['retry_count']`
			`msg['retried_at'] = Time.now.utc`
			`msg['retry_count'] += 1`
			`else`
			`msg['failed_at'] = Time.now.utc`
			`msg['retry_count'] = 0`
			`end`

Implement optional backtrace storage [#155] 2012-04-27 23:25:46 -04:00			`if msg['backtrace'] == true`
			`msg['error_backtrace'] = e.backtrace`
			`elsif msg['backtrace'].to_i != 0`
			`msg['error_backtrace'] = e.backtrace[0..msg['backtrace'].to_i]`
			`end`

Auto failure retry now working! 2012-03-18 02:04:31 -04:00			`if count <= MAX_COUNT`
			`delay = DELAY.call(count)`
			`logger.debug { "Failure! Retry #{count} in #{delay} seconds" }`
			`retry_at = Time.now.to_f + delay`
Refactor to use Sidekiq.dump_json and Sidekiq.load_json These methods perform MultiJson feature detection and can be removed after this library's MultiJson dependency is upgraded to ~> 2.0. 2012-04-22 17:02:35 -04:00			`payload = Sidekiq.dump_json(msg)`
HOT new automatic retry feature. Needs testing. 2012-03-17 16:41:53 -04:00			`Sidekiq.redis do \|conn\|`
Auto failure retry now working! 2012-03-18 02:04:31 -04:00			`conn.zadd('retry', retry_at.to_s, payload)`
HOT new automatic retry feature. Needs testing. 2012-03-17 16:41:53 -04:00			`end`
			`else`
Auto failure retry now working! 2012-03-18 02:04:31 -04:00			`# Goodbye dear message, you (re)tried your best I'm sure.`
			`logger.debug { "Dropping message after hitting the retry maximum: #{msg}" }`
HOT new automatic retry feature. Needs testing. 2012-03-17 16:41:53 -04:00			`end`
			`raise`
			`end`
Auto failure retry now working! 2012-03-18 02:04:31 -04:00
HOT new automatic retry feature. Needs testing. 2012-03-17 16:41:53 -04:00			`end`
			`end`
			`end`
			`end`