f1ae1e39ce
Moving the check out of the general requests, makes sure we don't have any slowdown in the regular requests. To keep the process performing this checks small, the check is still performed inside a unicorn. But that is called from a process running on the same server. Because the checks are now done outside normal request, we can have a simpler failure strategy: The check is now performed in the background every `circuitbreaker_check_interval`. Failures are logged in redis. The failures are reset when the check succeeds. Per check we will try `circuitbreaker_access_retries` times within `circuitbreaker_storage_timeout` seconds. When the number of failures exceeds `circuitbreaker_failure_count_threshold`, we will block access to the storage. After `failure_reset_time` of no checks, we will clear the stored failures. This could happen when the process that performs the checks is not running.
50 lines
1.6 KiB
Ruby
50 lines
1.6 KiB
Ruby
module API
|
|
class CircuitBreakers < Grape::API
|
|
before { authenticated_as_admin! }
|
|
|
|
resource :circuit_breakers do
|
|
params do
|
|
requires :type,
|
|
type: String,
|
|
desc: "The type of circuitbreaker",
|
|
values: ['repository_storage']
|
|
end
|
|
resource ':type' do
|
|
namespace '', requirements: { type: 'repository_storage' } do
|
|
helpers do
|
|
def failing_storage_health
|
|
@failing_storage_health ||= Gitlab::Git::Storage::Health.for_failing_storages
|
|
end
|
|
|
|
def storage_health
|
|
@failing_storage_health ||= Gitlab::Git::Storage::Health.for_all_storages
|
|
end
|
|
end
|
|
|
|
desc 'Get all failing git storages' do
|
|
detail 'This feature was introduced in GitLab 9.5'
|
|
success Entities::RepositoryStorageHealth
|
|
end
|
|
get do
|
|
present storage_health, with: Entities::RepositoryStorageHealth
|
|
end
|
|
|
|
desc 'Get all failing git storages' do
|
|
detail 'This feature was introduced in GitLab 9.5'
|
|
success Entities::RepositoryStorageHealth
|
|
end
|
|
get 'failing' do
|
|
present failing_storage_health, with: Entities::RepositoryStorageHealth
|
|
end
|
|
|
|
desc 'Reset all storage failures and open circuitbreaker' do
|
|
detail 'This feature was introduced in GitLab 9.5'
|
|
end
|
|
delete do
|
|
Gitlab::Git::Storage::FailureInfo.reset_all!
|
|
end
|
|
end
|
|
end
|
|
end
|
|
end
|
|
end
|