gitlab-org--gitlab-foss/app/services/projects/update_remote_mirror_service.rb

# frozen_string_literal: true

module Projects
  class UpdateRemoteMirrorService < BaseService
    MAX_TRIES = 3

    def execute(remote_mirror, tries)
      return success unless remote_mirror.enabled?

      if Gitlab::UrlBlocker.blocked_url?(CGI.unescape(Gitlab::UrlSanitizer.sanitize(remote_mirror.url)))
        return error("The remote mirror URL is invalid.")
      end

      update_mirror(remote_mirror)

      success
    rescue Gitlab::Git::CommandError => e
      # This happens if one of the gitaly calls above fail, for example when
      # branches have diverged, or the pre-receive hook fails.
      retry_or_fail(remote_mirror, e.message, tries)

      error(e.message)
    rescue => e
      remote_mirror.mark_as_failed!(e.message)
      raise e
    end

    private

    def update_mirror(remote_mirror)
      remote_mirror.update_start!
      remote_mirror.ensure_remote!

      # LFS objects must be sent first, or the push has dangling pointers
      send_lfs_objects!(remote_mirror)

      response = remote_mirror.update_repository

      if response.divergent_refs.any?
        message = "Some refs have diverged and have not been updated on the remote:"
        message += "\n\n#{response.divergent_refs.join("\n")}"

        remote_mirror.mark_as_failed!(message)
      else
        remote_mirror.update_finish!
      end
    end

    def send_lfs_objects!(remote_mirror)
      return unless Feature.enabled?(:push_mirror_syncs_lfs, project)
      return unless project.lfs_enabled?

      # TODO: Support LFS sync over SSH
      # https://gitlab.com/gitlab-org/gitlab/-/issues/249587
      return unless remote_mirror.url =~ /\Ahttps?:\/\//i
      return unless remote_mirror.password_auth?

      Lfs::PushService.new(
        project,
        current_user,
        url: remote_mirror.bare_url,
        credentials: remote_mirror.credentials
      ).execute
    end

    def retry_or_fail(mirror, message, tries)
      if tries < MAX_TRIES
        mirror.mark_for_retry!(message)
      else
        # It's not likely we'll be able to recover from this ourselves, so we'll
        # notify the users of the problem, and don't trigger any sidekiq retries
        # Instead, we'll wait for the next change to try the push again, or until
        # a user manually retries.
        mirror.mark_as_failed!(message)
      end
    end
  end
end
Enable more frozen string in app/services/*/.rb Partially addresses #47424. 2018-07-17 12:50:37 -04:00			`# frozen_string_literal: true`

Backports every CE related change from ee-5484 to CE 2018-05-03 08:55:14 -04:00			`module Projects`
			`class UpdateRemoteMirrorService < BaseService`
Rework retry strategy for remote mirrors Prevention of running 2 simultaneous updates Instead of using `RemoteMirror#update_status` and raise an error if it's already running to prevent the same mirror being updated at the same time we now use `Gitlab::ExclusiveLease` for that. When we fail to obtain a lease in 3 tries, 30 seconds apart, we bail and reschedule. We'll reschedule faster for the protected branches. If the mirror already ran since it was scheduled, the job will be skipped. Error handling: Remote side When an update fails because of a `Gitlab::Git::CommandError`, we won't track this error in sentry, this could be on the remote side: for example when branches have diverged. In this case, we'll try 3 times scheduled 1 or 5 minutes apart. In between, the mirror is marked as "to_retry", the error would be visible to the user when they visit the settings page. After 3 tries we'll mark the mirror as failed and notify the user. We won't track this error in sentry, as it's not likely we can help it. The next event that would trigger a new refresh. Error handling: our side If an unexpected error occurs, we mark the mirror as failed, but we'd still retry the job based on the regular sidekiq retries with backoff. Same as we used to The error would be reported in sentry, since its likely we need to do something about it. 2019-08-13 16:52:01 -04:00			`MAX_TRIES = 3`
Backports every CE related change from ee-5484 to CE 2018-05-03 08:55:14 -04:00
Rework retry strategy for remote mirrors Prevention of running 2 simultaneous updates Instead of using `RemoteMirror#update_status` and raise an error if it's already running to prevent the same mirror being updated at the same time we now use `Gitlab::ExclusiveLease` for that. When we fail to obtain a lease in 3 tries, 30 seconds apart, we bail and reschedule. We'll reschedule faster for the protected branches. If the mirror already ran since it was scheduled, the job will be skipped. Error handling: Remote side When an update fails because of a `Gitlab::Git::CommandError`, we won't track this error in sentry, this could be on the remote side: for example when branches have diverged. In this case, we'll try 3 times scheduled 1 or 5 minutes apart. In between, the mirror is marked as "to_retry", the error would be visible to the user when they visit the settings page. After 3 tries we'll mark the mirror as failed and notify the user. We won't track this error in sentry, as it's not likely we can help it. The next event that would trigger a new refresh. Error handling: our side If an unexpected error occurs, we mark the mirror as failed, but we'd still retry the job based on the regular sidekiq retries with backoff. Same as we used to The error would be reported in sentry, since its likely we need to do something about it. 2019-08-13 16:52:01 -04:00			`def execute(remote_mirror, tries)`
Backports every CE related change from ee-5484 to CE 2018-05-03 08:55:14 -04:00			`return success unless remote_mirror.enabled?`

Add latest changes from gitlab-org/gitlab@master 2020-09-02 11:10:54 -04:00			`if Gitlab::UrlBlocker.blocked_url?(CGI.unescape(Gitlab::UrlSanitizer.sanitize(remote_mirror.url)))`
			`return error("The remote mirror URL is invalid.")`
			`end`

Rework retry strategy for remote mirrors Prevention of running 2 simultaneous updates Instead of using `RemoteMirror#update_status` and raise an error if it's already running to prevent the same mirror being updated at the same time we now use `Gitlab::ExclusiveLease` for that. When we fail to obtain a lease in 3 tries, 30 seconds apart, we bail and reschedule. We'll reschedule faster for the protected branches. If the mirror already ran since it was scheduled, the job will be skipped. Error handling: Remote side When an update fails because of a `Gitlab::Git::CommandError`, we won't track this error in sentry, this could be on the remote side: for example when branches have diverged. In this case, we'll try 3 times scheduled 1 or 5 minutes apart. In between, the mirror is marked as "to_retry", the error would be visible to the user when they visit the settings page. After 3 tries we'll mark the mirror as failed and notify the user. We won't track this error in sentry, as it's not likely we can help it. The next event that would trigger a new refresh. Error handling: our side If an unexpected error occurs, we mark the mirror as failed, but we'd still retry the job based on the regular sidekiq retries with backoff. Same as we used to The error would be reported in sentry, since its likely we need to do something about it. 2019-08-13 16:52:01 -04:00			`update_mirror(remote_mirror)`
Synchronize the default branch when updating a remote mirror 2018-09-10 15:12:49 -04:00
Rework retry strategy for remote mirrors Prevention of running 2 simultaneous updates Instead of using `RemoteMirror#update_status` and raise an error if it's already running to prevent the same mirror being updated at the same time we now use `Gitlab::ExclusiveLease` for that. When we fail to obtain a lease in 3 tries, 30 seconds apart, we bail and reschedule. We'll reschedule faster for the protected branches. If the mirror already ran since it was scheduled, the job will be skipped. Error handling: Remote side When an update fails because of a `Gitlab::Git::CommandError`, we won't track this error in sentry, this could be on the remote side: for example when branches have diverged. In this case, we'll try 3 times scheduled 1 or 5 minutes apart. In between, the mirror is marked as "to_retry", the error would be visible to the user when they visit the settings page. After 3 tries we'll mark the mirror as failed and notify the user. We won't track this error in sentry, as it's not likely we can help it. The next event that would trigger a new refresh. Error handling: our side If an unexpected error occurs, we mark the mirror as failed, but we'd still retry the job based on the regular sidekiq retries with backoff. Same as we used to The error would be reported in sentry, since its likely we need to do something about it. 2019-08-13 16:52:01 -04:00			`success`
			`rescue Gitlab::Git::CommandError => e`
			`# This happens if one of the gitaly calls above fail, for example when`
			`# branches have diverged, or the pre-receive hook fails.`
			`retry_or_fail(remote_mirror, e.message, tries)`
Backports every CE related change from ee-5484 to CE 2018-05-03 08:55:14 -04:00
Rework retry strategy for remote mirrors Prevention of running 2 simultaneous updates Instead of using `RemoteMirror#update_status` and raise an error if it's already running to prevent the same mirror being updated at the same time we now use `Gitlab::ExclusiveLease` for that. When we fail to obtain a lease in 3 tries, 30 seconds apart, we bail and reschedule. We'll reschedule faster for the protected branches. If the mirror already ran since it was scheduled, the job will be skipped. Error handling: Remote side When an update fails because of a `Gitlab::Git::CommandError`, we won't track this error in sentry, this could be on the remote side: for example when branches have diverged. In this case, we'll try 3 times scheduled 1 or 5 minutes apart. In between, the mirror is marked as "to_retry", the error would be visible to the user when they visit the settings page. After 3 tries we'll mark the mirror as failed and notify the user. We won't track this error in sentry, as it's not likely we can help it. The next event that would trigger a new refresh. Error handling: our side If an unexpected error occurs, we mark the mirror as failed, but we'd still retry the job based on the regular sidekiq retries with backoff. Same as we used to The error would be reported in sentry, since its likely we need to do something about it. 2019-08-13 16:52:01 -04:00			`error(e.message)`
			`rescue => e`
			`remote_mirror.mark_as_failed!(e.message)`
			`raise e`
			`end`

			`private`

			`def update_mirror(remote_mirror)`
			`remote_mirror.update_start!`
			`remote_mirror.ensure_remote!`
Add latest changes from gitlab-org/gitlab@master 2020-06-02 02:08:01 -04:00
Add latest changes from gitlab-org/gitlab@master 2020-09-17 14:10:12 -04:00			`# LFS objects must be sent first, or the push has dangling pointers`
			`send_lfs_objects!(remote_mirror)`

Add latest changes from gitlab-org/gitlab@master 2020-04-28 17:09:35 -04:00			`response = remote_mirror.update_repository`
Backports every CE related change from ee-5484 to CE 2018-05-03 08:55:14 -04:00
Add latest changes from gitlab-org/gitlab@master 2020-04-28 17:09:35 -04:00			`if response.divergent_refs.any?`
			`message = "Some refs have diverged and have not been updated on the remote:"`
			`message += "\n\n#{response.divergent_refs.join("\n")}"`
Rework retry strategy for remote mirrors Prevention of running 2 simultaneous updates Instead of using `RemoteMirror#update_status` and raise an error if it's already running to prevent the same mirror being updated at the same time we now use `Gitlab::ExclusiveLease` for that. When we fail to obtain a lease in 3 tries, 30 seconds apart, we bail and reschedule. We'll reschedule faster for the protected branches. If the mirror already ran since it was scheduled, the job will be skipped. Error handling: Remote side When an update fails because of a `Gitlab::Git::CommandError`, we won't track this error in sentry, this could be on the remote side: for example when branches have diverged. In this case, we'll try 3 times scheduled 1 or 5 minutes apart. In between, the mirror is marked as "to_retry", the error would be visible to the user when they visit the settings page. After 3 tries we'll mark the mirror as failed and notify the user. We won't track this error in sentry, as it's not likely we can help it. The next event that would trigger a new refresh. Error handling: our side If an unexpected error occurs, we mark the mirror as failed, but we'd still retry the job based on the regular sidekiq retries with backoff. Same as we used to The error would be reported in sentry, since its likely we need to do something about it. 2019-08-13 16:52:01 -04:00
Add latest changes from gitlab-org/gitlab@master 2020-04-28 17:09:35 -04:00			`remote_mirror.mark_as_failed!(message)`
			`else`
			`remote_mirror.update_finish!`
			`end`
Rework retry strategy for remote mirrors Prevention of running 2 simultaneous updates Instead of using `RemoteMirror#update_status` and raise an error if it's already running to prevent the same mirror being updated at the same time we now use `Gitlab::ExclusiveLease` for that. When we fail to obtain a lease in 3 tries, 30 seconds apart, we bail and reschedule. We'll reschedule faster for the protected branches. If the mirror already ran since it was scheduled, the job will be skipped. Error handling: Remote side When an update fails because of a `Gitlab::Git::CommandError`, we won't track this error in sentry, this could be on the remote side: for example when branches have diverged. In this case, we'll try 3 times scheduled 1 or 5 minutes apart. In between, the mirror is marked as "to_retry", the error would be visible to the user when they visit the settings page. After 3 tries we'll mark the mirror as failed and notify the user. We won't track this error in sentry, as it's not likely we can help it. The next event that would trigger a new refresh. Error handling: our side If an unexpected error occurs, we mark the mirror as failed, but we'd still retry the job based on the regular sidekiq retries with backoff. Same as we used to The error would be reported in sentry, since its likely we need to do something about it. 2019-08-13 16:52:01 -04:00			`end`

Add latest changes from gitlab-org/gitlab@master 2020-09-17 14:10:12 -04:00			`def send_lfs_objects!(remote_mirror)`
			`return unless Feature.enabled?(:push_mirror_syncs_lfs, project)`
			`return unless project.lfs_enabled?`

			`# TODO: Support LFS sync over SSH`
			`# https://gitlab.com/gitlab-org/gitlab/-/issues/249587`
			`return unless remote_mirror.url =~ /\Ahttps?:\/\//i`
			`return unless remote_mirror.password_auth?`

			`Lfs::PushService.new(`
			`project,`
			`current_user,`
			`url: remote_mirror.bare_url,`
			`credentials: remote_mirror.credentials`
			`).execute`
			`end`

Rework retry strategy for remote mirrors Prevention of running 2 simultaneous updates Instead of using `RemoteMirror#update_status` and raise an error if it's already running to prevent the same mirror being updated at the same time we now use `Gitlab::ExclusiveLease` for that. When we fail to obtain a lease in 3 tries, 30 seconds apart, we bail and reschedule. We'll reschedule faster for the protected branches. If the mirror already ran since it was scheduled, the job will be skipped. Error handling: Remote side When an update fails because of a `Gitlab::Git::CommandError`, we won't track this error in sentry, this could be on the remote side: for example when branches have diverged. In this case, we'll try 3 times scheduled 1 or 5 minutes apart. In between, the mirror is marked as "to_retry", the error would be visible to the user when they visit the settings page. After 3 tries we'll mark the mirror as failed and notify the user. We won't track this error in sentry, as it's not likely we can help it. The next event that would trigger a new refresh. Error handling: our side If an unexpected error occurs, we mark the mirror as failed, but we'd still retry the job based on the regular sidekiq retries with backoff. Same as we used to The error would be reported in sentry, since its likely we need to do something about it. 2019-08-13 16:52:01 -04:00			`def retry_or_fail(mirror, message, tries)`
			`if tries < MAX_TRIES`
			`mirror.mark_for_retry!(message)`
Backports every CE related change from ee-5484 to CE 2018-05-03 08:55:14 -04:00			`else`
Rework retry strategy for remote mirrors Prevention of running 2 simultaneous updates Instead of using `RemoteMirror#update_status` and raise an error if it's already running to prevent the same mirror being updated at the same time we now use `Gitlab::ExclusiveLease` for that. When we fail to obtain a lease in 3 tries, 30 seconds apart, we bail and reschedule. We'll reschedule faster for the protected branches. If the mirror already ran since it was scheduled, the job will be skipped. Error handling: Remote side When an update fails because of a `Gitlab::Git::CommandError`, we won't track this error in sentry, this could be on the remote side: for example when branches have diverged. In this case, we'll try 3 times scheduled 1 or 5 minutes apart. In between, the mirror is marked as "to_retry", the error would be visible to the user when they visit the settings page. After 3 tries we'll mark the mirror as failed and notify the user. We won't track this error in sentry, as it's not likely we can help it. The next event that would trigger a new refresh. Error handling: our side If an unexpected error occurs, we mark the mirror as failed, but we'd still retry the job based on the regular sidekiq retries with backoff. Same as we used to The error would be reported in sentry, since its likely we need to do something about it. 2019-08-13 16:52:01 -04:00			`# It's not likely we'll be able to recover from this ourselves, so we'll`
			`# notify the users of the problem, and don't trigger any sidekiq retries`
			`# Instead, we'll wait for the next change to try the push again, or until`
			`# a user manually retries.`
			`mirror.mark_as_failed!(message)`
Backports every CE related change from ee-5484 to CE 2018-05-03 08:55:14 -04:00			`end`
			`end`
			`end`
			`end`