gitlab-org--gitlab-foss/doc/update/with_downtime.md

Ignoring revisions in .git-blame-ignore-revs. Click here to bypass and see the normal blame view.

330 lines
10 KiB
Markdown
Raw Normal View History

---
stage: Systems
group: Distribution
info: To determine the technical writer assigned to the Stage/Group associated with this page, see https://about.gitlab.com/handbook/engineering/ux/technical-writing/#assignments
---
# Multi-node upgrades with downtime **(FREE SELF)**
NOTE:
This process is a work in progress. You're welcome to provide feedback by either raising a ticket to support,
or [commenting on this issue](https://gitlab.com/gitlab-org/omnibus-gitlab/-/issues/6244).
While you can upgrade a multi-node GitLab deployment [with zero downtime](zero_downtime.md),
there are a number of constraints. In particular, you can upgrade to only one minor release
at a time, for example, from 14.6 to 14.7, then to 14.8, etc.
If you want to upgrade to more than one minor release at a time (for example, from 14.6 to 14.9),
you must take your GitLab instance offline, which implies downtime.
Before starting this process, verify the
[version specific upgrading instructions](index.md#version-specific-upgrading-instructions)
relevant to your [upgrade path](index.md#upgrade-paths).
For a single node installation, you must only [upgrade the GitLab package](package/index.md).
The process for upgrading a number of components of a multi-node GitLab
installation is the same as for zero downtime upgrades.
The differences relate to the servers running Rails (Puma/Sidekiq) and
the order of events.
At a high level, the process is:
1. Shut down the GitLab application.
1. Upgrade your Consul servers.
1. Upgrade the other back-end components:
- Gitaly, Rails PostgreSQL, Redis, PgBouncer: these can be upgraded in any order.
- If you use PostgreSQL or Redis from your cloud platform and upgrades are required,
substitute the instructions for Omnibus GitLab with your cloud provider's instructions.
1. Upgrade the GitLab application (Sidekiq, Puma) and start the application up.
If you are a Community Edition user, replace `gitlab-ee` with
`gitlab-ce` in the following commands.
## Stop writes to the database
Shut down Puma and Sidekiq on all servers running these processes:
```shell
sudo gitlab-ctl stop sidekiq
sudo gitlab-ctl stop puma
```
## Upgrade the Consul nodes
[Consult the Consul documentation for the complete instructions](../administration/consul.md#upgrade-the-consul-nodes).
In summary:
1. Check the Consul nodes are all healthy.
1. Upgrade the GitLab package on all your Consul servers:
```shell
# Debian/Ubuntu
sudo apt-get update && sudo apt-get install gitlab-ee
# Centos/RHEL
sudo yum install gitlab-ee
```
1. Restart all GitLab services **one node at a time**:
```shell
sudo gitlab-ctl restart
```
If your Consul cluster processes are not on their own servers, and are shared
with another service such as Redis HA or Patroni, ensure that you follow the
following principles when upgrading those servers:
- Do not restart services more than one server at a time.
- Check the Consul cluster is healthy before upgrading or restarting services.
## Upgrade the Gitaly nodes (Praefect / Gitaly Cluster)
If you're running Gitaly cluster, follow the [zero downtime process](zero_downtime.md#gitaly-or-gitaly-cluster)
for Gitaly cluster.
If you are using Amazon Machine Images (AMIs) on AWS, the Gitaly nodes
**should not be upgraded via the AMI process**. Gitaly nodes should **only**
be upgraded using the package upgrade because:
- Praefect tracks replicas of Git repositories by server hostname.
- Redeployment using AMIs issues the nodes with new hostnames.
- Even though the storage is the same, Gitaly cluster does not work after this.
The Praefect nodes, however, can be upgraded via an AMI redeployment process:
1. The AMI redeployment process must include `gitlab-ctl reconfigure`.
Set `praefect['auto_migrate'] = false` on the AMI so all nodes get this. This
prevents `reconfigure` from automatically running database migrations.
1. The first node to be redeployed with the upgraded image should be your
deploy node.
1. After it's deployed, set `praefect['auto_migrate'] = true` in `gitlab.rb`
and apply with `gitlab-ctl reconfigure`. This runs the database
migrations.
1. Redeploy your other Praefect nodes.
## Upgrade the Gitaly nodes not part of Gitaly cluster
For Gitaly servers which are not part of Gitaly cluster, update the GitLab package:
```shell
# Debian/Ubuntu
sudo apt-get update && sudo apt-get install gitlab-ee
# Centos/RHEL
sudo yum install gitlab-ee
```
If you have multiple Gitaly shards or have multiple load-balanced Gitaly nodes
using NFS, it doesn't matter in which order you upgrade the Gitaly servers.
## Upgrade the PostgreSQL nodes
For unclustered PostgreSQL servers:
1. Upgrade the GitLab package:
```shell
# Debian/Ubuntu
sudo apt-get update && sudo apt-get install gitlab-ee
# Centos/RHEL
sudo yum install gitlab-ee
```
1. The upgrade process does not restart PostgreSQL when the binaries are upgraded.
Restart to load the new version:
```shell
sudo gitlab-ctl restart
```
## Upgrade the Patroni node
Patroni is used to achieve high availability with PostgreSQL.
If a PostgreSQL major version upgrade is required,
[follow the major version process](../administration/postgresql/replication_and_failover.md#upgrading-postgresql-major-version-in-a-patroni-cluster).
The upgrade process for all other versions is performed on all replicas first.
After they're upgraded, a cluster failover occurs from the leader to one of the upgraded
replicas. This ensures that only one failover is needed, and once complete the new
leader is upgraded.
Follow the following process:
1. Identify the leader and replica nodes, and [verify that the cluster is healthy](../administration/postgresql/replication_and_failover.md#check-replication-status).
Run on a database node:
```shell
sudo gitlab-ctl patroni members
```
1. Upgrade the GitLab package on one of the replica nodes:
```shell
# Debian/Ubuntu
sudo apt-get update && sudo apt-get install gitlab-ee
# Centos/RHEL
sudo yum install gitlab-ee
```
1. Restart to load the new version:
```shell
sudo gitlab-ctl restart
```
1. [Verify that the cluster is healthy](../administration/postgresql/replication_and_failover.md#check-replication-status).
1. Repeat these steps for the other replica: upgrade, restart, health check.
1. Upgrade the leader node following the same package upgrade as the replicas.
1. Restart all services on the leader node to load the new version, and also
trigger a cluster failover:
```shell
sudo gitlab-ctl restart
```
1. [Check the cluster is healthy](../administration/postgresql/replication_and_failover.md#check-replication-status)
## Upgrade the PgBouncer nodes
If you run PgBouncer on your Rails (application) nodes, then
PgBouncer are upgraded as part of the application server upgrade.
Upgrade the PgBouncer nodes:
```shell
# Debian/Ubuntu
sudo apt-get update && sudo apt-get install gitlab-ee
# Centos/RHEL
sudo yum install gitlab-ee
```
## Upgrade the Redis node
Upgrade a standalone Redis server by updating the GitLab package:
```shell
# Debian/Ubuntu
sudo apt-get update && sudo apt-get install gitlab-ee
# Centos/RHEL
sudo yum install gitlab-ee
```
## Upgrade Redis HA (using Sentinel) **(PREMIUM SELF)**
Follow [the zero downtime instructions](zero_downtime.md#redis-ha-using-sentinel)
for upgrading your Redis HA cluster.
## Upgrade the Rails nodes (Puma / Sidekiq)
All the Puma and Sidekiq processes were previously shut down. On each node:
1. Ensure `/etc/gitlab/skip-auto-reconfigure` does not exist.
1. Check that Puma and Sidekiq are shut down:
```shell
ps -ef | egrep 'puma: | puma | sidekiq '
```
Select one node that runs Puma. This is your deploy node, and is responsible for
running all database migrations. On the deploy node:
1. Ensure the server is configured to permit regular migrations. Check that
`/etc/gitlab/gitlab.rb` does not contain `gitlab_rails['auto_migrate'] = false`.
Either set it specifically `gitlab_rails['auto_migrate'] = true` or omit it
for the default behavior (`true`).
1. If you're using PgBouncer:
You must bypass PgBouncer and connect directly to PostgreSQL
before running migrations.
Rails uses an advisory lock when attempting to run a migration to prevent
concurrent migrations from running on the same database. These locks are
not shared across transactions, resulting in `ActiveRecord::ConcurrentMigrationError`
and other issues when running database migrations using PgBouncer in transaction
pooling mode.
1. If you're running Patroni, find the leader node. Run on a database node:
```shell
sudo gitlab-ctl patroni members
```
1. Update `gitlab.rb` on the deploy node. Change `gitlab_rails['db_host']`
and `gitlab_rails['db_port']` to either:
- The host and port for your database server (unclustered PostgreSQL).
- The host and port for your cluster leader if you're running Patroni.
1. Apply the changes:
```shell
sudo gitlab-ctl reconfigure
```
1. Upgrade the GitLab package:
```shell
# Debian/Ubuntu
sudo apt-get update && sudo apt-get install gitlab-ee
# Centos/RHEL
sudo yum install gitlab-ee
```
1. If you modified `gitlab.rb` on the deploy node to bypass PgBouncer:
1. Update `gitlab.rb` on the deploy node. Change `gitlab_rails['db_host']`
and `gitlab_rails['db_port']` back to your PgBouncer settings.
1. Apply the changes:
```shell
sudo gitlab-ctl reconfigure
```
1. To ensure all services are running the upgraded version, and (if applicable) accessing
the database using PgBouncer, restart all services on the deploy node:
```shell
sudo gitlab-ctl restart
```
Next, upgrade all the other Puma and Sidekiq nodes. The setting `gitlab_rails['auto_migrate']` can be
set to anything in `gitlab.rb` on these nodes.
They can be upgraded in parallel:
1. Upgrade the GitLab package:
```shell
# Debian/Ubuntu
sudo apt-get update && sudo apt-get install gitlab-ee
# Centos/RHEL
sudo yum install gitlab-ee
```
1. Ensure all services are restarted:
```shell
sudo gitlab-ctl restart
```
## Upgrade the Monitor node
Upgrade the GitLab package:
```shell
# Debian/Ubuntu
sudo apt-get update && sudo apt-get install gitlab-ee
# Centos/RHEL
sudo yum install gitlab-ee
```