kotovalexarian-likes-gitlab/gitlab-org--gitlab-foss

GitLab Bot 6d533fe8b4 Add latest changes from gitlab-org/gitlab@master

2021-02-12 18:08:59 +00:00

84 KiB

Raw Blame History

reading_time	stage	group	info
true	Enablement	Distribution	To determine the technical writer assigned to the Stage/Group associated with this page, see https://about.gitlab.com/handbook/engineering/ux/technical-writing/#assignments

Reference architecture: up to 50,000 users (PREMIUM SELF)

This page describes GitLab reference architecture for up to 50,000 users. For a full list of reference architectures, see Available reference architectures.

Supported users (approximate): 50,000

High Availability: Yes

Test requests per second (RPS) rates: API: 1000 RPS, Web: 100 RPS, Git (Pull): 100 RPS, Git (Push): 20 RPS

Service	Nodes	Configuration	GCP	AWS	Azure
External load balancing node	1	8 vCPU, 7.2 GB memory	n1-highcpu-8	`c5.2xlarge`	F8s v2
Consul	3	2 vCPU, 1.8 GB memory	n1-highcpu-2	`c5.large`	F2s v2
PostgreSQL	3	16 vCPU, 60 GB memory	n1-standard-16	`m5.4xlarge`	D16s v3
PgBouncer	3	2 vCPU, 1.8 GB memory	n1-highcpu-2	`c5.large`	F2s v2
Internal load balancing node	1	8 vCPU, 7.2 GB memory	n1-highcpu-8	`c5.2xlarge`	F8s v2
Redis - Cache	3	4 vCPU, 15 GB memory	n1-standard-4	`m5.xlarge`	D4s v3
Redis - Queues / Shared State	3	4 vCPU, 15 GB memory	n1-standard-4	`m5.xlarge`	D4s v3
Redis Sentinel - Cache	3	1 vCPU, 1.7 GB memory	g1-small	`t3.small`	B1MS
Redis Sentinel - Queues / Shared State	3	1 vCPU, 1.7 GB memory	g1-small	`t3.small`	B1MS
Gitaly	2 (minimum)	64 vCPU, 240 GB memory	n1-standard-64	`m5.16xlarge`	D64s v3
Sidekiq	4	4 vCPU, 15 GB memory	n1-standard-4	`m5.xlarge`	D4s v3
GitLab Rails	12	32 vCPU, 28.8 GB memory	n1-highcpu-32	`c5.9xlarge`	F32s v2
Monitoring node	1	4 vCPU, 3.6 GB memory	n1-highcpu-4	`c5.xlarge`	F4s v2
Object storage	n/a	n/a	n/a	n/a	n/a
NFS server	1	4 vCPU, 3.6 GB memory	n1-highcpu-4	`c5.xlarge`	F4s v2

stateDiagram-v2
    [*] --> LoadBalancer
    LoadBalancer --> ApplicationServer

    ApplicationServer --> BackgroundJobs
    ApplicationServer --> Gitaly
    ApplicationServer --> Redis_Cache
    ApplicationServer --> Redis_Queues
    ApplicationServer --> PgBouncer
    PgBouncer --> Database
    ApplicationServer --> ObjectStorage
    BackgroundJobs --> ObjectStorage

    ApplicationMonitoring -->ApplicationServer
    ApplicationMonitoring -->PgBouncer
    ApplicationMonitoring -->Database
    ApplicationMonitoring -->BackgroundJobs

    ApplicationServer --> Consul

    Consul --> Database
    Consul --> PgBouncer
    Redis_Cache --> Consul
    Redis_Queues --> Consul
    BackgroundJobs --> Consul

    state Consul {
      "Consul_1..3"
    }

    state Database {
      "PG_Primary_Node"
      "PG_Secondary_Node_1..2"
    }

    state Redis_Cache {
      "R_Cache_Primary_Node"
      "R_Cache_Replica_Node_1..2"
      "R_Cache_Sentinel_1..3"
    }

    state Redis_Queues {
      "R_Queues_Primary_Node"
      "R_Queues_Replica_Node_1..2"
      "R_Queues_Sentinel_1..3"
    }

    state Gitaly {
      "Gitaly_1..2"
    }

    state BackgroundJobs {
      "Sidekiq_1..4"
    }

    state ApplicationServer {
      "GitLab_Rails_1..12"
    }

    state LoadBalancer {
      "LoadBalancer_1"
    }

    state ApplicationMonitoring {
      "Prometheus"
      "Grafana"
    }

    state PgBouncer {
      "Internal_Load_Balancer"
      "PgBouncer_1..3"
    }

The Google Cloud Platform (GCP) architectures were built and tested using the Intel Xeon E5 v3 (Haswell) CPU platform. On different hardware you may find that adjustments, either lower or higher, are required for your CPU or node counts. For more information, see our Sysbench-based CPU benchmark.

Due to better performance and availability, for data objects (such as LFS, uploads, or artifacts), using an object storage service is recommended instead of using NFS. Using an object storage service also doesn't require you to provision and maintain a node.

Setup components

To set up GitLab and its components to accommodate up to 50,000 users:

Configure the external load balancing node to handle the load balancing of the GitLab application services nodes.
Configure Consul.
Configure PostgreSQL, the database for GitLab.
Configure PgBouncer.
Configure the internal load balancing node.
Configure Redis.
Configure Gitaly, which provides access to the Git repositories.
Configure Sidekiq.
Configure the main GitLab Rails application to run Puma/Unicorn, Workhorse, GitLab Shell, and to serve all frontend requests (which include UI, API, and Git over HTTP/SSH).
Configure Prometheus to monitor your GitLab environment.
Configure the object storage used for shared data objects.
Configure Advanced Search (optional) for faster, more advanced code search across your entire GitLab instance.
Configure NFS (optional, and not recommended) to have shared disk storage service as an alternative to Gitaly or object storage. You can skip this step if you're not using GitLab Pages (which requires NFS).

The servers start on the same 10.6.0.0/24 private network range, and can connect to each other freely on these addresses.

The following list includes descriptions of each server and its assigned IP:

10.6.0.10: External Load Balancer
10.6.0.11: Consul 1
10.6.0.12: Consul 2
10.6.0.13: Consul 3
10.6.0.21: PostgreSQL primary
10.6.0.22: PostgreSQL secondary 1
10.6.0.23: PostgreSQL secondary 2
10.6.0.31: PgBouncer 1
10.6.0.32: PgBouncer 2
10.6.0.33: PgBouncer 3
10.6.0.40: Internal Load Balancer
10.6.0.51: Redis - Cache Primary
10.6.0.52: Redis - Cache Replica 1
10.6.0.53: Redis - Cache Replica 2
10.6.0.71: Sentinel - Cache 1
10.6.0.72: Sentinel - Cache 2
10.6.0.73: Sentinel - Cache 3
10.6.0.61: Redis - Queues Primary
10.6.0.62: Redis - Queues Replica 1
10.6.0.63: Redis - Queues Replica 2
10.6.0.81: Sentinel - Queues 1
10.6.0.82: Sentinel - Queues 2
10.6.0.83: Sentinel - Queues 3
10.6.0.91: Gitaly 1
10.6.0.92: Gitaly 2
10.6.0.101: Sidekiq 1
10.6.0.102: Sidekiq 2
10.6.0.103: Sidekiq 3
10.6.0.104: Sidekiq 4
10.6.0.111: GitLab application 1
10.6.0.112: GitLab application 2
10.6.0.113: GitLab application 3
10.6.0.121: Prometheus

Configure the external load balancer

In an active/active GitLab configuration, you'll need a load balancer to route traffic to the application servers. The specifics on which load balancer to use or its exact configuration is beyond the scope of GitLab documentation. We hope that if you're managing multi-node systems like GitLab, you already have a load balancer of choice. Some load balancer examples include HAProxy (open-source), F5 Big-IP LTM, and Citrix Net Scaler. This documentation outline the ports and protocols needed for use with GitLab.

This architecture has been tested and validated with HAProxy as the load balancer. Although other load balancers with similar feature sets could also be used, those load balancers have not been validated.

The next question is how you will handle SSL in your environment. There are several different options:

The application node terminates SSL.
The load balancer terminates SSL without backend SSL and communication is not secure between the load balancer and the application node.
The load balancer terminates SSL with backend SSL and communication is secure between the load balancer and the application node.

Application node terminates SSL

Configure your load balancer to pass connections on port 443 as TCP rather than HTTP(S) protocol. This will pass the connection to the application node's NGINX service untouched. NGINX will have the SSL certificate and listen on port 443.

See the NGINX HTTPS documentation for details on managing SSL certificates and configuring NGINX.

Load balancer terminates SSL without backend SSL

Configure your load balancer to use the HTTP(S) protocol rather than TCP. The load balancer will then be responsible for managing SSL certificates and terminating SSL.

Since communication between the load balancer and GitLab will not be secure, there is some additional configuration needed. See the NGINX proxied SSL documentation for details.

Load balancer terminates SSL with backend SSL

Configure your load balancer(s) to use the 'HTTP(S)' protocol rather than 'TCP'. The load balancer(s) will be responsible for managing SSL certificates that end users will see.

Traffic will also be secure between the load balancer(s) and NGINX in this scenario. There is no need to add configuration for proxied SSL since the connection will be secure all the way. However, configuration will need to be added to GitLab to configure SSL certificates. See NGINX HTTPS documentation for details on managing SSL certificates and configuring NGINX.

Readiness checks

Ensure the external load balancer only routes to working services with built in monitoring endpoints. The readiness checks all require additional configuration on the nodes being checked, otherwise, the external load balancer will not be able to connect.

Ports

The basic ports to be used are shown in the table below.

LB Port	Backend Port	Protocol
80	80	HTTP (1)
443	443	TCP or HTTPS (1) (2)
22	22	TCP

(1): Web terminal support requires your load balancer to correctly handle WebSocket connections. When using HTTP or HTTPS proxying, this means your load balancer must be configured to pass through the Connection and Upgrade hop-by-hop headers. See the web terminal integration guide for more details.
(2): When using HTTPS protocol for port 443, you will need to add an SSL certificate to the load balancers. If you wish to terminate SSL at the GitLab application server instead, use TCP protocol.

If you're using GitLab Pages with custom domain support you will need some additional port configurations. GitLab Pages requires a separate virtual IP address. Configure DNS to point the pages_external_url from /etc/gitlab/gitlab.rb at the new virtual IP address. See the GitLab Pages documentation for more information.

LB Port	Backend Port	Protocol
80	Varies (1)	HTTP
443	Varies (1)	TCP (2)

(1): The backend port for GitLab Pages depends on the gitlab_pages['external_http'] and gitlab_pages['external_https'] setting. See GitLab Pages documentation for more details.
(2): Port 443 for GitLab Pages should always use the TCP protocol. Users can configure custom domains with custom SSL, which would not be possible if SSL was terminated at the load balancer.

Alternate SSH Port

Some organizations have policies against opening SSH port 22. In this case, it may be helpful to configure an alternate SSH hostname that allows users to use SSH on port 443. An alternate SSH hostname will require a new virtual IP address compared to the other GitLab HTTP configuration above.

Configure DNS for an alternate SSH hostname such as altssh.gitlab.example.com.

LB Port	Backend Port	Protocol
443	22	TCP

Object storage type	Supported by consolidated configuration?
Backups	No
Job artifacts including archived job logs	Yes
LFS objects	Yes
Uploads	Yes
Container Registry (optional feature)	No
Merge request diffs	Yes
Mattermost	No
Packages (optional feature)	Yes
Dependency Proxy (optional feature)	Yes
Pseudonymizer (optional feature) (ULTIMATE SELF)	No
Autoscale runner caching (optional for improved performance)	No
Terraform state files	Yes

84 KiB Raw Blame History

Reference architecture: up to 50,000 users (PREMIUM SELF)

Setup components

Configure the external load balancer

Application node terminates SSL

Load balancer terminates SSL without backend SSL

Load balancer terminates SSL with backend SSL

Readiness checks

Ports

Alternate SSH Port

Configure Consul

Configure PostgreSQL

Provide your own PostgreSQL instance

Standalone PostgreSQL using Omnibus GitLab

PostgreSQL nodes

PostgreSQL post-configuration

Configure PgBouncer

Configure the internal load balancer

Configure Redis

Providing your own Redis instance

Configure the Redis and Sentinel Cache cluster

Configure the primary Redis Cache node

Configure the replica Redis Cache nodes

Configure the Sentinel Cache nodes

Configure the Redis and Sentinel Queues cluster

Configure the primary Redis Queues node

Configure the replica Redis Queues nodes

Configure the Sentinel Queues nodes

Configure Gitaly

Gitaly TLS support

Configure Sidekiq

Configure GitLab Rails

GitLab Rails post-configuration

Configure Prometheus

Configure the object storage

Configure Advanced Search (PREMIUM SELF)

Configure NFS (optional)

Troubleshooting

84 KiB

Raw Blame History