kotovalexarian-likes-gitlab/gitlab-org--gitlab-foss

GitLab Bot 2b349d9a94 Add latest changes from gitlab-org/gitlab@master

2020-08-25 18:10:49 +00:00

82 KiB

Raw Blame History

reading_time	stage	group	info
true	Enablement	Distribution	To determine the technical writer assigned to the Stage/Group associated with this page, see https://about.gitlab.com/handbook/engineering/ux/technical-writing/#designated-technical-writers

Reference architecture: up to 50,000 users (PREMIUM ONLY)

This page describes GitLab reference architecture for up to 50,000 users. For a full list of reference architectures, see Available reference architectures.

Supported users (approximate): 50,000

High Availability: Yes

Test requests per second (RPS) rates: API: 1000 RPS, Web: 100 RPS, Git: 100 RPS

Service	Nodes	Configuration	GCP	AWS	Azure
External load balancing node	1	8 vCPU, 7.2GB memory	n1-highcpu-8	c5.2xlarge	F8s v2
Consul	3	2 vCPU, 1.8GB memory	n1-highcpu-2	c5.large	F2s v2
PostgreSQL	3	16 vCPU, 60GB memory	n1-standard-16	m5.4xlarge	D16s v3
PgBouncer	3	2 vCPU, 1.8GB memory	n1-highcpu-2	c5.large	F2s v2
Internal load balancing node	1	8 vCPU, 7.2GB memory	n1-highcpu-8	c5.2xlarge	F8s v2
Redis - Cache	3	4 vCPU, 15GB memory	n1-standard-4	m5.xlarge	D4s v3
Redis - Queues / Shared State	3	4 vCPU, 15GB memory	n1-standard-4	m5.xlarge	D4s v3
Redis Sentinel - Cache	3	1 vCPU, 1.7GB memory	g1-small	t2.small	B1MS
Redis Sentinel - Queues / Shared State	3	1 vCPU, 1.7GB memory	g1-small	t2.small	B1MS
Gitaly	2 (minimum)	64 vCPU, 240GB memory	n1-standard-64	m5.16xlarge	D64s v3
Sidekiq	4	4 vCPU, 15GB memory	n1-standard-4	m5.xlarge	D4s v3
GitLab Rails	12	32 vCPU, 28.8GB memory	n1-highcpu-32	c5.9xlarge	F32s v2
Monitoring node	1	4 vCPU, 3.6GB memory	n1-highcpu-4	c5.xlarge	F4s v2
Object Storage	n/a	n/a	n/a	n/a	n/a
NFS Server	1	4 vCPU, 3.6GB memory	n1-highcpu-4	c5.xlarge	F4s v2

The Google Cloud Platform (GCP) architectures were built and tested using the Intel Xeon E5 v3 (Haswell) CPU platform. On different hardware you may find that adjustments, either lower or higher, are required for your CPU or node counts. For more information, see our Sysbench-based CPU benchmark.

For data objects (such as LFS, Uploads, or Artifacts), an object storage service is recommended instead of NFS where possible, due to better performance and availability. Since this doesn't require a node to be set up, Object Storage is noted as not applicable (n/a) in the previous table.

Setup components

To set up GitLab and its components to accommodate up to 50,000 users:

Configure the external load balancing node that will handle the load balancing of the three GitLab application services nodes.
Configure Consul.
Configure PostgreSQL, the database for GitLab.
Configure PgBouncer.
Configure the internal load balancing node
Configure Redis.
Configure Gitaly, which provides access to the Git repositories.
Configure Sidekiq.
Configure the main GitLab Rails application to run Puma/Unicorn, Workhorse, GitLab Shell, and to serve all frontend requests (UI, API, Git over HTTP/SSH).
Configure Prometheus to monitor your GitLab environment.
Configure the Object Storage used for shared data objects.
Configure NFS (Optional) to have shared disk storage service as an alternative to Gitaly and/or Object Storage (although not recommended). NFS is required for GitLab Pages, you can skip this step if you're not using that feature.

We start with all servers on the same 10.6.0.0/24 private network range, they can connect to each other freely on those addresses.

Here is a list and description of each machine and the assigned IP:

10.6.0.10: External Load Balancer
10.6.0.11: Consul 1
10.6.0.12: Consul 2
10.6.0.13: Consul 3
10.6.0.21: PostgreSQL primary
10.6.0.22: PostgreSQL secondary 1
10.6.0.23: PostgreSQL secondary 2
10.6.0.31: PgBouncer 1
10.6.0.32: PgBouncer 2
10.6.0.33: PgBouncer 3
10.6.0.40: Internal Load Balancer
10.6.0.51: Redis - Cache Primary
10.6.0.52: Redis - Cache Replica 1
10.6.0.53: Redis - Cache Replica 2
10.6.0.71: Sentinel - Cache 1
10.6.0.72: Sentinel - Cache 2
10.6.0.73: Sentinel - Cache 3
10.6.0.61: Redis - Queues Primary
10.6.0.62: Redis - Queues Replica 1
10.6.0.63: Redis - Queues Replica 2
10.6.0.81: Sentinel - Queues 1
10.6.0.82: Sentinel - Queues 2
10.6.0.83: Sentinel - Queues 3
10.6.0.91: Gitaly 1
10.6.0.92: Gitaly 2
10.6.0.101: Sidekiq 1
10.6.0.102: Sidekiq 2
10.6.0.103: Sidekiq 3
10.6.0.104: Sidekiq 4
10.6.0.111: GitLab application 1
10.6.0.112: GitLab application 2
10.6.0.113: GitLab application 3
10.6.0.121: Prometheus

Configure the external load balancer

NOTE: Note: This architecture has been tested and validated with HAProxy as the load balancer. Although other load balancers with similar feature sets could also be used, those load balancers have not been validated.

In an active/active GitLab configuration, you will need a load balancer to route traffic to the application servers. The specifics on which load balancer to use or the exact configuration is beyond the scope of GitLab documentation. We hope that if you're managing multi-node systems like GitLab you have a load balancer of choice already. Some examples including HAProxy (open-source), F5 Big-IP LTM, and Citrix Net Scaler. This documentation will outline what ports and protocols you need to use with GitLab.

The next question is how you will handle SSL in your environment. There are several different options:

The application node terminates SSL.
The load balancer terminates SSL without backend SSL and communication is not secure between the load balancer and the application node.
The load balancer terminates SSL with backend SSL and communication is secure between the load balancer and the application node.

Application node terminates SSL

Configure your load balancer to pass connections on port 443 as TCP rather than HTTP(S) protocol. This will pass the connection to the application node's NGINX service untouched. NGINX will have the SSL certificate and listen on port 443.

See the NGINX HTTPS documentation for details on managing SSL certificates and configuring NGINX.

Load balancer terminates SSL without backend SSL

Configure your load balancer to use the HTTP(S) protocol rather than TCP. The load balancer will then be responsible for managing SSL certificates and terminating SSL.

Since communication between the load balancer and GitLab will not be secure, there is some additional configuration needed. See the NGINX proxied SSL documentation for details.

Load balancer terminates SSL with backend SSL

Configure your load balancer(s) to use the 'HTTP(S)' protocol rather than 'TCP'. The load balancer(s) will be responsible for managing SSL certificates that end users will see.

Traffic will also be secure between the load balancer(s) and NGINX in this scenario. There is no need to add configuration for proxied SSL since the connection will be secure all the way. However, configuration will need to be added to GitLab to configure SSL certificates. See NGINX HTTPS documentation for details on managing SSL certificates and configuring NGINX.

Ports

The basic ports to be used are shown in the table below.

LB Port	Backend Port	Protocol
80	80	HTTP (1)
443	443	TCP or HTTPS (1) (2)
22	22	TCP

(1): Web terminal support requires your load balancer to correctly handle WebSocket connections. When using HTTP or HTTPS proxying, this means your load balancer must be configured to pass through the Connection and Upgrade hop-by-hop headers. See the web terminal integration guide for more details.
(2): When using HTTPS protocol for port 443, you will need to add an SSL certificate to the load balancers. If you wish to terminate SSL at the GitLab application server instead, use TCP protocol.

If you're using GitLab Pages with custom domain support you will need some additional port configurations. GitLab Pages requires a separate virtual IP address. Configure DNS to point the pages_external_url from /etc/gitlab/gitlab.rb at the new virtual IP address. See the GitLab Pages documentation for more information.

LB Port	Backend Port	Protocol
80	Varies (1)	HTTP
443	Varies (1)	TCP (2)

(1): The backend port for GitLab Pages depends on the gitlab_pages['external_http'] and gitlab_pages['external_https'] setting. See GitLab Pages documentation for more details.
(2): Port 443 for GitLab Pages should always use the TCP protocol. Users can configure custom domains with custom SSL, which would not be possible if SSL was terminated at the load balancer.

Alternate SSH Port

Some organizations have policies against opening SSH port 22. In this case, it may be helpful to configure an alternate SSH hostname that allows users to use SSH on port 443. An alternate SSH hostname will require a new virtual IP address compared to the other GitLab HTTP configuration above.

Configure DNS for an alternate SSH hostname such as altssh.gitlab.example.com.

LB Port	Backend Port	Protocol
443	22	TCP

82 KiB Raw Blame History

Reference architecture: up to 50,000 users (PREMIUM ONLY)

Setup components

Configure the external load balancer

Application node terminates SSL

Load balancer terminates SSL without backend SSL

Load balancer terminates SSL with backend SSL

Ports

Alternate SSH Port

Configure Consul

Configure PostgreSQL

Provide your own PostgreSQL instance

Standalone PostgreSQL using Omnibus GitLab

PostgreSQL primary node

PostgreSQL secondary nodes

PostgreSQL post-configuration

Configure PgBouncer

Configure the internal load balancer

Configure Redis

Configure the Redis and Sentinel Cache cluster

Configure the primary Redis Cache node

Configure the replica Redis Cache nodes

Configure the Sentinel Cache nodes

Configure the Redis and Sentinel Queues cluster

Configure the primary Redis Queues node

Configure the replica Redis Queues nodes

Configure the Sentinel Queues nodes

Configure Gitaly

Gitaly TLS support

Configure Sidekiq

Configure GitLab Rails

GitLab Rails post-configuration

Configure Prometheus

Configure the object storage

Configure NFS (optional)

Troubleshooting

82 KiB

Raw Blame History