gitlab-org--gitlab-foss/doc/development/rolling_out_changes_using_feature_flags.md

# Rolling out changes using feature flags

[Feature flags](feature_flags.md) can be used to gradually roll out changes, be
it a new feature, or a performance improvement. By using feature flags, we can
comfortably measure the impact of our changes, while still being able to easily
disable those changes, without having to revert an entire release.

## When to use feature flags

Starting with GitLab 11.4, developers are required to use feature flags for
non-trivial changes. Such changes include:

- New features (e.g. a new merge request widget, epics, etc).
- Complex performance improvements that may require additional testing in
  production, such as rewriting complex queries.
- Invasive changes to the user interface, such as a new navigation bar or the
  removal of a sidebar.
- Adding support for importing projects from a third-party service.

In all cases, those working on the changes can best decide if a feature flag is
necessary. For example, changing the color of a button doesn't need a feature
flag, while changing the navigation bar definitely needs one. In case you are
uncertain if a feature flag is necessary, simply ask about this in the merge
request, and those reviewing the changes will likely provide you with an answer.

When using a feature flag for UI elements, make sure to _also_ use a feature
flag for the underlying backend code, if there is any. This ensures there is
absolutely no way to use the feature until it is enabled.

## The cost of feature flags

When reading the above, one might be tempted to think this procedure is going to
add a lot of work. Fortunately, this is not the case, and we'll show why. For
this example we'll specify the cost of the work to do as a number, ranging from
0 to infinity. The greater the number, the more expensive the work is. The cost
does _not_ translate to time, it's just a way of measuring complexity of one
change relative to another.

Let's say we are building a new feature, and we have determined that the cost of
this is 10. We have also determined that the cost of adding a feature flag check
in a variety of places is 1. If we do not use feature flags, and our feature
works as intended, our total cost is 10. This however is the best case scenario.
Optimising for the best case scenario is guaranteed to lead to trouble, whereas
optimising for the worst case scenario is almost always better.

To illustrate this, let's say our feature causes an outage, and there's no
immediate way to resolve it. This means we'd have to take the following steps to
resolve the outage:

1. Revert the release.
1. Perform any cleanups that might be necessary, depending on the changes that
   were made.
1. Revert the commit, ensuring the "master" branch remains stable. This is
   especially necessary if solving the problem can take days or even weeks.
1. Pick the revert commit into the appropriate stable branches, ensuring we
   don't block any future releases until the problem is resolved.

As history has shown, these steps are time consuming, complex, often involve
many developers, and worst of all: our users will have a bad experience using
GitLab.com until the problem is resolved.

Now let's say that all of this has an associated cost of 10. This means that in
the worst case scenario, which we should optimise for, our total cost is now 20.

If we had used a feature flag, things would have been very different. We don't
need to revert a release, and because feature flags are disabled by default we
don't need to revert and pick any Git commits. In fact, all we have to do is
disable the feature, and in the worst case, perform cleanup. Let's say that 
the cost of this is 2. In this case, our best case cost is 11: 10 to build the 
feature, and 1 to add the feature flag. The worst case cost is now 13: 10 to 
build the feature, 1 to add the feature flag, and 2 to disable and clean up.

Here we can see that in the best case scenario the work necessary is only a tiny
bit more compared to not using a feature flag. Meanwhile, the process of
reverting our changes has been made significantly and reliably cheaper.

In other words, feature flags do not slow down the development process. Instead,
they speed up the process as managing incidents now becomes _much_ easier. Once
continuous deployments are easier to perform, the time to iterate on a feature
is reduced even further, as you no longer need to wait weeks before your changes
are available on GitLab.com.

## Rolling out changes

The procedure of using feature flags is straightforward, and similar to not
using them. You add the necessary tests (make sure to test both the on and off
states of your feature flag(s)), make sure they all pass, have the code
reviewed, etc. You then submit your merge request, and add the ~"feature flag"
label. This label is used to signal to release managers that your changes are
hidden behind a feature flag and that it is safe to pick the MR into a stable
branch, without the need for an exception request.

When the changes are deployed it is time to start rolling out the feature to our
users. The exact procedure of rolling out a change is unspecified, as this can
vary from change to change. However, in general we recommend rolling out changes
incrementally, instead of enabling them for everybody right away. We also
recommend you to _not_ enable a feature _before_ the code is being deployed.
This allows you to separate rolling out a feature from a deploy, making it
easier to measure the impact of both separately.

GitLab's feature library (using
[Flipper](https://github.com/jnunemaker/flipper), and covered in the [Feature
Flags](feature_flags.md) guide) supports rolling out changes to a percentage of
users. This in turn can be controlled using [GitLab
chatops](../ci/chatops/README.md).

For an up to date list of feature flag commands please see [the source
code](https://gitlab.com/gitlab-com/chatops/blob/master/lib/chatops/commands/feature.rb).
Note that all the examples in that file must be preceded by
`/chatops run`.

If you get an error "Whoops! This action is not allowed. This incident
will be reported." that means your Slack account is not allowed to
change feature flags. To test if you are allowed to do anything at all,
run:

```
/chatops run feature --help
```

For example, to enable a feature for 25% of all users, run the following in
Slack:

```
/chatops run feature set new_navigation_bar 25
```

This will enable the feature for GitLab.com, with `new_navigation_bar` being the
name of the feature. We can also enable the feature for <https://dev.gitlab.org>
or <https://staging.gitlab.com>:

```
/chatops run feature set new_navigation_bar 25 --dev
/chatops run feature set new_navigation_bar 25 --staging
```

If you are not certain what percentages to use, simply use the following steps:

1. 25%
1. 50%
1. 75%
1. 100%

Between every step you'll want to wait a little while and monitor the
appropriate graphs on <https://dashboards.gitlab.net>. The exact time to wait
may differ. For some features a few minutes is enough, while for others you may
want to wait several hours or even days. This is entirely up to you, just make
sure it is clearly communicated to your team, and the Production team if you
anticipate any potential problems.

Feature gates can also be actor based, for example a feature could first be
enabled for only the `gitlab-ce` project. The project is passed by supplying a
`--project` flag:

```
/chatops run feature set --project=gitlab-org/gitlab-ce some_feature true
```

For groups the `--group` flag is available:

```
/chatops run feature set --group=gitlab-org some_feature true
```

Once a change is deemed stable, submit a new merge request to remove the
feature flag. This ensures the change is available to all users and self-hosted
instances. Make sure to add the ~"feature flag" label to this merge request so
release managers are aware the changes are hidden behind a feature flag. If the
merge request has to be picked into a stable branch (e.g. after the 7th), make
sure to also add the appropriate "Pick into X" label (e.g. "Pick into 11.4").

One might be tempted to think this will delay the release of a feature by at
least one month (= one release). This is not the case. A feature flag does not
have to stick around for a specific amount of time (e.g. at least one release),
instead they should stick around until the feature is deemed stable. Stable
means it works on GitLab.com without causing any problems, such as outages. In
most cases this will translate to a feature (with a feature flag) being shipped
in RC1, followed by the feature flag being removed in RC2. This in turn means
the feature will be stable by the time we publish a stable package around the
22nd of the month.

## Implicit feature flags

The [`Project#feature_available?`][project-fa],
[`Namespace#feature_available?`][namespace-fa] (EE), and
[`License.feature_available?`][license-fa] (EE) methods all implicitly check for
a feature flag by the same name as the provided argument.

For example if a feature is license-gated, there's no need to add an additional
explicit feature flag check since the flag will be checked as part of the
`License.feature_available?` call. Similarly, there's no need to "clean up" a
feature flag once the feature has reached general availability.

You'd still want to use an explicit `Feature.enabled?` check if your new feature
isn't gated by a License or Plan.

[project-fa]: https://gitlab.com/gitlab-org/gitlab-ee/blob/4cc1c62918aa4c31750cb21dfb1a6c3492d71080/app/models/project_feature.rb#L63-68
[namespace-fa]: https://gitlab.com/gitlab-org/gitlab-ee/blob/4cc1c62918aa4c31750cb21dfb1a6c3492d71080/ee/app/models/ee/namespace.rb#L71-85
[license-fa]: https://gitlab.com/gitlab-org/gitlab-ee/blob/4cc1c62918aa4c31750cb21dfb1a6c3492d71080/ee/app/models/license.rb#L293-300

### Undefined feature flags default to "on"

An important side-effect of the [implicit feature flags](#implicit-feature-flags) 
mentioned above is that unless the feature is explicitly disabled or limited to a 
percentage of users, the feature flag check will default to `true`.

As an example, if you were to ship the backend half of a feature behind a flag,
you'd want to explicitly disable that flag until the frontend half is also ready
to be shipped. You can do this via ChatOps:

```
/chatops run feature set some_feature 0
```

Note that you can do this at any time, even before the merge request using the
flag has been merged!

### Cleaning up

When a feature gate has been removed from the code base, the value still exists
in the database. This can be removed through ChatOps:

```
/chatops run feature delete some_feature
```
Document the need for feature flags This adds a development guide explaining that we are going to use feature flags more often, why, what the benefits are, and so on. See https://gitlab.com/gitlab-org/gitlab-ce/issues/49619 for more information. 2018-09-03 11:35:44 -04:00			`# Rolling out changes using feature flags`

			`[Feature flags](feature_flags.md) can be used to gradually roll out changes, be`
			`it a new feature, or a performance improvement. By using feature flags, we can`
			`comfortably measure the impact of our changes, while still being able to easily`
			`disable those changes, without having to revert an entire release.`

			`## When to use feature flags`

			`Starting with GitLab 11.4, developers are required to use feature flags for`
			`non-trivial changes. Such changes include:`

Make unordered lists conform to styleguide - Also makes other minor Markdown fixes that were near the main fixes. 2018-11-13 01:07:16 -05:00			`- New features (e.g. a new merge request widget, epics, etc).`
			`- Complex performance improvements that may require additional testing in`
Document the need for feature flags This adds a development guide explaining that we are going to use feature flags more often, why, what the benefits are, and so on. See https://gitlab.com/gitlab-org/gitlab-ce/issues/49619 for more information. 2018-09-03 11:35:44 -04:00			`production, such as rewriting complex queries.`
Make unordered lists conform to styleguide - Also makes other minor Markdown fixes that were near the main fixes. 2018-11-13 01:07:16 -05:00			`- Invasive changes to the user interface, such as a new navigation bar or the`
Document the need for feature flags This adds a development guide explaining that we are going to use feature flags more often, why, what the benefits are, and so on. See https://gitlab.com/gitlab-org/gitlab-ce/issues/49619 for more information. 2018-09-03 11:35:44 -04:00			`removal of a sidebar.`
Make unordered lists conform to styleguide - Also makes other minor Markdown fixes that were near the main fixes. 2018-11-13 01:07:16 -05:00			`- Adding support for importing projects from a third-party service.`
Document the need for feature flags This adds a development guide explaining that we are going to use feature flags more often, why, what the benefits are, and so on. See https://gitlab.com/gitlab-org/gitlab-ce/issues/49619 for more information. 2018-09-03 11:35:44 -04:00
			`In all cases, those working on the changes can best decide if a feature flag is`
			`necessary. For example, changing the color of a button doesn't need a feature`
			`flag, while changing the navigation bar definitely needs one. In case you are`
			`uncertain if a feature flag is necessary, simply ask about this in the merge`
			`request, and those reviewing the changes will likely provide you with an answer.`

			`When using a feature flag for UI elements, make sure to _also_ use a feature`
			`flag for the underlying backend code, if there is any. This ensures there is`
			`absolutely no way to use the feature until it is enabled.`

			`## The cost of feature flags`

			`When reading the above, one might be tempted to think this procedure is going to`
			`add a lot of work. Fortunately, this is not the case, and we'll show why. For`
			`this example we'll specify the cost of the work to do as a number, ranging from`
			`0 to infinity. The greater the number, the more expensive the work is. The cost`
			`does _not_ translate to time, it's just a way of measuring complexity of one`
			`change relative to another.`

			`Let's say we are building a new feature, and we have determined that the cost of`
			`this is 10. We have also determined that the cost of adding a feature flag check`
			`in a variety of places is 1. If we do not use feature flags, and our feature`
			`works as intended, our total cost is 10. This however is the best case scenario.`
			`Optimising for the best case scenario is guaranteed to lead to trouble, whereas`
			`optimising for the worst case scenario is almost always better.`

			`To illustrate this, let's say our feature causes an outage, and there's no`
			`immediate way to resolve it. This means we'd have to take the following steps to`
			`resolve the outage:`

			`1. Revert the release.`
			`1. Perform any cleanups that might be necessary, depending on the changes that`
			`were made.`
			`1. Revert the commit, ensuring the "master" branch remains stable. This is`
			`especially necessary if solving the problem can take days or even weeks.`
			`1. Pick the revert commit into the appropriate stable branches, ensuring we`
			`don't block any future releases until the problem is resolved.`

			`As history has shown, these steps are time consuming, complex, often involve`
			`many developers, and worst of all: our users will have a bad experience using`
			`GitLab.com until the problem is resolved.`

			`Now let's say that all of this has an associated cost of 10. This means that in`
			`the worst case scenario, which we should optimise for, our total cost is now 20.`

			`If we had used a feature flag, things would have been very different. We don't`
			`need to revert a release, and because feature flags are disabled by default we`
			`don't need to revert and pick any Git commits. In fact, all we have to do is`
Don't undermine the point with biased logic I.e. Include cleanup in both sides of the comparison. 2019-02-28 15:54:46 -05:00			`disable the feature, and in the worst case, perform cleanup. Let's say that`
			`the cost of this is 2. In this case, our best case cost is 11: 10 to build the`
			`feature, and 1 to add the feature flag. The worst case cost is now 13: 10 to`
			`build the feature, 1 to add the feature flag, and 2 to disable and clean up.`
Document the need for feature flags This adds a development guide explaining that we are going to use feature flags more often, why, what the benefits are, and so on. See https://gitlab.com/gitlab-org/gitlab-ce/issues/49619 for more information. 2018-09-03 11:35:44 -04:00
			`Here we can see that in the best case scenario the work necessary is only a tiny`
			`bit more compared to not using a feature flag. Meanwhile, the process of`
Don't undermine the point with biased logic I.e. Include cleanup in both sides of the comparison. 2019-02-28 15:54:46 -05:00			`reverting our changes has been made significantly and reliably cheaper.`
Document the need for feature flags This adds a development guide explaining that we are going to use feature flags more often, why, what the benefits are, and so on. See https://gitlab.com/gitlab-org/gitlab-ce/issues/49619 for more information. 2018-09-03 11:35:44 -04:00
			`In other words, feature flags do not slow down the development process. Instead,`
			`they speed up the process as managing incidents now becomes _much_ easier. Once`
			`continuous deployments are easier to perform, the time to iterate on a feature`
			`is reduced even further, as you no longer need to wait weeks before your changes`
			`are available on GitLab.com.`

			`## Rolling out changes`

			`The procedure of using feature flags is straightforward, and similar to not`
			`using them. You add the necessary tests (make sure to test both the on and off`
			`states of your feature flag(s)), make sure they all pass, have the code`
			`reviewed, etc. You then submit your merge request, and add the ~"feature flag"`
			`label. This label is used to signal to release managers that your changes are`
			`hidden behind a feature flag and that it is safe to pick the MR into a stable`
			`branch, without the need for an exception request.`

			`When the changes are deployed it is time to start rolling out the feature to our`
			`users. The exact procedure of rolling out a change is unspecified, as this can`
			`vary from change to change. However, in general we recommend rolling out changes`
			`incrementally, instead of enabling them for everybody right away. We also`
			`recommend you to _not_ enable a feature _before_ the code is being deployed.`
			`This allows you to separate rolling out a feature from a deploy, making it`
			`easier to measure the impact of both separately.`

			`GitLab's feature library (using`
			`[Flipper](https://github.com/jnunemaker/flipper), and covered in the [Feature`
			`Flags](feature_flags.md) guide) supports rolling out changes to a percentage of`
			`users. This in turn can be controlled using [GitLab`
Use relative URLs in development docs This is part of https://gitlab.com/gitlab-org/gitlab-ce/issues/61945 2019-05-19 19:27:22 -04:00			`chatops](../ci/chatops/README.md).`
Document the need for feature flags This adds a development guide explaining that we are going to use feature flags more often, why, what the benefits are, and so on. See https://gitlab.com/gitlab-org/gitlab-ce/issues/49619 for more information. 2018-09-03 11:35:44 -04:00
Add documentation reference for chatops feature flag commands 2019-03-21 10:19:00 -04:00			`For an up to date list of feature flag commands please see [the source`
			`code](https://gitlab.com/gitlab-com/chatops/blob/master/lib/chatops/commands/feature.rb).`
			`Note that all the examples in that file must be preceded by`
			`/chatops run`.

			`If you get an error "Whoops! This action is not allowed. This incident`
			`will be reported." that means your Slack account is not allowed to`
			`change feature flags. To test if you are allowed to do anything at all,`
			`run:`

			```
			`/chatops run feature --help`
			```

Document the need for feature flags This adds a development guide explaining that we are going to use feature flags more often, why, what the benefits are, and so on. See https://gitlab.com/gitlab-org/gitlab-ce/issues/49619 for more information. 2018-09-03 11:35:44 -04:00			`For example, to enable a feature for 25% of all users, run the following in`
			`Slack:`

			```
			`/chatops run feature set new_navigation_bar 25`
			```

			This will enable the feature for GitLab.com, with `new_navigation_bar` being the
			`name of the feature. We can also enable the feature for <https://dev.gitlab.org>`
			`or <https://staging.gitlab.com>:`

			```
			`/chatops run feature set new_navigation_bar 25 --dev`
			`/chatops run feature set new_navigation_bar 25 --staging`
			```

			`If you are not certain what percentages to use, simply use the following steps:`

			`1. 25%`
			`1. 50%`
			`1. 75%`
			`1. 100%`

			`Between every step you'll want to wait a little while and monitor the`
			`appropriate graphs on <https://dashboards.gitlab.net>. The exact time to wait`
			`may differ. For some features a few minutes is enough, while for others you may`
			`want to wait several hours or even days. This is entirely up to you, just make`
			`sure it is clearly communicated to your team, and the Production team if you`
			`anticipate any potential problems.`

Document ChatOps for actor feature gates The documentations only covered the percentage based feature gates. These don't work really well for a number of reasons, for example; a new visual element in GitLab would be weird to see only 25% of the time. Allowing feature gates based on projects and groups has been supported, now it's documented. 2019-03-06 10:11:54 -05:00			`Feature gates can also be actor based, for example a feature could first be`
			enabled for only the `gitlab-ce` project. The project is passed by supplying a
			`--project` flag:

			```
			`/chatops run feature set --project=gitlab-org/gitlab-ce some_feature true`
			```

			For groups the `--group` flag is available:

			```
			`/chatops run feature set --group=gitlab-org some_feature true`
			```

Document the need for feature flags This adds a development guide explaining that we are going to use feature flags more often, why, what the benefits are, and so on. See https://gitlab.com/gitlab-org/gitlab-ce/issues/49619 for more information. 2018-09-03 11:35:44 -04:00			`Once a change is deemed stable, submit a new merge request to remove the`
			`feature flag. This ensures the change is available to all users and self-hosted`
			`instances. Make sure to add the ~"feature flag" label to this merge request so`
			`release managers are aware the changes are hidden behind a feature flag. If the`
			`merge request has to be picked into a stable branch (e.g. after the 7th), make`
			`sure to also add the appropriate "Pick into X" label (e.g. "Pick into 11.4").`

			`One might be tempted to think this will delay the release of a feature by at`
			`least one month (= one release). This is not the case. A feature flag does not`
			`have to stick around for a specific amount of time (e.g. at least one release),`
			`instead they should stick around until the feature is deemed stable. Stable`
			`means it works on GitLab.com without causing any problems, such as outages. In`
			`most cases this will translate to a feature (with a feature flag) being shipped`
			`in RC1, followed by the feature flag being removed in RC2. This in turn means`
			`the feature will be stable by the time we publish a stable package around the`
			`22nd of the month.`
Documentation for feature flags defaulting to on Closes https://gitlab.com/gitlab-org/gitlab-ee/issues/7883 2018-10-05 12:12:24 -04:00
Document implicit feature flag checks 2018-10-29 15:54:35 -04:00			`## Implicit feature flags`
Documentation for feature flags defaulting to on Closes https://gitlab.com/gitlab-org/gitlab-ee/issues/7883 2018-10-05 12:12:24 -04:00
Document implicit feature flag checks 2018-10-29 15:54:35 -04:00			The [`Project#feature_available?`][project-fa],
Documentation for feature flags defaulting to on Closes https://gitlab.com/gitlab-org/gitlab-ee/issues/7883 2018-10-05 12:12:24 -04:00			[`Namespace#feature_available?`][namespace-fa] (EE), and
Document implicit feature flag checks 2018-10-29 15:54:35 -04:00			[`License.feature_available?`][license-fa] (EE) methods all implicitly check for
			`a feature flag by the same name as the provided argument.`

			`For example if a feature is license-gated, there's no need to add an additional`
			`explicit feature flag check since the flag will be checked as part of the`
			`License.feature_available?` call. Similarly, there's no need to "clean up" a
			`feature flag once the feature has reached general availability.`

			You'd still want to use an explicit `Feature.enabled?` check if your new feature
			`isn't gated by a License or Plan.`

			`[project-fa]: https://gitlab.com/gitlab-org/gitlab-ee/blob/4cc1c62918aa4c31750cb21dfb1a6c3492d71080/app/models/project_feature.rb#L63-68`
			`[namespace-fa]: https://gitlab.com/gitlab-org/gitlab-ee/blob/4cc1c62918aa4c31750cb21dfb1a6c3492d71080/ee/app/models/ee/namespace.rb#L71-85`
			`[license-fa]: https://gitlab.com/gitlab-org/gitlab-ee/blob/4cc1c62918aa4c31750cb21dfb1a6c3492d71080/ee/app/models/license.rb#L293-300`

			`### Undefined feature flags default to "on"`

Update invalid anchor link 2019-06-03 12:04:07 -04:00			`An important side-effect of the [implicit feature flags](#implicit-feature-flags)`
			`mentioned above is that unless the feature is explicitly disabled or limited to a`
			percentage of users, the feature flag check will default to `true`.
Documentation for feature flags defaulting to on Closes https://gitlab.com/gitlab-org/gitlab-ee/issues/7883 2018-10-05 12:12:24 -04:00
			`As an example, if you were to ship the backend half of a feature behind a flag,`
			`you'd want to explicitly disable that flag until the frontend half is also ready`
			`to be shipped. You can do this via ChatOps:`

			```
			`/chatops run feature set some_feature 0`
			```

			`Note that you can do this at any time, even before the merge request using the`
			`flag has been merged!`
Document ChatOps for actor feature gates The documentations only covered the percentage based feature gates. These don't work really well for a number of reasons, for example; a new visual element in GitLab would be weird to see only 25% of the time. Allowing feature gates based on projects and groups has been supported, now it's documented. 2019-03-06 10:11:54 -05:00
			`### Cleaning up`

			`When a feature gate has been removed from the code base, the value still exists`
			`in the database. This can be removed through ChatOps:`

			```
			`/chatops run feature delete some_feature`
			```