2019-09-24 02:06:02 -04:00
---
2020-10-30 20:09:08 -04:00
stage: none
group: unassigned
2020-11-26 01:09:20 -05:00
info: To determine the technical writer assigned to the Stage/Group associated with this page, see https://about.gitlab.com/handbook/engineering/ux/technical-writing/#assignments
2019-09-24 02:06:02 -04:00
type: reference
---
# Testing Rails migrations at GitLab
In order to reliably check Rails migrations, we need to test them against
a database schema.
## When to write a migration test
- Post migrations (`/db/post_migrate`) and background migrations
(`lib/gitlab/background_migration`) **must** have migration tests performed.
- If your migration is a data migration then it **must** have a migration test.
- Other migrations may have a migration test if necessary.
## How does it work?
Adding a `:migration` tag to a test signature enables some custom RSpec
`before` and `after` hooks in our
2020-03-11 17:09:19 -04:00
[`spec/support/migration.rb` ](https://gitlab.com/gitlab-org/gitlab/-/blob/f81fa6ab1dd788b70ef44b85aaba1f31ffafae7d/spec/support/migration.rb )
2019-09-24 02:06:02 -04:00
to run.
2020-12-14 13:09:48 -05:00
A `before` hook reverts all migrations to the point that a migration
2019-09-24 02:06:02 -04:00
under test is not yet migrated.
2020-12-14 13:09:48 -05:00
In other words, our custom RSpec hooks finds a previous migration, and
2019-09-24 02:06:02 -04:00
migrate the database **down** to the previous migration version.
With this approach you can test a migration against a database schema.
2021-02-04 16:09:06 -05:00
An `after` hook migrates the database **up** and restores the latest
2019-09-24 02:06:02 -04:00
schema version, so that the process does not affect subsequent specs and
ensures proper isolation.
## Testing an `ActiveRecord::Migration` class
To test an `ActiveRecord::Migration` class (i.e., a
regular migration `db/migrate` or a post-migration `db/post_migrate` ), you
2020-12-14 13:09:48 -05:00
must load the migration file by using the `require_migration!` helper
2020-08-19 17:10:32 -04:00
method because it is not autoloaded by Rails.
Example:
2019-09-24 02:06:02 -04:00
```ruby
2020-08-19 17:10:32 -04:00
require 'spec_helper'
require_migration!
RSpec.describe ...
2019-09-24 02:06:02 -04:00
```
2019-09-26 20:06:23 -04:00
### Test helpers
2020-08-19 17:10:32 -04:00
#### `require_migration!`
2020-12-14 13:09:48 -05:00
Since the migration files are not autoloaded by Rails, you must manually
2020-08-19 17:10:32 -04:00
load the migration file. To do so, you can use the `require_migration!` helper method
2020-12-11 01:10:17 -05:00
which can automatically load the correct migration file based on the spec filename.
2020-08-19 17:10:32 -04:00
2021-10-18 20:10:29 -04:00
In GitLab 14.4 and later, you can use `require_migration!` to load migration files from spec files
that contain the schema version in the filename (for example,
`2021101412150000_populate_foo_column_spec.rb` ).
2020-08-19 17:10:32 -04:00
2021-10-18 20:10:29 -04:00
```ruby
# frozen_string_literal: true
require 'spec_helper'
require_migration!
RSpec.describe PopulateFooColumn do
...
end
```
In some cases, you must require multiple migration files to use them in your specs. Here, there's no
pattern between your spec file and the other migration file. You can provide the migration filename
like so:
2020-08-19 17:10:32 -04:00
```ruby
2021-10-18 20:10:29 -04:00
# frozen_string_literal: true
require 'spec_helper'
require_migration!
require_migration!('populate_bar_column')
RSpec.describe PopulateFooColumn do
...
end
2020-08-19 17:10:32 -04:00
```
2019-09-26 20:06:23 -04:00
#### `table`
2019-09-24 02:06:02 -04:00
Use the `table` helper to create a temporary `ActiveRecord::Base` -derived model
2020-02-25 04:09:10 -05:00
for a table. [FactoryBot ](best_practices.md#factories )
2020-11-26 07:09:48 -05:00
**should not** be used to create data for migration specs because it relies on
application code which can change after the migration has run, and cause the test
to fail. For example, to create a record in the `projects` table:
2019-09-24 02:06:02 -04:00
```ruby
project = table(:projects).create!(id: 1, name: 'gitlab1', path: 'gitlab1')
```
2019-09-26 20:06:23 -04:00
#### `migrate!`
2020-12-14 13:09:48 -05:00
Use the `migrate!` helper to run the migration that is under test. It
runs the migration and bumps the schema version in the `schema_migrations`
2019-09-24 02:06:02 -04:00
table. It is necessary because in the `after` hook we trigger the rest of
the migrations, and we need to know where to start. Example:
```ruby
it 'migrates successfully' do
# ... pre-migration expectations
migrate!
# ... post-migration expectations
end
```
2019-09-26 20:06:23 -04:00
#### `reversible_migration`
Use the `reversible_migration` helper to test migrations with either a
2020-12-14 13:09:48 -05:00
`change` or both `up` and `down` hooks. This tests that the state of
2019-09-26 20:06:23 -04:00
the application and its data after the migration becomes reversed is the
same as it was before the migration ran in the first place. The helper:
1. Runs the `before` expectations before the **up** migration.
1. Migrates **up** .
1. Runs the `after` expectations.
1. Migrates **down** .
1. Runs the `before` expectations a second time.
Example:
```ruby
reversible_migration do |migration|
migration.before -> {
# ... pre-migration expectations
}
migration.after -> {
# ... post-migration expectations
}
end
```
2021-08-13 14:09:11 -04:00
### Custom matchers for post-deployment migrations
We have some custom matchers in
[`spec/support/matchers/background_migrations_matchers.rb` ](https://gitlab.com/gitlab-org/gitlab/blob/v14.1.0-ee/spec/support/matchers/background_migrations_matchers.rb )
to verify background migrations were correctly scheduled from a post-deployment migration, and
receive the correct number of arguments.
All of them use the internal matcher `be_background_migration_with_arguments` , which verifies that
the `#perform` method on your migration class doesn't crash when receiving the provided arguments.
#### `be_scheduled_migration`
Verifies that a Sidekiq job was queued with the expected class and arguments.
This matcher usually makes sense if you're queueing jobs manually, rather than going through our helpers.
```ruby
# Migration
BackgroundMigrationWorker.perform_async('MigrationClass', args)
# Spec
expect('MigrationClass').to be_scheduled_migration(*args)
```
#### `be_scheduled_migration_with_multiple_args`
Verifies that a Sidekiq job was queued with the expected class and arguments.
This works the same as `be_scheduled_migration` , except that the order is ignored when comparing
array arguments.
```ruby
# Migration
BackgroundMigrationWorker.perform_async('MigrationClass', ['foo', [3, 2, 1]])
# Spec
expect('MigrationClass').to be_scheduled_migration_with_multiple_args('foo', [1, 2, 3])
```
#### `be_scheduled_delayed_migration`
Verifies that a Sidekiq job was queued with the expected delay, class, and arguments.
This can also be used with `queue_background_migration_jobs_by_range_at_intervals` and related helpers.
```ruby
# Migration
BackgroundMigrationWorker.perform_in(delay, 'MigrationClass', args)
# Spec
expect('MigrationClass').to be_scheduled_delayed_migration(delay, *args)
```
#### `have_scheduled_batched_migration`
Verifies that a `BatchedMigration` record was created with the expected class and arguments.
The `*args` are additional arguments passed to the `MigrationClass` , while `**kwargs` are any other
attributes to be verified on the `BatchedMigration` record (Example: `interval: 2.minutes` ).
```ruby
# Migration
queue_batched_background_migration(
'MigrationClass',
table_name,
column_name,
*args,
**kwargs
)
# Spec
expect('MigrationClass').to have_scheduled_batched_migration(
table_name: table_name,
column_name: column_name,
job_arguments: args,
**kwargs
)
```
### Examples of migration tests
Migration tests depend on what the migration does exactly, the most common types are data migrations and scheduling background migrations.
#### Example of a data migration test
2019-09-24 02:06:02 -04:00
This spec tests the
2019-11-21 10:06:17 -05:00
[`db/post_migrate/20170526185842_migrate_pipeline_stages.rb` ](https://gitlab.com/gitlab-org/gitlab-foss/blob/v11.6.5/db/post_migrate/20170526185842_migrate_pipeline_stages.rb )
2019-09-24 02:06:02 -04:00
migration. You can find the complete spec in
2019-11-21 10:06:17 -05:00
[`spec/migrations/migrate_pipeline_stages_spec.rb` ](https://gitlab.com/gitlab-org/gitlab-foss/blob/v11.6.5/spec/migrations/migrate_pipeline_stages_spec.rb ).
2019-09-24 02:06:02 -04:00
```ruby
require 'spec_helper'
2020-08-19 17:10:32 -04:00
require_migration!
2019-09-24 02:06:02 -04:00
2020-06-22 17:08:42 -04:00
RSpec.describe MigratePipelineStages do
2019-09-24 02:06:02 -04:00
# Create test data - pipeline and CI/CD jobs.
let(:jobs) { table(:ci_builds) }
let(:stages) { table(:ci_stages) }
let(:pipelines) { table(:ci_pipelines) }
let(:projects) { table(:projects) }
before do
projects.create!(id: 123, name: 'gitlab1', path: 'gitlab1')
pipelines.create!(id: 1, project_id: 123, ref: 'master', sha: 'adf43c3a')
jobs.create!(id: 1, commit_id: 1, project_id: 123, stage_idx: 2, stage: 'build')
jobs.create!(id: 2, commit_id: 1, project_id: 123, stage_idx: 1, stage: 'test')
end
2019-09-26 20:06:23 -04:00
# Test just the up migration.
2019-09-24 02:06:02 -04:00
it 'correctly migrates pipeline stages' do
expect(stages.count).to be_zero
migrate!
expect(stages.count).to eq 2
expect(stages.all.pluck(:name)).to match_array %w[test build]
end
2019-09-26 20:06:23 -04:00
# Test a reversible migration.
it 'correctly migrates up and down pipeline stages' do
reversible_migration do |migration|
# Expectations will run before the up migration,
# and then again after the down migration
migration.before -> {
expect(stages.count).to be_zero
}
# Expectations will run after the up migration.
migration.after -> {
expect(stages.count).to eq 2
expect(stages.all.pluck(:name)).to match_array %w[test build]
}
end
2019-09-24 02:06:02 -04:00
end
```
2021-08-13 14:09:11 -04:00
#### Example of a background migration scheduling test
To test these you usually have to:
- Create some records.
- Run the migration.
- Verify that the expected jobs were scheduled, with the correct set
of records, the correct batch size, interval, etc.
The behavior of the background migration itself needs to be verified in a [separate
test for the background migration class](#example-background-migration-test).
This spec tests the
[`db/post_migrate/20210701111909_backfill_issues_upvotes_count.rb` ](https://gitlab.com/gitlab-org/gitlab/-/blob/v14.1.0-ee/db/post_migrate/20210701111909_backfill_issues_upvotes_count.rb )
post-deployment migration. You can find the complete spec in
[`spec/migrations/backfill_issues_upvotes_count_spec.rb` ](https://gitlab.com/gitlab-org/gitlab/blob/v14.1.0-ee/spec/spec/migrations/backfill_issues_upvotes_count_spec.rb ).
```ruby
require 'spec_helper'
require_migration!
RSpec.describe BackfillIssuesUpvotesCount do
let(:migration) { described_class.new }
let(:issues) { table(:issues) }
let(:award_emoji) { table(:award_emoji) }
let!(:issue1) { issues.create! }
let!(:issue2) { issues.create! }
let!(:issue3) { issues.create! }
let!(:issue4) { issues.create! }
let!(:issue4_without_thumbsup) { issues.create! }
let!(:award_emoji1) { award_emoji.create!( name: 'thumbsup', awardable_type: 'Issue', awardable_id: issue1.id) }
let!(:award_emoji2) { award_emoji.create!( name: 'thumbsup', awardable_type: 'Issue', awardable_id: issue2.id) }
let!(:award_emoji3) { award_emoji.create!( name: 'thumbsup', awardable_type: 'Issue', awardable_id: issue3.id) }
let!(:award_emoji4) { award_emoji.create!( name: 'thumbsup', awardable_type: 'Issue', awardable_id: issue4.id) }
it 'correctly schedules background migrations', :aggregate_failures do
stub_const("#{described_class.name}::BATCH_SIZE", 2)
Sidekiq::Testing.fake! do
freeze_time do
migrate!
expect(described_class::MIGRATION).to be_scheduled_migration(issue1.id, issue2.id)
expect(described_class::MIGRATION).to be_scheduled_migration(issue3.id, issue4.id)
expect(BackgroundMigrationWorker.jobs.size).to eq(2)
end
end
end
end
```
2019-09-24 02:06:02 -04:00
## Testing a non-`ActiveRecord::Migration` class
To test a non-`ActiveRecord::Migration` test (a background migration),
2020-12-14 13:09:48 -05:00
you must manually provide a required schema version. Please add a
2020-03-17 17:09:16 -04:00
`schema` tag to a context that you want to switch the database schema within.
If not set, `schema` defaults to `:latest` .
2019-09-24 02:06:02 -04:00
Example:
```ruby
2020-03-11 17:09:19 -04:00
describe SomeClass, schema: 20170608152748 do
2019-09-24 02:06:02 -04:00
# ...
end
```
### Example background migration test
This spec tests the
2019-11-21 10:06:17 -05:00
[`lib/gitlab/background_migration/archive_legacy_traces.rb` ](https://gitlab.com/gitlab-org/gitlab-foss/blob/v11.6.5/lib/gitlab/background_migration/archive_legacy_traces.rb )
2019-09-24 02:06:02 -04:00
background migration. You can find the complete spec on
2019-11-21 10:06:17 -05:00
[`spec/lib/gitlab/background_migration/archive_legacy_traces_spec.rb` ](https://gitlab.com/gitlab-org/gitlab-foss/blob/v11.6.5/spec/lib/gitlab/background_migration/archive_legacy_traces_spec.rb )
2019-09-24 02:06:02 -04:00
```ruby
require 'spec_helper'
2020-03-11 17:09:19 -04:00
describe Gitlab::BackgroundMigration::ArchiveLegacyTraces, schema: 20180529152628 do
2019-09-24 02:06:02 -04:00
include TraceHelpers
let(:namespaces) { table(:namespaces) }
let(:projects) { table(:projects) }
let(:builds) { table(:ci_builds) }
let(:job_artifacts) { table(:ci_job_artifacts) }
before do
namespaces.create!(id: 123, name: 'gitlab1', path: 'gitlab1')
projects.create!(id: 123, name: 'gitlab1', path: 'gitlab1', namespace_id: 123)
@build = builds.create!(id: 1, project_id: 123, status: 'success', type: 'Ci::Build')
end
context 'when trace file exists at the right place' do
before do
create_legacy_trace(@build, 'trace in file')
end
it 'correctly archive legacy traces' do
expect(job_artifacts.count).to eq(0)
expect(File.exist?(legacy_trace_path(@build))).to be_truthy
described_class.new.perform(1, 1)
expect(job_artifacts.count).to eq(1)
expect(File.exist?(legacy_trace_path(@build))).to be_falsy
expect(File.read(archived_trace_path(job_artifacts.first))).to eq('trace in file')
end
end
end
```
These tests do not run within a database transaction, as we use a deletion database
cleanup strategy. Do not depend on a transaction being present.