92 lines
No EOL
3.4 KiB
Markdown
92 lines
No EOL
3.4 KiB
Markdown
# Migration Style Guide
|
|
|
|
When writing migrations for GitLab, you have to take into account that
|
|
these will be ran by hundreds of thousands of organizations of all sizes, some with
|
|
many years of data in their database.
|
|
|
|
In addition, having to take a server offline for a an upgrade small or big is
|
|
a big burden for most organizations. For this reason it is important that your
|
|
migrations are written carefully, can be applied online and adhere to the style guide below.
|
|
|
|
It's advised to have offline migrations only in major GitLab releases.
|
|
|
|
When writing your migrations, also consider that databases might have stale data
|
|
or inconsistencies and guard for that. Try to make as little assumptions as possible
|
|
about the state of the database.
|
|
|
|
Please don't depend on GitLab specific code since it can change in future versions.
|
|
If needed copy-paste GitLab code into the migration to make it forward compatible.
|
|
|
|
## Comments in the migration
|
|
|
|
Each migration you write needs to have the two following pieces of information
|
|
as comments.
|
|
|
|
### Online, Offline, errors?
|
|
|
|
First, you need to provide information on whether the migration can be applied:
|
|
|
|
1. online without errors (works on previous version and new one)
|
|
2. online with errors on old instances after migrating
|
|
3. online with errors on new instances while migrating
|
|
4. offline (needs to happen without app servers to prevent db corruption)
|
|
|
|
It is always preferable to have a migration run online. If you expect the migration
|
|
to take particularly long (for instance, if it loops through all notes),
|
|
this is valuable information to add.
|
|
|
|
If you don't provide the information it means that a migration is safe to run online.
|
|
|
|
### Reversibility
|
|
|
|
Your migration should be reversible. This is very important, as it should
|
|
be possible to downgrade in case of a vulnerability or bugs.
|
|
|
|
In your migration, add a comment describing how the reversibility of the
|
|
migration was tested.
|
|
|
|
|
|
## Removing indices
|
|
|
|
If you need to remove index, please add a condition like in following example:
|
|
|
|
```
|
|
remove_index :namespaces, column: :name if index_exists?(:namespaces, :name)
|
|
```
|
|
|
|
## Adding indices
|
|
|
|
If you need to add an unique index please keep in mind there is possibility of existing duplicates. If it is possible write a separate migration for handling this situation. It can be just removing or removing with overwriting all references to these duplicates depend on situation.
|
|
|
|
## Testing
|
|
|
|
Make sure that your migration works with MySQL and PostgreSQL with data. An empty database does not guarantee that your migration is correct.
|
|
|
|
Make sure your migration can be reversed.
|
|
|
|
## Data migration
|
|
|
|
Please prefer Arel and plain SQL over usual ActiveRecord syntax. In case of using plain SQL you need to quote all input manually with `quote_string` helper.
|
|
|
|
Example with Arel:
|
|
|
|
```
|
|
users = Arel::Table.new(:users)
|
|
users.group(users[:user_id]).having(users[:id].count.gt(5))
|
|
|
|
#updtae other tables with this results
|
|
```
|
|
|
|
Example with plain SQL and `quote_string` helper:
|
|
|
|
```
|
|
select_all("SELECT name, COUNT(id) as cnt FROM tags GROUP BY name HAVING COUNT(id) > 1").each do |tag|
|
|
tag_name = quote_string(tag["name"])
|
|
duplicate_ids = select_all("SELECT id FROM tags WHERE name = '#{tag_name}'").map{|tag| tag["id"]}
|
|
origin_tag_id = duplicate_ids.first
|
|
duplicate_ids.delete origin_tag_id
|
|
|
|
execute("UPDATE taggings SET tag_id = #{origin_tag_id} WHERE tag_id IN(#{duplicate_ids.join(",")})")
|
|
execute("DELETE FROM tags WHERE id IN(#{duplicate_ids.join(",")})")
|
|
end
|
|
``` |