sql-blog-migration (github.com/antonio-alexander/go-blog-data-consistency)

This is a companion repository that's meant to serve as a deep dive into the basics of migration (with my own spin). The goal of this repository is to show some solutions for database migration and describe lessons I learn along the way.

DISCLAIMER: the sole database technology used by this repository is MySQL; different DBMSs have different rules, we're ONLY taking the rules of MySQL/MariaDB into account. In addition, this repository is from my perspective as a developer, rather than the perspective of a database administrator/architect

Migration is one (of three) administrative operations for databases that involves executing sql or commands that modify data, schemas and/or tables using DDL (data definition language). A migration is by definition NECCESSARY: something has occured where the data HAS to change. The other two administrative operations are upgrades (e.g., upgrading the version of MySQL) or maintenance (e.g., vacuuming the database tables). One doesn't simply do a migration, not all migrations are painful, some (if done correctly) are just tedius. Here are some situations where migrations are required:

adding new column (with default)
increasing column size
adding or removing a table index
adding new table
adding new constraints (with some caveats)

For purposes of this respository, we'll assume that although a migration can be reverted, the idea is that in practice, you'll never revert a successful migration. As such, even if a migration is backwards compatible it shouldn't be a focus. It's one thing if you delay migration (maybe it's an offline database, or you want to migrate when you update the software), but migration is expected to be neccessary and never undone upon successful verification.

I also want to keep "offline" databases in mind; for purposes of this discussion an "offline" database is one that isn't being actively mainained. It could be a database used by an application that has to be migrated at an irregular cadence (e.g., when the user wants to update the application) or it's a database that generally runs without any internet connectivity for extended periods of time and can only be updated when there is an opportune time to. Offline databases are a bit unique because the migration almost requires automation.

Bibliography

These ar some links I found that helped me figure out how to do something or gave me context as to why something really mattered

https://github.com/pressly/goose
https://dev.mysql.com/doc/refman/8.0/en/lock-tables.html
Semantic Versioning
https://dev.mysql.com/doc/refman/8.0/en/alter-table.html
https://stackoverflow.com/questions/9783636/unlocking-tables-if-thread-is-lost
My DBA friend, you'll have to get your own DBA Caveman

Getting Started

Unfortunately, this respository is NOT simple, BUT a lot of the work has already been done for you. I've tried to simplify some of the common operations you'll want to do (so you don't have to figure it out). From the root of the repository, you can do the following to setup your environment (you'll need Docker and Go):

make check-goose
make build
make run

This will build and run the custom mysql image included in this repository; if you're curious about what the make commands are doing, you can look at the Makefile. When the mysql image is running, you can execute one of the following commands to:

Interactive MySQL shell (find credentials in .env and docker-compose.yml):

docker exec -it mysql mysql -u<USERNAME> -p<PASSWORD>

If you'd like to attempt to use goose to perform migration, you can enter one (or a combination of) these commands. To get the migration status you can run:

make goose-status

docker exec -it mysql goose mysql "root:<PASSWORD>@tcp(localhost:3306)/sql_blog_migration?parseTime=true&&multiStatements=true" status

The connection strings used above are a bit complex/esoteric (and required), just copy+paste them.

Data Architecture (Domain Analysis)

For any table that's going to get anything more than cursory use, a domain analysis must be made to determine how the table should be architected to meet the common use cases of the table itself and survive along with the application. Although applications often, go through major changes, databases with proper domain analysis, generally don't. Although it may sound a bit draconian, the same is true for applications; breaking contracts and generally changing the function of an application enough that your domain analysis becomes invalid is a big deal and avoided at all costs.

In general, through domain analysis, you want to have enough forsight such that as you need to change your data, you can make forward compatible changes to it. Data, unlike API/contracts, is almost ALWAYS forward compatible. Once you modify the schema (how the data is organized) and add/change data you've created a change that you can't undo without destroying data; this is my underlying reason why changes to the schema/data are ONLY forward compatible and a big big big no no.

I won't lie to you and tell you that you can mitigate data/schema changes such that you never have to migrate but in showing some examples of data and schema changes we can provide some context to help you better know and understand when a migration is required. Let's start with this contract; we'll start with JSON and then I'll show the same with SQL.

{
    "employee" : {
        "id": "86fa2f09-d260-11ec-bd5d-0242c0a8e002",
        "first_name": "John",
        "last_name": "Connor",
        "email_address": "[email protected]",
        "last_updated": 1652417242000,
        "last_updated_by": "sql-blog-migration",
        "version": 1
    }
}

This is an "employee" which in terms of domain analysis (from the womb to the tomb); it should:

an employee will be created
that employee will be "unique" via their email address
we may need to add more properties to the employee in the future, but are certain that the email address will be the natural key for an employee
we want to avoid a situation where email addresses are used "on the wire" for applications involveding employees, so we want a unique id that will serve as a surrogate key and be safe to travel on the wire
we have a desire to be able to audit when a given employee was last modified (and knowledge of what was modified); we've included the audit fields last_updated, last_updated_by and version to this effect
we will start with the fields first name, last name and email address
we expect to search for an employee via first name, last name and email address, but expect only email address to be the only field that will always return one or zero rows
we expect that when an employee leaves the company, their employee record will be deleted (and ostensibly their email address can be re-used within the context of the database)
we expect that we're at the mercy of HR, but will do our best to adhere to our initial domain analysis

With this in mind, we can put together the following sql:

-- DROP DATABASE IF EXISTS sql_blog_migration;
CREATE DATABASE IF NOT EXISTS sql_blog_migration;

-- use the database
USE sql_blog_migration;

-- DROP TABLE IF EXISTS employees;
CREATE TABLE IF NOT EXISTS employees (
    id VARCHAR(36) PRIMARY KEY NOT NULL DEFAULT (UUID()),
    first_name VARCHAR(50) DEFAULT '',
    last_name VARCHAR(50) DEFAULT '',
    email_address VARCHAR(50) NOT NULL,
    aux_id BIGINT AUTO_INCREMENT,
    version INT NOT NULL DEFAULT 1,
    last_updated DATETIME(6) NOT NULL DEFAULT CURRENT_TIMESTAMP(6),
    last_updated_by VARCHAR(25) NOT NULL DEFAULT CURRENT_USER,
    UNIQUE(email_address),
    INDEX idx_aux_id (aux_id)
) ENGINE = InnoDB;

-- DROP TABLE IF EXISTS employees_audit;
CREATE TABLE IF NOT EXISTS employees_audit (
    employee_id VARCHAR(36) NOT NULL,
    first_name TEXT,
    last_name TEXT,
    email_address TEXT,
    version INT NOT NULL,
    last_updated DATETIME(6) NOT NULL,
    last_updated_by TEXT NOT NULL,
    PRIMARY KEY (employee_id, version),
    FOREIGN KEY (employee_id) REFERENCES employees(id) ON DELETE CASCADE
) ENGINE = InnoDB;

-- DROP TRIGGER IF EXISTS employees_audit_info_update;
CREATE TRIGGER employees_audit_info_update
BEFORE UPDATE ON employees FOR EACH ROW
    SET new.id = old.id, new.aux_id = old.aux_id, new.version = old.version+1, new.last_updated = CURRENT_TIMESTAMP(6), new.last_updated_by = CURRENT_USER;

-- DROP TRIGGER IF EXISTS employees_audit_insert;
CREATE TRIGGER employees_audit_insert
AFTER INSERT ON employees FOR EACH ROW
    INSERT INTO employees_audit(employee_id, first_name, last_name, email_address, version, last_updated, last_updated_by)
     VALUES (new.id, new.first_name,  new.last_name, new.email_address, new.version, new.last_updated, new.last_updated_by);

-- DROP TRIGGER IF EXISTS employees_audit_update;
CREATE TRIGGER employees_audit_update
AFTER UPDATE ON employees FOR EACH ROW
    INSERT INTO employees_audit(employee_id, first_name, last_name, email_address, version, last_updated, last_updated_by)
     VALUES(new.id, new.first_name,  new.last_name, new.email_address, new.version, new.last_updated, new.last_updated_by);

The above sql implements the domain analysis we did earlier:

the employees table provides an id, first/last name and email address (along with audit info) for a given employee
id being a uuid/guid and a primary key ensures that this surrogate key is unique for every row
the unique clause on email addresses ensures that no two employees can have the same email address
the employees_audit table keeps track of individual mutations of every employee, it's primary key being a combination of the employee id and version
the triggers ensure that every time an employee is inserted or updated, the employees_audit table has an updated row

You can "inject" some data by executing the following sql:

INSERT INTO
    employees(first_name, last_name, email_address)
VALUES
    ('Isshin', 'Kurosaki', '[email protected]'),
    ('Masaki', 'Kurosaki', '[email protected]'),
    ('Ichigo', 'Kurosaki', '[email protected]'),
    ('Karin', 'Kurosaki', '[email protected]'),
    ('Yuzu', 'Kurosaki', '[email protected]'),
    ('Orihime', 'Inoue', '[email protected]'),
    ('Kazui', 'Kurosaki', '[email protected]');

This will be our starting point for when we discuss manual and automatic migration.

Migration Do's and Don'ts

As you (like me), may be very new to migrations, I wanted to go over some basic dos and don'ts for database migration. To provide some context or glue, all of the do's and don'ts are motivated by the idea that you:

shouldn't perform any operations that destroy data
shouldn't perform any operations that can potentially destroy relationships
shouldn't perform any operations that put the database in an inconsistent state for an extended period of time (unnecessarily)

The kinds of operations that can destroy data are:

column/table drops
modification of a column type such that it truncates the data
modification of a column type such that it's data is no longer compatible

The kinds of operations that can destroy relationships are:

modification of any key or id (i.e., foreign/primary keys)
moving a table to another database (MySQL) or schema (Postgres)

I'm not saying that it's impossible to do any of the above, i'm just saying that whenever you perform a migration that can potentially destroy data or relationships, you create opportunities to be unsuccessful.

Things that are within the realm of reason when it comes to migrations are:

adding new [nullable] column
increasing column size
adding/removing index
adding constraints (with some caveats)

Again, these operations are safe(r) because they make it less likely to destroy data and there's less opportunity for significant database downtime; certain operations can cause the database/table to be unavailable while the change is occuring.

Finally, one of the biggest don'ts for a migration is NOT to migrate down. Once you've performed the migration, verified that it works as expected and you've unlocked the table; you can't easily and shouldn't revert the changes. This is an important "step" when doing the domain analysis that comes with performing a migration.

Backing up the Database

From my IT background, I know that it's VERY important to ALWAYS backup before doing anything...so you have a way to attempt to undo what you did; a backup will ensure that your data is "correct", but that doesn't always mean it's simple to restore/undo what you did. Sometimes it may take longer to perform a restore than to successfully complete the migration.

This command can be used to backup the sql_blog_migration database:

mysqldump -u root -p sql_blog_migration > /tmp/sql_blog_migration.sql

Manual Migration

The purpose of this section is to show how we'd perform migration manually. I come from a background where there was only first/second-party support and in general, when you wanted a tool or functionality it was much faster to build your own then to look for someone else's solution and tweak it for your use case. As Golang is has MUUUCH better support than LabVIEW; I don't have the same problem, but it's evolution is that I always want to know how I'd go about doing it manually just in case.

This section is going to be use-case driven and will only go over the migrations that are practical/safe. I KNOW there are migrations that are unsafe or...just possible; to not distract from the topic at hand and to avoid giving you bad habits, I won't.

Manually: adding middle_name column

What if we wanted to add a new column to allow adding of a middle name. We can "manually" make this change to the employees table by executing a number of steps:

lock the table to prevent modification while migrating
update the employees table to add middle_name column
update the employees_audit table to add middle_name column
drop the existing insert trigger since it won't include middle_name
re-create the trigger for insert
drop the existing update trigger since it won't include middle_name
re-create the trigger for update
perform verification that the change doesn't modify updates/inserts
and that you can select existing data
unlock the tables so it can be modified

One of the caveats to keep in mind is that we HAVE to lock the table otherwise you may get some inconsistency with the audit tables (any newly inserted/updated rows could be missed). This is what the SQL would look like:

-- lock the table to prevent modification while migrating
LOCK TABLES employees_audit WRITE, employees WRITE;

-- update the employees table to add middle_name column
ALTER TABLE employees ADD COLUMN middle_name VARCHAR(50) DEFAULT '';

-- update the employees_audit table to add middle_name column
ALTER TABLE employees_audit ADD COLUMN middle_name VARCHAR(50) DEFAULT '';

-- drop the existing insert trigger since it won't include middle_name
DROP TRIGGER employees_audit_insert;

-- re-create the trigger for insert
CREATE TRIGGER employees_audit_insert
AFTER INSERT ON employees FOR EACH ROW
    INSERT INTO employees_audit(employee_id, first_name, last_name, email_address, version, last_updated, last_updated_by, middle_name)
     VALUES (new.id, new.first_name, new.last_name, new.email_address, new.version, new.last_updated, new.last_updated_by, new.middle_name);

-- drop the existing update trigger since it won't include middle_name
DROP TRIGGER employees_audit_update;

-- re-create the trigger for update
CREATE TRIGGER employees_audit_update
AFTER UPDATE ON employees FOR EACH ROW
    INSERT INTO employees_audit(employee_id, first_name, last_name, email_address, version, last_updated, last_updated_by, middle_name)
     VALUES(new.id, new.first_name, new.last_name, new.email_address, new.version, new.last_updated, new.last_updated_by, new.middle_name);

-- perform verification that the change doesn't modify updates/inserts
-- and that you can select existing data

-- unlock the tables so it can be modified
UNLOCK TABLES;

The above solution is not the ONLY possible solution; there are a myriad of different ways, but one of the techniques you'll see employed often is the idea of ETL: Extract (Copy), Transform (Change), Load to reduce overall downtime and "automatically" do some of the background maintenance you have to do for tables (e.g., garbage collection). For certain changes to be "performant" you have to perform admin operations like vacuuming to "reclaim" the space on disk, creating new tables and swapping the names of old tables is significantly faster (in some cases).

Below; we'll create a migration effort that uses the ETL technique:

drop the insert/update triggers
create a "new" employees table with the updated schema
select the employees table and insert into the "new" table
create a "new" employees_audit table with the updated schema
select the employees_audit taable and insert into the "new" table
drop the "old" tables
rename the "new" tables
create new triggers with updated schemas

This is what the SQL would look like:

-- lock the tables so they can't be modified while we're
-- making changes
LOCK TABLES employees_audit WRITE, employees WRITE;

-- create a "new" employees table with the additional
-- column for middle-name
CREATE TABLE IF NOT EXISTS employees_new (
    id VARCHAR(36) PRIMARY KEY NOT NULL DEFAULT (UUID()),
    first_name VARCHAR(50) DEFAULT '',
    last_name VARCHAR(50) DEFAULT '',
    email_address VARCHAR(50) NOT NULL,
    aux_id BIGINT AUTO_INCREMENT,
    version INT NOT NULL DEFAULT 1,
    last_updated DATETIME(6) NOT NULL DEFAULT CURRENT_TIMESTAMP(6),
    last_updated_by VARCHAR(25) NOT NULL DEFAULT CURRENT_USER,
    middle_name VARCHAR(50) DEFAULT '',
    UNIQUE(email_address),
    INDEX idx_aux_id (aux_id)
) ENGINE = InnoDB;

-- lock the "new" table so there are no hiccups
LOCK TABLES employees_new WRITE;

-- select the data from the current employees table and insert into
-- the new employees table
INSERT INTO employees_new (id, first_name, last_name, email_address, aux_id, version, last_updated, last_updated_by) SELECT id, first_name, last_name, email_address, aux_id, version, last_updated, last_updated_by FROM employees;

-- create a "new" employees_audit table with the additional
-- column for middle-name
CREATE TABLE IF NOT EXISTS employees_audit_new (
    employee_id VARCHAR(36) NOT NULL,
    first_name VARCHAR(50),
    last_name VARCHAR(50),
    email_address VARCHAR(50),
    version INT NOT NULL,
    last_updated DATETIME(6) NOT NULL,
    last_updated_by VARCHAR(25) NOT NULL,
    middle_name VARCHAR(50),
    PRIMARY KEY (employee_id, version),
    FOREIGN KEY (employee_id) REFERENCES _employees(id) ON DELETE CASCADE
) ENGINE = InnoDB;

-- lock the "new" table so there are no hiccups
LOCK TABLES employees_audit_new WRITE;

-- select the data from the current employees table and insert into
-- the new employees table
INSERT INTO employees_audit_new (employee_id, first_name, last_name, email_address, version, last_updated, last_updated_by) SELECT employee_id, first_name, last_name, email_address, version, last_updated, last_updated_by FROM employees_audit;

-- drop the existing triggers
DROP TRIGGER employees_audit_insert;
DROP TRIGGER employees_audit_update;

-- rename the tables
rename table employees_audit to employees_audit_old
rename table employees_audit_new to employees_audit;
rename table employees to employees_old;
rename table employees_new to employees;

-- re-create the triggers
CREATE TRIGGER employees_audit_insert
AFTER INSERT ON employees FOR EACH ROW
    INSERT INTO employees_audit(employee_id, first_name, last_name, email_address, version, last_updated, last_updated_by, middle_name)
     VALUES (new.id, new.first_name, new.last_name, new.email_address, new.version, new.last_updated, new.last_updated_by, new.middle_name);
CREATE TRIGGER employees_audit_update
AFTER UPDATE ON employees FOR EACH ROW
    INSERT INTO employees_audit(employee_id, first_name, last_name, email_address, version, last_updated, last_updated_by, middle_name)
     VALUES(new.id, new.first_name, new.last_name, new.email_address, new.version, new.last_updated, new.last_updated_by, new.middle_name);

-- perform verification that the change doesn't modify updates/inserts
-- and that you can select existing data

-- drop the tables if you no longer need them
DROP TABLE employees_audit_old;
DROP TABLE employees_old;

-- unlock the tables so they can be used again
UNLOCK TABLE;

I can't stress how important verification is; once you've made your changes, you need to verify that they're successful; in this case if you did the following, you could feel fairly certain that you haven't broken anything:

insert into the employees table without middle_name
verify that the employess_audit table is updated with the version of the employee just inserted
update the employees table to update the middle_name
verify that the employees_audit table is updated with the version of the employee just updated
insert into the employees table with middle_name
verify that the employess_audit table is updated with the version of the employee just inserted

In SQL, this would look like:

Remember the earlier conversation we had about NOT wanting to destroy data and a related conversation we had about the idea that migrations could go both ways, but generally they only went forward? Keep in mind that although this is a "forward compatible" migration to "undo" or revert this migration, you have to destroy data (and break a contract); so although you could revert this change, once you unlock the database, you can't do so without destroying data. I use this example to mostly make you aware of what's going on. MySQL won't tell you that you're destroying data, it'll just do it, so you have to keep that in mind.

Manually: adding new indexes

Indexes can increase the overall performance when reading data from a given table. Indexes have a "cost" in that they are pre-compiled and take disk space. From a domain analysis perspective, we could say that feedback from HR indicates that they expect to search by first name + last name and NOT in terms of auditing. Currently our table has zero indexes, so we'll create an index for first name and last name.

This (by comparison) is probably the easiest thing to do; we accomplish it by doing the following:

Lock the table to prevent modification while doing the migration
Alter the table to add the index
Unlock the table

Note that I didn't mention a verification step; the expectation is that you did your due-dillegence prior to and confirmed that the addition of the index will reduce the overall query time; this is a strong assumption (and beyond the scope of this repository). In SQL, this migration would look like the following:

-- lock the tables so they can't be modified while we're modifying it
LOCK TABLES employees WRITE;

-- create a composite index with first_name and last_name
CREATE INDEX idx_first_last_name ON employees(first_name, last_name);

-- verify that the index has been created

-- unlock the tables since we're finished
UNLOCK TABLES;

Verification could take the form of confirming that the index exists with this sql:

SHOW INDEXES FROM employees;

You could also attempt some sample queries with existing data to confirm a reduction in query time or use EXPLAIN to confirm that expected queries are now using the newly created index.

Manually: adding new constraint

Automatic Migration Using Goose

With those examples out of the way and clearly how complicated the process is, you may want to automate the process so it's one click up and one click down; you may also realize that some of your offline installations may be MANY migrations behind where you are currently. Being able to automate those migrations (that have been thoroughly tested) may reduce your administrative overhead as you can schedule and plan those mass migrations to be done at the click of a button rather than having to log into each and every installation. In this section we'll be making some strong assumptions, but I think they'll push you in one direction to simplify/automate this process:

We'll create a customer MySQL Docker image
Our Docker image will have the most recent schema on it
Our Docker image will have a startup script that will automatically migrate the database from whatever state it's in to the most recent schema

Our tool of choice will be Goose because it does a lot of the work for us. Before we can start doing stuff, we'll need to install goose and then validate it's connection to the database:

make check-goose
goose version
goose mysql "root:mysql@/sql_blog_migration?parseTime=true&&multiStatements=true" status

Once we have goose installed we can start the processes of create migrations for Goose.

Automatically: adding a middle_name column

We're going to automate our initial migration of changing the id field from a string to an integer. Once we've installed goose, we'll need to go into the migration folder and create a new migration (starting from the root of the repo):

goose create add_middle_name_column sql

This should create a file prefixed with a timestamp; the sql file is initially empty (see below):

-- +goose Up
-- +goose StatementBegin
SELECT 'up SQL query';
-- +goose StatementEnd

-- +goose Down
-- +goose StatementBegin
SELECT 'down SQL query';
-- +goose StatementEnd

What we'll do from here is copy+paste the code we created above in the Manual Migration section for goose up to migrate forward (from wherever it is currently to the most recent migration). It'll look like the following:

-- +goose Up
-- +goose StatementBegin
-- lock the table to prevent modification while migrating
LOCK TABLES employees_audit WRITE, employees WRITE;

-- update the employees table to add middle_name column
ALTER TABLE employees ADD COLUMN middle_name VARCHAR(50) DEFAULT '';

-- update the employees_audit table to add middle_name column
ALTER TABLE employees_audit ADD COLUMN middle_name VARCHAR(50) DEFAULT '';

-- drop the existing insert trigger since it won't include middle_name
DROP TRIGGER employees_audit_insert;

-- re-create the trigger for insert
CREATE TRIGGER employees_audit_insert
AFTER INSERT ON employees FOR EACH ROW
    INSERT INTO employees_audit(employee_id, first_name, last_name, email_address, version, last_updated, last_updated_by, middle_name)
     VALUES (new.id, new.first_name, new.last_name, new.email_address, new.version, new.last_updated, new.last_updated_by, new.middle_name);

-- drop the existing update trigger since it won't include middle_name
DROP TRIGGER employees_audit_update;

-- re-create the trigger for update
CREATE TRIGGER employees_audit_update
AFTER UPDATE ON employees FOR EACH ROW
    INSERT INTO employees_audit(employee_id, first_name, last_name, email_address, version, last_updated, last_updated_by, middle_name)
     VALUES(new.id, new.first_name, new.last_name, new.email_address, new.version, new.last_updated, new.last_updated_by, new.middle_name);

-- unlock the tables so it can be modified
UNLOCK TABLES;
-- +goose StatementEnd

-- +goose Down
-- +goose StatementBegin
SIGNAL SQLSTATE '45000' SET MESSAGE_TEXT = 'Cannot migrate down';
-- +goose StatementEnd

Two things that are of note are: (1) that we're effectively disabling the ability to revert or migrate down by having the SIGNAL SQLSTATE; if you attempt to migrate down it'll execute an error and (2) there's no atomic way to verify between the steps (like we did during the manual migration).

To migrate up (forward) by one version, we can enter the following command:

goose mysql "root:mysql@/sql_blog_migration?parseTime=true&&multiStatements=true" up-by-one

It will have the following output:

2023/04/07 17:53:09 OK   20230407165454_add_middle_name_column.sql (1.12s)

Once the migration is done, you can also check the status of the migration with the following command:

goose mysql "root:mysql@/sql_blog_migration?parseTime=true&&multiStatements=true" status

2023/04/07 18:43:04     Applied At                  Migration
2023/04/07 18:43:04     =======================================
2023/04/07 18:43:04     Fri Apr  7 22:53:10 2023 -- 20230407165454_add_middle_name_column.sql

In case you wanted to know how Goose "works":

Goose "knows" what version(s) of the database have been deployed by querying a goose specific table (the default is goose_db_version). It's possible to provide a specific table to goose with command arguments if you want to have different table versions within th same database.

Automatically: adding new indexes

Similarly to how we did the previous migration, we'd go through the same steps to create a new migration file and populate it with the sql to perform the migration.

goose create add_new_index sql

The contents of the file will be as-such (from the Manually: adding new indexes):

-- +goose Up
-- +goose StatementBegin
-- lock the tables so they can't be modified while we're modifying it
LOCK TABLES employees WRITE;

-- create a composite index with first_name and last_name
CREATE INDEX idx_first_last_name ON employees(first_name, last_name);

-- unlock the tables since we're finished
UNLOCK TABLES;
-- +goose StatementEnd

-- +goose Down
-- +goose StatementBegin
SIGNAL SQLSTATE '45000' SET MESSAGE_TEXT = 'Cannot migrate down';
-- +goose StatementEnd

Similarly, you can execute the goose up-by-one and use goose status:

goose mysql "root:sql_blog_migration@tcp(localhost:3306)/sql_blog_migration?parseTime=true&&multiStatements=true" up-by-one
goose mysql "root:sql_blog_migration@tcp(localhost:3306)/sql_blog_migration?parseTime=true&&multiStatements=true" status

2023/04/07 19:42:57 OK   20230407185110_add_new_index.sql (60.57ms)
2023/04/07 19:43:01     Applied At                  Migration
2023/04/07 19:43:01     =======================================
2023/04/07 19:43:01     Fri Apr  7 22:53:10 2023 -- 20230407165454_add_middle_name_column.sql
2023/04/07 19:43:01     Sat Apr  8 00:42:57 2023 -- 20230407185110_add_new_index.sql

Automatically: adding a new constraint

Integration (a complete solution)

This section won't be as interactive as the other sections, but I'll include the example code within this repository. You may be going through this entire repo and wonder to yourself, how would I put all of this together to create a solution that's maintainable, auditable and reasonable to allow other people to migrate "offline" databases?

TLDR; we use the following solutions to "solve" this problem:

We use Docker to create a versioned image that will contain the database, starting sql AND migration scripts
We use a two-stage docker build to build goose for the appropriate architecture and inject into the final image
On start we run a script that will attempt to migrate the database as needed

This isn't a very complex solution (when you know what you're looking at) and borrows heavily from https://github.com/antonio-alexander/go-bludgeon/tree/main/mysql. Some of the things we've modified are:

installing goose to the Dockerfile when building
the docker-entrypoint script to create the initial database (if it doesn't already exist)
the auto_migration script to run goose up once the MySQL service is running

Although it's totally possible to load mysql from scratch, starting from yobasystems/alpine-mariadb is much easier and is a good starting point too.

run.sh: this file replaces the startup script for the image, we've made some modifications from the original so that it executes our scripts and sql files in the appropriate context:

on startup, if no database currently exists, it will create the MYSQL_DATABASE database
on startup, if no database currently exists, it will create the sql_blog_employees database associated with the configured version of the employees database
the post install scripts are run AFTER mysql is up and running (and accessible via mysqladmin ping)
if auto migration is enabled, it'll automatically execute goose up

The Dockerfile is a two stage build that will generate the executable for goose and then inject it into the mariadb image with our updated scripts. Some of the important aspects of this dDockerfile:

we use sed to clean up the sql and shell scripts to fix the line endings
we install goose in a separate stage so there's no go dependencies in the output image
goose is installed using a slightly interesting trick, this should work for amd64 (intel) and arm images
we copy over multiple copies of employees.sql with 003_employees.sql being the default

The docker-compose.yml puts everything together to both build and run the image with defaults:

you can use this to build and run the mysql image

The sql files 001_employees.sql, 002_employees.sql, and 003_employees.sql are used to load a specific version of the employees database:

001_employees.sql will load the base table without ANY migrations
002_employees.sql will load the base table with 20230407165454_add_middle_name_column.sql migration applied
003_employees.sql will load the base table with 20230407165454_add_middle_name_column.sql and 20230407185110_add_new_index.sql applied.

With these solutions above, you should be able to run the make build command and have a local image you can use to interact with sql_blog_migration.

Something you may notice is that in practice, you can get inconsistent behavior with goose on new databases. I think in general, for "new" installations, you wouldn't want to apply a really old version of the database, and then migrate it through the steps (although this does work). It'd make more sense to simply apply the latest version of the database (since there's "nothing" to migrate). Something you may notice is that even though the database is up-to-date, when you run goose, it'll attempt to migrate as if you were running the older version of the database; goose assumes that no migrations have occured, so it'll attempt to run all of the migrations.

To resolve this issue, you have to manually create the goose migration table for the database so that it knows NOT to run those goose migrations. For example, if you wanted to indicate that the database had never been migrated, you could do nothing (goose will automatically create this table) or create an empty table:

CREATE TABLE IF NOT EXISTS goose_db_version (
    id BIGINT(20) UNSIGNED PRIMARY KEY AUTO_INCREMENT NOT NULL,
    version_id BIGINT(20) NOT NULL,
    is_applied TINYINT(1) NOT NULL,
    tstamp TIMESTAMP NULL DEFAULT current_timestamp() 
) ENGINE = InnoDB;

Alternatively, if this were a much more recent version that had already been migrated, you could do the following:

-- create the goose table
CREATE TABLE IF NOT EXISTS goose_db_version (
    id BIGINT(20) UNSIGNED PRIMARY KEY AUTO_INCREMENT NOT NULL,
    version_id BIGINT(20) NOT NULL,
    is_applied TINYINT(1) NOT NULL,
    tstamp TIMESTAMP NULL DEFAULT current_timestamp() 
) ENGINE = InnoDB;

-- insert data communicating that the database has been migrated with these versions
INSERT INTO goose_db_version(id, version_id, is_applied, tstamp) VALUES
    ('1','0','1','2023-04-07 22:53:09'),
    ('2','20230407165454','1','2023-04-07 22:53:10'),
    ('3','20230407185110','1', '2023-04-08 00:42:57');

As a result, part of the workflow for updating your sql image/dependencies, must be to ALSO manage one or more tables associated with the goose db version. You may also ask yourself, "Do I need to organize my tables to be in their own databases?" the short answer is no, but you have to be a little more creative in how you organize things since you can add an argument to goose to set the table.

goose mysql "root:<PASSWORD>@tcp(localhost:3306)/sql_blog_migration?parseTime=true&&multiStatements=true" --table employees_goose_db_version status

This gives you a bit more flexibility in how you organize your migration files; you won't be limited to a single database or schema.

Security Considerations

I think this comes with any kind of database how-to/opinion; in an ideal situation you'd migrate using a user with the specific permissions needed to do that migration with that database. This takes some significant effort (to figure out what permissions are needed) and I took a shortcut in using root; even using the MYSQL_USER wouldn't have worked in this case. Although I think you'll only have to do it once, if you care about security in your offline (and online/cloud) databases; this should be really high on your list of things to do.

As an aside (this is totally my opinion); it's better to separate code that performs DDL (data definition langauge) from code that performs DML (data markup language). Its TOTALLY possible...and easy to integrate goose into your application's executable and on startup have it perform the migration. This has a handful of problems:

You should provide your application with a set of credentials that can perform DML and DDL commands
Depending on your orchestration tool (e.g., Docker, Kubernetes); there may be situations (out of your control) that would abort the migration process and leave the database in a broken state (requiring manual intervention/recovery)
If your orchestration tool generates multiple instances, you have to worry about multiple instances of the migration running simultaneously or you'll need some way to synchronize the goose process. Alter commands can be done within a lock, but it takes significantly more effort to make migrations idempotent

This opinion is rather inherent the Integration (a complete solution) section; I specifically integrated goose into the custom mysql image, rather than the application that would interact with the table. Although this solution is one specific to "offline" databases, the work to get there would provide the following benefits:

You could migrate separately from deploying/updating the application (you uncouple the two activities)
You can protect the elevated credentials required to perform DDL operations from those needed to perform DML
You can avoid the synchronization issue that may result depending on your orchestration tool
You have an opportunity to perform validation prior to making the database ready for public consumption

Your results may vary, but a "one-size-fits-all" solution doesn't scale well when you need it to the most

Issues Found Worth Mentioning

This is a list of issues I found while putting this together; it may give you more context or communicate an "a-ha" moment:

the original run.sh script would execute sql using mysqld directly using skip-names, so the definer would be set to a user that didn't exist and the migration would fail unless you dropped the view, re-created it, and THEN ran the migration

to resolve this issue, I modified the run.sh script to execute the sql scripts as a known user rather than using mysqld behind the scenes; it was an easy problem to resolve interactively by dropping the view and re-creating it, but was almost impossible to automate. I think in most online databases, this isn't an issue, but in an offline database, it's paramount to be able to automate first-time database creation.

Thanks for reading.

antonio-alexander / sql-blog-migration Goto Github PK

sql-blog-migration's Introduction