core: Save schema version for intermediate steps #364

digitalcircuit · 2018-05-31T18:21:36Z

In short

Update schemaversion for each successful intermediate upgrade
- Better chance of auto-recovering by running core after interrupt
- Aids debugging after the fact
- Makes downgrades after interrupted upgrade more explicit, requiring setting schema version
Update logging to be more explicit about what schema version fails

Criteria	Rank	Reason
Impact	★★★ 3/3	Reduces risk of breaking database with interrupted upgrades
Risk	★★☆ 2/3	Schema version might not be stored correctly
Intrusiveness	★☆☆ 1/3	Minor changes, shouldn't interfere with other pull requests

Rationale

These changes

When running upgrade queries, update the schema version for each intermediate step. This ensures that any interrupted upgrades have a greater chance of resuming correctly after core restart.

Almost all databases make single queries atomic (fully works or fully fails, no partial), and with many of the longest migrations being a single query, this makes upgrade interruptions much more likely to leave the database in a valid intermediate schema version.

As a side effect, downgrading from a partial upgrade will now require setting schema version manually, instead of it possibly happening to work depending on what changed. An unsupported operation failing in a known, recoverable way seems better than failing in an unknown, possibly recoverable way.

Future work

In the future, for databases that support it (e.g. almost only PostgreSQL), we may want to wrap upgrades in a transaction. This will need careful testing of potential additional space requirements and any database modifications that might not be allowed in a transaction.

For sake of time, the above was copied from the commit message.

Examples

Interrupted upgrade procedure

Steps

Show current schema version
Run core, forcibly (kill -9) interrupting upgrade when it reaches a long single-query migration
Run core, letting it finish

Commands run
run-profile.sh is a script to simplify managing multiple testing copies of Quassel and is equivalent to calling the right quasselcore with the right options

sqlite> select * from coreinfo;
schemaversion|26

$ (sleep 0.5 && pkill -9 quasselcore ) & \
  ./run-profile.sh master core local

[terminated]

sqlite> select * from coreinfo;
schemaversion|30

$ ./run-profile.sh master core local

sqlite> select * from coreinfo;
schemaversion|31

Interrupted upgrade log

2018-05-31 12:49:20 Warning: Installed Schema (version 26) is not up to date. Upgrading to version 31...  This may take a while for major upgrades. 
[terminated]
2018-05-31 12:54:29 Warning: Installed Schema (version 30) is not up to date. Upgrading to version 31...  This may take a while for major upgrades. 
2018-05-31 12:54:29 Warning: Installed Schema successfully upgraded to version 31. 
2018-05-31 12:54:29 Info: SQLite storage backend is ready. Schema version: 31

When running upgrade queries, update the schema version for each intermediate step. This ensures that any interrupted upgrades have a greater chance of resuming correctly after core restart. Almost all databases make single queries atomic (fully works or fully fails, no partial), and with many of the longest migrations being a single query, this makes upgrade interruptions much more likely to leave the database in a valid intermediate schema version. As a side effect, downgrading from a partial upgrade will now require setting schema version manually, instead of it possibly happening to work depending on what changed. An unsupported operation failing in a known, recoverable way seems better than failing in an unknown, possibly recoverable way. Update logging to be more explicit about what fails. Tested by kill -9'ng Quasselcore during migration, then re-running core afterwards. In the future, for databases that support it (e.g. almost only PostgreSQL), we may want to wrap upgrades in a transaction. This will need careful testing of potential additional space requirements and any database modifications that might not be allowed in a transaction. sqlite> select * from coreinfo; schemaversion|26 $ (sleep 0.5 && pkill -9 quasselcore ) & \ ./run-profile.sh master core local sqlite> select * from coreinfo; schemaversion|30 $ ./run-profile.sh master core local sqlite> select * from coreinfo; schemaversion|31

digitalcircuit · 2018-06-03T21:14:01Z

Pretty sure we're already in favor of this, but another point for the release notes:
Upgrades may take a long time, and may temporarily more than double the size of the database.

Just had someone on IRC run into a failed upgrade due to out of space - 22 GB database, 26 GB free space.

digitalcircuit force-pushed the fix-schema-version-storing branch from 17a0884 to 5472d27 Compare May 31, 2018 23:21

Sput42 merged commit f10304a into quassel:master Jun 6, 2018

digitalcircuit deleted the fix-schema-version-storing branch June 7, 2018 01:07

This was referenced Jan 29, 2019

core: Track upgrade step within each schema version #473

Merged

[0.13] core: Track upgrade step within each schema version #474

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core: Save schema version for intermediate steps #364

core: Save schema version for intermediate steps #364

digitalcircuit commented May 31, 2018 •

edited

digitalcircuit commented Jun 3, 2018

core: Save schema version for intermediate steps #364

core: Save schema version for intermediate steps #364

Conversation

digitalcircuit commented May 31, 2018 • edited

In short

Rationale

These changes

Future work

Examples

Interrupted upgrade procedure

Interrupted upgrade log

digitalcircuit commented Jun 3, 2018

digitalcircuit commented May 31, 2018 •

edited