Use local DB and clock process in every container #5236

pmac · 2017-11-01T14:33:41Z

Removes our reliance on an external database server. A.K.A:

THE SQLITENING

pmac · 2017-11-02T21:13:30Z

@jgmize I've made some changes that might seem unrelated, but this effort has uncovered some things. The way we handle Git repos for management commands has changed for example, since the initial population of the DB doesn't have the app config it could set the wrong remote, so now we don't use git remotes since git remotes are really just conveniences and you can use the git repo URL.

I also further improved the cron health check to get the health check task names and expected run intervals from the job configs themselves. It's possibly a tad hacky, but has the advantage of making sure that the first run of the health check view will set the last-run file mtimes so that it won't fail immediately on deploy if the app starts before the cron process.

tl'dr: coming along nicely

pmac · 2017-11-02T21:14:04Z

You can see the new fancy task health check on my demo:

https://bedrock-demo-pmac.us-west.moz.works/healthz-cron/

pmac · 2017-11-03T13:42:23Z

Fixes issue #5235

pmac · 2017-11-03T18:53:35Z

Did some testing with a local container and ab. I had the cron process set to constantly be reloading the security advisories data from disk to the db and ran the following while that was happening:

$ ab -t 120 -c 4 http://localhost:8000/en-US/security/advisories/
This is ApacheBench, Version 2.3 <$Revision: 1706008 $>
Copyright 1996 Adam Twiss, Zeus Technology Ltd, http://www.zeustech.net/
Licensed to The Apache Software Foundation, http://www.apache.org/

Benchmarking localhost (be patient)
Finished 809 requests


Server Software:        meinheld/0.6.1
Server Hostname:        localhost
Server Port:            8000

Document Path:          /en-US/security/advisories/
Document Length:        302890 bytes

Concurrency Level:      4
Time taken for tests:   120.097 seconds
Complete requests:      809
Failed requests:        0
Total transferred:      246914078 bytes
HTML transferred:       245643298 bytes
Requests per second:    6.74 [#/sec] (mean)
Time per request:       593.807 [ms] (mean)
Time per request:       148.452 [ms] (mean, across all concurrent requests)
Transfer rate:          2007.76 [Kbytes/sec] received

Connection Times (ms)
              min  mean[+/-sd] median   max
Connect:        0    0   0.0      0       0
Processing:   119  588 549.5    441    3759
Waiting:      119  412 405.2    301    3758
Total:        119  589 549.5    441    3759

Percentage of the requests served within a certain time (ms)
  50%    441
  66%    564
  75%    671
  80%    751
  90%   1122
  95%   1594
  98%   2471
  99%   3288
 100%   3759 (longest request)

Looks pretty good to me.

pmac · 2017-11-15T16:56:54Z

After talking with @jgmize we'd like to try another direction. The current implementation has every bedrock container fetching data from all of the sources. This is pretty inefficient as well as potentially more error prone since they're all fetching data across the internet. We should be able to scale down the number of running containers with this change, but it's still not great.

New Proposal

We should have a single container (or Jenkins job) running in a single region that is doing the DB updates. This container will upload the file to S3 when the file changes, naming it after the git-sha of the running bedrock code. Naming it after the git-sha ensures that we don't run into any schema change issues.

The bedrock containers meanwhile will have a process that will be checking for updated database files on a schedule. When the check returns a new file, it will save said file, and swap the new db file for the old one and all should be well. It will do the swap via symlinks. It should work something like:

Bedrock SQLite db file is generated during deployment and included in the built image. This DB file is named after its own hash calculation and symlinked to bedrock.db, which bedrock is configured to use.
The db-updater process will check for new db file at the S3 URL using etags to only fetch it if it's new. If the response is 404 it will do nothing as it will assume that the updater for this particular git sha build of bedrock has not found any data updates yet.
When it gets a new file it will name it with the new hash of the database file, check the integrity of the new file, and swap the bedrock.db symlink to the new file.
It will then delete the old database file.
Profit.

This will mean that we have only a single updater process to monitor rather than n = len(bedrock_containers). And it will mean that a good and updated database file will be publicly available for everyone, which should make our dev environment much easier to setup.

* Removes our reliance on an external database server. * Adds a /healthz-cron/ URL that will 500 if no data updates have happened in more than 10 minutes.

Will now check and report on each cron task individually. Also include the data git repos for prod-details, security advisories, and release notes in the docker image.

Strange state when initial setup of git repos happened under a different config than the running container. This should fix this by using repo URLs directly.

cron.py will now print a CSV of its config to a tmp file at startup which the health check view will read.

Moves us from 4 dockerfiles to 1 \o/

pmac · 2017-12-06T01:25:07Z

Closing in favor of #5334

pmac added Do Not Merge ⚠️ WIP labels Nov 1, 2017

pmac force-pushed the local-db-only branch 5 times, most recently from dcee869 to fb4362d Compare November 2, 2017 20:35

pmac force-pushed the local-db-only branch 2 times, most recently from 9c06fea to 19a5cf9 Compare November 3, 2017 01:52

pmac mentioned this pull request Nov 3, 2017

Move Bedrock to SQLite in production #5235

Closed

pmac added 5 commits December 4, 2017 11:49

Use local DB and clock process in every container

adcd590

* Removes our reliance on an external database server. * Adds a /healthz-cron/ URL that will 500 if no data updates have happened in more than 10 minutes.

Make cron health check more comprehensive

ecd9c2e

Will now check and report on each cron task individually. Also include the data git repos for prod-details, security advisories, and release notes in the docker image.

Git repos avoid using remotes

675be1b

Strange state when initial setup of git repos happened under a different config than the running container. This should fix this by using repo URLs directly.

Get cron config for health check from cron file

57bad97

cron.py will now print a CSV of its config to a tmp file at startup which the health check view will read.

Move to a multistage dockerfile and build

1f2b686

Moves us from 4 dockerfiles to 1 \o/

pmac force-pushed the local-db-only branch from 19a5cf9 to d2e2a8f Compare December 4, 2017 22:29

Upload and distribute database updates via S3

c18aa7e

pmac force-pushed the local-db-only branch from d2e2a8f to c18aa7e Compare December 5, 2017 04:19

pmac mentioned this pull request Dec 6, 2017

Upload and distribute database updates via S3 #5334

Merged

pmac closed this Dec 6, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use local DB and clock process in every container #5236

Use local DB and clock process in every container #5236

pmac commented Nov 1, 2017 •

edited

pmac commented Nov 2, 2017

pmac commented Nov 2, 2017

pmac commented Nov 3, 2017

pmac commented Nov 3, 2017

pmac commented Nov 15, 2017

pmac commented Dec 6, 2017

Use local DB and clock process in every container #5236

Use local DB and clock process in every container #5236

Conversation

pmac commented Nov 1, 2017 • edited

THE SQLITENING

pmac commented Nov 2, 2017

pmac commented Nov 2, 2017

pmac commented Nov 3, 2017

pmac commented Nov 3, 2017

pmac commented Nov 15, 2017

New Proposal

pmac commented Dec 6, 2017

pmac commented Nov 1, 2017 •

edited