scheduled task to conditionally scale up the houndigrade cluster #229

infinitewarp · 2018-04-25T23:34:37Z

This PR adds our first support for scheduled Celery tasks using beat. As described in #176, this task checks the current cluster auto scaling size, checks the message queue for volume IDs, scales up to 1, and runs a (currently stubbed-out) async task that will be responsible for actually "running" houndigrade in the cluster.

Note: this PR builds upon the organize-aws-helpers branch and needs to merge after #224 merges.

Demo: coming soon!

codecov · 2018-04-25T23:50:18Z

Codecov Report

Merging #229 into master will increase coverage by 0.2%.
The diff coverage is 100%.

@@            Coverage Diff            @@
##           master     #229     +/-   ##
=========================================
+ Coverage   98.94%   99.15%   +0.2%     
=========================================
  Files          48       50      +2     
  Lines        1807     2012    +205     
  Branches       94       99      +5     
=========================================
+ Hits         1788     1995    +207     
+ Misses         13       11      -2     
  Partials        6        6

Impacted Files	Coverage Δ
cloudigrade/util/aws/__init__.py	`100% <100%> (ø)`	⬆️
cloudigrade/util/tests/test_aws_autoscaling.py	`100% <100%> (ø)`
cloudigrade/util/exceptions.py	`100% <100%> (ø)`	⬆️
cloudigrade/account/tests/test_tasks.py	`98.97% <100%> (+0.35%)`	⬆️
cloudigrade/util/aws/autoscaling.py	`100% <100%> (ø)`
cloudigrade/account/util.py	`98.27% <100%> (+0.71%)`	⬆️
cloudigrade/account/tasks.py	`100% <100%> (ø)`	⬆️
cloudigrade/account/tests/test_util.py	`100% <100%> (ø)`	⬆️
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 24c8bd5...ec4821a. Read the comment docs.

katherine-black · 2018-04-26T13:25:33Z

cloudigrade/config/settings/base.py

+)
+HOUNDIGRADE_AWS_VOLUME_BATCH_SIZE = env.int(
+    'HOUNDIGRADE_AWS_VOLUME_BATCH_SIZE',
+    default=5


Do we not want to do 32 like we originally talked about?

Good catch. 👍 I accidentally overlooked that requirement.

katherine-black · 2018-04-26T14:06:44Z

cloudigrade/account/tasks.py

+    )
+
+    if len(messages) == 0:
+        # Quietly exit and let a future run check for messages.


Not sure if this is the right time, but I'd love to see some debug logging happen in situations like this.

I'd considered about the same thing when I wrote this. I think I'll add some in now.

info logs added in the latest commits. I figure start louder at info and we can lower it to debug when we decide it's too noisy.

katherine-black · 2018-04-26T14:07:50Z

cloudigrade/config/settings/base.py

+CELERYBEAT_SCHEDULE = {
+    'scale_up_inspection_cluster_every_5_min': {
+        'task': 'account.tasks.scale_up_inspection_cluster',
+        'schedule': 60 * 5,  # seconds


Regarding timing, I think we originally were talking about trying to run the cluster once every 30 mins or an hour, maybe have it be configurable? Either way 5 minutes seems awfully small with how long snapshot copy operations can take.

katherine-black

Throwing request changes on here until individual comments are resolved.

… because pypy3 doesn't (yet?) support the former

infinitewarp · 2018-04-27T19:39:59Z

Recorded demos here: #176 (comment)

katherine-black

Thanks for adding the test for the limit 👍

https://asciinema.org/a/178750 - limit message dequeue to batch size https://asciinema.org/a/178752 - successful scale up https://asciinema.org/a/178755 - empty queue exits https://asciinema.org/a/178767 - non-zero scale exits cloudigrade/cloudigrade#229 cloudigrade/cloudigrade#176

infinitewarp changed the title ~~176 scale cluster~~ WIP 176 scale cluster Apr 25, 2018

infinitewarp changed the title ~~WIP 176 scale cluster~~ scheduled task to conditionally scale up the houndigrade cluster Apr 25, 2018

infinitewarp requested review from katherine-black, kholdaway, abaiken and a team April 25, 2018 23:55

katherine-black reviewed Apr 26, 2018

View reviewed changes

infinitewarp added this to the sprint 2018-04-23 milestone Apr 26, 2018

katherine-black reviewed Apr 26, 2018

View reviewed changes

katherine-black suggested changes Apr 26, 2018

View reviewed changes

infinitewarp changed the base branch from organize-aws-helpers to master April 26, 2018 14:12

infinitewarp force-pushed the 176-scale-cluster branch from 42edf6e to cc237c1 Compare April 26, 2018 14:15

infinitewarp requested a review from noahl April 26, 2018 17:44

infinitewarp added 8 commits April 27, 2018 13:26

add util function for read messages from queue

9749b31

remove unused function argument "result_key"

bda0e95

add django-celery-beat for scheduling tasks

776589e

add scheduled task to scale up the aws houndigrade cluster

8f22aeb

replace assert_called_once with more explicit assert_called_once_with…

91091fc

… because pypy3 doesn't (yet?) support the former

increase default volume batch size to 32

a3f0b60

extend time to scale up inspection cluster to 60 minutes

87a9c01

log info when we do not scale up the cluster

d5f3cae

infinitewarp force-pushed the 176-scale-cluster branch from 8d6da12 to d5f3cae Compare April 27, 2018 17:26

katherine-black previously approved these changes Apr 27, 2018

View reviewed changes

the official celery docs lie! fix "incorrect" config variable name

c1bbdbb

infinitewarp dismissed katherine-black’s stale review via c1bbdbb April 27, 2018 19:18

katherine-black previously approved these changes Apr 27, 2018

View reviewed changes

add test to show read_messages_from_queue stops at given limit

ec4821a

infinitewarp dismissed katherine-black’s stale review via ec4821a April 27, 2018 20:35

katherine-black approved these changes Apr 27, 2018

View reviewed changes

infinitewarp merged commit 2fcd3d6 into master Apr 27, 2018

infinitewarp deleted the 176-scale-cluster branch April 27, 2018 20:46

infinitewarp mentioned this pull request May 1, 2018

Celery worker missing from OpenShift environment #243

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scheduled task to conditionally scale up the houndigrade cluster #229

scheduled task to conditionally scale up the houndigrade cluster #229

infinitewarp commented Apr 25, 2018 •

edited

codecov bot commented Apr 25, 2018 •

edited

katherine-black Apr 26, 2018

infinitewarp Apr 26, 2018

katherine-black Apr 26, 2018

infinitewarp Apr 26, 2018

infinitewarp Apr 26, 2018

katherine-black Apr 26, 2018

katherine-black left a comment

infinitewarp commented Apr 27, 2018

katherine-black left a comment

scheduled task to conditionally scale up the houndigrade cluster #229

scheduled task to conditionally scale up the houndigrade cluster #229

Conversation

infinitewarp commented Apr 25, 2018 • edited

codecov bot commented Apr 25, 2018 • edited

Codecov Report

katherine-black Apr 26, 2018

Choose a reason for hiding this comment

infinitewarp Apr 26, 2018

Choose a reason for hiding this comment

katherine-black Apr 26, 2018

Choose a reason for hiding this comment

infinitewarp Apr 26, 2018

Choose a reason for hiding this comment

infinitewarp Apr 26, 2018

Choose a reason for hiding this comment

katherine-black Apr 26, 2018

Choose a reason for hiding this comment

katherine-black left a comment

Choose a reason for hiding this comment

infinitewarp commented Apr 27, 2018

katherine-black left a comment

Choose a reason for hiding this comment

infinitewarp commented Apr 25, 2018 •

edited

codecov bot commented Apr 25, 2018 •

edited