Releases: health-checks, auto-rollback, gradual rollout and a/b releases #139

friism · 2023-02-07T22:33:41Z

Required Terms

I agree to follow this project's Code of Conduct
I have read and accept the Salesforce Program Agreement

What service(s) is this request for?

runtime

Tell us about what you're trying to solve. What challenges are you facing?

We should improve how code changes (releases) are rolled out with Heroku. We should consider adding:

Healthchecks: Currently Heroku deems a dyno healthy once it has bound to $PORT. This is too simplistic since the app may not actually be ready to serve traffic by then
Auto-rollback: We don't automatically fail a release and auto-rollback, even when the platform can determine that dynos running the new release are crashing
Gradual rollout: We have three different mechanisms for rollouts. The standard common runtime behavior, Preboot in common runtime and gradual rollout in Private Spaces (PS). The PS behavior is often not gradual enough and causes latency spikes apps under load
A/B releases and canary releases: For even more assurance and flexibility

For non-web dynos, we should also establish a healthcheck convention and support rolling deploys (currently non-web dynos don't support any form of gradual rollout)

The text was updated successfully, but these errors were encountered:

trevorturk · 2023-02-17T20:19:16Z

I'm excited to see this on the roadmap!

I'm especially interested in gradual rollout and canary deploys. This sort of thing has been on my Heroku wishlist for a long time.

I'd also like to suggest considering an "adaptive preboot" for example using Rails recent addition rails/rails#46936

If an app had a standard endpoint that could return 200 OK when everything is booted, we could make the zero downtime deploy via preboot much quicker, instead of waiting for a static 3 minutes. This could also be leveraged for auto-rollback, as in, don't switch over to the new code unless the health/heartbeat/up endpoint responds 200 OK.

Also worth mentioning is that I'd like to see a an option added to rollback which would bypass the preboot delay for emergency use.

Thanks!

stevenharman · 2023-03-17T21:12:37Z

re: Gradual Rollout.

I'd be happy just to see preboot get the boot, and instead see Common Runtime have a rolling restart like Private Spaces does. A cherry on top would be the ability to configure the percentage of the roll - it's hard-coded to 25% on Dogwood, IIRC. But in a large enough formation, it'd be nice to tune that down even further.

locofocos · 2023-08-16T17:39:03Z

I'd be excited to start with a limited version of this: a healthcheck endpoint + auto-rollback. There are cases where we have pushed code changes that caused our Rails application to fail to boot. A simple GET to a healthcheck endpoint would have returned a 500. I would love if heroku would make such a request to our new dynos during preboot, then halt the rest of the deploy if it can't get a 200 response.

nightpool · 2023-11-03T17:53:41Z

For canary deployments, having something gradual that would be controlled by the error rate of each release would be great.

friism added the Proposed label Feb 7, 2023

friism assigned elimchaysengSF Feb 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: health-checks, auto-rollback, gradual rollout and a/b releases #139

Releases: health-checks, auto-rollback, gradual rollout and a/b releases #139

friism commented Feb 7, 2023 •

edited

trevorturk commented Feb 17, 2023

stevenharman commented Mar 17, 2023

locofocos commented Aug 16, 2023

nightpool commented Nov 3, 2023

Releases: health-checks, auto-rollback, gradual rollout and a/b releases #139

Releases: health-checks, auto-rollback, gradual rollout and a/b releases #139

Comments

friism commented Feb 7, 2023 • edited

Required Terms

What service(s) is this request for?

Tell us about what you're trying to solve. What challenges are you facing?

trevorturk commented Feb 17, 2023

stevenharman commented Mar 17, 2023

locofocos commented Aug 16, 2023

nightpool commented Nov 3, 2023

friism commented Feb 7, 2023 •

edited