fix(saga): Handle concurrency issue when same op is sent more than once #4229

robzienert · 2019-12-18T19:40:20Z

We experienced an issue with sagas where Orca submitted the same operation into clouddriver at the same time (separated by a few milliseconds). The second request returned a 500 due to the SQL integrity constraint violation, but the work was already being handled successfully by the other clouddriver instance.

What should happen is that the second instance returns a pointer to the original task so that Orca doesn't have to reason about its mistake and just carry on monitoring the operation that's being performed.

Also changed the duration that Kato Tasks are kept around. 1 hour is too short to be able to get notified of an error and to diagnose. 4 days seems a reasonable default so that if an error occurs EOW or on the weekend, there's still sufficient time to get information the following week.

ajordens · 2019-12-18T19:46:19Z

Thanks for increasing the task cleanup interval.

spinnakerbot · 2019-12-18T19:50:32Z

The following commits need their title changed:

f01f774: Update SqlEventRepository.kt

Please format your commit title into the form:

<type>(<scope>): <subject>, e.g. fix(kubernetes): address NPE in status check

This allows us to easily generate changelogs & determine semantic version numbers when cutting releases. You can read more about commit conventions here.

jonsie

LGTM

fix(saga): Handle concurrency issue when same op is sent more than once

2191322

robzienert requested review from cfieber, ajordens, srekapalli, dreynaud and jonsie December 18, 2019 19:40

Update SqlEventRepository.kt

f01f774

ajordens approved these changes Dec 18, 2019

View reviewed changes

robzienert added the ready to merge Approved and ready for a merge label Dec 18, 2019

mergify bot added the auto merged Merged automatically by a bot label Dec 18, 2019

jonsie approved these changes Dec 18, 2019

View reviewed changes

Merge branch 'master' into fix-dupe-key-exception

1dbe2f8

mergify bot merged commit 1ab574c into spinnaker:master Dec 18, 2019

spinnakerbot added the target-release/1.18 label Dec 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(saga): Handle concurrency issue when same op is sent more than once #4229

fix(saga): Handle concurrency issue when same op is sent more than once #4229

robzienert commented Dec 18, 2019

ajordens commented Dec 18, 2019

spinnakerbot commented Dec 18, 2019

jonsie left a comment

fix(saga): Handle concurrency issue when same op is sent more than once #4229

fix(saga): Handle concurrency issue when same op is sent more than once #4229

Conversation

robzienert commented Dec 18, 2019

ajordens commented Dec 18, 2019

spinnakerbot commented Dec 18, 2019

jonsie left a comment

Choose a reason for hiding this comment