[PR] Prevent repeated resumes of a resource by remembering its resumed flag #230

kopf-archiver · 2020-08-18T20:01:18Z

A pull request by nolar at 2019-11-13 11:20:13+00:00
Original URL: zalando-incubator/kopf#230
Merged by nolar at 2019-11-13 12:54:04+00:00

Issue : #113 #223 #214

Description

This PR does two things:

First, it adds a per-resource in-memory container, which is remembered for the whole lifecycle of an operator. This can lead to more memory consumption on big clusters.

It is different from the persistent storage of the object's data directly on the object's status, as the values stored in the in-memory container are specific to that operator process only, and sometimes not serialisable (e.g. threads, tasks, asyncio events, locks, etc).

Second, it remember every object's resuming status in-memory. Once resumed, the object is never resumed again. Otherwise (i.e. currently), the resuming happens on every reconnection of the API watch-stream (partially remedied by #229, but not fully).

More on that, currently (i.e. before fixing), if there are multiple resume handlers or retries or sub-handlers, only the first attempt will be executed. Also, if the on-resume handlers go after the on-create/on-update handlers, they are ignored. All other attempts after the initial listing will not be interpreted as "initial" (due to event's type not being None anymore), thus not detected as resuming.

With this PR, the resume handlers are included into the selection until all of them succeed at least one — even for the regular watch-events with ['type']!=None (i.e. after patching from the previous attempts).

Such approach should improve the following for the @on.resume handlers:

Executed only once per operator process life time (in case of success).
Retried until timeout or retries limit (in case of failures).
Any order of handlers supported (e.g. on-resume handlers after the on-create handlers).
Multiple on-resume handlers supported.
Sub-handlers in the on-resume handlers supported.

Types of Changes

Bug fix (non-breaking change which fixes an issue)
Refactor/improvements

Review

List of tasks the reviewer must do to review the PR

Tests
Documentation

The text was updated successfully, but these errors were encountered:

kopf-archiver bot added the archive label Aug 18, 2020

kopf-archiver bot closed this as completed Aug 18, 2020

This was referenced Aug 19, 2020

Is resume called before create/update? #112

Closed

Resuming handler will not be retried if it fails #113

Closed

Sub-handlers in a resume handlers are executed not in parallel #214

Closed

kopf-archiver bot changed the title ~~[archival placeholder]~~ [PR] Prevent repeated resumes of a resource by remembering its resumed flag Aug 19, 2020

kopf-archiver bot added bug Something isn't working enhancement New feature or request labels Aug 19, 2020

This was referenced Aug 19, 2020

[PR] Skip resumes for deleted objects, unless explicitly marked for selection #233

Closed

Handlers for resuming only unmodified/idling/sleeping resources #241

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PR] Prevent repeated resumes of a resource by remembering its resumed flag #230

[PR] Prevent repeated resumes of a resource by remembering its resumed flag #230

kopf-archiver bot commented Aug 18, 2020 •

edited

[PR] Prevent repeated resumes of a resource by remembering its resumed flag #230

[PR] Prevent repeated resumes of a resource by remembering its resumed flag #230

Comments

kopf-archiver bot commented Aug 18, 2020 • edited

Description

Types of Changes

Review

kopf-archiver bot commented Aug 18, 2020 •

edited