-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fc-agent: improve maintenance scheduling #671
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
51d2578
to
e0cf4ce
Compare
375794f
to
72e29bb
Compare
a4deaaa
to
4b18896
Compare
f9a58a8
to
57a2902
Compare
111d1e6
to
dd95e15
Compare
dd95e15
to
382bc17
Compare
13cf40f
to
1d2d361
Compare
1d2d361
to
94a4689
Compare
94a4689
to
b4b66e9
Compare
a4d6ae5
to
5a88908
Compare
282869c
to
f2e1cad
Compare
bd1b9a2
to
fc35e15
Compare
osnyx
requested changes
Jul 19, 2023
dc96955
to
de29cf5
Compare
- New requests can now be merged with existing ones if their activities have the same type. Significant changes to activities cause postponing of the updated request. - Requests can be cancelled by requesting an activity which nullifies the original activity (for example, reset channel back to current system channel => planned update will be cancelled) - UpdateActivity with metadata and better comment generation replaces dumb shell scripts for planned system updates. - VMChangeActivity with metadata replaces RebootActivity for mem and core changes. - All activities can request a reboot which will be done after all due requests have been executed. - Continously scheduled requests will be executed in one go if at least the first request is due, avoiding repeated switching to maintenance mode in a short time frame and possibly unneccessary reboots. - Overdue requests (more than 30 minutes after scheduled start time) will be postponed to avoid overrunning the planned maintenance window or interfering with other machines going into maintenance mode. - Maintenance preparation time and request execution time are different concepts now. Execution of requests is typically quite fast but there may be commands delaying the execution of all requests. Directory doesn't support this yet so we just report the sum of preparation time and estimated execution time (but at least 15min). - Un-tangled maintenance code and manage.py: all maintenance requests are now generated in maintenance.py. - Fix handling of postponed requests and cleaned up state updates in the process. tempfail and retrylimit don't exist anymore as dedicated states. - Update shortcut saving time: if the new channel of an UpdateActivity results in the same system, just set the system channel and forget about the update. - Explicitly exit after calling the reboot command. - Reduce number of channel URL resolve calls (which impact Hydra), UpdateActivity expects a resolved URL now. PL-129777
de29cf5
to
c3abf47
Compare
osnyx
approved these changes
Jul 19, 2023
osnyx
approved these changes
Jul 19, 2023
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
@flyingcircusio/release-managers
Release process
Impact:
Changelog:
Reboots for kernel updates now happen directly after system updates, avoiding scheduling another maintenance for the reboot. This also fixes the long-standing bug that delayed activities could be executed outside of maintenance windows. Activities that are overdue (more than 30min after planned time) are postponed for at least 8 hours and scheduled again (PL-129777).
Security implications