Move restart policy to systemd #263

siegfriedweber · 2021-08-17T14:56:43Z

Description

Fixes #207
Tested by stackabletech/agent-integration-tests#61

Changed

Handling of service restarts moved from the Stackable agent to systemd.

Removed

Check removed if a service starts up correctly within 10 seconds. systemd manages restarts now and the Stackable agent cannot detect if a service is in a restart loop.

Review Checklist

Code contains useful comments
(Integration-)Test cases added (or not applicable)
Documentation added (or not applicable)
Changelog updated (or not applicable)

soenkeliebau

LGTM overall, I've left a few questions that I'd like to clarify, but most of those will probably simply be resolveable.

As far as I can tell this would not notice if a service enters into a permanent restart loop, right? It would update the restart count on the pod, but never set a failed condition or something similar?

src/provider/states/pod/starting.rs

src/provider/states/pod/running.rs

src/provider/systemdmanager/service.rs

razvan

I didn't look closely at the code but the corresponding integration tests are green.

siegfriedweber · 2021-08-24T10:02:52Z

As far as I can tell this would not notice if a service enters into a permanent restart loop, right? It would update the restart count on the pod, but never set a failed condition or something similar?

Right. If the restart policy of the pod is set to Always or OnFailure then this is the desired behavior, I think. The Stackable Agent also does not have a notion what a "restart loop" is. We regard a restart every 10 seconds as a restart loop but a restart every 10 minutes probably not.

soenkeliebau

LGTM

The running state is no longer responsible for deciding if a service should be restarted. This is done by systemd. Therefore the transition from the running state to the starting state was removed. The service_state function was added to the SystemdManager to check if a service is running or terminated. This function is used for the transition from the running to the terminated state and for patching the container state.

When a systemd service is started then the agent does not wait anymore ten seconds to check if it was successfully started because systemd manages restarts now and the agent cannot detect if the service is in a restart loop.

siegfriedweber requested a review from a team August 17, 2021 14:56

siegfriedweber self-assigned this Aug 17, 2021

siegfriedweber mentioned this pull request Aug 17, 2021

Test restart policy stackabletech/agent-integration-tests#61

Merged

1 task

siegfriedweber force-pushed the move_restart_policy_to_systemd branch from 4310fa5 to 2fc6ef8 Compare August 19, 2021 07:21

soenkeliebau reviewed Aug 19, 2021

View reviewed changes

src/provider/states/pod/starting.rs Show resolved Hide resolved

src/provider/states/pod/running.rs Show resolved Hide resolved

src/provider/states/pod/running.rs Show resolved Hide resolved

src/provider/systemdmanager/service.rs Show resolved Hide resolved

siegfriedweber force-pushed the move_restart_policy_to_systemd branch from 2fc6ef8 to e48e1ea Compare August 23, 2021 10:39

razvan previously approved these changes Aug 23, 2021

View reviewed changes

siegfriedweber dismissed razvan’s stale review via 6b74ec9 August 24, 2021 12:37

siegfriedweber force-pushed the move_restart_policy_to_systemd branch 3 times, most recently from a205512 to abeac58 Compare August 26, 2021 14:25

siegfriedweber requested a review from soenkeliebau August 26, 2021 14:51

soenkeliebau previously approved these changes Aug 30, 2021

View reviewed changes

siegfriedweber added 14 commits August 30, 2021 10:43

Set TimeoutStopSec only once

49c9a4e

Map restart policy to systemd service

bb03232

Service state "Created" added

71558d8

Waiting after systemd service startup removed

fe40635

When a systemd service is started then the agent does not wait anymore ten seconds to check if it was successfully started because systemd manages restarts now and the agent cannot detect if the service is in a restart loop.

Set RestartSec to 2 in unit files

b53d5f5

Start rate limiting of systemd units disabled

1df17da

Tests fixed

12676bc

Changelog updated

39516c9

Remove the waiting phase also from the docs

e5748b9

Set restart count in the container status

72abc6f

Add pull request to the changelog

68731bb

Changelog updated

2b3a628

Set correct systemd default target

3b67ff7

siegfriedweber dismissed soenkeliebau’s stale review via 3b67ff7 August 30, 2021 08:45

siegfriedweber force-pushed the move_restart_policy_to_systemd branch from abeac58 to 3b67ff7 Compare August 30, 2021 08:45

siegfriedweber requested a review from soenkeliebau August 30, 2021 11:02

lfrancke approved these changes Aug 30, 2021

View reviewed changes

siegfriedweber merged commit 98bf547 into main Aug 30, 2021

siegfriedweber deleted the move_restart_policy_to_systemd branch August 30, 2021 11:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move restart policy to systemd #263

Move restart policy to systemd #263

siegfriedweber commented Aug 17, 2021 •

edited by razvan

Loading

soenkeliebau left a comment

razvan left a comment

siegfriedweber commented Aug 24, 2021

soenkeliebau left a comment

Move restart policy to systemd #263

Move restart policy to systemd #263

Conversation

siegfriedweber commented Aug 17, 2021 • edited by razvan Loading

Description

Changed

Removed

Review Checklist

soenkeliebau left a comment

Choose a reason for hiding this comment

razvan left a comment

Choose a reason for hiding this comment

siegfriedweber commented Aug 24, 2021

soenkeliebau left a comment

Choose a reason for hiding this comment

siegfriedweber commented Aug 17, 2021 •

edited by razvan

Loading