Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[4.8] cgroupv1 freezer: thaw to increase freeze chances #41

Merged
merged 1 commit into from Mar 2, 2021

Conversation

kolyshkin
Copy link
Collaborator

This backports opencontainers/runc#2791 to rhaos-4.8 branch (i.e. on top of runc 1.0.0-rc93). In case we won't be upgrading to rc94 (once out) in 4.8, this may be of use.

This should fix https://bugzilla.redhat.com/show_bug.cgi?id=1903228

Original description follows


It appears that briefly thawing the cgroup while freezing
greatly increases its chances to freeze successfully.

The test case I used is doing runc exec in a look parallel with runc
pause/resume in another loop, and the failure to freeze rate reduced
from 40 to 0 per minute (tested inside a VM using a busybox container
running sleep 1h, doing about 1500 pause/resumes and 650 execs per
minute), with max retries being 150 (of 1000).

This is still a game of chances, so failures are possible.

Signed-off-by: Kir Kolyshkin kolyshkin@gmail.com
(cherry picked from commit d1007b0)
Signed-off-by: Kir Kolyshkin kolyshkin@gmail.com

@haircommander
Copy link
Collaborator

huh, we have gh actions now, and they're also failing :)

@kolyshkin
Copy link
Collaborator Author

huh, we have gh actions now, and they're also failing :)

Ah, this is because of rc1 which got removed. Fixed by #42

It appears that briefly thawing the cgroup while freezing
greatly increases its chances to freeze successfully.

The test case I used is doing runc exec in a look parallel with runc
pause/resume in another loop, and the failure to freeze rate reduced
from 40 to 0 per minute (tested inside a VM using a busybox container
running sleep 1h, doing about 1500 pause/resumes and 650 execs per
minute), with max retries being 150 (of 1000).

This is still a game of chances, so failures are possible.

Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>
(cherry picked from commit d1007b0)
Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>
@kolyshkin
Copy link
Collaborator Author

rebased on top of merged #42 to make ci green

@haircommander
Copy link
Collaborator

LGTM, I'll merge with green CI

@haircommander haircommander merged commit 5e27a39 into projectatomic:rhaos-4.8 Mar 2, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants