Support restarting containerd in tests, add restart test case #1188

kevpar · 2021-10-05T23:26:19Z

This PR adds support in the cri-containerd test suite to allow tests to restart containerd.
A test case is also added that validates containerd properly terminates running containers
when it is restarted.

Note this test case depends on functionality in the kevpar/cri windows_port branch, and
that the terminate_containers_on_restart option is set on the plugins.cri section of the
containerd config.

See commits for more details.

Previously main.go didn't have _test suffix, so it was not considered a test file. Notably this meant that TestMain was never actually invoked because it must be in a test file. It seems we were fortunate that there was nothing in TestMain that wasn't done automatically by the Go test infrastructure. As the cri-containerd directory is all test code, it is probably safe to rename all other files in the directory to be test code as well. However, that is left for a future change. Signed-off-by: Kevin Parsons <kevpar@microsoft.com>

This change lets the cri-containerd tests start/stop containerd as needed, rather than assuming it is always running. This is done through the addition of startContainerd/stopContainerd functions which can be called from tests. As all of the existing tests need containerd to be running, this currently is not used in any tests. Future tests can take advantage of this functionality. Tests assume that containerd is running when they start, and should not need to explicitly start containerd before calling into it. This means that if a test stops containerd, it needs to ensure containerd is started again. If containerd crashes during a test, then subsequent tests will fail, but that's the same as the current behavior. An unfortunate side effect of this change is that, due to a standing issue with Go's service support and containerd, the service can sometimes exit with ERROR_PROCESS_ABORTED when it is stopped. Combined with the fact that recovery actions are used for containerd, this can result in the service being restarted by the service control manager. To work around this, we need to first disable recovery actions for the service before running tests. This can be done with: sc failure containerd reset=0 actions= command= Signed-off-by: Kevin Parsons <kevpar@microsoft.com>

dcantah · 2021-10-07T00:39:48Z

Needs a rebase I believe. The pullRequiredLcowImages function was renamed to have Lcow be capitalized here 8debf44#diff-a8c710332541b245cc336ff7f50e609f30213b47a1f67abe39908fe212862322

Edit: Err not a rebase, this has the changes. Just need to update your uses of the old function name

Adds a test case that runs a pod+container, restarts containerd, then verifies that the pod+container were terminated. This validates the change made in the CRI fork [1] to terminate containers when containerd is restarted. [1]: kevpar/cri@f8e83e6 Signed-off-by: Kevin Parsons <kevpar@microsoft.com>

kevpar · 2021-10-07T01:01:44Z

Needs a rebase I believe. The pullRequiredLcowImages function was renamed to have Lcow be capitalized here 8debf44#diff-a8c710332541b245cc336ff7f50e609f30213b47a1f67abe39908fe212862322

Edit: Err not a rebase, this has the changes. Just need to update your uses of the old function name

Aha, I hadn't checked back on CI yet.

test/cri-containerd/containerdrestart_test.go

Signed-off-by: Kevin Parsons <kevpar@microsoft.com>

ambarve · 2021-10-17T21:53:47Z

test/cri-containerd/containerdrestart_test.go

+	defer stopContainer(t, client, ctx, containerID)
+
+	t.Log("Restart containerd")
+	stopContainerd(t)


Usually, cri-containerd.test.exe runs 4 tests in parallel (IIRC), This would break other tests that are running in parallel, right?

Does it? I thought running tests in parallel required you to sprinkle the ones that you'd like to make eligible with t.Parallel

I see. I thought just setting -test.parallel (which has default value of 4) to value greater than 1 does it.

My understanding is also that tests must explicitly opt into being run in parallel. I agree running in parallel would cause problems given the service is global state.

Related work items: microsoft#1067, microsoft#1097, microsoft#1119, microsoft#1170, microsoft#1176, microsoft#1180, microsoft#1181, microsoft#1182, microsoft#1183, microsoft#1184, microsoft#1185, microsoft#1186, microsoft#1187, microsoft#1188, microsoft#1189, microsoft#1191, microsoft#1193, microsoft#1194, microsoft#1195, microsoft#1196, microsoft#1197, microsoft#1200, microsoft#1201, microsoft#1202, microsoft#1203, microsoft#1204, microsoft#1205, microsoft#1206, microsoft#1207, microsoft#1209, microsoft#1210, microsoft#1211, microsoft#1218, microsoft#1219, microsoft#1220, microsoft#1223

kevpar requested a review from a team as a code owner October 5, 2021 23:26

kevpar added 2 commits October 5, 2021 16:27

kevpar force-pushed the restart-tests branch from 5f6bbff to 1ebba5b Compare October 5, 2021 23:30

kevpar changed the title ~~Add support for restarting containerd in tests, add restart test case~~ Support restarting containerd in tests, add restart test case Oct 5, 2021

dcantah approved these changes Oct 7, 2021

View reviewed changes

dcantah self-assigned this Oct 7, 2021

kevpar force-pushed the restart-tests branch from 1ebba5b to 2d35b70 Compare October 7, 2021 01:01

anmaxvl reviewed Oct 7, 2021

View reviewed changes

test/cri-containerd/containerdrestart_test.go Outdated Show resolved Hide resolved

kevpar added 2 commits October 15, 2021 14:07

Address PR feedback

7b098b0

Signed-off-by: Kevin Parsons <kevpar@microsoft.com>

Add TerminateOnRestart feature flag for new test

5ec59dc

Signed-off-by: Kevin Parsons <kevpar@microsoft.com>

ambarve reviewed Oct 17, 2021

View reviewed changes

ambarve mentioned this pull request Oct 18, 2021

Separate volume tests #1198

Closed

anmaxvl approved these changes Oct 19, 2021

View reviewed changes

kevpar merged commit b406abf into microsoft:master Oct 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support restarting containerd in tests, add restart test case #1188

Support restarting containerd in tests, add restart test case #1188

kevpar commented Oct 5, 2021

dcantah commented Oct 7, 2021 •

edited

Loading

kevpar commented Oct 7, 2021

ambarve Oct 17, 2021

dcantah Oct 18, 2021

ambarve Oct 18, 2021

kevpar Oct 19, 2021

Support restarting containerd in tests, add restart test case #1188

Support restarting containerd in tests, add restart test case #1188

Conversation

kevpar commented Oct 5, 2021

dcantah commented Oct 7, 2021 • edited Loading

kevpar commented Oct 7, 2021

ambarve Oct 17, 2021

Choose a reason for hiding this comment

dcantah Oct 18, 2021

Choose a reason for hiding this comment

ambarve Oct 18, 2021

Choose a reason for hiding this comment

kevpar Oct 19, 2021

Choose a reason for hiding this comment

dcantah commented Oct 7, 2021 •

edited

Loading