Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI: K8sServicesTest Checks service across nodes Tests NodePort BPF Tests with direct routing Tests with secondary NodePort device: Exitcode: 42 #12690

Closed
aanm opened this issue Jul 28, 2020 · 4 comments
Assignees
Labels
area/CI Continuous Integration testing issue or flake ci/flake This is a known failure that occurs in the tree. Please investigate me! sig/datapath Impacts bpf/ or low-level forwarding details, including map management and monitor messages.
Projects

Comments

@aanm
Copy link
Member

aanm commented Jul 28, 2020

/home/jenkins/workspace/Cilium-PR-K8s-1.12-net-next/src/github.com/cilium/cilium/test/ginkgo-ext/scopes.go:514
Request from testclient-l9jhj pod to service tftp://10.102.0.52:10069/hello failed
Expected command: kubectl exec -n default testclient-l9jhj -- /bin/bash -c 'fails=""; id=$RANDOM; for i in $(seq 1 10); do if curl --path-as-is -s -D /dev/stderr --fail --connect-timeout 5 --max-time 20 tftp://10.102.0.52:10069/hello -H "User-Agent: cilium-test-$id/$i"; then echo "Test round $id/$i exit code: $?"; else fails=$fails:$id/$i=$?; fi; done; if [ -n "$fails" ]; then echo "failed: $fails"; fi; cnt="${fails//[^:]}"; if [ ${#cnt} -gt 0 ]; then exit 42; fi' 
To succeed, but it failed:
Exitcode: 42 
Err: exit status 42
Stdout:
 	 
	 Hostname: testds-9gdwc
	 
	 Request Information:
	 	client_address=10.0.1.205
	 	client_port=49833
	 	real path=/hello
	 	request_scheme=tftp
	 
	 Test round 26023/1 exit code: 0
	 
	 Hostname: testds-4cvff
	 
	 Request Information:
	 	client_address=10.0.1.205
	 	client_port=38054
	 	real path=/hello
	 	request_scheme=tftp
	 
	 Test round 26023/2 exit code: 0
	 
	 Hostname: testds-9gdwc
	 
	 Request Information:
	 	client_address=10.0.1.205
	 	client_port=50368
	 	real path=/hello
	 	request_scheme=tftp
	 
	 Test round 26023/3 exit code: 0
	 
	 Hostname: testds-9gdwc
	 
	 Request Information:
	 	client_address=10.0.1.205
	 	client_port=60718
	 	real path=/hello
	 	request_scheme=tftp
	 
	 Test round 26023/4 exit code: 0
	 
	 Hostname: testds-9gdwc
	 
	 Request Information:
	 	client_address=10.0.1.205
	 	client_port=52418
	 	real path=/hello
	 	request_scheme=tftp
	 
	 Test round 26023/5 exit code: 0
	 
	 Hostname: testds-4cvff
	 
	 Request Information:
	 	client_address=10.0.1.205
	 	client_port=38251
	 	real path=/hello
	 	request_scheme=tftp
	 
	 Test round 26023/6 exit code: 0
	 
	 Hostname: testds-4cvff
	 
	 Request Information:
	 	client_address=10.0.1.205
	 	client_port=34753
	 	real path=/hello
	 	request_scheme=tftp
	 
	 Test round 26023/8 exit code: 0
	 
	 Hostname: testds-9gdwc
	 
	 Request Information:
	 	client_address=10.0.1.205
	 	client_port=58050
	 	real path=/hello
	 	request_scheme=tftp
	 
	 Test round 26023/9 exit code: 0
	 
	 Hostname: testds-4cvff
	 
	 Request Information:
	 	client_address=10.0.1.205
	 	client_port=55828
	 	real path=/hello
	 	request_scheme=tftp
	 
	 Test round 26023/10 exit code: 0
	 failed: :26023/7=72
	 
Stderr:
 	 command terminated with exit code 42
	 

/home/jenkins/workspace/Cilium-PR-K8s-1.12-net-next/src/github.com/cilium/cilium/test/k8sT/Services.go:756

https://jenkins.cilium.io/job/Cilium-PR-K8s-1.12-net-next/30/testReport/junit/Suite-k8s-1/12/K8sServicesTest_Checks_service_across_nodes_Tests_NodePort_BPF_Tests_with_direct_routing_Tests_with_secondary_NodePort_device/

fd3841ce_K8sServicesTest_Checks_service_across_nodes_Tests_NodePort_BPF_Tests_with_direct_routing_Tests_with_secondary_NodePort_device.zip

@aanm aanm added area/CI Continuous Integration testing issue or flake ci/flake This is a known failure that occurs in the tree. Please investigate me! labels Jul 28, 2020
@brb brb assigned brb and borkmann Jul 28, 2020
@borkmann borkmann added this to TODO (untriaged & unsorted) in 1.9 kube-proxy removal & general dp optimization Jul 28, 2020
@borkmann borkmann moved this from TODO (untriaged & unsorted) to Priority unclear in 1.9 kube-proxy removal & general dp optimization Sep 4, 2020
@joestringer joestringer changed the title CI: K8sServicesTest Checks service across nodes Tests NodePort BPF Tests with direct routing Tests with secondary NodePort device CI: K8sServicesTest Checks service across nodes Tests NodePort BPF Tests with direct routing Tests with secondary NodePort device: Exitcode: 42 Oct 23, 2020
@pchaigno
Copy link
Member

pchaigno commented Apr 6, 2021

I assume variants of this flake with HTTP and VXLAN (e.g., https://jenkins.cilium.io/job/Cilium-PR-K8s-1.20-kernel-4.19/80/testReport/junit/Suite-k8s-1/20/K8sServicesTest_Checks_service_across_nodes_Tests_NodePort_BPF_Tests_with_secondary_NodePort_device/) likely have the same root cause. Or do I need to open a separate issue?

@pchaigno
Copy link
Member

pchaigno commented Jan 7, 2022

@twpayne Should we disable this test?

@twpayne
Copy link
Contributor

twpayne commented Jan 7, 2022

@twpayne Should we disable this test?

Yes, given that it's been flaky for 18 months, it's right to disable it.

@brb brb removed this from Priority unclear in 1.9 kube-proxy removal & general dp optimization Feb 17, 2022
@brb brb added the sig/datapath Impacts bpf/ or low-level forwarding details, including map management and monitor messages. label Feb 17, 2022
@brb
Copy link
Member

brb commented Feb 17, 2022

Considering recent K8sServices suite refactoring, the flake might have fixed. Let's reopen if we gonna hit it on master branch again.

@brb brb closed this as completed Feb 17, 2022
1.11 CI automation moved this from Investigating to Evaluate to exit quarantine Feb 17, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/CI Continuous Integration testing issue or flake ci/flake This is a known failure that occurs in the tree. Please investigate me! sig/datapath Impacts bpf/ or low-level forwarding details, including map management and monitor messages.
Projects
No open projects
1.11 CI
Evaluate to exit quarantine
Development

No branches or pull requests

5 participants