Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

e2e flake: Proxy version v1 [It] should proxy through a service and a pod #21484

Closed
gmarek opened this issue Feb 18, 2016 · 44 comments
Closed

e2e flake: Proxy version v1 [It] should proxy through a service and a pod #21484

gmarek opened this issue Feb 18, 2016 · 44 comments
Assignees
Labels
kind/flake Categorizes issue or PR as related to a flaky test. priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now.

Comments

@gmarek
Copy link
Contributor

gmarek commented Feb 18, 2016

https://storage.googleapis.com/kubernetes-jenkins/pr-logs/pull/21251/kubernetes-pull-build-test-e2e-gce/29155/build-log.txt

Failure [132.205 seconds]
Proxy
/go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/proxy.go:40
  version v1
  /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/proxy.go:39
    should proxy through a service and a pod [Conformance] [It]
    /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/proxy.go:224

    0: path /api/v1/namespaces/e2e-tests-proxy-ehd91/pods/proxy-service-yckdo-cd23f/proxy/ gave error: an error on the server has prevented the request from succeeding

    /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/proxy.go:222

cc @kubernetes/goog-cluster

@gmarek gmarek added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. team/cluster kind/flake Categorizes issue or PR as related to a flaky test. labels Feb 18, 2016
@lavalamp
Copy link
Member

#17493 merged recently, but it looks like after this flake happened, so it shouldn't be the cause.

@lavalamp
Copy link
Member

Huh, somehow we returned a 0 status code.

Feb 18 07:48:29.875: INFO: /api/v1/namespaces/e2e-tests-proxy-ehd91/pods/proxy-service-yckdo-cd23f/proxy/: no body (0; 3.900034ms)

@wojtek-t
Copy link
Member

@lavalamp lavalamp self-assigned this Feb 22, 2016
@bprashanth
Copy link
Contributor

Can we dupe this with #21484, or change the title to reflect to 0 status code issue?

@lavalamp
Copy link
Member

@bprashanth this is #21484, did you mean something else?

@bprashanth
Copy link
Contributor

Bah i meant #18963

@lavalamp
Copy link
Member

OK, so I misread. The '0' is the index of the attempt, not the status code.

@lavalamp
Copy link
Member

I think the pod probably just needs a readiness check.

@lavalamp
Copy link
Member

I added one & tried it--indeed, the pod spent a fair amount of time in running but not ready state. PR sent.

@caesarxuchao
Copy link
Member

Occurred again:

• Failure [154.406 seconds]
Proxy
/go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/proxy.go:40
  version v1
  /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/proxy.go:39
    should proxy through a service and a pod [Conformance] [It]
    /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/proxy.go:244

    0: path /api/v1/proxy/namespaces/e2e-tests-proxy-2gz9u/pods/proxy-service-j63nq-ur6c1:80/ gave error: an error on the server has prevented the request from succeeding

    /go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/proxy.go:242
------------------------------

https://console.cloud.google.com/storage/browser/kubernetes-jenkins/pr-logs/pull/21175/kubernetes-pull-build-test-e2e-gce/30727/?debugUI=CLOUD

@lavalamp
Copy link
Member

lavalamp commented Mar 1, 2016

Huh, it worked through the service but not through the pod directly.

@janetkuo
Copy link
Member

janetkuo commented Mar 1, 2016

https://pantheon.corp.google.com/storage/browser/kubernetes-jenkins/pr-logs/pull/22202/kubernetes-pull-build-test-e2e-gce/31071/artifacts/?debugUI=CLOUD

/go/src/k8s.io/kubernetes/_output/dockerized/go/src/k8s.io/kubernetes/test/e2e/proxy.go:244
0: path /api/v1/namespaces/e2e-tests-proxy-31hny/pods/http:proxy-service-b9v0f-v5kdh:80/proxy/ gave error: an error on the server has prevented the request from succeeding

@a-robinson
Copy link
Contributor

@spxtr
Copy link
Contributor

spxtr commented May 9, 2016

This test is exceptionally flaky on GKE (~6% flake rate). Here is a recent flake.

@roberthbailey
Copy link
Contributor

/cc @cjcullen

@gmarek
Copy link
Contributor Author

gmarek commented May 9, 2016

This is a P0 since Mar 4th, so over 2 months now. Can we really, really, really prioritize this? @goltermann

@bprashanth
Copy link
Contributor

I think the first problem is wrong triage. Who owns apiserver proxy? I debugged it simply because it was labelled team/cluster and figured it isn't something that worried me. In fact it seemed like a test fix (#21484 (comment))

@gmarek
Copy link
Contributor Author

gmarek commented May 16, 2016

@thockin - can you please reassign if cluster team is a wrong one? This is noticeably flaky.

@gmarek
Copy link
Contributor Author

gmarek commented May 19, 2016

@lavalamp
Copy link
Member

@gmarek
Copy link
Contributor Author

gmarek commented May 20, 2016

@lavalamp - should this be assigned to API machinery team?

@lavalamp
Copy link
Member

@gmarek yes, probably :(

@lavalamp lavalamp assigned caesarxuchao and unassigned thockin May 20, 2016
This was referenced May 21, 2016
@sttts
Copy link
Contributor

sttts commented May 24, 2016

Some analysis:

@caesarxuchao
Copy link
Member

@sttts do you want to take this one? I haven't started working on it (I was OOO until this week) and have no prior knowledge.

@krousey krousey assigned krousey and unassigned caesarxuchao May 24, 2016
@sttts
Copy link
Contributor

sttts commented May 25, 2016

@caesarxuchao @krousey I can continue to investigate that. Feel free to assign it to me.

@dims
Copy link
Member

dims commented May 25, 2016

Is this a dup of the bot proposed one? #26210

@krousey
Copy link
Contributor

krousey commented May 25, 2016

Closing in favor of the automated report #26210

@krousey krousey closed this as completed May 25, 2016
k8s-github-robot pushed a commit that referenced this issue May 26, 2016
Automatic merge from submit-queue

Flake 21484: retrieve pod log during e2e error

Print the pod log when an error occurs in

> Proxy version 1 should proxy through a service and a pod [Conformance]

e2e test. This will help to understand flake #21484 better.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/flake Categorizes issue or PR as related to a flaky test. priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now.
Projects
None yet
Development

No branches or pull requests