-
Notifications
You must be signed in to change notification settings - Fork 785
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Found two or more with same ip #7897
Comments
Found some logs from kuberhealthy It seems to be related to the issue. |
did the last boot job succeed (see the end of the log of the most recent |
which install instructions are you following? how did you install kuberhealthy? |
HI James, I installed the server including kuberhealthy by steps of https://jenkins-x.io/v3/admin/setup/operator/ It seems my installation boot job succeeds with some errors: |
could you do a dummy commit in your git repository (e.g. modify the README.md) and do |
which git repository template did you start from? the on premise one right? https://github.com/jx3-gitops-repositories/jx3-kubernetes whats the output of:
I don't really understand why kuberhealthy is not working in your cluster. The daemonset / deployment checks (the first 2 health checks) are pure vanilla kuberhealthy + k8s checks and have absolutely nothing to do with jenkins x at all - they verify k8s stuff |
also try
it could be your cluster is out of capacity? |
Yes I'm using the on premise one. https://github.com/bvboca/jx3-kubernetes |
It seems the resources are fine now. Resource Requests Limits cpu 5270m (65%) 17320m (216%) |
James, I checked the failed pods' logs. Most of them are about the kuberhealthy status reporting : Here's the log from jx-bot-token: Log from jx-webhooks: Log from jx-webhook-events: Log from jx-pod-status: |
there is a kuberhealthy service running in the kuberhealthy namespace right? can you try curl http://kuberhealthy.kuberhealthy.svc.cluster.local from inside a pod in the cluster? I wonder if there's an issue with service + DNS in your cluster? |
e.g. run kubectl exec -it jx-build-controller-XXXX bash then run curl -v http://kuberhealthy.kuberhealthy.svc.cluster.local |
you should get a 200 with json output |
$ curl http://kuberhealthy.kuberhealthy.svc.cluster.local |
James, I think the pod DNS should be fine. The above comment shows the response from kuberhealthy service. And I have other apps depending on DNS and running well. Is it related to kuberhealthy/kuberhealthy#858? |
We are now disabling kuberhealthy. It seems to cause more headaches that it solves. |
logs from deployment pod in kuberhealthy
time="2021-07-24T15:01:05Z" level=info msg="Successfully hit service endpoint."
time="2021-07-24T15:01:05Z" level=info msg="Rolling update option is enabled. Performing roll."
time="2021-07-24T15:01:05Z" level=info msg="Creating deployment resource with 4 replica(s) in kuberhealthy namespace using image [nginxinc/nginx-unprivileged:1.17.9] with environment variables: map[]"
time="2021-07-24T15:01:05Z" level=info msg="Creating container using image [nginxinc/nginx-unprivileged:1.17.9] with environment variables: map[]"
time="2021-07-24T15:01:05Z" level=info msg="Created rolling-update deployment resource."
time="2021-07-24T15:01:05Z" level=info msg="Performing rolling-update on deployment deployment-deployment to [nginxinc/nginx-unprivileged:1.17.9]"
time="2021-07-24T15:01:26Z" level=info msg="Rolled deployment in kuberhealthy namespace: deployment-deployment"
time="2021-07-24T15:01:26Z" level=info msg="Looking for a response from the endpoint."
time="2021-07-24T15:01:26Z" level=info msg="Beginning backoff loop for HTTP GET request."
time="2021-07-24T15:01:26Z" level=info msg="Successfully made an HTTP request on attempt: 1"
time="2021-07-24T15:01:26Z" level=info msg="Got a 200 with a GET to http://10.108.6.162"
time="2021-07-24T15:01:26Z" level=info msg="Got a result from GET request backoff: 200 OK"
time="2021-07-24T15:01:26Z" level=info msg="Successfully hit service endpoint after rolling-update."
time="2021-07-24T15:01:26Z" level=info msg="Cleaning up deployment and service."
time="2021-07-24T15:01:26Z" level=info msg="Attempting to delete service deployment-svc in kuberhealthy namespace."
time="2021-07-24T15:01:31Z" level=info msg="Attempting to delete deployment in kuberhealthy namespace."
time="2021-07-24T15:01:36Z" level=info msg="Attempting to delete deployment in kuberhealthy namespace."
time="2021-07-24T15:01:41Z" level=info msg="Finished clean up process."
time="2021-07-24T15:01:41Z" level=info msg="Reporting success to Kuberhealthy."
time="2021-07-24T15:02:42Z" level=fatal msg="error reporting to kuberhealthy: bad status code from kuberhealthy status reporting url: [400] 400 Bad Request
The text was updated successfully, but these errors were encountered: