Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

503 intermitent no healthy upstream error in istio #50026

Open
ritesh58 opened this issue Mar 21, 2024 · 0 comments
Open

503 intermitent no healthy upstream error in istio #50026

ritesh58 opened this issue Mar 21, 2024 · 0 comments

Comments

@ritesh58
Copy link

ritesh58 commented Mar 21, 2024

We have an issue which is giving 503 intermittent “no healthy upstreams” from istio gateway which is causing issues . and need some assistant Please see the case study:

1 . No healthy upstream errors is showing on istio gateway logs only. When we check the application pod log itself. No errors are showing. Only the healthy 200 ones are showing in application pod

The errors starts showing when istio ingress gateway pod replica is increased

We checked the existing Istio installation , seems good and healthy. also upgraded istio to new version

following are the more details

istioctl analyze --all-namespaces not showing an obvious errors

its clearly showing the envoy [istio-proxy] pod being disconnected from the Mesh

Kube-proxy logs showing a healthy iptables sync

We enabled the debug logs for istio-proxy to understand more about the errors using istioctl proxy-config log -n --level=debug .

We noticed a high error rate for xds server connecting to isitod pilot service

2024-03-19T15:59:35.670788Z error xdsproxy upstream [6] error: rpc error: code = ResourceExhausted desc = request rate limit exceeded: rate: Wait(n=1) would exceed context deadline
2024-03-19T15:59:35.670816Z warn xdsproxy upstream [6] terminated with unexpected error rpc error: code = ResourceExhausted desc = request rate limit exceeded: rate: Wait(n=1) would exceed context deadline
2024-03-19T15:59:35.671186Z warning envoy config external/envoy/source/extensions/config_subscription/grpc/grpc_stream.h:177 StreamAggregatedResources gRPC config stream to xds-grpc closed: 8, request rate limit exceeded: rate: Wait(n=1) would exceed context deadline (previously 14, closing transport due to: connection error: desc = "error reading from server: EOF", received prior goaway: code: NO_ERROR, debug data: "graceful_stop" since 0s ago) thread=14
2024-03-19T15:59:56.177601Z warning envoy config external/envoy/source/extensions/config_subscription/grpc/grpc_stream.h:177 StreamAggregatedResources gRPC config stream to xds-grpc closed: 14, connection error: desc = "transport: Error while dialing: dial tcp 10.15.1.156:15012: i/o timeout" (previously 8, request rate limit exceeded: rate: Wait(n=1) would exceed context deadline since 20s ago) thread=14
2024-03-19T16:21:54.333274Z error xdsproxy upstream [12] error: rpc error: code = ResourceExhausted desc = request rate limit exceeded: rate: Wait(n=1) would exceed context deadline
2024-03-19T16:21:54.333299Z warn xdsproxy upstream [12] terminated with unexpected error rpc error: code = ResourceExhausted desc = request rate limit exceeded: rate: Wait(n=1) would exceed context deadline
2024-03-19T16:21:54.333713Z warning envoy config external/envoy/source/extensions/config_subscription/grpc/grpc_stream.h:177 StreamAggregatedResources gRPC config stream to xds-grpc closed: 8, request rate limit exceeded: rate: Wait(n=1) would exceed context deadline (previously 14, connection error: desc = "transport: Error while dialing: dial tcp 10.15.1.156:15012: connect: connection refused" since 0s ago) thread=14

image
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant