-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DNS Resolution Fails Sporadically - Crashes Server #10093
Comments
Same issue. However, we have always happened that the gateway service is not normal and cannot connect to the internal readis cluster
|
Hi @NickTheSecurityDude @0xRook1e could yoy attach a tcpdump during the time the error happened (when Kong receives error/timeout, but normal nslookup is fine)? |
Related issue #9959 |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Is there an existing issue for this?
Kong version (
$ kong version
)Kong Proxy 3.1.0 Kong Ingress 2.8.0
Current Behavior
Kong keeps crashing with DNS resolution errors.
If I restart kong it will work for 15 min or so, then all requests will fail and the website goes down.
If I exec into the kong pod directly I’m able to run nslookup commands fine.
Does anyone know why this may be happening? EKS 1.24 Kong Proxy 3.1.0 Kong Ingress 2.8.0
[notice] 1132#0: *436443 [kong] handler.lua:181 [mysite-auth] :mys_lua: resty - validation api call encountered error [cosocket] DNS resolution failed: dns lookup pool exceeded retries (1): timeout. Tried: [“(short)auth.web.svc.mysite.com:(na) - cache-miss”,“auth.web.svc.mysite.com.kong.svc.mysite.com:33
The other error I’m seeing frequently is:
[warn] 1132#0: * [lua] batch_queue.lua:183: failed to process entries: nil, context: ngx.timer”
Expected Behavior
No DNS Resolution Errors.
Steps To Reproduce
It happens sporadically, if I delete the kong pod and let it recreate it, it will work for some time before showing those errors and taking the web site offline.
Anything else?
EKS 1.24
The text was updated successfully, but these errors were encountered: