-
Notifications
You must be signed in to change notification settings - Fork 345
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
k8s readiness check failing #1975
Comments
Hi @pocesar! Thanks for opening an issue on the Cloud SQL Proxy. Couple quick questions for you to hopefully understand your scenario a bit better:
We are currently updating our Health Check sample and this work is being tracked here #1950 |
Hi @pocesar, If your application can't connect, and also the proxy readiness check fails, then there may be a network configuration problem preventing the proxy from establishing a connection to the Cloud SQL Instance. About readiness checks: We discourage the use of the readiness check in most applications. If you are running an application like a website or API service, you usually want the application container to provide a readiness check that includes connecting through the proxy to the database. (see Readiness Check Configuration). The example readiness check configuration is tuned for batch jobs. In this configuration, the readiness check will attempt to connect to the instance for a full 60 seconds, failing 6 times before marking the pod as "not ready". |
@hessjcg I see, so the recommended manifest should leave the |
Yes -- readiness is a tricky check to get right. @pocesar Can your app reach your database otherwise? |
Closing due to inactivity. |
Question
When running on k8s, I'm now seeing
[Health Check] Readiness failed: Dial error: handshake failed (connection name = "xxx:xxx:xxx"): context canceled
only and it never recovers.I'm using the config from here
https://github.com/GoogleCloudPlatform/cloud-sql-proxy/blob/main/examples/k8s-health-check/proxy_with_http_health_check.yaml
Also, the pod is showing as healthy, but it's not, since the application isn't being able to connect to it.
Nothing changed in the configuration and started failing this weekend on GCP
Code
No response
Additional Details
No response
The text was updated successfully, but these errors were encountered: