[vtadmin-api] vtctld proxy dialer should check that gRPC connection is ready #9422

doeg · 2021-12-17T20:50:01Z

Currently, vtadmin-api's vtctld proxy's Dial function will only reinitialize its vtctld connection if (a) it does not have a cached connection (i.e., it's the first time Dial has been called), or (b) if the vtctld connection is explicitly closed.

As a result, if any of the vtctlds to which VTAdmin is connected go away (are deprovisioned, the vtctld service crashes, etc.) then any vtadmin-api endpoint that the vtctlds (keyspaces, workflows, schemas, etc.) will time out.

Ideally, Dial should also check that the gRPC connection is "ready for work", i.e., its connectivity state is ready/idle. If it is in a failure state, then the Dial function should close the connection and rediscover a new vtctld.

This is fairly easy to reproduce locally:

Bring up two vtctlds (e.g., using vtctld-up.sh)
Update VTAdmin's discovery.json file to include both:

{
    "vtctlds": [
        {
            "host": {
                "fqdn": "localhost:15000",
                "hostname": "localhost:15999"
            }
        },
        {
            "host": {
                "fqdn": "localhost:19000",
                "hostname": "localhost:19999"
            }
        }
    ],
    "vtgates": [
        {
            "host": {
                "hostname": "localhost:15991"
            }
        }
    ]
}

Bring up vtadmin-api with ./scripts/vtadmin-up.sh
Make a request against http://localhost:14200/api/keyspaces, which will call Dial and discover one of the two vtctlds. Additional logging to show:

I1217 14:04:51.390344   53482 config.go:122] [rbac]: loaded authorizer with 1 rules
I1217 14:04:51.390402   53482 config.go:146] [rbac]: no authenticator implementation specified
I1217 14:04:51.396443   53482 server.go:240] server vtadmin listening on :14200
I1217 14:04:56.160507   53482 proxy.go:140] Discovering vtctld to dial...
I1217 14:04:56.160575   53482 proxy.go:147] Discovered vtctld localhost:19999
; dialing...
I1217 14:04:56.161394   53482 proxy.go:173] Established connection to vtctld localhost:19999

kill -9 whichever vtctld it established a connection to
Make another request against http://localhost:14200/api/keyspaces to redial

At this point, ideal behaviour is that vtadmin-api will detect that vtctld is no longer available, close the gRPC connection, and then rediscover the other vtctld.

What currently happens is that the gRPC connection just retries forever:

W1217 14:06:34.506722   53482 component.go:41] [core] grpc: addrConn.createTransport failed to connect to {localhost:19999 localhost:19999 <nil> 0 <nil>}. Err: connection error: desc = "transport: Error while dialing dial tcp [::1]:19999: connect: connection refused". Reconnecting...
W1217 14:06:39.317060   53482 component.go:41] [core] grpc: addrConn.createTransport failed to connect to {localhost:19999 localhost:19999 <nil> 0 <nil>}. Err: connection error: desc = "transport: Error while dialing dial tcp [::1]:19999: connect: connection refused". Reconnecting...
W1217 14:06:45.755668   53482 component.go:41] [core] grpc: addrConn.createTransport failed to connect to {localhost:19999 localhost:19999 <nil> 0 <nil>}. Err: connection error: desc = "transport: Error while dialing dial tcp [::1]:19999: connect: connection refused". Reconnecting...
I1217 14:06:50.956459   53482 client.go:86] WaitForReady ClientConn status: TRANSIENT_FAILURE

The text was updated successfully, but these errors were encountered:

doeg added the Type: Bug label Dec 17, 2021

doeg self-assigned this Dec 17, 2021

doeg added the Component: VTAdmin VTadmin interface label Dec 17, 2021

vitessio deleted a comment from Manasi25 Jan 28, 2022

doeg mentioned this issue Mar 18, 2022

[vtadmin] Update vtctld dialer to validate connectivity #9915

Merged

3 tasks

doeg closed this as completed in #9915 Mar 22, 2022

ajm188 mentioned this issue Mar 30, 2022

[vtadmin] custom discovery resolver #9977

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[vtadmin-api] vtctld proxy dialer should check that gRPC connection is ready #9422

[vtadmin-api] vtctld proxy dialer should check that gRPC connection is ready #9422

doeg commented Dec 17, 2021 •

edited

Loading

[vtadmin-api] vtctld proxy dialer should check that gRPC connection is ready #9422

[vtadmin-api] vtctld proxy dialer should check that gRPC connection is ready #9422

Comments

doeg commented Dec 17, 2021 • edited Loading

doeg commented Dec 17, 2021 •

edited

Loading