cli: `cockroach quit` should wait until the server has effectively stopped #6585

knz · 2016-05-09T16:30:42Z

Scripts like e.g. the Jepsen tests will start and then restart a server. Now if the following commands are issued quickly in sequence:

cockroach start
cockroach quit
cockroach start

then the 3rd sometimes fails with an error "the server is already started". This is because "quit" merely issues a termination request and terminates (and gives control back to the shell) before the db server has effectively terminated.

Script writers need a way to synchronize on the termination of the db server. The easiest way forward would be to have "quit" wait until the shutdown request has been processed and the connection to the server is lost.

Reported by @dationl.

The text was updated successfully, but these errors were encountered:

tamird · 2016-07-02T02:12:25Z

I believe this was fixed by #7483.

tbg · 2016-07-02T02:18:35Z

No, not fixed yet. Technically quit only waits until the request is processed, but the process might still take a little bit of time to actually shut down (it would be safer to poll the port and wait for it to close, but that isn't really that idiomatic as well). I wonder how other software does this.

tamird · 2016-07-02T02:28:23Z

Could make quit a streaming RPC and wait for the stream to close?

tbg · 2016-07-02T02:51:56Z

But the stream would always close slightly before the process (pid)
disappears, no?

On Fri, Jul 1, 2016 at 10:28 PM Tamir Duberstein notifications@github.com
wrote:

Could make quit a streaming RPC and wait for the stream to close?

—
You are receiving this because you modified the open/close state.

Reply to this email directly, view it on GitHub
#6585 (comment),
or mute the thread
https://github.com/notifications/unsubscribe/AE135OYXOJZ6RmJVXA14ADv4qzvgQ10bks5qRczSgaJpZM4IaThe
.

-- Tobias

tamird · 2016-07-02T03:23:42Z

Yeah, but that's the best we're ever going to do, right?

knz · 2016-07-02T11:25:19Z

No that's not the best. You can also use the syscall kill(2) with SIGCONT or some other innocuous signal until your kernel tells you the process does not exist any more. That's the best way.

tamird · 2016-07-02T13:01:55Z

Doesn't that assume that the cli process is on the same machine as the server?

knz · 2016-07-02T13:52:37Z

Well ok if it isn't then what you said is the best. But it local control not the common case?

bdarnell · 2016-07-02T14:48:24Z

I wonder how other software does this.

Most software doesn't have an equivalent command (and I consider our quit command to generally be a bad idea). Servers are run under a process manager, and you shut them down by asking that process manager to stop them, not by asking the server to stop itself. (or running locally without a process manager, you just kill the process and poll for its PID to disappear)

Without a process manager, it's difficult to manage the full lifecycle precisely. You can either start with cockroach start & and use the shell's wait command at shutdown, (which means that the server may not be ready when cockroach start & returns) or you start with cockroach start --background and stop with cockroach quit, which gives you a precise startup notification but nothing at shutdown.

If Go supported fork, we could fork off a child process and respond to the quit command when the parent process was gone. Without fork, all our options are hacks on top of hacks and it's probably better to just encourage the use of a proper process manager.

knz · 2016-07-02T15:12:57Z

As I explain in #7603 (comment) you can also count on the server's OS to signal process termination by shutting down all open sockets. That is detectable over the network.

knz added C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. E-easy Easy issue to tackle, requires little or no CockroachDB experience labels May 9, 2016

tamird closed this as completed Jul 2, 2016

tbg reopened this Jul 2, 2016

tamird mentioned this issue Jul 2, 2016

cli: quit command waits for server shutdown #7603

Merged

tamird closed this as completed in #7603 Jul 5, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cli: `cockroach quit` should wait until the server has effectively stopped #6585

cli: `cockroach quit` should wait until the server has effectively stopped #6585

knz commented May 9, 2016

tamird commented Jul 2, 2016

tbg commented Jul 2, 2016

tamird commented Jul 2, 2016

tbg commented Jul 2, 2016

tamird commented Jul 2, 2016

knz commented Jul 2, 2016

tamird commented Jul 2, 2016

knz commented Jul 2, 2016

bdarnell commented Jul 2, 2016

knz commented Jul 2, 2016

cli: cockroach quit should wait until the server has effectively stopped #6585

cli: cockroach quit should wait until the server has effectively stopped #6585

Comments

knz commented May 9, 2016

tamird commented Jul 2, 2016

tbg commented Jul 2, 2016

tamird commented Jul 2, 2016

tbg commented Jul 2, 2016

tamird commented Jul 2, 2016

knz commented Jul 2, 2016

tamird commented Jul 2, 2016

knz commented Jul 2, 2016

bdarnell commented Jul 2, 2016

knz commented Jul 2, 2016

cli: `cockroach quit` should wait until the server has effectively stopped #6585

cli: `cockroach quit` should wait until the server has effectively stopped #6585