server/{dex,market}: cleaner shutdown #787

chappjc · 2020-10-26T21:44:34Z

server/dex

This reorders the subsystem stack so that DEX manager shutdown is ordered properly now. Namely:

~~comms server shuts down after the subsystems that use it (all but assets)~~ comms didn't move because we want it started after the subsystems that use it are up for the sake of incoming message handling, even though stopping it before those dependent subsystems stop can result in dropped outgoing messages
book router shuts down after the markets that use it

server/market

When a market is stopped by context cancellation (not a market suspend admin command), it does not wait for the current epoch to close, so any orders in that truncated epoch were just being dropped without changing their status from epoch to executed (how a no-match/fail is normally handled when epochs are processed by the matcher). This updates the primary defer function in (*Market).Run so that if there are orders left in the epochOrders map after the closed epoch pipeline is drained, their statuses are change in the DB so that they aren't reported to clients as active orders for eternity.

Resolves #474, although it didn't hang any more.

buck54321

comms server shuts down after the subsystems that use it (all but assets)

Is this to allow the sub-systems to continue sending messages. Seems like we'd want to immediately end incoming messages before the shutting down the lower-level systems.

buck54321 · 2020-10-27T11:45:02Z

server/dex/dex.go

+	// Client comms RPC server.
+	server, err := comms.NewServer(cfg.CommsCfg)
+	if err != nil {
+		abort()
+		return nil, fmt.Errorf("NewServer failed: %v", err)
+	}
+	startSubSys("Comms Server", server)
+


I see how this would achieve the desired effect of shutting down the server last, but is it ok to start the server before the lower-level systems that it relies on? I realize of course that sub-system Run methods are run in goroutines with no guarantee of sequence anyway, but we still want ot start the server last, no?

That's a good point about the comms order. The issue I noticed was that messages at shutdown are dropped if comms goes dows first. But your point about start-up is taken.

The comms subsys is moved back to its original location, but I'll let you resolve this comment in case you have thoughts about the outgoing messages on shutdown.

server/auth/auth.go

chappjc · 2020-10-27T20:28:49Z

server/market/market.go

+		// Revoke any unmatched epoch orders (if context was canceled, not a
+		// clean suspend stopped the market).
+		for oid, ord := range m.epochOrders {
+			log.Infof("Dropping epoch order %v", oid)
+			if co, ok := ord.(*order.CancelOrder); ok {
+				if err := m.storage.FailCancelOrder(co); err != nil {
+					log.Errorf("Failed set orphaned epoch cancel order %v as executed: %v", oid, err)
+				}
+				continue
+			}
+			if err := m.storage.ExecuteOrder(ord); err != nil {
+				log.Errorf("Failed set orphaned epoch trade order %v as executed: %v", oid, err)
+			}
+		}


The suspend command should be used to complete the epoch if that is desired, but ctrl+c should at least not ditch orders in epoch status even though they do not go through matching.

Just noticing the difference between the executed status here, and the revoked status when we do a book purge. Seems like similar situations.

Yeah, maybe revoked is better for the trades too. Will switch it. Really just avoiding eternal epoch status orders though.

OK, so we could use RevokeOrder{Uncounted} for the trade order, but that creates a cancel order to go with it. The general idea of order revocation is that it's a forced unbooking, a server-generated cancellation. So while executed may not fix exactly here, going from epoch->executed is more inline with how this is handled elsewhere, such as epoch orders not matching becoming executed, not revoked. This is another (wacky) way epoch orders can become executed, just without the matcher.

Would an operator have any way to know by looking at the order row in the DB whether this order was executed via matching vs. dirty shutdown?

No preimage.

Hmm, need to double check that these won't count as a preimage miss, but it had better be pretty darn uncommon to do a dirty shutdown without suspending the markets first.

OK, so by making it executed we also escape having it counted as a preimage miss because the preimage miss query looks for revoked status orders.
Still thinking through other pitfalls with executed status orders that have no preimage stored.

chappjc added 2 commits October 26, 2020 16:33

server/dex: reorganize the subsystem stack

25ef086

server/market: don't leave orders in epoch status on sigint

c2f1c74

chappjc added this to the 0.1.1 milestone Oct 26, 2020

buck54321 reviewed Oct 27, 2020

View reviewed changes

chappjc commented Oct 27, 2020

View reviewed changes

server/auth/auth.go Show resolved Hide resolved

revert comms server stack position

c25f09e

chappjc commented Oct 27, 2020

View reviewed changes

chappjc requested a review from buck54321 November 1, 2020 15:37

buck54321 approved these changes Nov 2, 2020

View reviewed changes

revise error log message

b4ec5e8

chappjc merged commit c7d7e28 into decred:master Nov 2, 2020

chappjc deleted the server-market-shutdown branch November 2, 2020 23:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server/{dex,market}: cleaner shutdown #787

server/{dex,market}: cleaner shutdown #787

chappjc commented Oct 26, 2020 •

edited

Loading

buck54321 left a comment

buck54321 Oct 27, 2020

chappjc Oct 27, 2020 •

edited

Loading

chappjc Oct 28, 2020

chappjc Oct 27, 2020 •

edited

Loading

buck54321 Oct 30, 2020

chappjc Oct 30, 2020

chappjc Oct 30, 2020 •

edited

Loading

buck54321 Oct 30, 2020

chappjc Oct 30, 2020

chappjc Oct 30, 2020 •

edited

Loading

chappjc Oct 30, 2020 •

edited

Loading

server/{dex,market}: cleaner shutdown #787

server/{dex,market}: cleaner shutdown #787

Conversation

chappjc commented Oct 26, 2020 • edited Loading

buck54321 left a comment

Choose a reason for hiding this comment

buck54321 Oct 27, 2020

Choose a reason for hiding this comment

chappjc Oct 27, 2020 • edited Loading

Choose a reason for hiding this comment

chappjc Oct 28, 2020

Choose a reason for hiding this comment

chappjc Oct 27, 2020 • edited Loading

Choose a reason for hiding this comment

buck54321 Oct 30, 2020

Choose a reason for hiding this comment

chappjc Oct 30, 2020

Choose a reason for hiding this comment

chappjc Oct 30, 2020 • edited Loading

Choose a reason for hiding this comment

buck54321 Oct 30, 2020

Choose a reason for hiding this comment

chappjc Oct 30, 2020

Choose a reason for hiding this comment

chappjc Oct 30, 2020 • edited Loading

Choose a reason for hiding this comment

chappjc Oct 30, 2020 • edited Loading

Choose a reason for hiding this comment

chappjc commented Oct 26, 2020 •

edited

Loading

chappjc Oct 27, 2020 •

edited

Loading

chappjc Oct 27, 2020 •

edited

Loading

chappjc Oct 30, 2020 •

edited

Loading

chappjc Oct 30, 2020 •

edited

Loading

chappjc Oct 30, 2020 •

edited

Loading