server/auth: scoring for offline users must load history from DB #1083

chappjc · 2021-05-10T21:17:16Z

This resolves an elusive and long-standing issue where users would seemingly not have their score offset by match successes in their history. The cause is that offline users do not have entries in the AuthManager's matchOutcomes and preimageOutcomes maps, but registerMatchOutcome and registerPreimageOutcome, which are often called when a user is offline, handle the lack of map entries by creating fresh histories that just included the outcome being registered at the time.

This PR resolves the issue by loading the user's match and preimage history from DB when an outcome is registered while they are offline and these outcome tracking map entries are non-existent. This involves some minor refactoring to the loadUserScore method to separate the DB loading component (loadUserOutcomes) from score computation (integrateOutcomes) and insertion of outcome map entries. I considered just deferring account scoring until the next login, but it is important for unbooking orders of suspended users.

This also modifies handleConnect so that an account will automatically be reinstated if computed user score indicates their account should not be closed but it is. This is not just for bug recovery, but to allow operator changing the ban score threshold without having to manually apply the change to suspended accounts.

This also modifies the add methods of both latest outcome tracking structures so that it is not possible to add duplicate matches or orders. This is done to ensure concurrent login/connect and match or swap outcome registration cannot create duplicate entries. With small outcome slices (~100 elements) this dumb search is inexpensive.

chappjc · 2021-05-10T21:23:36Z

server/auth/auth.go

-	if !found {
-		outcomes = newLatestMatchOutcomes(scoringMatchLimit)
-		auth.matchOutcomes[user] = outcomes
-	}


This is the gist of the bug. I suspect it was written (by me) with the thinking "no known outcomes, so this must be a new user with no previous outcomes" but the absence of an entry really means offline user.

chappjc · 2021-05-10T21:26:04Z

server/auth/auth.go

+	// Make outcome entries for the user to optimize subsequent outcomes calls
+	// while they are disconnected? This could lead to adding duplicate outcomes
+	// with a concurrent connect/login or subsequent outcomes while offline.
+	//
+	// auth.matchOutcomes[user] = matchOutcomes
+	// auth.preimgOutcomes[user] = piOutcomes


This would be safe with the new duplicate checks in the outcome structs' add methods, but it's a little messy and these map entries are assumed not to be present for a disconnected user. (The entries would remain in the map for eternity if they never logged back in.)

Even for the largest matches table so far, the loadUserOutcomes query is ~30ms on my (admittedly fast) machine and ~200ms on a closer-to-production-equivalent machine. Even the occasional string of a dozen back-to-back outcomes for offline users (match failures or preimage misses) should be tolerable. We can experiment with this optimization or even separate outcome maps if it becomes slow to hit the DB for each outcome for an offline user.

chappjc · 2021-05-10T21:28:03Z

server/auth/auth.go

+	} else if score < int32(auth.banScore) && !open {
+		if err = auth.Unban(user); err == nil {
+			log.Warnf("Restoring suspended account %v (score = %d).", acctInfo.ID, score)
+		} else {
+			log.Errorf("Failed to restore suspended account %v (score = %d): %v.",
+				acctInfo.ID, score, err)
+		}


banScore is an operator setting, but even so this branch would have automated account restoration on reconnect in the event that this bug manifested.

chappjc · 2021-05-10T21:29:01Z

server/market/market.go

+		// Register the preimage miss violation, adjusting the user's score.
+		m.auth.MissedPreimage(ord.User(), epochEnd, ord.ID())


Moved below the DB storage of the preimage miss.

chappjc · 2021-05-10T21:29:38Z

server/swap/swap.go

+	// Credit the user for completing the swap, adjusting the user's score.
+	if actor.user != counterParty.user || newStatus == order.MatchComplete { // if user is both sides, only credit on MatchComplete (taker done too)
+		s.authMgr.SwapSuccess(actor.user, db.MatchID(match.Match), match.Quantity, redeemTime) // maybe call this in swapDone callback
+	}


Fairly inconsequential for a success, but moved below the DB store for consistency. Note that the relevant callback for match failure outcomes is the Inaction call in (*Swapper).failMatch, which was already called after the DB update via db.SetMatchInactive.

An alternative approach that moves the responsibility squarely into the auth manager was to still attempt adding the outcome after loading match history from DB, allowing the add method to filter out a duplicate add if it was already in the DB. Still may consider this...

JoeGruffins

Looks good and working well for me.

chappjc · 2021-05-11T15:58:07Z

Also verified fix on simnet. And deployed on dex-test (testnet). Will merge shortly to include in release-v0.2

test auto-unban

chappjc · 2021-05-11T16:30:51Z

Tests updated for the auto-unban change, with minor fix applied to avoid need to reconnect in this case.

auth: integrate outcomes for registerMatchOutcome

26858db

chappjc changed the title ~~auth: integrate outcomes for registerMatchOutcome~~ server/auth: integrate outcomes for registerMatchOutcome May 10, 2021

chappjc modified the milestones: 0.2, 0.2.1 May 10, 2021

chappjc commented May 10, 2021

View reviewed changes

JoeGruffins approved these changes May 11, 2021

View reviewed changes

chappjc changed the title ~~server/auth: integrate outcomes for registerMatchOutcome~~ server/auth: scoring for offline users must load history from DB May 11, 2021

flip suspended=false in auto-unban since client is not in map yet

7a52265

test auto-unban

grammar

848ee24

buck54321 approved these changes May 11, 2021

View reviewed changes

chappjc merged commit ba59397 into decred:master May 11, 2021

chappjc deleted the integrate-outcomes branch May 11, 2021 17:09

chappjc removed this from the 0.2.1 milestone Jul 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server/auth: scoring for offline users must load history from DB #1083

server/auth: scoring for offline users must load history from DB #1083

chappjc commented May 10, 2021 •

edited

chappjc May 10, 2021 •

edited

chappjc May 10, 2021 •

edited

chappjc May 10, 2021

chappjc May 10, 2021

chappjc May 10, 2021 •

edited

JoeGruffins left a comment

chappjc commented May 11, 2021

chappjc commented May 11, 2021

		// Register the preimage miss violation, adjusting the user's score.
		m.auth.MissedPreimage(ord.User(), epochEnd, ord.ID())

server/auth: scoring for offline users must load history from DB #1083

server/auth: scoring for offline users must load history from DB #1083

Conversation

chappjc commented May 10, 2021 • edited

chappjc May 10, 2021 • edited

Choose a reason for hiding this comment

chappjc May 10, 2021 • edited

Choose a reason for hiding this comment

chappjc May 10, 2021

Choose a reason for hiding this comment

chappjc May 10, 2021

Choose a reason for hiding this comment

chappjc May 10, 2021 • edited

Choose a reason for hiding this comment

JoeGruffins left a comment

Choose a reason for hiding this comment

chappjc commented May 11, 2021

chappjc commented May 11, 2021

chappjc commented May 10, 2021 •

edited

chappjc May 10, 2021 •

edited

chappjc May 10, 2021 •

edited

chappjc May 10, 2021 •

edited