DRIVERS-2035 Use minimum RTT for CSOT maxTimeMS calculation instead of 90th percentile #1350

ShaneHarvey · 2022-11-18T19:24:01Z

https://jira.mongodb.org/browse/DRIVERS-2035

Please complete the following before merging:

Update changelog.
Make sure there are generated JSON files from the YAML test files.
Test changes in at least one language driver: https://spruce.mongodb.com/version/63e568b07742ae5e72e81ce0/tasks?sorts=STATUS%3AASC%3BBASE_STATUS%3ADESC
Test these changes against all server versions and topologies (including standalone, replica set, sharded clusters, and serverless).

benjirewis

Thanks for making these changes!

benjirewis · 2022-11-21T22:04:45Z

source/client-side-operations-timeout/client-side-operations-timeout.rst

+A previous version of this spec used the 90th percentile RTT to short
+circuit operations that might otherwise fail with a socket timeout.
+We decided to change this logic to avoid canceling operations that may
+have a high chance of succeeding and also removes a dependency on t-digest.


Suggested change

have a high chance of succeeding and also removes a dependency on t-digest.

have a high chance of succeeding; this change also removes a dependency on t-digest.

benjirewis · 2022-11-21T22:09:59Z

source/client-side-operations-timeout/tests/command-execution.yml

@@ -39,7 +39,7 @@ tests:
              failCommands: ["hello", "isMaster"]


I have yet to sync these tests in the Go driver as we do not use the short-circuiting logic until we have at least 10 RTT samples. The tests do not allow enough time (~100s) to reach this sample count.

@matthewdale do you know why we have this 10 sample minimum in the first place? Will minimum RTT be significantly different before the first 10 samples than after? I imagine the answer is highly dependent on the network context, but is the potential downside of not waiting for 10 samples worth the complexity of implementation/testing?

10 samples gives a 100 second window by default which seems fine. I also do not agree with using 0 when < 10 samples. I think we should use the rtt even with only 1 sample. Otherwise, the rtt+maxTimeMS logic will be disabled for the first 100 seconds after a server is (re)discovered. That will be difficult to test and confusing to reason about (both internally and externally).

In the end any heuristic we choose here is somewhat arbitrary. We should avoid adding complexity to these heuristics and keep them as simple as possible. That's why I'm suggesting a fixed window that's not based on time.

The 10 sample minimum is an attempt to reduce the probability that early RTT outliers cause unnecessary operation skipping. Keep in mind that the "min RTT" logic currently in the Go driver was not added as part of CSOT, but was an optimization added with Go driver connection pool improvements. I decided to be conservative with the implementation because it was a novel and untested concept at the time and impacts all Go driver users.

We haven't encountered any known issues with that "min RTT" implementation, so it makes sense to be less conservative with the opt-in CSOT behavior. However, trusting the first RTT sample on startup seems dangerous and may lead to to increased operation errors immediately after connecting, until more RTT samples are collected. I recommend we collect at least 2 samples before assuming it's an accurate representation of the actual RTT.

I don't have any stats on the impact of requiring 1 vs 2 RTT samples, only the observation that networks are unreliable and that any single RTT sample may have a nontrivial probability of being bogus. That may be especially true while an application is starting up.

As far as using a fixed number of samples (e.g. 10), that seems totally reasonable.

Updated the spec to use a min of 2 and max of 10 samples. I also updated the tests to wait (heuristically) for >=2 samples.

benjirewis · 2022-11-21T22:13:28Z

source/server-discovery-and-monitoring/server-monitoring.rst

@@ -812,24 +810,25 @@ on a dedicated connection, for example:
        helloOk = stableApi != Null
        lock = Mutex()
        movingAverage = MovingAverage()
-        rttDigest = TDigest() # for 90th percentile RTT calculation
+        rttMin = MinWindow(max_window_size=10) # for min RTT calculation


As discussed offline, I think we should document what MinWindow does. Based on context, I'm assuming it takes the minimum RTT value of at most the past max_window_size samples. That seems to be an ok definition to me, but see my other comment about calculating min when we have less than 10 samples available.

Fixed. The peusdo-python is now explicit about the behavior using a deque: https://docs.python.org/3/library/collections.html#deque-objects

…f 90th percentile

…last 10 samples

…TT measurement

ShaneHarvey · 2023-02-09T22:10:54Z

@matthewdale this is ready for another look. I've updated the spec to require at least 2 RTT samples before enabling the short-circuit logic and updated the tests to account for this.

matthewdale

One question, but otherwise looks good 👍

matthewdale · 2023-02-15T01:02:10Z

source/server-discovery-and-monitoring/server-monitoring.rst

-clients MUST use average and 90th percentile round trip times from the RTT
-task.
+clients MUST set the average and minimum round trip times from the RTT task as the
+"roundTripTime" and "minRoundTripTime" fields, respectively.


This requirement has actually lead to significant confusion when implementing features in the Go driver that depend on up-to-date avg/min RTT values. Using values for average and minimum RTT pulled from server descriptions can lead to difficult-to-diagnose bugs unless devs realize that the RTT values on server descriptions are never updated after handshake. The scope for Go Driver 2.0 currently includes removing all values from server descriptions that are not derived directly from the connection/handshake process. See GODRIVER-2691 for more details.

Should we remove this requirement and the corresponding roundTripTime and minRoundTripTime fields from a ServerDescription?

P.S. I realize this is only updating the existing requirement, but this could be a good time to reconsider the need for these fields. I created DRIVERS-2552 to continue this discussion if this isn't a good forum.

I don't agree because even fields that are reported by the server in the hello command response can become stale. For example, hosts, isWritablePrimary, $clusterTime, operationTime, etc.. In fact, almost every single field can become stale right after the server sends it.

ShaneHarvey requested a review from benjirewis November 21, 2022 17:09

benjirewis reviewed Nov 21, 2022

View reviewed changes

ShaneHarvey added 5 commits February 9, 2023 12:18

Refactor RTT tests

72cbb0c

DRIVERS-2035 Use minimum RTT for CSOT maxTimeMS calculation instead o…

4ecb767

…f 90th percentile

DRIVERS-2035 Improve psuedocode a bit

c3a6d2e

DRIVERS-2035 Min of 2 RTT samples, otherwise use 0 as RTT, only keep …

b6c261f

…last 10 samples

DRIVERS-2035 Update tests to wait for multiple RTTs

37c1818

ShaneHarvey force-pushed the DRIVERS-2035 branch from 7d100d5 to 37c1818 Compare February 9, 2023 20:18

ShaneHarvey marked this pull request as ready for review February 9, 2023 21:24

ShaneHarvey requested review from a team as code owners February 9, 2023 21:24

ShaneHarvey requested review from nbbeeken and matthewdale and removed request for a team February 9, 2023 21:24

DRIVERS-2035 Add test that short-circuit is not enabled with only 1 R…

fe92ba8

…TT measurement

matthewdale approved these changes Feb 15, 2023

View reviewed changes

ShaneHarvey merged commit c06650d into mongodb:master Feb 16, 2023

ShaneHarvey mentioned this pull request Feb 23, 2023

PYTHON-3616 Use minimum RTT for CSOT maxTimeMS calculation mongodb/mongo-python-driver#1163

Merged

W-A-James mentioned this pull request Apr 1, 2024

feat(NODE-5825): add minRoundTripTime to ServerDescription and change roundTripTime to a moving average mongodb/node-mongodb-native#4059

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DRIVERS-2035 Use minimum RTT for CSOT maxTimeMS calculation instead of 90th percentile #1350

DRIVERS-2035 Use minimum RTT for CSOT maxTimeMS calculation instead of 90th percentile #1350

ShaneHarvey commented Nov 18, 2022 •

edited

benjirewis left a comment

benjirewis Nov 21, 2022

benjirewis Nov 21, 2022

ShaneHarvey Nov 29, 2022 •

edited

matthewdale Dec 1, 2022

ShaneHarvey Feb 9, 2023

benjirewis Nov 21, 2022

ShaneHarvey Feb 9, 2023

ShaneHarvey commented Feb 9, 2023

matthewdale left a comment

matthewdale Feb 15, 2023 •

edited

ShaneHarvey Feb 16, 2023

	have a high chance of succeeding and also removes a dependency on t-digest.
	have a high chance of succeeding; this change also removes a dependency on t-digest.

DRIVERS-2035 Use minimum RTT for CSOT maxTimeMS calculation instead of 90th percentile #1350

DRIVERS-2035 Use minimum RTT for CSOT maxTimeMS calculation instead of 90th percentile #1350

Conversation

ShaneHarvey commented Nov 18, 2022 • edited

benjirewis left a comment

Choose a reason for hiding this comment

benjirewis Nov 21, 2022

Choose a reason for hiding this comment

benjirewis Nov 21, 2022

Choose a reason for hiding this comment

ShaneHarvey Nov 29, 2022 • edited

Choose a reason for hiding this comment

matthewdale Dec 1, 2022

Choose a reason for hiding this comment

ShaneHarvey Feb 9, 2023

Choose a reason for hiding this comment

benjirewis Nov 21, 2022

Choose a reason for hiding this comment

ShaneHarvey Feb 9, 2023

Choose a reason for hiding this comment

ShaneHarvey commented Feb 9, 2023

matthewdale left a comment

Choose a reason for hiding this comment

matthewdale Feb 15, 2023 • edited

Choose a reason for hiding this comment

ShaneHarvey Feb 16, 2023

Choose a reason for hiding this comment

ShaneHarvey commented Nov 18, 2022 •

edited

ShaneHarvey Nov 29, 2022 •

edited

matthewdale Feb 15, 2023 •

edited