chore(spanner): add LatencyTracker interface and default implementation by olavloite · Pull Request #12729 · googleapis/google-cloud-java

olavloite · 2026-04-09T13:46:20Z

Adds an internal LatencyTracker interface and a default implementation that allows the client to track the latency of requests. This can be used for automatic replica selection and load balancing.

gemini-code-assist

Code Review

This pull request introduces a new latency tracking mechanism using Exponentially Weighted Moving Average (EWMA), including a LatencyTracker interface, its EwmaLatencyTracker implementation, and comprehensive unit tests. The primary feedback concerns the initial state of the EwmaLatencyTracker, specifically that an uninitialized tracker currently returns a score of 0.0. This is problematic as it implies a perfect score, potentially leading to incorrect load balancing decisions. It is suggested that an uninitialized tracker should instead return Double.POSITIVE_INFINITY to accurately reflect its unmeasured state, and a new test case should be added to verify this behavior.

...r/google-cloud-spanner/src/main/java/com/google/cloud/spanner/spi/v1/EwmaLatencyTracker.java

...ogle-cloud-spanner/src/test/java/com/google/cloud/spanner/spi/v1/EwmaLatencyTrackerTest.java

Adds an internal LatencyTracker interface and a default implementation that allows the client to track the latency of requests. This can be used for automatic replica selection and load balancing.

rahul2393

Thanks, looks good to me overall with some questions open. I would like to see follow-up PRs soon for answers

Open Question: Eligibility filtering for stale read/query only, score ownership, score updates from successful/errorful routed calls, and the actual Po2 selection logic.

rahul2393 · 2026-04-10T10:12:43Z

...anner/google-cloud-spanner/src/main/java/com/google/cloud/spanner/spi/v1/LatencyTracker.java

+   *
+   * @param latencyMillis the observed latency in milliseconds.
+   */
+  void update(long latencyMillis);


This flatten most “60us vs 500us vs 700us” differences into the same bucket. If we want this to drive bypass selection, the score needs to be at least micros, and ideally nanos or Duration.

Good point, changed to Duration.

rahul2393 · 2026-04-10T10:14:40Z

...r/google-cloud-spanner/src/main/java/com/google/cloud/spanner/spi/v1/EwmaLatencyTracker.java

+  @Override
+  public double getScore() {
+    synchronized (lock) {
+      return initialized ? score : Double.MAX_VALUE;


getScore() returns Double.MAX_VALUE until the tracker has seen traffic. That means a new endpoint, or one recreated after eviction, will always lose against any sampled endpoint that has historical data. In other words, it never gets traffic, so it never learns. We should be probing / low-rate exploration so a replica can “come back to the game”; this implementation bakes in starvation unless some separate mechanism guarantees exploration.

Yeah, that is a good point. We will fix this in a follow-up PR in combination in the ReplicaSelector by allowing some of the traffic to just choose a random endpoint.

rahul2393 · 2026-04-10T10:17:03Z

...anner/google-cloud-spanner/src/main/java/com/google/cloud/spanner/spi/v1/LatencyTracker.java

+import com.google.api.core.InternalApi;
+
+/**
+ * Interface for tracking latency scores of Spanner servers.


nit: The abstraction is attached to the wrong identity unless you are very careful in the follow-up work. The doc wants a score “for a given spanner server”, but this branch introduces a generic tracker with no ownership model.

In the current routing code, CachedTablet instances are reused across cache updates and can change serverAddress in place. If someone later stores the EWMA on CachedTablet, the latency history from the old server will bleed into the new one after a cache update. The stable identity in this codebase is the per-address endpoint cached, not the tablet object.

olavloite requested review from a team as code owners April 9, 2026 13:46

gemini-code-assist bot reviewed Apr 9, 2026

View reviewed changes

...r/google-cloud-spanner/src/main/java/com/google/cloud/spanner/spi/v1/EwmaLatencyTracker.java Show resolved Hide resolved

...ogle-cloud-spanner/src/test/java/com/google/cloud/spanner/spi/v1/EwmaLatencyTrackerTest.java Show resolved Hide resolved

olavloite force-pushed the spanner-latency-tracker branch from bc8842b to 274308d Compare April 9, 2026 14:33

chore(spanner): add LatencyTracker interface and default implementation

c50bb2e

Adds an internal LatencyTracker interface and a default implementation that allows the client to track the latency of requests. This can be used for automatic replica selection and load balancing.

olavloite force-pushed the spanner-latency-tracker branch from 274308d to c50bb2e Compare April 9, 2026 14:36

rahul2393 approved these changes Apr 10, 2026

View reviewed changes

chore(spanner): address review comments

0d14d73

olavloite enabled auto-merge (squash) April 10, 2026 11:08

olavloite merged commit c29b99f into main Apr 10, 2026
102 of 103 checks passed

olavloite deleted the spanner-latency-tracker branch April 10, 2026 11:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(spanner): add LatencyTracker interface and default implementation#12729

chore(spanner): add LatencyTracker interface and default implementation#12729
olavloite merged 2 commits intomainfrom
spanner-latency-tracker

olavloite commented Apr 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

rahul2393 left a comment

Uh oh!

rahul2393 Apr 10, 2026

Uh oh!

olavloite Apr 10, 2026

Uh oh!

rahul2393 Apr 10, 2026 •

edited

Loading

Uh oh!

olavloite Apr 10, 2026

Uh oh!

rahul2393 Apr 10, 2026

Uh oh!

olavloite Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

olavloite commented Apr 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

rahul2393 left a comment

Choose a reason for hiding this comment

Uh oh!

rahul2393 Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

olavloite Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

rahul2393 Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

olavloite Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

rahul2393 Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

olavloite Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rahul2393 Apr 10, 2026 •

edited

Loading