adding retryable to scan #456

limbooverlambda · 2024-06-17T21:39:37Z

solve: #455

We were running into an issue with scans where periodically we noticed scans returning empty results for datasets that were present in a cluster. The hypothesis was that the scans were returning empties when the regions are undergoing splits. While trying to reproduce the issue (by issuing splits from pd-ctl), we found out that when there's a region error (epoch_version_mismatch et al), the scan_inner is not triggering any cache invalidations and subsequent retries. The scan simply returns an empty. This PR is fixing the issue by triggering the invalidations and retry for such issues.

@pingyu @ekexium @andylokandy.

Signed-off-by: limbooverlambda <schakra1@gmail.com>

pingyu

Thanks for your contribution !

The PR overall looks good. And I left some minor suggestions.

Besides, If you would like to verify the correctness when region error happens, consider to use failpoint. Please refer to failpoint_test.rs. It's not a must before the PR is accepted.

src/raw/client.rs

pingyu · 2024-06-22T10:29:59Z

src/raw/client.rs

+                        plan::handle_region_error(self.rpc.clone(), err.clone(), store.clone())
+                            .await?;
+                    return if status {
+                        self.retryable_scan(scan_args.clone()).await


Suggest to eliminate the recursion and let caller do the retry, to reduce overhead.

Removed the recursion, in the initial implementation, I followed the pattern as outlined in

client-rust/src/request/plan.rs

Line 109 in 54fd720

async fn single_plan_handler(

where a mutual recursion is being performed to retry the flow. The idea was to shield the caller from performing the retries. In subsequent PRs I would have proposed some enhancements to control the retry logic. I can remove the retry if that seems more prudent.

Signed-off-by: limbooverlambda <schakra1@gmail.com>

limbooverlambda · 2024-06-28T07:15:27Z

Hi @pingyu, are there any more changes you need me to make before this change can be merged? Thanks for taking the time to look at this.

pingyu · 2024-07-07T07:24:22Z

src/raw/client.rs

+            let scan_args = ScanInnerArgs {
+                start_key: current_key.clone(),
+                range: range.clone(),
+                limit,


It would be better to use current_limit here, otherwise it would return kv pairs more than limit. Moreover, the following current_limiit -= kvs.en() would overflow.

Good catch. Changed.

pingyu · 2024-07-07T07:33:36Z

src/raw/client.rs

+                current_limit -= kvs.len() as u32;
+                result.append(&mut kvs);
+            }
+            if end_key


Nit:

Suggested change

if end_key

if end_key.is_some_and(|ek| ek <= next_key.as_ref())

Thanks. Changed. end_key had to be cloned however.

pingyu · 2024-07-07T07:38:19Z

src/raw/client.rs

+        while current_limit > 0 {
+            let scan_args = ScanInnerArgs {
+                start_key: current_key.clone(),
+                range: range.clone(),


I think we should scan from start_key. Otherwise if there is region merge during scan, we would get duplicated kv paires, and lose some others if limit is reached.

I was trying to trace through the logic and from what I understand, we will only be looping if the limit of the scan has not been exhausted. So regardless of a split or merge, won't we be resuming the next scan from the end_key returned by the previous scan call? So if the first scan result returns an end_key "foo", doesn't the system guarantee that if we start the next scan starting from "foo", we are guaranteed to return all results that are lexicographically larger than "foo" and smaller than whatever end_key has been provided. This is regardless of whether the underlying regions are undergoing any churn(due to any splits or merges). There may be gaps in my understanding so will be more than happy to get some more feedback here.

pingyu · 2024-07-07T08:19:50Z

src/raw/client.rs

@@ -953,4 +1016,63 @@ mod tests {
        );
        Ok(())
    }
+
+    #[tokio::test]
+    async fn test_raw_scan_retryer() -> Result<()> {


This test case looks a little not necessary to me, as it just check that we call error handler on region error, but introduce more complexity.

The core changes in this PR is the retry on region error, so if we want to verify the correctness, I think we need to simulate such scene.

For example, we first put some little entries, and perform the scan to fill the region cache. Then we put much more (or large) entries to trigger the region split (see tests::common::init()), and scan again. The later scan should meet region errors.

The test case would be a little complex to implement, so it's OK to me if we don't have it in this PR.

Good point. The current setup, to your point, is a bit cumbersome. I have removed this test case. I will add the test as you outlined in a subsequent PR.

Signed-off-by: limbooverlambda <schakra1@gmail.com>

limbooverlambda · 2024-07-08T20:30:16Z

@pingyu Thanks for your feedback. It will be great if you could take another look.

adding retryable to scan

4f506f6

Signed-off-by: limbooverlambda <schakra1@gmail.com>

limbooverlambda mentioned this pull request Jun 17, 2024

Scan returning empty results #455

Open

pingyu reviewed Jun 22, 2024

View reviewed changes

limbooverlambda and others added 4 commits June 25, 2024 13:13

PR feedback along with a unit test

997b684

Signed-off-by: limbooverlambda <schakra1@gmail.com>

make check fixes

683e4a0

Signed-off-by: limbooverlambda <schakra1@gmail.com>

more make check fixes

d47671c

Signed-off-by: limbooverlambda <schakra1@gmail.com>

Merge branch 'master' into limbooverlambda/fixsignature

f2f5438

Merge branch 'master' into limbooverlambda/fixsignature

b5640d1

pingyu reviewed Jul 7, 2024

View reviewed changes

address PR feedback

4740269

Signed-off-by: limbooverlambda <schakra1@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding retryable to scan #456

adding retryable to scan #456

limbooverlambda commented Jun 17, 2024

pingyu left a comment

pingyu Jun 22, 2024

limbooverlambda Jun 25, 2024

limbooverlambda commented Jun 28, 2024

pingyu Jul 7, 2024

limbooverlambda Jul 8, 2024

pingyu Jul 7, 2024

limbooverlambda Jul 8, 2024

pingyu Jul 7, 2024

limbooverlambda Jul 8, 2024

pingyu Jul 7, 2024

limbooverlambda Jul 8, 2024

limbooverlambda commented Jul 8, 2024

	if end_key
	if end_key.is_some_and(\|ek\| ek <= next_key.as_ref())

adding retryable to scan #456

Are you sure you want to change the base?

adding retryable to scan #456

Conversation

limbooverlambda commented Jun 17, 2024

pingyu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

limbooverlambda commented Jun 28, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

limbooverlambda commented Jul 8, 2024