Expose timeout query param for search requests #2748

coszio · 2023-09-29T22:11:06Z

Continues upon #2293 and solves #2623

Exposes timeout param for search requests.

in query params for REST
in top level args for gRPC

Affects at local shard level and internal client request. For group requests, it affects at the GroupBy level

Checklist:

Expose for search and batch search
Expose for recommend and batch recommend
Expose for group (search/recommend) requests
Implement timeout for GroupBy
Make tests

Caveats

Search and recommend requests return a 500 Service Error status because of this conversion, even when it is just a timeout error. This is different from what grouping requests return now, which is a standard 408 Request Timeout. We can make both return 500 if desired, but I feel we should fix the former case.

All Submissions:

Contributions should target the dev branch. Did you create your branch from dev?
Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?

New Feature Submissions:

Does your submission pass tests?
Have you formatted your code locally using cargo +nightly fmt --all command prior to submission?
Have you checked your code using cargo clippy --all --all-features command?

Changes to Core Features:

Have you added an explanation of what your changes do and why you'd like us to include them?
Have you written new tests for your core changes, as applicable?
Have you successfully ran tests with your changes locally?

timvisee · 2023-10-04T15:17:19Z

This is different from what grouping requests return now, which is a standard 408 Request Timeout. We can make both return 500 if desired, but I feel we should fix the former case.

I haven't looked into the details yet, but I'd also prefer the former case returning HTTP 408 if we can keep backwards compatibility in a reasonable manner. We can potentially explore this in another PR.

timvisee

Nice work! I haven't been able to actually test it jet but did want to post my review comments thus far.

I'll try to test tomorrow.

src/actix/api/read_params.rs

lib/api/build.rs

openapi/openapi-main.ytt.yaml

timvisee · 2023-10-05T07:09:30Z

Sadly, the recent #2761 and #2762 merges brought some conflicts.

coszio · 2023-10-05T19:45:30Z

An update:
@generall noted that the current changes do not extend the default timeout for remote shard communication.

Here's what I've tried:

The way it works with these changes is that we add a timeout to the Request the internal grpc client makes to the remote shard.
We also have available our method of with_channel_timeout, but after using it, it doesn't solve our problem.

In first scenario we are limited by the default max_timeout of with_channel_timeout so we sometimes get an error like

"Timeout {}ms reached for uri: {}"

or

"Tonic status error: The operation was cancelled"

In second scenario, we only get the second error.

So what's happening? with_channel_timeout is only applying a separate timeout from the actual channel timeout.

I propose to deal with that complexity in another PR, and we leave the constraint that, for remote shards, timeout will be the lowest of global or query param.

My second proposal is to refactor to get rid of the custom layer of timeout of with_channel_timeout so that the only way to affect it is setting the Request timeout.

Lastly, to be able to actually extend the channel timeout we'd need to always create a new channel, which is not desired. So a solution would be to set it very high by default, and make sure we always set the Request timeout to a default max if it was not specified before

cc @timvisee

timvisee · 2023-10-06T07:51:50Z

After some investigation from my end I came to the same conclusion.

I agree that the right approach is to remove (or make very high) the channel timeout and set it per request everywhere.

Thanks for sharing your findings!

timvisee · 2023-10-09T07:58:04Z

#2771 implements the above suggestion. When we merge it, I think we can merge this as well.

coszio · 2023-11-02T15:31:05Z

Now that #2771 has been merged, I've rebased this one, and manually tested the case of setting timeout to more than the configured one. It now works in both cases.

lib/api/src/grpc/proto/points.proto

message

* add timeout query param for search requests * enable timeout for recommend requests * Add query timeout for group by requests * update openapi models * Don't decrease timeout after recommend preprocessing * Add openapi test * code review * add timeout to individual group by requests, non-decreasing * handle timeout for discover * Update timeout field tag in SearchBatchPoints message

coszio force-pushed the search-timeout branch from 36239a0 to bfba469 Compare September 29, 2023 22:13

coszio requested a review from timvisee October 4, 2023 13:12

coszio marked this pull request as ready for review October 4, 2023 13:12

coszio requested a review from generall October 4, 2023 13:12

timvisee reviewed Oct 4, 2023

View reviewed changes

src/actix/api/read_params.rs Outdated Show resolved Hide resolved

lib/api/build.rs Show resolved Hide resolved

openapi/openapi-main.ytt.yaml Show resolved Hide resolved

coszio force-pushed the search-timeout branch from 5ca3a14 to 63f3da6 Compare October 5, 2023 13:58

timvisee mentioned this pull request Oct 9, 2023

Refactor with_channel_timeout #2771

Merged

9 tasks

This was referenced Oct 18, 2023

Indexing VS Search #2822

Closed

Timeout config doesn't affect search_bulk #2853

Closed

coszio force-pushed the search-timeout branch from 72c10d2 to 10a2eb1 Compare November 1, 2023 14:54

coszio added 9 commits November 2, 2023 15:27

add timeout query param for search requests

c455775

enable timeout for recommend requests

b8eb09d

Add query timeout for group by requests

dae5348

update openapi models

e842f38

Don't decrease timeout after recommend preprocessing

a296e4a

Add openapi test

2688f7f

code review

543927d

add timeout to individual group by requests, non-decreasing

13f39d2

handle timeout for discover

2d81e40

coszio force-pushed the search-timeout branch from 10102c9 to 2d81e40 Compare November 2, 2023 15:30

generall approved these changes Nov 2, 2023

View reviewed changes

lib/api/src/grpc/proto/points.proto Outdated Show resolved Hide resolved

Update timeout field tag in SearchBatchPoints

3e04876

message

coszio merged commit 4700e2a into dev Nov 2, 2023
18 checks passed

coszio deleted the search-timeout branch November 2, 2023 16:45

This was referenced Nov 2, 2023

Use consistent timeout status between groups api and other reads #2920

Open

Explicit timeout parameter for search requests. #2623

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose timeout query param for search requests #2748

Expose timeout query param for search requests #2748

coszio commented Sep 29, 2023 •

edited

timvisee commented Oct 4, 2023 •

edited

timvisee left a comment

timvisee commented Oct 5, 2023

coszio commented Oct 5, 2023

timvisee commented Oct 6, 2023

timvisee commented Oct 9, 2023

coszio commented Nov 2, 2023

Expose timeout query param for search requests #2748

Expose timeout query param for search requests #2748

Conversation

coszio commented Sep 29, 2023 • edited

Checklist:

Caveats

All Submissions:

New Feature Submissions:

Changes to Core Features:

timvisee commented Oct 4, 2023 • edited

timvisee left a comment

Choose a reason for hiding this comment

timvisee commented Oct 5, 2023

coszio commented Oct 5, 2023

timvisee commented Oct 6, 2023

timvisee commented Oct 9, 2023

coszio commented Nov 2, 2023

coszio commented Sep 29, 2023 •

edited

timvisee commented Oct 4, 2023 •

edited