An exciting and daring idea about trade off between query latency and accuracy. #1465

xuguruogu · 2019-12-17T06:52:46Z

Our demand to query latency is strict, at the same time, we can afford less accuracy by missing some records. Mostly, we would discard parts of the result records if more than expected.

To avoid uncontrollable massive Intermediate output, why not assign a time limit in a step in job DAG / non parallelizable execution plan. If max execution time is reached, simply go on to the next step, and drop the rest reply from peers.

As TCP is natively non parallelizable, making full use of UDP may be a better solution.

I wander if the problem be solved by provide max_exec_time/discard_rest_after_sometime semantics in the DSL.

The text was updated successfully, but these errors were encountered:

jude-zhu added the v2.0 label May 26, 2020

sherman-the-tank removed the v2.0 label Apr 28, 2021

CPWstatic added the type/question Type: question about the product label Aug 28, 2021

Sophie-Xie assigned CPWstatic Aug 31, 2021

Sophie-Xie added the wontfix Solution: this will not be worked on recently label Aug 31, 2021

CPWstatic closed this as completed Aug 31, 2021

jamieliu1023 mentioned this issue Sep 4, 2021

Weekly Report 2021-09-04 vesoft-inc/nebula-community#24

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

An exciting and daring idea about trade off between query latency and accuracy. #1465

An exciting and daring idea about trade off between query latency and accuracy. #1465

xuguruogu commented Dec 17, 2019 •

edited

An exciting and daring idea about trade off between query latency and accuracy. #1465

An exciting and daring idea about trade off between query latency and accuracy. #1465

Comments

xuguruogu commented Dec 17, 2019 • edited

xuguruogu commented Dec 17, 2019 •

edited