Optimize query filtering on the result of another query

Example query:
`SELECT COUNT(*) FROM table1 WHERE id1 IN (SELECT id2 FROM table2 WHERE ...) ...`

A naive solution would be sending the sub-query first, gather the result, then use the result to construct the main query. It works for cases where the result for the sub-query is small (< 100), but when the result size becomes big (> 1000), the cost of ser/de, query compilation and query processing will be too high.

In order to optimize this query, we need to reduce the number of ids to process. We can rely on the partitioning to achieve that. When all the segments for a partition is on a single server, we can solve the query for the partition on the server side without sending and merging the result of the sub-query on the broker.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize query filtering on the result of another query #5925

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Optimize query filtering on the result of another query #5925

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions