Introduce a query backlog in the index #1896

tobim · 2021-10-01T13:58:24Z

📔 Description

Before this change the index would push queries back into its own message queue when no worker was available. This means we don't have to skip lots of queries any more whenever the index receives a new message.

The change also introduces a stricter restriction on the number of parallel queries. The old version would interleave the work for all queries, i.e. a worker would switch to a random work package from the pending queries map when it was done with the one before, but now a worker gets assigned to a query until that is completely done.

📝 Checklist

All user-facing changes have changelog entries.
The PR description contains instructions for the reviewer, if necessary.

🎯 Review Instructions

dominiklohmann · 2021-10-01T16:18:15Z

@tobim and I collaborated on the earlier. The code looks good to me, but it needs testing (and performance testing!) before merging, and also a changelog entry for the changed behavior of vast.max-queries if this turns out to be a performance-relevant change.

Before this change the index would push queries back into its own message queue when no worker was available. This means we don't have to skip lots of queries any more whenever the index receives a new message. The change also introduces a stricter restriction on the number of parallel queries. The old version would interleave the work for all queries, i.e. a worker would switch to a random work package from the pending queries map when it was done with the one before, but now a worker gets assigned to a query until that is completely done.

dominiklohmann

I think this can safely be merged. In limited testing, I did not notice a major overall performance impact, although the changed behavior can easily be observed: When issuing thousands of queries at around the same time, the average response time for earlier queries is lower compared to before this change, while it is higher for later queries.

I think additional logging makes sense on the verbose level. We should notify the user when …

… a query is pushed onto the backlog (trigger: new query, no worker available)
… a query is popped from the backlog (trigger: new worker, query backlog not empty)
… a query is executed immediately because workers were available (trigger: new query, worker available)
… a query has finished, i.e., a worker goes back into the pool of available workers (trigger: new worker, query backlog empty)

I am approving this as-is, but please make the requested changes before merging.

libvast/src/system/index.cpp

... and demote another one to DEBUG.

This reverts commit 62248a1, reversing changes made to 3f602a2.

tobim added the maintenance Tasks for keeping up the infrastructure label Oct 1, 2021

tobim requested a review from a team October 1, 2021 13:58

tobim changed the title ~~Story/ch28702/query backlog~~ Introduce a query backlog in the index Oct 1, 2021

tobim added 4 commits October 11, 2021 11:22

Use a less powerful cast to delegate in the index

6437efa

Modify mock indexes to add the new query handler

18b86e2

Add a changelog entry

73f399d

tobim force-pushed the story/ch28702/query-backlog branch from 8219d62 to 73f399d Compare October 11, 2021 10:01

dominiklohmann approved these changes Oct 12, 2021

View reviewed changes

libvast/src/system/index.cpp Outdated Show resolved Hide resolved

libvast/src/system/index.cpp Outdated Show resolved Hide resolved

dominiklohmann added the performance Improvements or regressions of performance label Oct 12, 2021

tobim added 2 commits October 18, 2021 08:48

Add log messages to trace the backlog

c070218

Remove a nonsensical log message

554dcfc

... and demote another one to DEBUG.

tobim enabled auto-merge October 19, 2021 07:46

tobim merged commit 62248a1 into master Oct 19, 2021

tobim deleted the story/ch28702/query-backlog branch October 19, 2021 08:46

tobim added a commit that referenced this pull request Nov 8, 2021

Revert "Merge pull request #1896"

85f1233

This reverts commit 62248a1, reversing changes made to 3f602a2.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce a query backlog in the index #1896

Introduce a query backlog in the index #1896

tobim commented Oct 1, 2021 •

edited

dominiklohmann commented Oct 1, 2021

dominiklohmann left a comment

Introduce a query backlog in the index #1896

Introduce a query backlog in the index #1896

Conversation

tobim commented Oct 1, 2021 • edited

📔 Description

📝 Checklist

🎯 Review Instructions

dominiklohmann commented Oct 1, 2021

dominiklohmann left a comment

Choose a reason for hiding this comment

tobim commented Oct 1, 2021 •

edited