Data flow: Performance tuning #7232

aschackmull · 2021-11-24T13:47:02Z

This is a sequence of smaller performance tweaks (commit-by-commit review is encouraged). All of them relate to reducing work by avoiding splits of the tuple stream in the main recursive pipelines. Either by making sure negations are simple anti-joins (thereby also avoiding materialisation) or by avoiding tuple duplication arising from disjunctive filters.

MathiasVP

LGTM if DCA is happy! I'm curious about the performance implications of removing disjunction-induced tuple duplication. (Could we have a ql-for-ql query for this?)

hvitved

LGTM

atorralba

FWIW, LGTM

RasmusWL

👍 from Python (after also talking through what some of the changes do 👍)

aschackmull added 6 commits November 23, 2021 11:35

Dataflow: Remove negation materialization.

e711ba9

Dataflow: Pull ccc.matchesCall(call) from the recursive loop.

f5f67dd

Dataflow: Remove disjunction-induced tuple duplication.

822890f

Dataflow: Improve barrier handling.

4efdcc2

Dataflow: Remove more disjunction-induced tuple duplication.

a7ec0fa

Dataflow: Sync.

7ca3407

aschackmull added the no-change-note-required This PR does not need a change note label Nov 24, 2021

aschackmull requested review from a team as code owners November 24, 2021 13:47

github-actions bot added C# C++ Java Python Ruby labels Nov 24, 2021

MathiasVP approved these changes Nov 24, 2021

View reviewed changes

hvitved approved these changes Nov 25, 2021

View reviewed changes

atorralba approved these changes Nov 25, 2021

View reviewed changes

RasmusWL approved these changes Nov 25, 2021

View reviewed changes

aschackmull merged commit a066429 into github:main Nov 25, 2021

aschackmull deleted the dataflow/perf branch November 25, 2021 14:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Data flow: Performance tuning #7232

Data flow: Performance tuning #7232

Uh oh!

aschackmull commented Nov 24, 2021

Uh oh!

MathiasVP left a comment

Uh oh!

hvitved left a comment

Uh oh!

atorralba left a comment

Uh oh!

RasmusWL left a comment

Uh oh!

Uh oh!

Data flow: Performance tuning #7232

Data flow: Performance tuning #7232

Uh oh!

Conversation

aschackmull commented Nov 24, 2021

Uh oh!

MathiasVP left a comment

Choose a reason for hiding this comment

Uh oh!

hvitved left a comment

Choose a reason for hiding this comment

Uh oh!

atorralba left a comment

Choose a reason for hiding this comment

Uh oh!

RasmusWL left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!