Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update API for requesting dataflow plan optimization #1278

Merged
merged 1 commit into from
Jun 25, 2024

Conversation

tlento
Copy link
Contributor

@tlento tlento commented Jun 14, 2024

In order to fully support predicate pushdown via the DataflowPlanOptimizer
framework we need two things:

  1. Support for optimization in distinct values queries
  2. The ability to share components between the DataflowPlanBuilder
    and the PredicatePushdownOptimizer

This update addresses both of these concerns by doing a small restructure
of the DataflowPlanBuilder interface for accepting optimizers. Instead of
accepting a sequence of optimizer instances, the build_plan method will now
accept a sequence of optimization enumerations. Those will then be converted
to instances via the factory class added in this change.

From there the update to the distinct values plan method signature was
a trivial addition.

Note - snapshot updates should be limited to ID numbers due to the added
call to the DataflowPlanNodeOutputDataSetResolver in the distinct values
plan.

Copy link

Thank you for your pull request! We could not find a changelog entry for this change. For details on how to document a change, see the contributing guide.

Copy link
Contributor Author

tlento commented Jun 14, 2024

This stack of pull requests is managed by Graphite. Learn more about stacking.

Join @tlento and the rest of your teammates on Graphite Graphite

Copy link
Contributor

@courtneyholcomb courtneyholcomb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Copy link
Contributor Author

tlento commented Jun 25, 2024

Merge activity

  • Jun 24, 8:29 PM PDT: @tlento started a stack merge that includes this pull request via Graphite.
  • Jun 24, 8:52 PM PDT: Graphite rebased this pull request as part of a merge.
  • Jun 24, 8:56 PM PDT: @tlento merged this pull request with Graphite.

@tlento tlento force-pushed the consolidate-where-filter-merging branch from 451ab0d to ad0298d Compare June 25, 2024 03:48
Base automatically changed from consolidate-where-filter-merging to main June 25, 2024 03:51
In order to fully support predicate pushdown via the DataflowPlanOptimizer
framework we need two things:

1. Support for optimization in distinct values queries
2. The ability to share components between the DataflowPlanBuilder
and the PredicatePushdownOptimizer

This update addresses both of these concerns by doing a small restructure
of the DataflowPlanBuilder interface for accepting optimizers. Instead of
accepting a sequence of optimizer instances, the build_plan method will now
accept a sequence of optimization enumerations. Those will then be converted
to instances via the factory class added in this change.

From there the update to the distinct values plan method signature was
a trivial addition.

Note - snapshot updates should be limited to ID numbers due to the added
call to the DataflowPlanNodeOutputDataSetResolver in the distinct values
plan.
@tlento tlento force-pushed the update-dataflow-optimizer-call-mechanism branch from ae3d6ff to 8d4d7fd Compare June 25, 2024 03:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants