Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-31027] [SQL] Refactor DataSourceStrategy to be more extendable #27778

Closed
wants to merge 10 commits into from

Conversation

dbtsai
Copy link
Member

@dbtsai dbtsai commented Mar 3, 2020

What changes were proposed in this pull request?

Refactor DataSourceStrategy.scala and DataSourceStrategySuite.scala so it's more extendable to implement nested predicate pushdown.

Why are the changes needed?

To support nested predicate pushdown.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Existing tests and new tests.

@dbtsai dbtsai changed the title [SPARK-31027] [SQL] Refactor DataSourceStrategy.scala to minimize the changes to support nested predicate pushdown [SPARK-31027] [SQL] Refactor DataSourceStrategy to be more extendable Mar 3, 2020
@dbtsai
Copy link
Member Author

dbtsai commented Mar 3, 2020

@SparkQA

This comment has been minimized.

Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1, LGTM. Thank you, @dbtsai .

@SparkQA

This comment has been minimized.

Copy link
Member

@viirya viirya left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Only one minor comment.

@SparkQA

This comment has been minimized.

@dbtsai dbtsai requested a review from cloud-fan March 4, 2020 04:51
@dbtsai
Copy link
Member Author

dbtsai commented Mar 4, 2020

@HyukjinKwon @dongjoon-hyun @viirya @rdblue @cloud-fan

Some changes based on @HyukjinKwon 's suggestion to use extractor pattern have been done. Would like to have your final reviews. Thanks,

Copy link
Member

@viirya viirya left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@dbtsai dbtsai requested a review from HyukjinKwon March 4, 2020 06:03
Copy link
Member

@HyukjinKwon HyukjinKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks okay to me.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@SparkQA

This comment has been minimized.

@cloud-fan
Copy link
Contributor

retest this please

Copy link
Contributor

@cloud-fan cloud-fan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM except a few minor comments

@SparkQA

This comment has been minimized.

@SparkQA
Copy link

SparkQA commented Mar 4, 2020

Test build #119293 has finished for PR 27778 at commit cad92d6.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@HyukjinKwon
Copy link
Member

Merged to master.

@dongjoon-hyun
Copy link
Member

Thank you all. Looks much better!

dbtsai added a commit that referenced this pull request Mar 4, 2020
### What changes were proposed in this pull request?
Refactor `DataSourceStrategy.scala` and `DataSourceStrategySuite.scala` so it's more extendable to implement nested predicate pushdown.

### Why are the changes needed?
To support nested predicate pushdown.

### Does this PR introduce any user-facing change?
No.

### How was this patch tested?
Existing tests and new tests.

Closes #27778 from dbtsai/SPARK-31027.

Authored-by: DB Tsai <d_tsai@apple.com>
Signed-off-by: HyukjinKwon <gurwls223@apache.org>
@dbtsai dbtsai deleted the SPARK-31027 branch March 4, 2020 17:32
@dbtsai
Copy link
Member Author

dbtsai commented Mar 4, 2020

Also merged into 3.0 branch since it's just refactoring without introducing new feature. It will make code maintenance easier for 3.0 and future master. Thanks all for reviewing.

sjincho pushed a commit to sjincho/spark that referenced this pull request Apr 15, 2020
### What changes were proposed in this pull request?
Refactor `DataSourceStrategy.scala` and `DataSourceStrategySuite.scala` so it's more extendable to implement nested predicate pushdown.

### Why are the changes needed?
To support nested predicate pushdown.

### Does this PR introduce any user-facing change?
No.

### How was this patch tested?
Existing tests and new tests.

Closes apache#27778 from dbtsai/SPARK-31027.

Authored-by: DB Tsai <d_tsai@apple.com>
Signed-off-by: HyukjinKwon <gurwls223@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
8 participants