Skip to content

Comments

[HUDI-3695] Add a ORC reader in HoodieBaseRelation#5111

Closed
miomiocat wants to merge 1 commit intoapache:masterfrom
miomiocat:orc-reader
Closed

[HUDI-3695] Add a ORC reader in HoodieBaseRelation#5111
miomiocat wants to merge 1 commit intoapache:masterfrom
miomiocat:orc-reader

Conversation

@miomiocat
Copy link
Contributor

@miomiocat miomiocat commented Mar 23, 2022

What is the purpose of the pull request

Add a orc reader in HoodieBaseRelation

Brief change log

Add a orc reader in HoodieBaseRelation

Verify this pull request

This pull request is already covered by existing tests

Committer checklist

  • Has a corresponding JIRA in PR title & commit

  • Commit message is descriptive of the change

  • CI is green

  • Necessary doc changes done or have another open PR

  • For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.

@miomiocat
Copy link
Contributor Author

@alexeykudinkin @YannByron PTAL

filters: Seq[Filter],
options: Map[String, String],
hadoopConf: Configuration): PartitionedFile => Iterator[InternalRow] = {
val hfileReader = createHFileReader(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The reason why it was called here as opposed to call w/in the returned lambda -- is to avoid serialization of all these passed in parameters down to Spark executor. This is likely the reason why your build is failing at the moment.

Let's keep it as is, and please add a comment elaborating why this should be done in here (and executed w/in Spark Driver)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for your advise 👍

Modified according to your comments, PTAL

@miomiocat miomiocat force-pushed the orc-reader branch 2 times, most recently from ff29427 to a9ddb49 Compare March 24, 2022 02:57
@miomiocat miomiocat changed the title [HUDI-3695]Add orc reader in HoodieBaseRelation [HUDI-3695] Add a ORC reader in HoodieBaseRelation Mar 24, 2022
@XuQianJin-Stars
Copy link
Contributor

@miomiocat Add UT Test for this PR?

@nsivabalan
Copy link
Contributor

@alexeykudinkin : can you follow up on the review when you get a chance.
@miomiocat : can you rebase w/ latest master

@miomiocat
Copy link
Contributor Author

@alexeykudinkin : can you follow up on the review when you get a chance. @miomiocat : can you rebase w/ latest master

Thanks for your reply.

Rebased, PTAL

@XuQianJin-Stars
Copy link
Contributor

hi @miomiocat fix the CI failed.

@miomiocat
Copy link
Contributor Author

hi @miomiocat fix the CI failed.

fixed, PTAL

@miomiocat
Copy link
Contributor Author

@alexeykudinkin @XuQianJin-Stars Could you help to review this PR?

@hudi-bot
Copy link
Collaborator

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@alexeykudinkin
Copy link
Contributor

@miomiocat can you please rebase this one?

@miomiocat miomiocat closed this Aug 16, 2022
@miomiocat
Copy link
Contributor Author

@miomiocat can you please rebase this one?

Thanks for your reply, but I don't have much time to work on this PR recently. closed it.

@miomiocat miomiocat deleted the orc-reader branch August 19, 2022 06:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants