Skip to content

[HUDI-0001] DO NOT MERGE Ci not working test#9983

Closed
jonvex wants to merge 13 commits intoapache:masterfrom
jonvex:ci_not_working_test
Closed

[HUDI-0001] DO NOT MERGE Ci not working test#9983
jonvex wants to merge 13 commits intoapache:masterfrom
jonvex:ci_not_working_test

Conversation

@jonvex
Copy link
Contributor

@jonvex jonvex commented Nov 3, 2023

Change Logs

testassdfa

Impact

sdafdsaf

Risk level (write none, low medium or high below)

none
asdfasd

Documentation Update

asfdfad

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

linliu-code and others added 13 commits October 27, 2023 23:02
To support incremental quereis, we do the following things:

1. Extends the HoodieFileIndex as HoodieIncremtnalFileIndex, in which
   we override the listFiles and inputFiles methods to return only
   the affected files during the specified time range.
2. Build a factory class to build different HadoopFsRelation for
   different query types.
To do this, what we have done:

1. Duplicated the logic for CDCFileGroupIterator logic.
2. Created HoodieCDCFileIndex, which lists files based on the HoodieFileGroupId, which is embedded into
   partitionValues of PartitionedFiles.
3. Created HadoopFsRelation based on the table type: COW or MOR on DefaultSource.
…s because df.count() is wrong while df.rdd.count() is correct
@jonvex jonvex closed this Nov 3, 2023
@jonvex jonvex deleted the ci_not_working_test branch November 3, 2023 22:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants