-
Notifications
You must be signed in to change notification settings - Fork 2.5k
[HUDI-8990] PartitionBucketIndexManager Enhance #13106
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@hudi-bot run azure |
|
|
||
| public static List<PartitionBucketIndexHashingConfig> getAllHashingConfig(HoodieTableMetaClient metaClient) throws IOException { | ||
| String basePath = metaClient.getBasePath().toString(); | ||
| List<StoragePath> allHashingConfig = getCommittedHashingConfigInstants(metaClient).stream().map(instant -> { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
allHashingConfig -> allHashingConfigPaths
| ValidationUtils.checkArgument(instantToLoad.isPresent(), "Can not load latest hashing config " + instantToLoad); | ||
| Option<PartitionBucketIndexHashingConfig> latestHashingConfig = loadHashingConfig(metaClient.getStorage(), getHashingConfigPath(metaClient.getBasePath().toString(), instantToLoad.get())); | ||
| ValidationUtils.checkArgument(latestHashingConfig.isPresent(), "Can not load latest hashing config " + instantToLoad); | ||
| Option<StoragePath> hashingConfigToLoad = getHashingConfigToLoad(metaClient, Option.empty()); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hashingConfigPathToLoad
| Option<PartitionBucketIndexHashingConfig> latestHashingConfig = loadHashingConfig(metaClient.getStorage(), | ||
| getHashingConfigPath(metaClient.getBasePath().toString(), hashingConfigInstantToLoad.get())); | ||
| ValidationUtils.checkArgument(latestHashingConfig.isPresent(), "Can not load hashing config " + hashingConfigInstantToLoad + " based on " + instant); | ||
| Option<StoragePath> hashingConfigToLoad = getHashingConfigToLoad(metaClient, Option.of(instant)); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hashingConfigPathToLoad
| dataFrame) | ||
|
|
||
| val details = s"Expression: $expression, Bucket Number: $bucketNumber, Dry Run: $dryRun" | ||
| val details = s"Expression: $expression, Bucket Number: $bucketNumber, Dry Run: $dryRun" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is the option val case-sensitive from users?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As a config name, it is case-sensitive.
To make it more user-friendly, change the config name from dryRun to dry_run like other Hoodie call command configs such as RollbackToSavepointProcedure => instant_time config name
|
@hudi-bot run azure |
|
The Azure CI failure is a known env issue: https://dev.azure.com/apachehudi/hudi-oss-ci/_build/results?buildId=4805&view=results |
Change Logs
Enhance PartitionBucketIndexManager Support
Impact
PartitionBucketIndexManager
Risk level (write none, low medium or high below)
low
Documentation Update
Describe any necessary documentation update if there is any new feature, config, or user-facing change. If not, put "none".
ticket number here and follow the instruction to make
changes to the website.
Contributor's checklist