Skip to content

fix: Fixing record index related configs and enums#14180

Merged
bhasudha merged 4 commits intoapache:masterfrom
nsivabalan:partitionedRLI_Rename
Nov 5, 2025
Merged

fix: Fixing record index related configs and enums#14180
bhasudha merged 4 commits intoapache:masterfrom
nsivabalan:partitionedRLI_Rename

Conversation

@nsivabalan
Copy link
Contributor

@nsivabalan nsivabalan commented Oct 29, 2025

Describe the issue this Pull Request addresses

Standardize config names and enums related to record level index.

Index type (hoodie.index.type):
Prior to this patch:
RECORD_INDEX - > refers to global version of RLI
PARTITIONED_RECORD_INDEX -> refers to non global version of RLI

Change with this PR:
GLOBAL_RECORD_LEVEL_INDEX -> refers to global version of RLI
RECORD_LEVEL_INDEX -> refers to non global version of RLI

Writer property to enable for mdt writes:
Prior to this patch:
hoodie.metadata.record.index.enable -> enables global version of RLI with mdt writes.
hoodie.metadata.partitioned.record.index.enable -> enables non global version of RLI with mdt writes.

Change with this PR:
hoodie.metadata.global.record.level.index.enable -> enables global version of RLI with mdt writes.
hoodie.metadata.record.level.index.enable -> enables non global version of RLI with mdt writes.

Summary and Changelog

Switching index type of global version of RLI from RECORD_INDEX to GLOBAL_RECORD_LEVEL_INDEX. (Added b/w compatibilty support as well).
Switching index type of non-global version of RLI from PARTITIONED_RECORD_INDEX to RECORD_LEVEL_INDEX.

Switching writer property to enable global RLI with mdt writes from hoodie.metadata.record.index.enable to hoodie.metadata.global.record.level.index.enable. Added b/w compatibility support as well.
Switching writer property to enable non global RLI with mdt writes from hoodie.metadata.partitioned.record.index.enable to hoodie.metadata.record.level.index.enable.

Impact

Standard naming conventions in line with other indexes.

Risk Level

low.

Documentation Update

Contributor's checklist

  • Read through contributor's guide
  • Enough context is provided in the sections above
  • Adequate tests were added if applicable

@github-actions github-actions bot added the size:XL PR with lines of changes > 1000 label Oct 29, 2025
@nsivabalan nsivabalan force-pushed the partitionedRLI_Rename branch from 1aa75f5 to 6352a9b Compare October 29, 2025 22:39
DataSourceWriteOptions.TABLE_TYPE.key -> HoodieTableType.MERGE_ON_READ.name(),
"hoodie.metadata.index.column.stats.enable" -> "false",
HoodieMetadataConfig.RECORD_INDEX_ENABLE_PROP.key -> "false")
HoodieMetadataConfig.GLOBAL_RECORD_LEVEL_INDEX_ENABLE_PROP.key -> "false")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could we also add a test with the old config naming to validate it works and prevents any breaking change in the future?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I fixed one of the test to make it parametrized for this.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@nsivabalan nsivabalan force-pushed the partitionedRLI_Rename branch from 9f5caa2 to ba483f0 Compare November 3, 2025 19:57
"RECORD_INDEX",
"GLOBAL_RECORD_LEVEL_INDEX")
)
// tests b/w compatibility of configs for global RLI
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tests added for b/w compatibility

"hoodie.metadata.record.index.enable",
"hoodie.metadata.global.record.level.index.enable")
)
// tests b/w compatibility of configs for global RLI
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tests added for b/w compatibility

@hudi-bot
Copy link
Collaborator

hudi-bot commented Nov 4, 2025

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@bhasudha bhasudha self-requested a review November 5, 2025 17:01
Copy link
Contributor

@bhasudha bhasudha left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@bhasudha bhasudha merged commit 8038180 into apache:master Nov 5, 2025
132 of 137 checks passed
yihua pushed a commit to yihua/hudi that referenced this pull request Nov 10, 2025
* Fixing record index related configs and enums

* Fixing compilation after rebase

* Addressing feedback from Ethan

* fixing compilation error
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

release-1.1.0 size:XL PR with lines of changes > 1000

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants