Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HIVE-28102: Iceberg: Invoke validateDataFilesExist for RowDelta operations. #5111

Merged
merged 2 commits into from Mar 4, 2024

Conversation

ayushtkn
Copy link
Member

@ayushtkn ayushtkn commented Mar 1, 2024

What changes were proposed in this pull request?

Validate Data Files Exist for RowDelta Operations

Why are the changes needed?

RewriteFiles Operations called by compaction jobs(Spark), leads to data corruption when running concurrently with MERGE?UPDATE/DELETE Queries

Does this PR introduce any user-facing change?

Prevents Data corruption

Is the change a dependency upgrade?

No

How was this patch tested?

CI

@ayushtkn
Copy link
Member Author

ayushtkn commented Mar 1, 2024

converted to draft: we need to referenced data files during in delete files, will push a new commit

@ayushtkn
Copy link
Member Author

ayushtkn commented Mar 1, 2024

Changed to validate the referenced data files in the delete files, will see how the build reacts, if all calm will do some touchups & call for review

Copy link
Member

@deniskuzZ deniskuzZ left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM +1

Copy link

sonarcloud bot commented Mar 4, 2024

Quality Gate Passed Quality Gate passed

Issues
2 New issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
No data about Duplication

See analysis details on SonarCloud

@ayushtkn ayushtkn merged commit 41bf5d5 into apache:master Mar 4, 2024
5 checks passed
deniskuzZ pushed a commit that referenced this pull request Mar 5, 2024
…tions. (#5111). (Ayush Saxena, reviewed by Denys Kuzmenko)

(cherry picked from commit 41bf5d5)
dengzhhu653 pushed a commit to dengzhhu653/hive that referenced this pull request Mar 7, 2024
…tions. (apache#5111). (Ayush Saxena, reviewed by Denys Kuzmenko)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
3 participants