Skip to content

Core: Fix thread conflict when deleting duplicate files in manifest#16686

Open
HuaHuaY wants to merge 1 commit into
apache:mainfrom
HuaHuaY:fix_thread_safe
Open

Core: Fix thread conflict when deleting duplicate files in manifest#16686
HuaHuaY wants to merge 1 commit into
apache:mainfrom
HuaHuaY:fix_thread_safe

Conversation

@HuaHuaY
Copy link
Copy Markdown

@HuaHuaY HuaHuaY commented Jun 5, 2026

deleteFiles and duplicateDeleteCount are not thread-safe and they are modified concurrently in filterManifests. In addition, duplicateDeleteCount will be counted repeatedly during the retry process after a commit failure.

Add two tests:
deleteDuplicateFilesWithinMultipleManifests ensures that the count of duplicate files is accurate under concurrent conditions.
deleteDuplicateFilesWithConcurrentDeleteRetry ensures that retrying is not affected by previous unsuccessful commits.

@github-actions github-actions Bot added the core label Jun 5, 2026
@HuaHuaY HuaHuaY force-pushed the fix_thread_safe branch from 128725f to c828a8b Compare June 5, 2026 06:43
@wgtmac
Copy link
Copy Markdown
Member

wgtmac commented Jun 5, 2026

@nastra @Fokko Could you help review this? @HuaHuaY found this issue when developing similar features in the iceberg-cpp.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants