Skip to content

Make Remote Compaction Failures fall back to local in Stress Test#13945

Closed
jaykorean wants to merge 1 commit intofacebook:mainfrom
jaykorean:fix_remote_compaction_stress_test_5
Closed

Make Remote Compaction Failures fall back to local in Stress Test#13945
jaykorean wants to merge 1 commit intofacebook:mainfrom
jaykorean:fix_remote_compaction_stress_test_5

Conversation

@jaykorean
Copy link
Contributor

Summary

This PR enables Stress Test to fall back to local compaction when a remote compaction fails, allowing the compaction to be retried on the main thread.

If the local compaction succeeds, the stress test will continue without failing. The main thread will log that the remote compaction failed and was retried locally, while detailed failure logs from the remote compaction attempt will still be printed by the worker thread for further investigation.

This approach allows us to keep collecting useful logs for diagnosing remote compaction failures in Stress Test, while ensuring the test continues to run with remote compaction enabled.

Test Plan

python3 -u tools/db_crashtest.py --cleanup_cmd='' --simple blackbox --remote_compaction_worker_threads=8 --interval=10

Also in Meta Internal CI

@meta-cla meta-cla bot added the CLA Signed label Sep 12, 2025
@jaykorean jaykorean requested a review from hx235 September 12, 2025 02:46
@jaykorean jaykorean marked this pull request as ready for review September 12, 2025 03:12
@facebook-github-bot
Copy link
Contributor

@jaykorean has imported this pull request. If you are a Meta employee, you can view this in D82279337.

@facebook-github-bot
Copy link
Contributor

@jaykorean merged this pull request in 4f12c55.

@jaykorean jaykorean deleted the fix_remote_compaction_stress_test_5 branch September 12, 2025 21:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants