fix: reuse Inflater/Deflater in BitCaskDiskMap to avoid JDK8 finalizer contention by vamsikarnika · Pull Request #18818 · apache/hudi

vamsikarnika · 2026-05-22T12:45:41Z

CompressionHandler currently allocates a new Deflater on every compressBytes() and a new Inflater on every decompressBytes(). On JDK 8 both classes register a Finalizer on construction. Under sustained, multi-threaded disk-map traffic (observed during MDT/RLI compaction merging millions of records across several Spark task threads on the same executor), the rate of zlib allocations exceeds the rate at which the single Finalizer thread can drain its queue. Native ZStreamRef handles pile up in old gen, the heap saturates, and G1 enters a mixed-GC death spiral while application threads make no progress.

CompressionHandler is already held in a ThreadLocal, so a single Deflater/ Inflater pair per worker thread is sufficient. This change:

Adds transient Deflater/Inflater fields and lazy accessors (transient so the class remains Serializable; lazy so deserialized instances rebuild the codecs on first use).
Calls reset() on the cached codecs at the start of each call.
Passes the user-supplied codecs to DeflaterOutputStream(out, def) and InflaterInputStream(in, inf), which sets usesDefaultDeflater/Inflater to false so close() does not call end() on the codec — the codec survives the try-with-resources for reuse on the next call.

On-disk format, compression level, and error semantics are unchanged. Allocation rate drops from O(records) to O(threads). On JDK 9+ this also removes per-call Cleaner registration overhead.

Describe the issue this Pull Request addresses

Summary and Changelog

Impact

Risk Level

Documentation Update

Contributor's checklist

Read through contributor's guide
Enough context is provided in the sections above
Adequate tests were added if applicable

…finalizer contention CompressionHandler currently allocates a new Deflater on every compressBytes() and a new Inflater on every decompressBytes(). On JDK 8 both classes register a Finalizer on construction. Under sustained, multi-threaded disk-map traffic (observed during MDT/RLI compaction merging millions of records across several Spark task threads on the same executor), the rate of zlib allocations exceeds the rate at which the single Finalizer thread can drain its queue. Native ZStreamRef handles pile up in old gen, the heap saturates, and G1 enters a mixed-GC death spiral while application threads make no progress. CompressionHandler is already held in a ThreadLocal, so a single Deflater/ Inflater pair per worker thread is sufficient. This change: - Adds transient Deflater/Inflater fields and lazy accessors (transient so the class remains Serializable; lazy so deserialized instances rebuild the codecs on first use). - Calls reset() on the cached codecs at the start of each call. - Passes the user-supplied codecs to DeflaterOutputStream(out, def) and InflaterInputStream(in, inf), which sets usesDefaultDeflater/Inflater to false so close() does not call end() on the codec — the codec survives the try-with-resources for reuse on the next call. On-disk format, compression level, and error semantics are unchanged. Allocation rate drops from O(records) to O(threads). On JDK 9+ this also removes per-call Cleaner registration overhead. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

hudi-agent

🤖 This review was generated by an AI agent and may contain mistakes. Please verify any suggestions before applying.

Thanks for the contribution! This PR caches a Deflater/Inflater pair per worker thread in BitCaskDiskMap.CompressionHandler to avoid per-call zlib allocations and the JDK 8 finalizer pile-up they cause under sustained disk-map traffic. The reuse pattern correctly relies on the documented usesDefaultDeflater/Inflater=false behavior when passing user-supplied codecs to DeflaterOutputStream/InflaterInputStream, calls reset() at the start of each call, and uses transient fields with lazy accessors so deserialized instances rebuild the codecs. No correctness issues found. A few style/readability suggestions in the inline comments. Please take a look, and this should be ready for a Hudi committer or PMC member to take it from here. One minor naming nit; otherwise the change is clean and the comments are appropriately non-obvious.

cc @yihua

hudi-agent · 2026-05-22T12:53:00Z


+    // Lazy accessors so the handler stays usable after Java deserialization
+    // (transient fields come back null).
+    private Deflater deflater() {


🤖 nit: having deflater() / inflater() share names with their backing fields forces the callers to reach for def and inf as local variable names (line 445 / 458) — and def is a keyword in Groovy, Kotlin, and Python, which can cause a mental stumble. Could you rename the accessors to getDeflater() / getInflater()? That would let callers write Deflater deflater = getDeflater() cleanly.

_{- AI-generated; verify before applying. React 👍/👎 to flag quality.}

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

hudi-agent

🤖 This review was generated by an AI agent and may contain mistakes. Please verify any suggestions before applying.

Thanks for the contribution! This PR caches the Deflater/Inflater in the thread-local CompressionHandler to avoid per-call construction and finalizer-queue contention on JDK 8. The transient/lazy pattern preserves serialization safety, reset() defends against leftover error state, and passing the explicit codecs into DeflaterOutputStream/InflaterInputStream correctly suppresses end() on close() so the codecs survive for reuse. No issues flagged from this automated pass — a Hudi committer or PMC member can take it from here for a final review.

cc @yihua

danny0405

+1, nice catch~

hudi-agent

🤖 This review was generated by an AI agent and may contain mistakes. Please verify any suggestions before applying.

Thanks for the contribution! This change caches per-thread Deflater/Inflater instances in CompressionHandler (held via ThreadLocal) instead of allocating fresh ones per call, which addresses JDK 8 finalizer-thread contention under heavy disk-map traffic. The reuse contract (reset() before use, external codec passed to streams so close() won't call end(), transient fields rebuilt lazily after deserialization) looks correct. No issues flagged from this automated pass — a Hudi committer or PMC member can take it from here for a final review.

cc @yihua

codecov-commenter · 2026-05-25T07:28:48Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 68.91%. Comparing base (f64c93e) to head (709a39b).
⚠️ Report is 227 commits behind head on master.

Additional details and impacted files

@@              Coverage Diff              @@
##             master   #18818       +/-   ##
=============================================
+ Coverage     57.27%   68.91%   +11.63%     
- Complexity    18650    29092    +10442     
=============================================
  Files          1956     2509      +553     
  Lines        107076   139477    +32401     
  Branches      13255    17116     +3861     
=============================================
+ Hits          61329    96117    +34788     
+ Misses        39943    35609     -4334     
- Partials       5804     7751     +1947

Flag	Coverage Δ
common-and-other-modules	`44.43% <100.00%> (?)`
hadoop-mr-java-client	`44.91% <100.00%> (-0.30%)`	⬇️
spark-client-hadoop-common	`48.24% <100.00%> (?)`
spark-java-tests	`49.34% <100.00%> (+1.87%)`	⬆️
spark-scala-tests	`45.26% <100.00%> (-0.30%)`	⬇️
utilities	`37.46% <100.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
...he/hudi/common/util/collection/BitCaskDiskMap.java	`85.80% <100.00%> (+15.53%)`	⬆️

... and 1481 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

hudi-bot · 2026-05-25T07:38:09Z

CI report:

709a39b Azure: SUCCESS

Bot commands

@hudi-bot supports the following commands:

@hudi-bot run azure re-run the last Azure build

vamsikarnika changed the title ~~fix(common): reuse Inflater/Deflater in BitCaskDiskMap to avoid finalizer contention~~ Reuse Inflater/Deflater in BitCaskDiskMap to avoid JDK8 finalizer contention May 22, 2026

hudi-agent reviewed May 22, 2026

View reviewed changes

github-actions Bot added the size:S PR with lines of changes in (10, 100] label May 22, 2026

remove extra comments

1b8a581

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

hudi-agent reviewed May 22, 2026

View reviewed changes

danny0405 approved these changes May 25, 2026

View reviewed changes

vamsikarnika added 2 commits May 25, 2026 11:36

address comments

89c34d3

address nits

709a39b

vamsikarnika changed the title ~~Reuse Inflater/Deflater in BitCaskDiskMap to avoid JDK8 finalizer contention~~ fix: reuse Inflater/Deflater in BitCaskDiskMap to avoid JDK8 finalizer contention May 25, 2026

hudi-agent reviewed May 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: reuse Inflater/Deflater in BitCaskDiskMap to avoid JDK8 finalizer contention#18818

fix: reuse Inflater/Deflater in BitCaskDiskMap to avoid JDK8 finalizer contention#18818
vamsikarnika wants to merge 4 commits into
apache:masterfrom
vamsikarnika:vamsi/bitcask-inflater-reuse

vamsikarnika commented May 22, 2026

Uh oh!

hudi-agent left a comment

Uh oh!

hudi-agent May 22, 2026

Uh oh!

hudi-agent left a comment

Uh oh!

danny0405 left a comment

Uh oh!

hudi-agent left a comment

Uh oh!

codecov-commenter commented May 25, 2026

Uh oh!

hudi-bot commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

vamsikarnika commented May 22, 2026

Describe the issue this Pull Request addresses

Summary and Changelog

Impact

Risk Level

Documentation Update

Contributor's checklist

Uh oh!

hudi-agent left a comment

Choose a reason for hiding this comment

Uh oh!

hudi-agent May 22, 2026

Choose a reason for hiding this comment

Uh oh!

hudi-agent left a comment

Choose a reason for hiding this comment

Uh oh!

danny0405 left a comment

Choose a reason for hiding this comment

Uh oh!

hudi-agent left a comment

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented May 25, 2026

Codecov Report

Uh oh!

hudi-bot commented May 25, 2026

CI report:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants