Fix L2ARC reads when compressed ARC disabled #10693

allanjude · 2020-08-09T16:15:00Z

Signed-off-by: Allan Jude allanjude@freebsd.org
Sponsored-by: The FreeBSD Foundation

Motivation and Context

When reading compressed blocks from the L2ARC, with compressed ARC disabled, arc_hdr_size() returns LSIZE rather than PSIZE, but the actual read is PSIZE. This causes l2arc_read_done() to compare the checksum against the wrong size, resulting in checksum failure.

This manifests as an increase in the kstat l2_cksum_bad and the read being retried from the main pool, making the L2ARC ineffective.

Description

This bug was discovered while creating the tests introduced in #10692

How Has This Been Tested?

With this change, the new test now passed.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Performance enhancement (non-breaking change which improves efficiency)
Code cleanup (non-breaking change which makes code smaller or more readable)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (a change to man pages or other documentation)

Checklist:

My code follows the ZFS on Linux code style requirements.
I have updated the documentation accordingly.
I have read the contributing document.
I have added tests to cover my changes.
I have run the ZFS Test Suite with this change applied.
All commit messages are properly formatted and contain Signed-off-by.

codecov · 2020-08-09T23:20:34Z

Codecov Report

Merging #10693 into master will decrease coverage by 0.17%.
The diff coverage is 50.00%.

@@            Coverage Diff             @@
##           master   #10693      +/-   ##
==========================================
- Coverage   79.92%   79.75%   -0.18%     
==========================================
  Files         394      394              
  Lines      124657   124661       +4     
==========================================
- Hits        99636    99419     -217     
- Misses      25021    25242     +221

Flag	Coverage Δ
#kernel	`80.42% <50.00%> (+0.02%)`	⬆️
#user	`65.45% <0.00%> (-0.66%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
module/zfs/arc.c	`89.89% <50.00%> (-0.29%)`	⬇️
cmd/zdb/zdb_il.c	`30.86% <0.00%> (-24.08%)`	⬇️
module/zfs/vdev_indirect.c	`73.50% <0.00%> (-11.00%)`	⬇️
module/zfs/vdev_rebuild.c	`93.26% <0.00%> (-3.92%)`	⬇️
module/zcommon/zfs_fletcher.c	`75.65% <0.00%> (-2.64%)`	⬇️
module/zfs/vdev_raidz.c	`89.54% <0.00%> (-2.62%)`	⬇️
module/zfs/btree.c	`81.61% <0.00%> (-2.00%)`	⬇️
module/zfs/lzjb.c	`98.14% <0.00%> (-1.86%)`	⬇️
cmd/ztest/ztest.c	`79.17% <0.00%> (-1.63%)`	⬇️
module/zcommon/zfs_uio.c	`87.75% <0.00%> (-1.03%)`	⬇️
... and 58 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d64c6a2...b24a773. Read the comment docs.

When reading compressed blocks from the L2ARC, with compressed ARC disabled, arc_hdr_size() returns LSIZE rather than PSIZE, but the actual read is PSIZE. This causes l2arc_read_done() to compare the checksum against the wrong size, resulting in checksum failure. This manifests as an increase in the kstat l2_cksum_bad and the read being retried from the main pool, making the L2ARC ineffective. Add new L2ARC tests with Compressed ARC enabled/disabled Blocks are handled differently depending on the state of the zfs_compressed_arc_enabled tunable. If a block is compressed on-disk, and compressed_arc is enabled: - the block is read from disk - It is NOT decompressed - It is added to the ARC in its compressed form - l2arc_write_buffers() may write it to the L2ARC (as is) - l2arc_read_done() compares the checksum to the BP (compressed) However, if compressed_arc is disabled: - the block is read from disk - It is decompressed - It is added to the ARC (uncompressed) - l2arc_write_buffers() will use l2arc_apply_transforms() to recompress the block, before writing it to the L2ARC - l2arc_read_done() compares the checksum to the BP (compressed) - l2arc_read_done() will use l2arc_untransform() to uncompress it This test writes out a test file to a pool consisting of one disk and one cache device, then randomly reads from it. Since the arc_max in the tests is low, this will feed the L2ARC, and result in reads from the L2ARC. We compare the value of the kstat l2_cksum_bad before and after to determine if any blocks failed to survive the trip through the L2ARC. Sponsored-by: The FreeBSD Foundation Signed-off-by: Allan Jude <allanjude@freebsd.org>

behlendorf

You're also going to add these two new tests to the tests/function/compression/Makefile.am. This ensures they're included in the make dist tarball which is what effectively gets built and tested by the CI.

Signed-off-by: Allan Jude <allanjude@freebsd.org>

When reading compressed blocks from the L2ARC, with compressed ARC disabled, arc_hdr_size() returns LSIZE rather than PSIZE, but the actual read is PSIZE. This causes l2arc_read_done() to compare the checksum against the wrong size, resulting in checksum failure. This manifests as an increase in the kstat l2_cksum_bad and the read being retried from the main pool, making the L2ARC ineffective. Add new L2ARC tests with Compressed ARC enabled/disabled Blocks are handled differently depending on the state of the zfs_compressed_arc_enabled tunable. If a block is compressed on-disk, and compressed_arc is enabled: - the block is read from disk - It is NOT decompressed - It is added to the ARC in its compressed form - l2arc_write_buffers() may write it to the L2ARC (as is) - l2arc_read_done() compares the checksum to the BP (compressed) However, if compressed_arc is disabled: - the block is read from disk - It is decompressed - It is added to the ARC (uncompressed) - l2arc_write_buffers() will use l2arc_apply_transforms() to recompress the block, before writing it to the L2ARC - l2arc_read_done() compares the checksum to the BP (compressed) - l2arc_read_done() will use l2arc_untransform() to uncompress it This test writes out a test file to a pool consisting of one disk and one cache device, then randomly reads from it. Since the arc_max in the tests is low, this will feed the L2ARC, and result in reads from the L2ARC. We compare the value of the kstat l2_cksum_bad before and after to determine if any blocks failed to survive the trip through the L2ARC. Sponsored-by: The FreeBSD Foundation Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Allan Jude <allanjude@freebsd.org> Closes openzfs#10693

allanjude mentioned this pull request Aug 11, 2020

Add new L2ARC tests with Compressed ARC enabled/disabled #10692

Closed

12 tasks

behlendorf approved these changes Aug 11, 2020

View reviewed changes

allanjude force-pushed the l2arc_compressed_arc_fix branch from f9202ae to 7e9d24c Compare August 12, 2020 17:17

allanjude mentioned this pull request Aug 13, 2020

Introduce ZSTD compression to ZFS #10278

Closed

17 tasks

behlendorf reviewed Aug 13, 2020

View reviewed changes

Add tests to Makefile

b24a773

Signed-off-by: Allan Jude <allanjude@freebsd.org>

behlendorf added the Status: Accepted Ready to integrate (reviewed, tested) label Aug 13, 2020

behlendorf merged commit fc34dfb into openzfs:master Aug 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix L2ARC reads when compressed ARC disabled #10693

Fix L2ARC reads when compressed ARC disabled #10693

allanjude commented Aug 9, 2020

codecov bot commented Aug 9, 2020 •

edited

Loading

behlendorf left a comment

Fix L2ARC reads when compressed ARC disabled #10693

Fix L2ARC reads when compressed ARC disabled #10693

Conversation

allanjude commented Aug 9, 2020

Motivation and Context

Description

How Has This Been Tested?

Types of changes

Checklist:

codecov bot commented Aug 9, 2020 • edited Loading

Codecov Report

behlendorf left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 9, 2020 •

edited

Loading