Fix sync issue where data writes could appear before metadata writes #948

geky · 2024-02-27T20:48:46Z

Long story short we aren't calling sync correctly in littlefs. This fixes that.

Some forms of storage, mainly anything with an FTL, eMMC, SD, etc, do not guarantee a strict write order for writes to different blocks. In theory this is what bd sync is for, to tell the bd when it is important for the writes to be ordered.

Currently, littlefs calls bd sync after committing metadata. This is useful as it ensures that user code can rely on lfs_file_sync for ordering external side-effects.

But this is insufficient for handling storage with out-of-order writes.

Consider the simple case of a file with one data block:

lfs_file_write(blablabla) => writes data into a new data block
lfs_file_sync() => commits metadata to point to the new data block

But with out-of-order writes, the bd is free to reorder things such that the metadata is updated before the data is written. If we lose power, that would be bad.

The solution to this is to call bd sync twice: Once before we commit the metadata to tell the bd that these writes must be ordered, and once after we commit the metadata to allow ordering with user code.

As a small optimization, we only call bd sync if the current file is not inlined and has actually been modified (LFS_F_DIRTY). It's possible for inlined files to be interleaved with writes to other files.

This PR also adds LFS_EMUBD_POWERLOSS_OOO to lfs_emubd, which tells lfs_emubd to try to break any order-dependent code on powerloss. This should prevent a regression in the future.

Found by @MFaehling and @alex31
Related issues: #822, #936

Some forms of storage, mainly anything with an FTL, eMMC, SD, etc, do not guarantee a strict write order for writes to different blocks. It would be good to test that this doesn't break littlefs. This adds LFS_EMUBD_POWERLOSS_OOO to lfs_emubd, which tells lfs_emubd to try to break any order-dependent code on powerloss. The behavior right now is a bit simple, but does result in test breakage: 1. Save the state of the block on first write (erase really) after sync/init. 2. On powerloss, revert the first write to its original state. This might be a bit confusing when debugging, since the block will appear to time-travel, but doing anything fancier would make emubd quite a bit more complicated. You could also get a bit fancier with which/how many blocks to revert, but this should at least be sufficient to make sure bd sync calls are in the right place.

Long story short we aren't calling sync correctly in littlefs. This fixes that. Some forms of storage, mainly anything with an FTL, eMMC, SD, etc, do not guarantee a strict write order for writes to different blocks. In theory this is what bd sync is for, to tell the bd when it is important for the writes to be ordered. Currently, littlefs calls bd sync after committing metadata. This is useful as it ensures that user code can rely on lfs_file_sync for ordering external side-effects. But this is insufficient for handling storage with out-of-order writes. Consider the simple case of a file with one data block: 1. lfs_file_write(blablabla) => writes data into a new data block 2. lfs_file_sync() => commits metadata to point to the new data block But with out-of-order writes, the bd is free to reorder things such that the metadata is updated _before_ the data is written. If we lose power, that would be bad. The solution to this is to call bd sync twice: Once before we commit the metadata to tell the bd that these writes must be ordered, and once after we commit the metadata to allow ordering with user code. As a small optimization, we only call bd sync if the current file is not inlined and has actually been modified (LFS_F_DIRTY). It's possible for inlined files to be interleaved with writes to other files. Found by MFaehling and alex31

Unlike the heuristic based testing, exhaustive powerloss testing effectively forks the current test and runs both the interrupted and uninterrupted test states to completion. But emubd wasn't expecting bd->cfg->powerloss_cb to return. The fix here is to keep track to both the old+new out-of-order block states and unrevert them if bd->cfg->powerloss_cb returns. This may leak the temporary copy, but powerloss testing is already inherently leaky.

We need to decrement the saved block state on sync, when we reset out-of-order emulation. Otherwise we leak blocks out the wazoo.

geky-bot · 2024-02-28T04:07:22Z

Tests passed ✓, Code: 17024 B (+0.3%), Stack: 1432 B (+0.0%), Structs: 812 B (+0.0%)

	Code	Stack	Structs		Coverage
Default	17024 B (+0.3%)	1432 B (+0.0%)	812 B (+0.0%)	Lines	2391/2570 lines (-0.0%)
Readonly	6186 B (+0.0%)	448 B (+0.0%)	812 B (+0.0%)	Branches	1242/1580 branches (-0.0%)
Threadsafe	17888 B (+0.3%)	1432 B (+0.0%)	820 B (+0.0%)		Benchmarks
Multiversion	17088 B (+0.3%)	1432 B (+0.0%)	816 B (+0.0%)	Readed	29369693876 B (+0.0%)
Migrate	18720 B (+0.3%)	1736 B (+0.0%)	816 B (+0.0%)	Proged	1482874766 B (+0.0%)
Error-asserts	17704 B (+0.2%)	1424 B (+0.0%)	812 B (+0.0%)	Erased	1568888832 B (+0.0%)

geky added 2 commits February 27, 2024 13:59

geky added next patch power loss labels Feb 27, 2024

geky added 2 commits February 27, 2024 21:14

Fixed memory leak in emubd's out-of-order write emulation

7873d81

We need to decrement the saved block state on sync, when we reset out-of-order emulation. Otherwise we leak blocks out the wazoo.

geky changed the base branch from master to devel March 8, 2024 22:44

geky merged commit 4dd30c1 into devel Mar 8, 2024
109 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix sync issue where data writes could appear before metadata writes #948

Fix sync issue where data writes could appear before metadata writes #948

geky commented Feb 27, 2024

geky-bot commented Feb 28, 2024

Fix sync issue where data writes could appear before metadata writes #948

Fix sync issue where data writes could appear before metadata writes #948

Conversation

geky commented Feb 27, 2024

geky-bot commented Feb 28, 2024