dm-cache for zram #71

Enelar · 2019-01-19T16:56:21Z

I've been a huge fan of using memory swap for several years, and I've been trying different setups with different configs. IMO zram works better than anything else. However it's limited. And when you use it with disk bundle it has a huge issue - once its full, it becomes a dead body in your physical memory space. And you meet LAAAGS cause of disk IOPS.

That's why I had this idea of pushing LRU to the disk once zram is full. I know there is zswap, but it using write-through (which is even worse: under load, you're system starving of file cache, free memory and using both ram and drive. anyway I didn't experience any improvements for desktop systems).

Instead of write-through, it should be write-back in the background. dm-cache seems to be good candidate for it. if I will do it, by any chance, it might get merged or not?

Summary:

System swapping to the zram
In the background it pushing all pages not used within X minutes to the disk
When zram is nearly (configurable percentage) full, it swapping to the disk
When zram is nearly empty, it does not do anything

Vision:
There should be several levels of swap:

zram lz4
zram deflate
disk drive

The text was updated successfully, but these errors were encountered:

SSalekin · 2019-04-17T04:12:11Z

@Enelar Hi, I like your idea. However, I'm interested in this

There should be several levels of swap:

- zram lz4
- zram deflate
- disk drive

Can you please elaborate why? I'm interested on it's possible benefits.

nefelim4ag · 2019-04-17T09:18:34Z

@Enelar, zswap is writeback, it's use write-through only for incompressible data.

Yep, that can be merged.

polarathene · 2021-01-15T02:29:13Z

Just so you know, zram does have a write-back feature as well:

With CONFIG_ZRAM_WRITEBACK, zram can write idle/incompressible page to backing storage rather than keeping it in memory. To use the feature, admin should set up backing device before disksize setting via:
echo /dev/sda5 > /sys/block/zramX/backing_dev
It supports only partition at this moment.

The feature appears to have arrived around Dec 2018. Besides the fact your kernel must have enabled the config option and the partition as only backing storage option, this doesn't seem to be automatically handled writeback. You can mark all pages as idle, then check back periodically to invoke writeback as any accessed pages will remove the idle association.

As for the dm-cache approach, someone describes doing this back in late 2013.

This article also seems to suggest the effectiveness of dm-cache will vary, if data is being written to slower disk storage and only cached within zram when dm-cache decides it's cache-worthy data, you may find the effectiveness is worse, especially as pages are updated/modified?(pulled from swap, then invalidated when swapped back in due to modification IIRC?)

It excels as a read cache where frequently accessed files (hot-spots) can be promoted to the cache over a period of multiple accesses (slow fill cache).

Seems like it has the same drawback of zram writeback, requiring block device as backing storage (technically storage to cache), in addition to needing a 2nd storage device for metadata. You just don't have manually manage/automate the LRU caching.

polarathene · 2021-01-15T05:07:40Z

Additional notes.

zram writeback to backing storage keeps it's pages compressed. zram has a capacity sized to uncompressed input, it is full regardless of how little RAM it uses in compressed state.
zswap writeback decompresses pages migrated to backing storage. It's memory cache is sized by a percent/ratio and represents how much RAM it can used compressed, no uncompressed input limit (due to backing storage I guess?).
zswap uses zbud/z3fold allocators which may not compress as well as zsmalloc that zram uses. zswap can use zsmalloc too but loses the main benefit of being able to evict pages to a backing store.

With all that in mind, you could presumably also achieve the desired goal of avoiding LRU inversion of zram by adding zswap into the mix with a smaller memory cache (since zram will compress better). Once the zswap cache fills up, it'll decompress the zbud/z3fold pages and compress them again in zram or if that's full send them to the disk based backing swap.

LRU is avoided in the sense that zswap will cache the frequently used pages at the expense of extra copies and added compression/decompression overhead. Compared to a dm-cache approach, you'd also have duplicates in RAM or write to disk before caching to dm-cache on zram works, which is probably not desirable.

If you can have zram prioritized over zswap, it's probably the same result minus the drawback of using zram as a backing store for zswap? Granted zram swap still never evicts it's stale pages if you're using an excessive amount of disk swap, then the dm-cache + zram route might work out better.

Or just use zswap if the additional compression ratio potential isn't worthwhile. You may find you get more out of it due to sizing capacity of the cache on compressed size in RAM not uncompressed size that zram uses.

vilgotf added the help wanted label Jun 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dm-cache for zram #71

dm-cache for zram #71

Enelar commented Jan 19, 2019

SSalekin commented Apr 17, 2019

nefelim4ag commented Apr 17, 2019 •

edited

polarathene commented Jan 15, 2021 •

edited

polarathene commented Jan 15, 2021

dm-cache for zram #71

dm-cache for zram #71

Comments

Enelar commented Jan 19, 2019

SSalekin commented Apr 17, 2019

nefelim4ag commented Apr 17, 2019 • edited

polarathene commented Jan 15, 2021 • edited

polarathene commented Jan 15, 2021

nefelim4ag commented Apr 17, 2019 •

edited

polarathene commented Jan 15, 2021 •

edited