defragger improvements around large bins #12963

oranagra · 2024-01-18T12:23:28Z

jemalloc/jemalloc#1098 (comment)
few things that can be improved:

don't attempt to defrag (wasting CPU cycles), if the fragmentation in small bins doesn't cross the threshold (ignore the active pages used by large bins)
consider adding INFO metrics for retained and muzzy memory (stats.arenas.<i>.pmuzzy)
and used memory breakdown of large vs small bins.

The text was updated successfully, but these errors were encountered:

zuiderkwast · 2024-01-20T08:54:37Z

This looks like your notes to yourself. :) Do you need the rest of us to get involved?

I see #12315. Didn't that already get rid of fragmentation on large bins?

oranagra · 2024-01-21T06:21:48Z

right, this was a "draft" note in the 8.0 project and i promoted it to an issue so that it can be assigned or discussed.

what's fixed in #12315 does reduce some case of "fragmentation" in large bins, but i was worried that it could be others, and wanted to somehow measure only fragmentation inside small bins to trigger defrag.
i don't remember how i wanted to gather that metric, i see that it doesn't exist in mallctl, but we do have it (per bin) in malloc_stats (so in theory we can add an API to expose it to redis)

sundb · 2024-01-23T06:15:50Z

@zuiderkwast did you start? if not, i'd like to start.

zuiderkwast · 2024-01-23T08:45:40Z

@sundb please go ahead! I didn't start.

sundb · 2024-02-01T04:31:23Z

@oranagra eliminating large bins actually results in a higher frag rate.
when there are far more large bins than small bins, the frag rate will approach zero.
I inserted 1000 strings of 100k into Redis, and the fragmentation rate would be 0% before eliminating, but when I eliminating the large bins, the fragmentation rate was 130%.

oranagra · 2024-02-01T10:47:19Z

right. but that's indeed the fragmentation ratio in the area we can defrag.
on the other hand, if that memory consumption is negligible compared to what we consume, maybe we shouldn't bother to defrag it.

the reason i suggested this change was because i wanted to hide any memory overheads in large bins (or other non-defraggable overheads), i.e. anything that's not defraggable.
so maybe we can take a different formula and get that, without suffering from the amplification of the fact most memory is in large bins.

e.g. we can sum all the memory wasted in small bins (i.e. explicit calculation of small bin fragmentation), and then divide that by the total memory usage, rather than the small-bin memory usage.

WDYT?

sundb · 2024-02-02T07:51:29Z

@oranagra why is the formula for fragmentation percent ((float)resident / allocated)*100 - 100, instead of (active - allocated) / active?

oranagra · 2024-02-03T13:25:28Z

you mean why isn't it (active-allocated)*100/allocated (which gives the same result as the one in the code)
the formula you suggested (the one who uses resident isn't actually used (just a print).
and also, yours results in a scale of 0..1, not 0..100

the other difference is that the one in the code measures the fragmentation overhead as a portion of the allocated memory (dividing by allocated), and the one you gave would give the fragmentation overhead as a portion of the total active memory.
e.g. if we have 200gb active, and 150gb allocated, is the fragmentation 33% or 25%. (current code considers it to be 33%, i.e. 50 out of 150).

am i missing anything?

Implement #12963 ## Changes 1. large bins don't have external fragmentation or are at least non-defraggable, so we should ignore the effect of large bins when measuring fragmentation, and only measure fragmentation of small bins. this affects both the allocator_frag* metrics and also the active-defrag trigger 2. Adding INFO metrics for `muzzy` memory, which is memory returned to the OS but still shows as RSS until the OS reclaims it. --------- Co-authored-by: Oran Agra <oran@redislabs.com>

sundb self-assigned this Jan 23, 2024

sundb mentioned this issue Jan 26, 2024

Defragger improvements around large bins #12996

Merged

oranagra linked a pull request Feb 3, 2024 that will close this issue

Defragger improvements around large bins #12996

Merged

oranagra closed this as completed in #12996 Feb 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

defragger improvements around large bins #12963

defragger improvements around large bins #12963

oranagra commented Jan 18, 2024

zuiderkwast commented Jan 20, 2024

oranagra commented Jan 21, 2024

sundb commented Jan 23, 2024

zuiderkwast commented Jan 23, 2024

sundb commented Feb 1, 2024 •

edited

Loading

oranagra commented Feb 1, 2024

sundb commented Feb 2, 2024

oranagra commented Feb 3, 2024 •

edited

Loading

defragger improvements around large bins #12963

defragger improvements around large bins #12963

Comments

oranagra commented Jan 18, 2024

zuiderkwast commented Jan 20, 2024

oranagra commented Jan 21, 2024

sundb commented Jan 23, 2024

zuiderkwast commented Jan 23, 2024

sundb commented Feb 1, 2024 • edited Loading

oranagra commented Feb 1, 2024

sundb commented Feb 2, 2024

oranagra commented Feb 3, 2024 • edited Loading

sundb commented Feb 1, 2024 •

edited

Loading

oranagra commented Feb 3, 2024 •

edited

Loading