read: add Dwarf::populate_abbreviations_cache #679

philipc · 2023-09-12T05:37:05Z

This gives additional options for caching duplicate entries or all entries.

@al13n321 Can you test if this has similar performance to the caching you implemented?

al13n321 · 2023-09-12T18:52:48Z

Yep, about the same performance. The set_abbreviations_cache_strategy() call adds ~20 ms. That seems fine next to the 370 ms loop parsing all the units (in one thread). (And the overall time likely increased by less than 20 ms, I didn't measure carefully enough to tell.)

I'd document that set_abbreviations_cache_strategy() call may iterate over all units, i.e. may do a bunch of random reads from .debug_info (which is likely mmapped, so reads may be slow and may in principle have side effects).

Thanks for working on this, and for making this library!

Some theoretical thoughts, optional reading:

The LazyArc's behavior under contention is not ideal: if multiple threads try to get the same abbreviations at the same time, they'll all do the parsing, and all but one will discard the result. It would be better (?) if one thread did the parsing while other threads wait. A busy-wait seems good enough and easy to implement by making the LazyArc's atomic have 3 states: empty, busy, ready. Busy-wait seems better than duplicate work because: (a) it will complete sooner because it only waits for the remaining fraction of the work, (b) it can do something like thread::yield_now(), (c) it doesn't read from the Reader. The downside is that it's not lock-free, i.e. we'd be in trouble if the thread dies or stalls in the middle of parsing abbreviations. Threads dying or stalling is not a real concern in practice (or is it?) (except for panics, but they can be handled by just resetting the LazyArc state to 'empty'), so busy-wait seems better overall.

Since set_abbreviations_cache_strategy() call is already doing some work inline, why not also have a mode where it would go ahead and load all abbreviations right there, AbbreviationsCacheStrategy::Eager or whatever. It can be faster because (a) it'll go in order of increasing offset (more-sequential file reads), (b) it's be done in one thread, so less LazyArc contention.

philipc · 2023-09-12T23:59:49Z

Thanks for the feedback.

The LazyArc's behavior under contention is not ideal: if multiple threads try to get the same abbreviations at the same time, they'll all do the parsing, and all but one will discard the result.

Note that this only happens for no_std. With std we use a Mutex.

Since set_abbreviations_cache_strategy() call is already doing some work inline, why not also have a mode where it would go ahead and load all abbreviations right there, AbbreviationsCacheStrategy::Eager or whatever. It can be faster because (a) it'll go in order of increasing offset (more-sequential file reads), (b) it's be done in one thread, so less LazyArc contention.

I'm starting to question the value of using LazyArc at all. Maybe we should always parse them all up front. It would greatly reduce the complexity of the cache.

The original reason for LazyArc is so that we could improve performance for users without them needing to change anything, but if they need to call something anyway for cases such as yours, then that is no longer a reason.

al13n321 · 2023-09-13T01:19:27Z

Note that this only happens for no_std. With std we use a Mutex.

Ah, I didn't notice that.

I'm starting to question the value of using LazyArc at all. Maybe we should always parse them all up front. It would greatly reduce the complexity of the cache.

I agree. Maybe the behavior should be:

By default, there's no caching. Useful if the user just wants a few units and doesn't want to pay 500 ms.
The user may call dwarf.preload_abbreviations() (or whatever), which will populate the abbreviations map right there. There can still be Duplicates and All modes. Duplicates useful when iterating over units only once (especially from multiple threads), All useful if iterating multiple times.
Or the user may do their own management of abbreviations and use Unit::new_with_abbreviations(). (E.g. group the units by their abbreviations offsets and assign groups to threads, then have separate abbreviations cache in each thread. I mean it's unlikely that anyone would need this particular scheme in practice, but it seems good to have this sort of freedom in a library.)

Fwiw, for me the abbreviations cache stood out as a surprising behavior: in a mostly straightforward low-level library that just parses bytes when requested, there's this one place where it does nontrivial things behind the scenes: caching (imposing memory usage), reference counting, thread synchronization, none of that mentioned in the documentation for dwarf.unit(). So making it explicit and less magic seems really good.

philipc · 2023-09-13T03:40:03Z

Sounds good. I've added a commit to do that.

al13n321 · 2023-09-13T04:31:49Z

The code looks good to me, performance on my test is still good. All seems to be ~5-10% faster than Duplicates, surprisingly. (The total populate + iterate time.) Maybe it's from sequential access order, idk.

…ons_cache

philipc · 2023-09-14T02:15:26Z

@Swatinem FYI, this removes the automatic caching that was added in #628, so you'll probably want to add a call to populate_abbreviations_cache once this is released.

philipc changed the title ~~read: add Dwarf::set_abbreviations_cache_strategy~~ read: add Dwarf::populate_abbreviations_cache Sep 14, 2023

philipc added 2 commits September 14, 2023 11:46

read: add Dwarf::set_abbreviations_cache_strategy

5f88ccc

Change Dwarf::set_abbreviations_cache_strategy to populate_abbreviati…

77dd7a2

…ons_cache

philipc force-pushed the abbrev_cache branch from c8c3864 to 77dd7a2 Compare September 14, 2023 01:56

philipc merged commit 23ebfc8 into gimli-rs:master Sep 14, 2023
20 checks passed

philipc deleted the abbrev_cache branch September 14, 2023 02:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

read: add Dwarf::populate_abbreviations_cache #679

read: add Dwarf::populate_abbreviations_cache #679

philipc commented Sep 12, 2023

al13n321 commented Sep 12, 2023 •

edited

Loading

philipc commented Sep 12, 2023 •

edited

Loading

al13n321 commented Sep 13, 2023

philipc commented Sep 13, 2023

al13n321 commented Sep 13, 2023

philipc commented Sep 14, 2023

read: add Dwarf::populate_abbreviations_cache #679

read: add Dwarf::populate_abbreviations_cache #679

Conversation

philipc commented Sep 12, 2023

al13n321 commented Sep 12, 2023 • edited Loading

philipc commented Sep 12, 2023 • edited Loading

al13n321 commented Sep 13, 2023

philipc commented Sep 13, 2023

al13n321 commented Sep 13, 2023

philipc commented Sep 14, 2023

al13n321 commented Sep 12, 2023 •

edited

Loading

philipc commented Sep 12, 2023 •

edited

Loading