Eliminate heap allocations in event dispatching #14543

edrumwri · 2021-01-19T19:51:02Z

With some care, heap allocations can generally be avoided (excepting a "warm up" phase) as a Systems' State is updated over time- i.e., through integration, discrete updates, and unrestricted updates- at least with regard to the Systems framework code.

Given the following code:

Simulator sim(my_system);
sim.Initialize()

in which any necessary heap allocations can be performed in Initialize(.), and assuming that my_system performs no heap allocations outside of that warm up period, calls to sim.AdvanceTo(t_final) would ideally allocate no heap.

Heap allocations are presently found in the following places (not comprehensive) encountered by AdvanceTo():

drake/systems/analysis/simulator.cc

Line 211 in f20d676

auto merged_events = system_.AllocateCompositeEventCollection();

drake/systems/framework/diagram.cc

Line 857 in f20d676

std::vector<T> times(num_subsystems());

drake/systems/framework/event.h

Line 770 in f20d676

auto event = std::unique_ptr<UnrestrictedUpdateEvent<T>>(this->DoClone());

drake/systems/framework/event_collection.h

Line 380 in f20d676

this->add_event(static_pointer_cast<EventType>(other_event->Clone()));

I'm willing to help eliminate these. Fixing the last two would seem to require a redesign.

The text was updated successfully, but these errors were encountered:

sherm1 · 2021-01-20T00:51:11Z

Thanks, @edrumwri !

@edrumwri

Relevant to: RobotLocomotion#14543, RobotLocomotion#14802 This is the beginning of a PR train to make heapless simulation possible, with careful system construction. All the good ideas are inspired by @edrumwri's PR RobotLocomotion#14707; all the sketchy ones are mine.

@edrumwri

Relevant to: RobotLocomotion#14543, RobotLocomotion#14802 This is the beginning of a PR train to make heapless simulation possible, with careful system construction. All the good ideas are inspired by @edrumwri's PR RobotLocomotion#14707; all the sketchy ones are mine.

@edrumwri

Relevant to: RobotLocomotion#14543, RobotLocomotion#14802 This is the beginning of a PR train to make heapless simulation possible, with careful system construction. All the good ideas are inspired by @edrumwri's PR RobotLocomotion#14707; all the sketchy ones are mine.

@edrumwri

* simulator: Add test to track heap hygiene Relevant to: #14543, #14802 This is the beginning of a PR train to make heapless simulation possible, with careful system construction. All the good ideas are inspired by @edrumwri's PR #14707; all the sketchy ones are mine.

@edrumwri

Relevant to: RobotLocomotion#14543 This is the second of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's PR This patch just moves some method-level heap use into longer-lived object data; the data flows are the same, but storage gets reused over successive AdvanceTo() steps.

@edrumwri

Relevant to: RobotLocomotion#14543 This is the second of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's PR This patch just moves some method-level heap use into longer-lived object data; the data flows are the same, but storage gets reused over successive AdvanceTo() steps.

@edrumwri

Relevant to: #14543 This is the second of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's PR This patch just moves some method-level heap use into longer-lived object data; the data flows are the same, but storage gets reused over successive AdvanceTo() steps.

@edrumwri

Relevant to: RobotLocomotion#14543 This is the third of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch, like the last, just moves some method-level heap use into longer-lived object data; the data flows are the same, but storage gets reused over successive simulation steps.

@edrumwri

Relevant to: RobotLocomotion#14543 This is the third of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch, similar to the last, moves some method-level heap use into longer-lived data managed as a cache entry; the data flows are the same, but storage gets reused over successive simulation steps.

@edrumwri

Relevant to: RobotLocomotion#14543 This is the third of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch, similar to the last, moves some method-level heap use into longer-lived data managed as a cache entry; the data flows are the same, but storage gets reused over successive simulation steps.

@edrumwri

Relevant to: RobotLocomotion#14543 This is the third of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch, similar to the last, moves some method-level heap use into longer-lived data managed as a cache entry; the data flows are the same, but storage gets reused over successive simulation steps.

@edrumwri

Relevant to: RobotLocomotion#14543 This is the third of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch, similar to the last, moves some method-level heap use into longer-lived data managed as a cache entry; the data flows are the same, but storage gets reused over successive simulation steps.

@edrumwri

* diagram: Refactor allocation for analyzing events by time Relevant to: #14543 This is the third of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR #14707. This patch, similar to the last, moves some method-level heap use into longer-lived data managed as a cache entry; the data flows are the same, but storage gets reused over successive simulation steps.

@edrumwri

Relevant to: RobotLocomotion#14543 This is the fourth of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch characterizes the heap allocations induced by all forms of PublishEvents. It also improves naming and commentary. Subsequent patches will reduce the measured heap usage from simulator run-time.

rpoyner-tri · 2021-04-27T20:33:20Z

While fiddling with writing a test for simulator/event heap usage, I discovered a fun (but in retrospect obvious) additional problem: std::vectors will reallocate when they overflow. See #14950 (review) for some discussion from the test case PR.

It might be possible to count or estimate the event population at init time, and do the necessary pre-allocations. At worst (yuck), it might be possible to add API for some event collection size hint that a user could supply. Something along these lines would be necessary to completely eliminate the possibility of heap transactions during AdvanceTo().

@edrumwri

Relevant to: RobotLocomotion#14543 This is the fourth of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch characterizes the heap allocations induced by all forms of PublishEvents. It also improves naming and commentary. Subsequent patches will reduce the measured heap usage from simulator run-time.

sherm1 · 2021-04-27T21:09:06Z

I don't think the goal has to be "no heap allocations in AdvanceTo()". We could make it "no heap allocations in AdvanceTo() for typical uses" (which would include Evan's). For that we could reserve (say) 128 entries in the vectors that hold simultaneous events. I believe it would be a rare simulation that would need more, but all that would happen is a single heap allocation and some copying upon the 129th event, after which nothing would happen until there were 257 simultaneous events!

rpoyner-tri · 2021-04-27T21:15:48Z

Agreed that we could do a fixed pre-allocation that would cover most cases. I'm content to have that as a fallback, and explore the possibility of system-aware estimates when I get to that step.

@edrumwri

Relevant to: RobotLocomotion#14543 This is the fifth of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch introduces a new callback type for PublishEvents that avoids heap allocations induced by lambda captures. Subsequent patches will expand this technique (and test coverage) to the other event types. This is a breaking change because it removes the one-argument handle() method and replaces it with a two-argument form. The known uses inside Drake are inside the system framework, or in tests. All of those are updated. There are no known uses outside of Drake.

@edrumwri

* framework: Avoid functor allocation for PublishEvents (breaking change) Relevant to: #14543 This is the fifth of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR #14707. This patch introduces a new callback type for PublishEvents that avoids heap allocations induced by lambda captures. Subsequent patches will expand this technique (and test coverage) to the other event types. This is a breaking change because it removes the one-argument handle() method and replaces it with a two-argument form. The known uses inside Drake are inside the system framework, or in tests. All of those are updated. There are no known uses outside of Drake.

@edrumwri

Relevant to: RobotLocomotion#14543 This is the sixth of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch expands heap-allocation testing to cover all event types and schedules. Subsequent patches will remove the heap allocations tracked here.

@edrumwri

Relevant to: #14543 This is the sixth of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR #14707. This patch expands heap-allocation testing to cover all event types and schedules. Subsequent patches will remove the heap allocations tracked here.

@edrumwri

Relevant to: RobotLocomotion#14543 This is the seventh of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch extends the techniques of PR RobotLocomotion#14969 to all of the event types. Subsequent patches will address other sources of heap allocation.

@edrumwri

…nge) Relevant to: RobotLocomotion#14543 This is the seventh of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch extends the techniques of PR RobotLocomotion#14969 to all of the event types. Subsequent patches will address other sources of heap allocation. This is a breaking change because it removes the two-argument handle() methods and replaces them with a three-argument form. The known uses inside Drake are inside the system framework, or in tests. All of those are updated. There are no known uses outside of Drake.

@edrumwri

…nge) Relevant to: RobotLocomotion#14543 This is the seventh of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch extends the techniques of PR RobotLocomotion#14969 to all of the event types. Subsequent patches will address other sources of heap allocation. This is a breaking change because it removes the two-argument handle() methods and replaces them with a three-argument form. The known uses inside Drake are inside the system framework, or in tests. All of those are updated. There are no known uses outside of Drake.

@edrumwri

…nge) Relevant to: RobotLocomotion#14543 This is the seventh of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch extends the techniques of PR RobotLocomotion#14969 to all of the event types. Subsequent patches will address other sources of heap allocation. This is a breaking change because it removes the two-argument handle() methods and replaces them with a three-argument form. The known uses inside Drake are inside the system framework, or in tests. All of those are updated. There are no known uses outside of Drake.

@edrumwri

…nge) Relevant to: RobotLocomotion#14543 This is the seventh of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR RobotLocomotion#14707. This patch extends the techniques of PR RobotLocomotion#14969 to all of the event types. Subsequent patches will address other sources of heap allocation. This is a breaking change because it removes the two-argument handle() methods and replaces them with a three-argument form. The known uses inside Drake are inside the system framework, or in tests. All of those are updated. There are no known uses outside of Drake.

@edrumwri

* framework: Avoid functor allocation for all event types (breaking change) Relevant to: #14543 This is the seventh of a long PR train to make heapless simulation possible, with careful system construction. Inspired by @edrumwri's draft PR #14707. This patch extends the techniques of PR #14969 to all of the event types. Subsequent patches will address other sources of heap allocation. This is a breaking change because it removes the two-argument handle() methods and replaces them with a three-argument form. The known uses inside Drake are inside the system framework, or in tests. All of those are updated. There are no known uses outside of Drake.

Relevant to: RobotLocomotion#14543 Replace a function-scoped vector with a cache entry. Decline all invalidation services and use manual methods to ensure no data migrates between uses.

* LeafSystem: remove allocations during simulation Relevant to: #14543 Replace a function-scoped vector with a cache entry. Decline all invalidation services and use manual methods to ensure no data migrates between uses.

Relevant to: RobotLocomotion#14543 Rewrite the storage of event collections to avoid most allocations, while maintaining most of the pre-existing public API. To support this change, allow fully-derived-type events to be copied and assigned. The Clone() mechanism is still supported, primarily for interfacing with Python. This patch also deprecates EventCollection<E>::add_event() and any overrides.

* framework: Remove some allocations from event collections Relevant to: #14543 Rewrite the storage of event collections to avoid most allocations, while maintaining most of the pre-existing public API. To support this change, allow fully-derived-type events to be copied and assigned. The Clone() mechanism is still supported, primarily for interfacing with Python. This patch also deprecates EventCollection<E>::add_event() and any overrides.

Relevant to: RobotLocomotion#14543 Remove unnecessary heap allocations from the rest of the event system.

Relevant to: #14543 Remove unnecessary heap allocations from the rest of the event system.

rpoyner-tri · 2021-06-02T19:13:09Z

Looks like all of the heap fixes I had are now merged. Closing.

sherm1 self-assigned this Jan 20, 2021

sherm1 added component: system framework System, Context, and supporting code priority: medium unused team: dynamics type: performance labels Jan 20, 2021

sherm1 mentioned this issue Feb 26, 2021

[DRAFT] Removes most heap allocations from the Event system #14707

Closed

rpoyner-tri mentioned this issue Apr 13, 2021

simulator: Add test to track heap hygiene #14900

Merged

rpoyner-tri mentioned this issue Apr 16, 2021

simulator: Remove allocations in AdvanceTo() #14912

Merged

rpoyner-tri self-assigned this Apr 19, 2021

rpoyner-tri mentioned this issue Apr 20, 2021

diagram: Refactor allocation for analyzing events by time #14929

Merged

rpoyner-tri mentioned this issue Apr 26, 2021

simulator_limit_malloc_test: Expand coverage of PublishEvents #14950

Merged

rpoyner-tri mentioned this issue May 10, 2021

framework: Avoid functor allocation for all event types #15032

Merged

rpoyner-tri mentioned this issue May 21, 2021

LeafSystem: remove allocations during simulation #15067

Merged

rpoyner-tri mentioned this issue May 24, 2021

framework: Remove some allocations from event collections #15081

Merged

rpoyner-tri added a commit to rpoyner-tri/drake that referenced this issue May 28, 2021

framework: Remove more heap allocations from events

b129342

Relevant to: RobotLocomotion#14543 Remove unnecessary heap allocations from the rest of the event system.

rpoyner-tri mentioned this issue May 28, 2021

framework: Remove more heap allocations from events #15101

Merged

rpoyner-tri added a commit to rpoyner-tri/drake that referenced this issue Jun 2, 2021

framework: Remove more heap allocations from events

662c28b

Relevant to: RobotLocomotion#14543 Remove unnecessary heap allocations from the rest of the event system.

sammy-tri pushed a commit that referenced this issue Jun 2, 2021

framework: Remove more heap allocations from events (#15101)

9731d9a

Relevant to: #14543 Remove unnecessary heap allocations from the rest of the event system.

rpoyner-tri closed this as completed Jun 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eliminate heap allocations in event dispatching #14543

Eliminate heap allocations in event dispatching #14543

edrumwri commented Jan 19, 2021 •

edited

Loading

sherm1 commented Jan 20, 2021

rpoyner-tri commented Apr 27, 2021

sherm1 commented Apr 27, 2021

rpoyner-tri commented Apr 27, 2021

rpoyner-tri commented Jun 2, 2021

Eliminate heap allocations in event dispatching #14543

Eliminate heap allocations in event dispatching #14543

Comments

edrumwri commented Jan 19, 2021 • edited Loading

sherm1 commented Jan 20, 2021

rpoyner-tri commented Apr 27, 2021

sherm1 commented Apr 27, 2021

rpoyner-tri commented Apr 27, 2021

rpoyner-tri commented Jun 2, 2021

edrumwri commented Jan 19, 2021 •

edited

Loading