Dynamic tracing TODO #407

magnatelee · 2018-07-09T16:50:21Z

Here is the list of features that are missing and will likely be implemented in the current dynamic tracing. I'll check off the boxes when I add them to the code.

These features need some discussion before we decide to add them to the code.

Map operations (just to handle the case when the runtime issue them on behalf of the user)
Traces that end with reduction tasks (Regent PENNANT generate them when static control replication is turned off)
Must epoch tasks

The text was updated successfully, but these errors were encountered:

lightsighter · 2018-07-10T23:50:58Z

This is not actually dead code:
https://gitlab.com/StanfordLegion/legion/blob/master/runtime/legion/legion_views.cc#L7933
It happens when we do an explicit copy operation and we virtual map the source region to construct a composite view and that composite view has reduction instances in it. My guess is that we don't have a test case for this scenario, although it shouldn't be too hard to construct one.

lightsighter · 2018-07-14T01:34:19Z

I had to disable the memoize tests for circuit_sparse.rg and pennant_fast.rg in the nopaint branch because they both hit the "dead code" assertion from the previous comment.

magnatelee · 2019-10-03T21:27:10Z

I checked the items that I know are handled by either me or Mike. I propose we revisit the list and create a new issue per outstanding item.

alexaiken · 2021-03-07T07:09:51Z

Per Mike's suggestion, I'm adding a comment. We run into non-idempotent traces regularly with S3D when making small extensions (e.g., adding a new boundary condition). I don't think we can expect users to understand and debug this, so I'd suggest a mode under mapper control to allow non-repayable traces to be replayed by issuing copies to satisfy the precondition if necessary. There is a question whether the system would just pick some instances to move, or whether we should have mapper calls for establishing preconditions to pick which of multiple instances to use.

rohany · 2023-11-08T18:25:39Z

@lightsighter and I have been thinking about the composability of programs that use tracing (especially in a high-level context, such as when user programs in cuNumeric might try and use tracing). There seem to be two main problems around tracing in this area:

If tracing annotations are added by the user, then when composing code, it's possible for a user to try to trace a loop that is calling another function that is tracing some internal part of the code, behind an API call.
End users are generally not knowledgeable enough to put traces around code / understand when tracing is possible / understand when changes inside their loop structure may invalidate traces.

We've been thinking so far about two solutions to these problems.

The first solution is supporting nested traces, which fixes problem 1, but doesn't address problem 2. Supporting nested traces would allow for arbitrary composition of codes that use tracing, since composing two codes that use tracing corresponds to just a trace record/replay with an existing record/replay. I don't know that much about the implementation of tracing, but Mike says that something like this would not be the hardest thing to do.

The second solution is to move towards more automation inside Legion, where we automatically detect when programs are replaying the same sequence of operations, and replay traces when we identify memoizable operation sequences. This solution solves both problems 1 and 2. Mike is already planning on building infrastructure that would help with identifying when repeated sequences of operations occur. The main difficult portion in this aspect is understanding what to do when the runtime decides that a trace should be replayed, but then the application's operation stream diverges from what the runtime predicts will happen. A potential solution inspired by JIT compilers right now is the following:

Given an operation stream $O_1, \ldots, O_n$, annotate the memoized trace graph with frontiers for each operation $O_i$
When the application issues operation $O_j$ that diverges from the memoized operation stream, replay the memoized graph until operation $O_{j-1}$, which we know from the frontiers created in the prior step.
Maintaining this information through optimizations on the trace graph seems possible, except for one optimization (I forget the name of this one, do you remember @lightsighter).

There is a potential to take this further, where we can push the granularity of memoization down to the operation level, where if the preconditions for an individual operation are satisfied, we could skip / replay the physical analysis for that operation. Since checking preconditions here is the expensive part, where if we see a prefix of some operations that we have seen before, followed by some operations that we haven't, we could replay the analysis of operations in the prefix, and just have the physical analysis effects (equivalence set updates etc) replayed on the final operation in the stream, so that everything after the prefix goes through the pipeline normally. Something like this is reminiscent of what @magnatelee wanted to see in Legate, where if the legate runtime is consistently making the same decisions, analysis costs should go down.

The second solution (and extension to it) are more forward looking than the first, but at the same time, we don't have any programs right now that would not be handled by nested tracing but would be handled by automatic tracing.

lightsighter · 2023-11-13T09:15:44Z

Maintaining this information through optimizations on the trace graph seems possible, except for one optimization (I forget the name of this one, do you remember @lightsighter).

Dead code elimination. Just because it is dead code in an entire trace, doesn't mean it is actually dead when being replayed with different downstream operations.

I think we don't necessarily need to pick between the two approaches. The important thing is to create a framework for tracing that allows us to explore the trade-offs. The current implementation is too rigid for that. I think we can make the current implementation work just as efficiently with a more "operation-based" implementation that looks backwards at the operations that came before it and infer whether it can be replayed or needs to redo its analysis. If we do that then I think we can explore both nested tracing as well as dynamic discovery of traces.

magnatelee added the enhancement label Jul 9, 2018

magnatelee self-assigned this Jul 9, 2018

magnatelee added the planned Feature/fix to be actively worked on - needs release target label Oct 3, 2019

magnatelee added this to the 20.06 milestone Oct 3, 2019

magnatelee added the Legion Issues pertaining to Legion label Oct 3, 2019

magnatelee modified the milestones: 20.06, 20.09 Jun 8, 2020

magnatelee added best effort indicates the milestone tag for an issue is a goal rather than a commitment and removed planned Feature/fix to be actively worked on - needs release target labels Jun 8, 2020

streichler modified the milestones: 20.09, 20.12 Oct 1, 2020

streichler modified the milestones: 20.12, 21.03 Dec 27, 2020

streichler modified the milestones: 21.03, 21.09 Jun 21, 2021

lightsighter mentioned this issue Jan 26, 2022

Tracing succeeds even when different regions are passed as arguments #1178

Closed

bandokihiro mentioned this issue Apr 1, 2022

Trying to understand mapping callbacks #1220

Closed

lightsighter mentioned this issue Nov 21, 2023

Legion: seg fault with -lg:no_trace_optimization #1442

Open

elliottslaughter removed this from the 21.09 milestone Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic tracing TODO #407

Dynamic tracing TODO #407

magnatelee commented Jul 9, 2018 •

edited by lightsighter

Loading

lightsighter commented Jul 10, 2018

lightsighter commented Jul 14, 2018

magnatelee commented Oct 3, 2019

alexaiken commented Mar 7, 2021

rohany commented Nov 8, 2023

lightsighter commented Nov 13, 2023

Dynamic tracing TODO #407

Dynamic tracing TODO #407

Comments

magnatelee commented Jul 9, 2018 • edited by lightsighter Loading

lightsighter commented Jul 10, 2018

lightsighter commented Jul 14, 2018

magnatelee commented Oct 3, 2019

alexaiken commented Mar 7, 2021

rohany commented Nov 8, 2023

lightsighter commented Nov 13, 2023

magnatelee commented Jul 9, 2018 •

edited by lightsighter

Loading