memory planning: add offset to planning output and respect it in graph executor #8134

rafzi · 2021-05-25T21:48:05Z

This adds the possibility to specify an offset for the results of memory planning, such that buffers can be placed at other positions than the base address. This way we can also partially overlap buffers to enable the most optimal buffer placements.

More details in this discussion: https://discuss.tvm.apache.org/t/discussion-alignment-memory-planning/9730

…h executor

rafzi · 2021-06-12T21:20:22Z

It seems like the long term plans of TVM are conflicting with this approach, in that the memory planning should happen in TIR.

Is this something that is useful to TVM right now? Should I continue work on this or drop it in favor of the upcoming approach?

areusch · 2021-06-14T01:36:25Z

@rafzi apologies for the delay in reviewing this one. i'm not sure there is broad alignment yet on the way we intend to do full-graph memory planning in TVM. and, even when we do come to agreement on a model for memory (which I think may look similar to the one you're working towards here), we still need to implement support for it in both Graph and AOT executors. Also, the Graph executor is invoking TIR PrimFunc, so it's likely something similar to this PR will be useful. My thinking is that what you have here is fairly close and we'll just need to rename fields or add additional e.g. pool_id to give more context to the offset.

So I'm not convinced we should drop this PR; however, before proceeding, I'd like to get everyone aligned around a single memory planning proposal. There are a couple of theoretically orthogonal pieces of such a proposal as well: a) the interface between the TVM graph and the memory planner; b) the algorithm(s) used in planning; c) the interface between TVM and the executors. At present there are two suggestions for (a) a TIR-level interface and a Relay-level planner. I think the TIR-based planner offers more flexibility but the Relay one is easier to implement to (e.g. it's nearly complete in the tree today).

Would you be interested in reviewing the TIR-level interface proposed in the USMP RFC? It would be great to get your thoughts whether it's possible to implement the algorithms you've proposed using that interface as well.

Given there is some interest from the community in doing whole-program TIR optimization, plus the AOT top-level function is in TIR, it may be slightly more impactful to adopt that interface. However, I'd like to understand whether that precludes including the algorithms you've proposed here. Finally, this PR could serve as a basis to implement the Graph executor changes required to support (c).

Let me know your thoughts!

jroesch · 2022-01-19T20:16:13Z

This PR appears to be out of date, please feel free to reopen it if this is not the case.

As part of the new year we are attempting to triage the project's open pull requests to ensure that code which
is ready for review and/or merging receives adequate attention.

Thanks again for your contribution, and feel free to reach out to discuss these changes.

rafzi force-pushed the storage_offset branch from af0565d to 09652e2 Compare May 26, 2021 12:21

memory planning: add offset to planning output and respect it in grap…

245396e

…h executor

rafzi force-pushed the storage_offset branch from 09652e2 to 245396e Compare May 26, 2021 16:31

fix tests

ff240c0

areusch self-assigned this Jun 3, 2021

areusch added the status: need review label Jun 3, 2021

jroesch closed this Jan 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

memory planning: add offset to planning output and respect it in graph executor #8134

memory planning: add offset to planning output and respect it in graph executor #8134

rafzi commented May 25, 2021

rafzi commented Jun 12, 2021

areusch commented Jun 14, 2021

jroesch commented Jan 19, 2022

memory planning: add offset to planning output and respect it in graph executor #8134

memory planning: add offset to planning output and respect it in graph executor #8134

Conversation

rafzi commented May 25, 2021

rafzi commented Jun 12, 2021

areusch commented Jun 14, 2021

jroesch commented Jan 19, 2022