[TIR] Use IndexMap to transform NDArray #12949

masahi · 2022-09-30T08:01:19Z

I've hit a weird use case where I want to manually transform runtime::NDArray (attached to AllocateConst node) according to the index map used in transform_layout. This is needed to support AllocateConst node in Metaschedule RewriteLayout postproc.

I can define it as a free function in the file where it is actually used. Having it available as part of the IndexMap interface makes it convenient to expose this to python and unit-test it. Let me know if this is a reasonable API addition.

cc @vinx13 @Lunderberg @junrushao

Lunderberg

This looks like a fantastic addition to the API, and very useful functionality. My one question is whether we want to use DeviceAPI::CopyDataFromTo to avoid the round trip from device to host and back.

In addition to the AllocateConst usecase you mentioned, it would also be very useful in preparing input/output/expected buffers for unit tests, rather than needing to specify the transformation logic in both IndexMap and np.transpose calls.

masahi · 2022-09-30T20:04:17Z

My one question is whether we want to use DeviceAPI::CopyDataFromTo

@Lunderberg I've just looked into this possibility, I wish I could use DeviceAPI::CopyDataFromTo but it's a protected method so I cannot use it from IndexMap

tvm/include/tvm/runtime/device_api.h

Line 230 in 8d60b3c

    
           virtual void CopyDataFromTo(const void* from, size_t from_offset, void* to, size_t to_offset,

Other variants of CopyDataFromTo all seem to take DLTensor as argument, so I cannot copy byte slices (element-by-element copy).

Lunderberg · 2022-09-30T20:51:38Z

Rats. I figured it was worth a shot. Two additional questions coming to mind:

Would it work to construct a DLTensor of shape [1], then iterate over DLTensor::byte_offset for each element? I think that would work within the public methods using DLTensor, but the per-element overhead might be large.
If there the per-element overhead of a virtual function call is large, would it be worth using DetectIterMap to attempt to regions that are contiguous in both source and destination layout, in order to copy those entire regions?

masahi · 2022-09-30T23:19:32Z

I agree that (1) is possible, but making such small copies on the device side sounds very slow (e.g. GPU), which would defeat the purpose of removing the host - device round trip.

On (2), it sounds like it would add too much complexity to the otherwise very simple code. I'm not sure how such "contiguous region detection" is effective in practice, but if @vinx13 and @junrushao also think it's a good idea, I'm happy to explore this approach.

junrushao · 2022-09-30T23:31:05Z

This is definitely cool Masa! Similar functionality existed in Relay's FoldConstant where we are explicitly doing post-hoc layout rewriting, where layout is inferred from TIR (ugly admittedly), and the difference is that it's actually compiling the transformation into multi-threaded TIR which is potentially faster

junrushao · 2022-09-30T23:34:49Z

I'm not sure how such "contiguous region detection" is effective in practice

It sounds a bit more complicated than the PR is supposed to be. In light that we have FoldConstant in Relay, shall we consider it as future work and potentially merge those two functionalities together? I'm not super sure

Lunderberg · 2022-10-03T15:52:48Z

Good points. I'd been thinking in terms of a small number of discontiguous breaks between largely contiguous regions, but that probably would be rather rare. Agreed that it isn't worth the extra complexity.

masahi · 2022-10-05T08:57:23Z

@Lunderberg @vinx13 @junrushao Can we merge this? I have another PR ready to be sent that depends on this.

junrushao

Sure. LGTM!

I've hit a weird use case where I want to manually transform `runtime::NDArray` (attached to `AllocateConst` node) according to the index map used in `transform_layout`. This is needed to support `AllocateConst` node in Metaschedule `RewriteLayout` postproc. I can define it as a free function in the file where it is actually used. Having it available as part of the `IndexMap` interface makes it convenient to expose this to python and unit-test it. Let me know if this is a reasonable API addition.

masahi added 5 commits September 30, 2022 16:16

[TIR] Transform NDArray by IndexMap

283b149

add more tests

2385ba6

clean

133a320

add doc

7cb0494

clang format

0bc49f4

github-actions bot requested review from Lunderberg, junrushao and vinx13 September 30, 2022 08:01

masahi added 4 commits September 30, 2022 17:07

add rank check

c779448

cpplint

4e36b82

fix compile warning

e05ef0f

add test for inverse

745acc1

Lunderberg reviewed Sep 30, 2022

View reviewed changes

junrushao approved these changes Oct 5, 2022

View reviewed changes

junrushao merged commit 9618e6a into apache:main Oct 5, 2022

masahi mentioned this pull request Oct 12, 2022

[MetaSchedule] Allow skipping exact NDArray rewrite in RemoveWeightLayoutRewriteBlock #13052

Merged

leandron mentioned this pull request Feb 1, 2023

TVM v0.11.0 Release Candidate Notes #13899

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TIR] Use IndexMap to transform NDArray #12949

[TIR] Use IndexMap to transform NDArray #12949

masahi commented Sep 30, 2022

Lunderberg left a comment

masahi commented Sep 30, 2022 •

edited

Lunderberg commented Sep 30, 2022

masahi commented Sep 30, 2022

junrushao commented Sep 30, 2022

junrushao commented Sep 30, 2022

Lunderberg commented Oct 3, 2022

masahi commented Oct 5, 2022

junrushao left a comment

[TIR] Use IndexMap to transform NDArray #12949

[TIR] Use IndexMap to transform NDArray #12949

Conversation

masahi commented Sep 30, 2022

Lunderberg left a comment

Choose a reason for hiding this comment

masahi commented Sep 30, 2022 • edited

Lunderberg commented Sep 30, 2022

masahi commented Sep 30, 2022

junrushao commented Sep 30, 2022

junrushao commented Sep 30, 2022

Lunderberg commented Oct 3, 2022

masahi commented Oct 5, 2022

junrushao left a comment

Choose a reason for hiding this comment

masahi commented Sep 30, 2022 •

edited