1. Rebased concatenation optimization. by shtinsa · Pull Request #11087 · apache/tvm

shtinsa · 2022-04-21T08:14:26Z

Workaround to reslolve issue with tvm.tir.expr.SizeVar added

Thanks for contributing to TVM! Please refer to guideline https://tvm.apache.org/docs/contribute/ for useful information and tips. After the pull request is submitted, please request code reviews from Reviewers by @ them in the pull request thread.

2. Workaround to reslolve issue with tvm.tir.expr.SizeVar added

Add the `configs` directory to be part of the installed version of TVM in the setuptools configuration, and introduce a new function to load the `configs` directory from the right paths both when TVM is locally installed for development, as well as, when it is installed as a package.

* Fix bug allowing microTVM to be used with Arduino version v0.20 and above (see changes to _parse_connected_boards) and adds relevant unit tests. * Only perform version check when calling build or flash (things that actually require arduino-cli), and adds relevant unit tests. * Only raise a warning if the arduino-cli version present is below the min version (previously any version other than v0.18 would cause an error). * Change version comparison to use version.check, like the rest of TVM

…anspose (apache#10952) * Support input scale and zp of 1-element vector in qnn.conv2d_transpose * Lint

* support Pool layout is CHW * fix lint test * change the if condition

…e#11077)

Refer to the issue in apache#10831 (comment)

Co-authored-by: Siyuan Feng <Hzfengsy@sjtu.edu.cn> Co-authored-by: Bohan Hou <32121147+spectrometerHBH@users.noreply.github.com> Co-authored-by: Hongyi Jin <3231950289@qq.com> Co-authored-by: Ruihang Lai <lairuihangdongdong@qq.com> Co-authored-by: Wuwei Lin <wuwei@apache.org> Co-authored-by: Siyuan Feng <Hzfengsy@sjtu.edu.cn> Co-authored-by: Bohan Hou <32121147+spectrometerHBH@users.noreply.github.com> Co-authored-by: Hongyi Jin <3231950289@qq.com> Co-authored-by: Ruihang Lai <lairuihangdongdong@qq.com> Co-authored-by: Wuwei Lin <wuwei@apache.org>

…che#11080) * [RPC] Don't use existence of USE_HEXAGON_SDK as enablement check Use USE_HEXAGON to check if Hexagon support is enabled or not. This fixes apache#11059. * Restart CI

Retrigger CI. Address issues. Retrigger CI.

Previously, the size of the memory which should be allocated was calculated as multiplication width on height. It doesn't work well in case when one texture has big size in height and the next one big size in width. We tried to reuse the allocated memory and every time when the next texture with big size was used we reallocated the previous one. It has huge impact on the performance. Now we check two dimensions independently. So, in this case we will check both dimensions and it helps us to avoid the situation with cyclic memory reallocation.

* unary op for resize2d and test * renamed test * added log in quantized form * black'd some files * changed suggested commentary

…11060) * [TVMScript] Allow `val = buf[index]` without type annotation Other instances of `var = expr` were previously allowed without requiring a type annotation, by using the dtype of the expression as the dtype of `var`. This behavior didn't work for `buf[index]` expressions, which are internally represented as `BufferSlice` python objects, and only converted to `BufferLoad` primexprs when used as an expression. This commit adds a `dtype` property to `BufferSlice`, allowing `buf[index]` to be used in a let statement without a type annotation. * Reverted a wider change Automatically adding a type annotation to Var if it could be determined from the dtype let the unit test directly compare the annotated and unannotated versions of buffer load. Unfortunately, it also broke 54 unrelated tests, so that change is removed from this PR.

* [TIR] StmtFunctor RenewDefs In this PR, I introduce a StmtFunctor `RenewDefs` for deep copy all definition nodes in PrimFunc (including Var, Buffer, and IterVar). This functor can create a new PrimFunc with the same behavior as the old one but contains different Nodes. This Functor may help TIR fusion or inline multiple PrimFuncs * add ut * address comments * address comments * lint * lint

) * [microNPU] Integrate rolling buffers in Arm(R) Ethos(TM)-U Change-Id: Iede5e68981a063f6eb1e118433cc2c92e175af52 * Add documentation for create_tiles * Fix linter issues * Fix integration tests

* Check if the requested memory info is defined before using it. * Address review comment to add warning when MemoryInfo for scope is undefined.

* [ROOFLINE] Roofline analysis over RPC Run roofline analysis on remote devices if requested. Peak flops and peak bandwidth estimation are done on the remote device. * allocate testing arrays directly on device and randomly fill * forgot to include remote * lower flops ratio, machine may be using multiple threads * forgot fill

- Fix issue where `CMAKE_C[XX]_COMPILER` isn't propagated into the build configuration for `x86_tvm_runtime_rpc`.

* Add USMP tests * Address Chris comments * Address Chris comment on assert * trigger

…che#11290) * [FRONTEND][PYTORCH] Support fo nn.SiLU added * torch.bool added to torch convert_torch_dtype_map

* add relay.f.frontend.fm_oneflow support cnns * support cuda * fix mobilenetv2 and reviews * fix: model without meta info * support eager and yolo, add test * fix: license * add: tutorials * fix: support new graph * fix some comments * refine * fix concat op convert bug * refine * refine * change cuda to cpu * fix bug * fix ci error in tvm * fix pylint check * delete useless file * add skimage package in docker * fix ci error * fix bug * add oneflow fronted test in ci * merge conflict * fix tutorial * try to find error in ci * revert * merge conflict * black oneflow * Delete from_oneflow.py * restruct oneflow fronted * support vision-transformer * black format * update black version and reformat * fix ci error * fix doc error * fix gpu fronted test failed Co-authored-by: hhhfccz <hjk1938927583@163.com>

This PR allowed users to set logging level without giving a logger config. Previous implementation hard-coded `logging.INFO` as the default logging level and requires a logger config to change it. Now the logging level and handlers can be inherited from the current `tvm.meta_schedule` logger setting.

* support represent ramp as index slice in tvmscript * fix testcase's comment, check slice lanes instead of extent

* fix cast from bfloat16 * fix cast to bfloat16 test as well * clean up comments * lint * add comment Co-authored-by: Margaret Qian <mqian@octoml.ai>

…on' bug of roi_align (apache#11322)

…pe (apache#11076) * support ConvTranspose explicitly specified output_shape * fix unit test case * fix lint test * retest * fix code error * fix lint test * update test * retest * fix test onnx official tests

Signed-off-by: Alexander Peskov <peskovnn@gmail.com>

…pache#11255) * [TIR] Propogate storage scope of undefined vars in SplitHostDevice. * Test global.texture for input, output, and intermediate buffers.

…apache#11317) * [TIR] Support affine expressions as indices in reverse compute inline * fix trivial iterators

Co-authored-by: Yuanjing Shi <yuanjing@octoml.ai> Co-authored-by: Yuanjing Shi <yuanjing@octoml.ai>

masahi self-assigned this Apr 21, 2022

Sergey Shtin and others added 29 commits May 17, 2022 15:31

1. Rebased concatenation optimization.

afd73c4

2. Workaround to reslolve issue with tvm.tir.expr.SizeVar added

Workaround which allows to fuse concat layer with others.

a7aa581

Lint fix.

07a03e4

More lint fixes.

55ff48c

One more lint fix.

0029426

lint fix

62ac41d

lint fix.

f32d6d8

lint fix.

8274d69

Fix to avoid const_vector object inlining.

3e55247

lint fix

da0ee4d

another lint fix.

e155b5f

Workaround to create injectable extern op.

3bf122e

utils.py code restored.

11ae4f1

Comments added and some code cleanup.

f74425f

[QNN] Support input scale and zp of 1-element vector in qnn.conv2d_tr…

fcb16dd

…anspose (apache#10952) * Support input scale and zp of 1-element vector in qnn.conv2d_transpose * Lint

[Frontend][ONNX]support Pool2D layout is CHW (apache#11034)

d3a9c21

* support Pool layout is CHW * fix lint test * change the if condition

Add FlattenAtrousConv pass into the default optimize pipeline. (apach…

e1396ad

…e#11077)

Add two possible missing visit of let stmt in lowering (apache#11079)

821f00e

Refer to the issue in apache#10831 (comment)

[RPC] Don't use existence of USE_HEXAGON_SDK as enablement check (apa…

7823673

…che#11080) * [RPC] Don't use existence of USE_HEXAGON_SDK as enablement check Use USE_HEXAGON to check if Hexagon support is enabled or not. This fixes apache#11059. * Restart CI

Restart popen pool. (apache#11074)

13c44b8

Retrigger CI. Address issues. Retrigger CI.

[CI] Update GPU image for oneflow v0.7 (apache#11085)

7c10dfb

[FQ2I] Add log op to FQ2I (apache#10924)

a9114c9

* unary op for resize2d and test * renamed test * added log in quantized form * black'd some files * changed suggested commentary

[microNPU] Integrate rolling buffers in Arm(R) Ethos(TM)-U (apache#10344

863e01e

) * [microNPU] Integrate rolling buffers in Arm(R) Ethos(TM)-U Change-Id: Iede5e68981a063f6eb1e118433cc2c92e175af52 * Add documentation for create_tiles * Fix linter issues * Fix integration tests

csullivan and others added 21 commits May 17, 2022 15:31

Avoid use of MemoryInfo when undefined in StorageRewrite (apache#11254)

da85040

* Check if the requested memory info is defined before using it. * Address review comment to add warning when MemoryInfo for scope is undefined.

[Hexagon][Docker]Add HEXAGON_SDK_ROOT ENV variable (apache#11291)

6757826

[build][hexagon] Respect x86 C/C++ compiler choice (apache#11312)

e346808

- Fix issue where `CMAKE_C[XX]_COMPILER` isn't propagated into the build configuration for `x86_tvm_runtime_rpc`.

[Hexagon] Add USMP tests (apache#11279)

2ab9be0

* Add USMP tests * Address Chris comments * Address Chris comment on assert * trigger

[ci][docker] Update images to include sccache changes (apache#11314)

1112c9d

[PYTORCH] [FRONTEND] torch.bool support for data type conversion (apa…

ca92911

…che#11290) * [FRONTEND][PYTORCH] Support fo nn.SiLU added * torch.bool added to torch convert_torch_dtype_map

[TVMScript] Represent ramp as index slice (apache#11308)

9ad8e17

* support represent ramp as index slice in tvmscript * fix testcase's comment, check slice lanes instead of extent

[ONNX] Fix cast op to/from bfloat16 (apache#11171)

a005676

* fix cast from bfloat16 * fix cast to bfloat16 test as well * clean up comments * lint * add comment Co-authored-by: Margaret Qian <mqian@octoml.ai>

[BugFix][Topi] Fix 'duplicated iterator names in the compute definiti…

1faca66

…on' bug of roi_align (apache#11322)

adding ramana to reviewers list (apache#11311)

65921bb

[frontend][ONNX]support ConvTranspose explicitly specified output_sha…

c27d933

…pe (apache#11076) * support ConvTranspose explicitly specified output_shape * fix unit test case * fix lint test * retest * fix code error * fix lint test * update test * retest * fix test onnx official tests

Missed out_layout field of conv1d attrs (apache#11325)

f1737f1

Signed-off-by: Alexander Peskov <peskovnn@gmail.com>

[TIR] Propagate storage scope of undefined vars in SplitHostDevice. (a…

d94bfbf

…pache#11255) * [TIR] Propogate storage scope of undefined vars in SplitHostDevice. * Test global.texture for input, output, and intermediate buffers.

Add vlogging for type-table registration. (apache#11041)

db7c9f0

[build][hexagon] fix several compiler warnings (apache#11245)

188b8eb

[TIR] Support affine expressions as indices in reverse compute inline (…

4dba7cc

…apache#11317) * [TIR] Support affine expressions as indices in reverse compute inline * fix trivial iterators

[TIR] Simplify indices in layout transform (apache#11330)

2c50631

Co-authored-by: Yuanjing Shi <yuanjing@octoml.ai> Co-authored-by: Yuanjing Shi <yuanjing@octoml.ai>

[CMSIS-NN] Align CMSIS-NN in TVM to TFLu SHA (apache#11273)

5a2cfea

shtinsa force-pushed the sshtin/concat_opt_rebased branch from 705f65e to 5a2cfea Compare May 17, 2022 12:51

Sergey Shtin added 5 commits May 17, 2022 15:59

linter fix.

4efcdf6

another lint fix.

965e32b

Comment fix.

0a85689

merge conflict fix.

9bf4acb

Lint fix.

5db2d6e

areusch added needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it and removed needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it labels Oct 19, 2022

masahi closed this Dec 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1. Rebased concatenation optimization.#11087

1. Rebased concatenation optimization.#11087
shtinsa wants to merge 204 commits intoapache:mainfrom
Deelvin:sshtin/concat_opt_rebased

shtinsa commented Apr 21, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

shtinsa commented Apr 21, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants