Skip to content

1. Rebased concatenation optimization.#11087

Closed
shtinsa wants to merge 204 commits intoapache:mainfrom
Deelvin:sshtin/concat_opt_rebased
Closed

1. Rebased concatenation optimization.#11087
shtinsa wants to merge 204 commits intoapache:mainfrom
Deelvin:sshtin/concat_opt_rebased

Conversation

@shtinsa
Copy link
Contributor

@shtinsa shtinsa commented Apr 21, 2022

  1. Workaround to reslolve issue with tvm.tir.expr.SizeVar added

Thanks for contributing to TVM! Please refer to guideline https://tvm.apache.org/docs/contribute/ for useful information and tips. After the pull request is submitted, please request code reviews from Reviewers by @ them in the pull request thread.

@masahi masahi self-assigned this Apr 21, 2022
Sergey Shtin and others added 29 commits May 17, 2022 15:31
2. Workaround to reslolve issue with tvm.tir.expr.SizeVar added
Add the `configs` directory to be part of the installed version of
TVM in the setuptools configuration, and introduce a new function
to load the `configs` directory from the right paths both when TVM
is locally installed for development, as well as, when it is installed
as a package.
* Fix bug allowing microTVM to be used with Arduino version v0.20 and   
above (see changes to _parse_connected_boards) and adds relevant unit   
tests.                                                                  
                                                                        
* Only perform version check when calling build or flash (things that   
actually require arduino-cli), and adds relevant unit tests.            
                                                                        
* Only raise a warning if the arduino-cli version present is below the  
min version (previously any version other than v0.18 would cause an     
error).                                                                 
                                                                        
* Change version comparison to use version.check, like the rest of TVM
…anspose (apache#10952)

* Support input scale and zp of 1-element vector in qnn.conv2d_transpose

* Lint
* support  Pool layout is CHW

* fix lint test

* change the if condition
Co-authored-by: Siyuan Feng <Hzfengsy@sjtu.edu.cn>
Co-authored-by: Bohan Hou <32121147+spectrometerHBH@users.noreply.github.com>
Co-authored-by: Hongyi Jin <3231950289@qq.com>
Co-authored-by: Ruihang Lai <lairuihangdongdong@qq.com>
Co-authored-by: Wuwei Lin <wuwei@apache.org>

Co-authored-by: Siyuan Feng <Hzfengsy@sjtu.edu.cn>
Co-authored-by: Bohan Hou <32121147+spectrometerHBH@users.noreply.github.com>
Co-authored-by: Hongyi Jin <3231950289@qq.com>
Co-authored-by: Ruihang Lai <lairuihangdongdong@qq.com>
Co-authored-by: Wuwei Lin <wuwei@apache.org>
…che#11080)

* [RPC] Don't use existence of USE_HEXAGON_SDK as enablement check

Use USE_HEXAGON to check if Hexagon support is enabled or not.

This fixes apache#11059.

* Restart CI
Retrigger CI.

Address issues.

Retrigger CI.
Previously, the size of the memory which should be allocated was
calculated as multiplication width on height. It doesn't work well in
case when one texture has big size in height and the next one big size
in width. We tried to reuse the allocated memory and every time when
the next texture with big size was used we reallocated the previous
one. It has huge impact on the performance.
Now we check two dimensions independently. So, in this case we will
check both dimensions and it helps us to avoid the situation with
cyclic memory reallocation.
* unary op for resize2d and test

* renamed test

* added log in quantized form

* black'd some files

* changed suggested commentary
…11060)

* [TVMScript] Allow `val = buf[index]` without type annotation

Other instances of `var = expr` were previously allowed without
requiring a type annotation, by using the dtype of the expression as
the dtype of `var`.  This behavior didn't work for `buf[index]`
expressions, which are internally represented as `BufferSlice` python
objects, and only converted to `BufferLoad` primexprs when used as an
expression.

This commit adds a `dtype` property to `BufferSlice`, allowing
`buf[index]` to be used in a let statement without a type annotation.

* Reverted a wider change

Automatically adding a type annotation to Var if it could be
determined from the dtype let the unit test directly compare the
annotated and unannotated versions of buffer load.  Unfortunately, it
also broke 54 unrelated tests, so that change is removed from this PR.
* [TIR] StmtFunctor RenewDefs

In this PR, I introduce a StmtFunctor `RenewDefs` for deep copy all definition nodes in PrimFunc (including Var, Buffer, and IterVar). This functor can create a new PrimFunc with the same behavior as the old one but contains different Nodes.

This Functor may help TIR fusion or inline multiple PrimFuncs

* add ut

* address comments

* address comments

* lint

* lint
)

* [microNPU] Integrate rolling buffers in Arm(R) Ethos(TM)-U

Change-Id: Iede5e68981a063f6eb1e118433cc2c92e175af52

* Add documentation for create_tiles

* Fix linter issues

* Fix integration tests
csullivan and others added 21 commits May 17, 2022 15:31
* Check if the requested memory info is defined before using it.

* Address review comment to add warning when MemoryInfo
for scope is undefined.
* [ROOFLINE] Roofline analysis over RPC

Run roofline analysis on remote devices if requested. Peak flops and
peak bandwidth estimation are done on the remote device.

* allocate testing arrays directly on device and randomly fill

* forgot to include remote

* lower flops ratio, machine may be using multiple threads

* forgot fill
- Fix issue where `CMAKE_C[XX]_COMPILER` isn't propagated
  into the build configuration for `x86_tvm_runtime_rpc`.
* Add USMP tests

* Address Chris comments

* Address Chris comment on assert

* trigger
…che#11290)

* [FRONTEND][PYTORCH] Support fo nn.SiLU added

* torch.bool added to torch convert_torch_dtype_map
* add relay.f.frontend.fm_oneflow support cnns

* support cuda

* fix mobilenetv2 and reviews

* fix: model without meta info

* support eager and yolo, add test

* fix: license

* add: tutorials

* fix: support new graph

* fix some comments

* refine

* fix concat op convert bug

* refine

* refine

* change cuda to cpu

* fix bug

* fix ci error in tvm

* fix pylint check

* delete useless file

* add skimage package in docker

* fix ci error

* fix bug

* add oneflow fronted test in ci

* merge conflict

* fix tutorial

* try to find error in ci

* revert

* merge conflict

* black oneflow

* Delete from_oneflow.py

* restruct oneflow fronted

* support vision-transformer

* black format

* update black version and reformat

* fix ci error

* fix doc error

* fix gpu fronted test failed

Co-authored-by: hhhfccz <hjk1938927583@163.com>
This PR allowed users to set logging level without giving a logger config. Previous implementation hard-coded `logging.INFO` as the default logging level and requires a logger config to change it. Now the logging level and handlers can be inherited from the current `tvm.meta_schedule` logger setting.
* support represent ramp as index slice in tvmscript

* fix testcase's comment, check slice lanes instead of extent
* fix cast from bfloat16

* fix cast to bfloat16 test as well

* clean up comments

* lint

* add comment

Co-authored-by: Margaret Qian <mqian@octoml.ai>
…pe (apache#11076)

* support ConvTranspose explicitly specified output_shape

* fix unit test case

* fix lint test

* retest

* fix code error

* fix lint test

* update test

* retest

* fix test onnx official tests
Signed-off-by: Alexander Peskov <peskovnn@gmail.com>
…pache#11255)

* [TIR] Propogate storage scope of undefined vars in SplitHostDevice.

* Test global.texture for input, output, and intermediate buffers.
…apache#11317)

* [TIR] Support affine expressions as indices in reverse compute inline

* fix trivial iterators
Co-authored-by: Yuanjing Shi <yuanjing@octoml.ai>

Co-authored-by: Yuanjing Shi <yuanjing@octoml.ai>
@shtinsa shtinsa force-pushed the sshtin/concat_opt_rebased branch from 705f65e to 5a2cfea Compare May 17, 2022 12:51
@areusch areusch added needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it and removed needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it labels Oct 19, 2022
@masahi masahi closed this Dec 8, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.