-
Notifications
You must be signed in to change notification settings - Fork 760
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Feature opencl kernel opt merge (#672)
* [OPENCL][INIT] init for winograd * [OPENCL][WINOGRAD] add simple kernel for winograd * [OPENCL][WINOGRAD] fix opencl compile error * [OPENCL][WINOGRAD] fix opencl winograd enqueue error * [OPENCL][WINOGRAD] fix opencl winograd some result error * [OPENCL][WINOGRAD] fix opencl kernel error * [ARM][WINOGRAD] fix arm winograd compile error * [OPENCL][WINOGRAD] fix opencl logic error * [OPENCL][WINOGRAD] fix winograd error * [OPENCL][WINOGRAD] opencl winograd debug * [OPENCL][WINOGRAD] support tensorform inner * Fix typo. (#590) Co-authored-by: E00216 <hcxiong@birentech.com> * [BUG] fix layer resource count error when packing model (#592) Co-authored-by: lucasktian <lucasktian@tencent.com> * [OPENCL][WINOGRAD] filter for some case, need opt later * Fix typo. (#594) * [OPENCL][WINOGRAD] support innerproduct 4*4 * [OPENCL][WINOGRAD] opt for winograd some case * [OPENCL][WINOGRAD] add transform outer * [OPENCL][WINOGRAD] opt for winograd * [CORE][TUNE] support tune kernel and tune opencl * [OPENCL][WINOGRAD] refactor for winograd * [OPENCL][WINOGRAD] fix compile error * [OPENCL][WINOGRAD] add double check * [OPENCL][WINOGRAD] support more cases * [BENCHMARK][MODEL] add densenet and inception v4 model * [OPENCL][CACHE] store opencl tune to cache * [OPENCL][CACHE] add mutex for multithread * [CACHE][OPT] add cache prefix * [CACHE][REFACTOR] generate cache file name * [ANDROID] TNN Benchmark Tools support APP * [ANDROID] add CMakeLists in benchmark app * [ANDROID] update gradle version for linux support * [ANDROID] benchmark app docs add * [ANDROID] modify doc * [ANDROID] remove redundant lib in benchmark app * [benchmark][model] rename shufflenet_v2 * [ANDROID] modify benchmark model script && update doc * [OPENCL][TUNE] support tune softmax and pooling layer * [TUNE][OPT] opt local tune sleep * [BENCHMARK][TEST] prevent overheating and frequency reduction * [ANDROID] remove redundant debug log * [BNECHMARK][APP]解决benchmark 输出异常 * [DOC][BENCHMARK] update benchmark doc * remove redundant log output * [BENCHMARK][UNINSTALL] unisntall benchmark app * [ANDROID] update android sdk download link * fix xcode build error * [DEVICE][ARM] fix include error * [BENCHMARK][APP] support load so from external * [BENCHMARK][FIX] fix benchmark app error * [BENCHMARK][APP] 支持传递参数 * [BENCHMARK][APP] support huawei_npu * [BENCHMARK][APP] 模型通过参数传入 * [BENCHMARK][APP] remove unused * [BENCHMARK][APP] remove log and set target sdk version * [BENCHMARK][APP] remove unused * [BENCHMARK][APP] fix benchmark backgroud error * update benchmark model script * [OPENCL] optimize conv perf with local mem && depthwise mix * [OPENCL] fix conv activation fuse && fix conv wb1 kernel * [BENCHMARK][APP] fix export singed app error * [BENCHMARK][APP] catch launch benchmark app from unexpected intent exception * [BENCHMARK][APP] support benchmark in app or shell * [BENCHMARK][APP] remove unused * [OPENCL] fix conv opt for opencl * [BENCHMARK][APP] profiling only in shell for print * [BENCHMARK][APP] update doc and change benchmark shell logic * [BENCHMARK][APP] fix logcat latency * [BENCHMARK][APP] fix benchmark app some case * fix benchmark result overlap * fix merge bug * [OPENCL] remove buffer part in depthwise conv * [OPENCL] remove logic with buffer in depthwise conv Co-authored-by: neiltian <neiltian@tencent.com> Co-authored-by: Xionghc <xionghcx@gmail.com> Co-authored-by: E00216 <hcxiong@birentech.com> Co-authored-by: ShaunDai <66760945+shaundai-tencent@users.noreply.github.com> Co-authored-by: lucasktian <lucasktian@tencent.com> Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>
- Loading branch information
1 parent
0003ac9
commit aedc6c8
Showing
9 changed files
with
270 additions
and
35 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.