Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[OPENCL] optimize pooling with fine-grained #553

Merged
merged 1 commit into from
Nov 14, 2020

Conversation

lnmdlong
Copy link
Collaborator

@lnmdlong lnmdlong commented Nov 14, 2020

使用local memory优化OpenCL Pooing

速度测试结果:https://docs.qq.com/sheet/DVXFndE5uek10bkVx

@codecov-io
Copy link

codecov-io commented Nov 14, 2020

Codecov Report

Merging #553 (cbc364e) into master (8a8e2f3) will not change coverage.
The diff coverage is n/a.

Impacted file tree graph

@@           Coverage Diff           @@
##           master     #553   +/-   ##
=======================================
  Coverage   24.56%   24.56%           
=======================================
  Files         287      287           
  Lines        9047     9047           
=======================================
  Hits         2222     2222           
  Misses       6825     6825           

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8a8e2f3...cbc364e. Read the comment docs.

@neiltian-tencent neiltian-tencent merged commit fb12c8c into master Nov 14, 2020
@neiltian-tencent neiltian-tencent deleted the feature_opencl_pooling_opt branch November 14, 2020 10:24
seanxcwang added a commit that referenced this pull request Nov 14, 2020
* [OPTIMIZER][OPT] fuse conv sigmoid mul

* [ARM][OPT] 1.conv add swish post act 2.modify deconv condition 3.opt sigmoid 4.unfold binary op

* [OPENCL][OPT] support fuse sigmoid mul

* [Metal] support sigmoid_mul activation
[Test] test different activation types in conv and deconv unit test

* [OPTIMIZE][OPT] refactor for fuse conv sigmoid mul

* [ARM][OPT] fix arm conv1x1 swish fuse error

* [OPENCL] optimize pooling with fine-grained parallelsim (#553)

* [iOS] fix tnn xcodeproj

Co-authored-by: seanxcwang <seanxcwang@tencent.com>
Co-authored-by: devandong <devandong@tencent.com>
Co-authored-by: lnmdlong <lnmdlong@hotmail.com>
darrenyao87 added a commit that referenced this pull request Nov 16, 2020
* master:
  [ADD]support ShareCommandQueue between instances (#556)
  Hotfix opencl select (#555)
  Eff opt (#554)
  [OPENCL] optimize pooling with fine-grained parallelsim (#553)

# Conflicts:
#	platforms/ios/tnn.xcodeproj/project.pbxproj
lnmdlong pushed a commit that referenced this pull request Sep 2, 2021
* master: (55 commits)
  [OPT] optimize memory useage of instance
  [ATLAS][OPT] reduce model string copy
  fix spell error (#564)
  [OPENCL][FIX] fix share opencl context (#563)
  add missing parameter in CreateContext (#551)
  [ADD]support ShareCommandQueue between instances (#556)
  Hotfix opencl select (#555)
  Eff opt (#554)
  [OPENCL] optimize pooling with fine-grained parallelsim (#553)
  create issue templates (#549)
  [OPENCL] fix fp16 select with short condition (#544)
  [Metal] enable CPU N8UC4 Mat in metal ConvertFromMat (#543)
  Add fuse SpaceToDepth and DepthToSpace (#542)
  [TOOL][ADD] add output name param support (#537)
  [ATLAS][BUG] allocate context for each weight memory avoid device mismatch
  [ATLAS][BUG] fix weights buffer free error
  [CONVERTER][BUG] fix the bug of fuse conv (#529)
  [ATLAS][BUG] fix compile error due to api change
  [UPD] update iOS&macos building scripts, check building errors (#531)
  [Metal] fix reshape out-of-bound access bug (#496)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants