Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

create layer decoupled #5258

Merged
merged 19 commits into from
Jan 5, 2024
Merged

create layer decoupled #5258

merged 19 commits into from
Jan 5, 2024

Conversation

nihui
Copy link
Member

@nihui nihui commented Jan 3, 2024

  • vulkan layer lightmode
  • net decouple layer creation
  • binary size / memory usage
libncnn.a binarysize (with vulkan) baseline pr5258 diff (avg = -13.89%)
x86-64 21944262 18745200 -14.58%
android arm64 11724106 10175458 -13.21%
cpu rss(kb) baseline pr5258 diff (avg = -3.53%)
squeezenet 34992.00 34416.00 -1.65%
squeezenet_int8 32592.00 31840.00 -2.31%
mobilenet 43388.00 43352.00 -0.08%
mobilenet_int8 21868.00 19852.00 -9.22%
mobilenet_v2 51580.00 50800.00 -1.51%
mobilenet_v3 39704.00 38792.00 -2.30%
shufflenet 26824.00 26112.00 -2.65%
shufflenet_v2 26668.00 25952.00 -2.68%
mnasnet 46096.00 44528.00 -3.40%
proxylessnasnet 41124.00 40360.00 -1.86%
efficientnet_b0 57440.00 56044.00 -2.43%
efficientnetv2_b0 66924.00 65204.00 -2.57%
regnety_400m 46948.00 44244.00 -5.76%
blazeface 16720.00 15644.00 -6.44%
googlenet 114136.00 112572.00 -1.37%
googlenet_int8 87696.00 86916.00 -0.89%
resnet18 155212.00 149688.00 -3.56%
resnet18_int8 118564.00 115276.00 -2.77%
alexnet 182660.00 182284.00 -0.21%
vgg16 572444.00 567908.00 -0.79%
vgg16_int8 473256.00 382072.00 -19.27%
resnet50 241888.00 229904.00 -4.95%
resnet50_int8 160424.00 156684.00 -2.33%
squeezenet_ssd 101176.00 100044.00 -1.12%
squeezenet_ssd_int8 112880.00 112984.00 0.09%
mobilenet_ssd 78540.00 69328.00 -11.73%
mobilenet_ssd_int8 34236.00 32484.00 -5.12%
mobilenet_yolo 99036.00 89332.00 -9.80%
mobilenetv2_yolov3 80652.00 80284.00 -0.46%
yolov4-tiny 186128.00 182508.00 -1.94%
nanodet_m 30632.00 30432.00 -0.65%
yolo-fastest-1.1 24564.00 23400.00 -4.74%
yolo-fastestv2 26900.00 25804.00 -4.07%
vision_transformer 299852.00 303508.00 1.22%
FastestDet 34124.00 32688.00 -4.21%
gpu rss(kb) baseline pr5258 diff (avg = -22.23%)
squeezenet 109164.00 89248.00 -18.24%
mobilenet 117636.00 88984.00 -24.36%
mobilenet_v2 111852.00 94752.00 -15.29%
mobilenet_v3 125496.00 96564.00 -23.05%
shufflenet 98732.00 95128.00 -3.65%
shufflenet_v2 106896.00 97336.00 -8.94%
mnasnet 135364.00 97796.00 -27.75%
proxylessnasnet 128104.00 95652.00 -25.33%
efficientnet_b0 139084.00 100968.00 -27.41%
efficientnetv2_b0 168192.00 129652.00 -22.91%
regnety_400m 126904.00 104784.00 -17.43%
blazeface 85328.00 83480.00 -2.17%
googlenet 221656.00 146292.00 -34.00%
resnet18 323740.00 175576.00 -45.77%
alexnet 377900.00 241324.00 -36.14%
vgg16 976664.00 377824.00 -61.31%
resnet50 541896.00 337712.00 -37.68%
squeezenet_ssd 269428.00 213320.00 -20.82%
mobilenet_ssd 159888.00 120412.00 -24.69%
mobilenet_yolo 150884.00 116480.00 -22.80%
mobilenetv2_yolov3 130956.00 113940.00 -12.99%
yolov4-tiny 334868.00 215836.00 -35.55%
nanodet_m 101112.00 95156.00 -5.89%
yolo-fastest-1.1 96280.00 93792.00 -2.58%
yolo-fastestv2 95436.00 92888.00 -2.67%
vision_transformer 839344.00 520232.00 -38.02%
FastestDet 95528.00 92832.00 -2.82%

@codecov-commenter
Copy link

codecov-commenter commented Jan 3, 2024

Codecov Report

Attention: 20 lines in your changes are missing coverage. Please review.

Comparison is base (92d49e1) 94.38% compared to head (8813cb7) 94.35%.
Report is 2 commits behind head on master.

Files Patch % Lines
src/layer.cpp 86.48% 20 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##           master    #5258      +/-   ##
==========================================
- Coverage   94.38%   94.35%   -0.03%     
==========================================
  Files         774      777       +3     
  Lines      243593   243749     +156     
==========================================
+ Hits       229916   229997      +81     
- Misses      13677    13752      +75     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@nihui nihui changed the title [WIP] create layer decoupled create layer decoupled Jan 5, 2024
@nihui nihui merged commit 556b79c into Tencent:master Jan 5, 2024
108 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants