[WIP] new int8 implement,better accuracy #749

BUG1989 · 2019-01-10T09:28:26Z

It's a WIP,
I find that quantize the weight data split by outch num can get a better accuracy.So it needs some change.

Better Accuracy

Models	fp32	int8	diff
squeezenet_v1_1 (Top1)	57.78	57.82	+0.04
mobilenet_v1 (Top1)	67.26	66.74	-0.52
resnet18 (Top1)	65.49	65.30	-0.19
googlenet_v1 (Top1)	68.50	68.62	+0.12
resnet50 (Top1)	71.80	71.76	-0.04
mobilenet_v1_ssd (mAP)	70.23	68.68	-1.55
squeezenet_v1_ssd (mAP)	61.80	61.27	-0.53

I have implemented the int8 winograd F(2,3),It has the same accuracy as original int8 conv3x3s1 : )

Faster Inference

Platform : Hisi3519(Cortex-A17@880MHz)

Unit : ms

Models	fp32	int8
squeezenet_v1_1	282	204
mobilenet_v1	490	369
mobilenet_v1_ssd	970	618
squeezenet_v1_ssd	610	560
resnet18	985	648
googlenet_v1	1107	785

Runtime Memory : mbytes

Models	fp32	int8
squeezenet_v1_1	50	30
mobilenet_v1	61	35
mobilenet_v1_ssd	90	45
squeezenet_v1_ssd	210	70
resnet18	335	77
googlenet_v1	154	72

Storage Memory : mbytes

Models	fp32	int8
squeezenet_v1_1	4.71	1.20
mobilenet_v1	16.3	4.31
mobilenet_v1_ssd	22.0	5.60
squeezenet_v1_ssd	21.1	5.37
resnet18	44.6	11.2
googlenet_v1	26.6	6.72

new convert tool

new caffe-int8-conver-tool

x86-simulator

arm

New Feature

x86 simulator

conv3x3s1 fp32 winograd F(2,3)
conv3x3s1 int8 winograd F(2,3)

armv7a(fix overflow)

conv3x3s1 int8 winograd F(2,3)
conv3x3s2 int8
conv1x1s1 int8 sgemm
dwconv3x3s1/s2

arm64-v8a(fix overflow)

conv3x3s1 int8 winograd F(2,3)
conv3x3s2 int8
conv1x1s1 int8 sgemm
dwconv3x3s1/s2

Another Int8 layers

x86 simulator

armv7a

arm64-v8a

BUG1989 · 2019-03-04T11:37:50Z

spaul13 · 2019-04-29T16:54:41Z

can anyone please tell me how to get the accuracy for a particular model (say mobilenet-yolov3) while running the benchmark?

* add the armv7a conv3x3s1 implement without overflow,remove old codes * fix the bug of conv3x3s2 packed int8 * new int8 implement,weight quant by perchanel,better accuracy~ * fix the bug of conv3x3s1 packed int8 neon * add the naive c fp32 and int8 winograd F(2,3) * add the neon intrinsic int8 winograd F(2,3) * optimize the armv7a int8 winograd F(2,3) with neon assembly * optimize the armv7a int8 winograd F(2,3) input transform with assembly. * add the requantize layer and int8 relu implement. * add graph optimize conv1x1s2 -> conv1x1s1,begin optimize int8 aarch64. * fix int8 bugs * add the c naive im2col with sgemm * add aarch64 int8 winograd f23, conv3x3s2 naive implement * add the int8 sgemm conv7x7s2 on x86/armv7a platform * optimize the int8 sgemm by neon intrinsic and packed kernel * optimize the int8 sgemm with packed data * optimize the int8 sgemm with armv7a neon assembly * add the int8 sgemm on arm64-v8a platform * perpare to merge latest codes from master * add the int8 param files * In the Class Net,add the fuse_network method

This reverts commit 0009ad6.

BUG1989 added 10 commits January 8, 2019 11:57

add the armv7a conv3x3s1 implement without overflow,remove old codes

8efb684

fix the bug of conv3x3s2 packed int8

19b983e

Merge remote-tracking branch 'upstream/master' into ncnn-pr

67df4e2

new int8 implement,weight quant by perchanel,better accuracy~

b5d39fe

fix the bug of conv3x3s1 packed int8 neon

7b53edd

add the naive c fp32 and int8 winograd F(2,3)

2a8fa6c

merge from master

46bbf5b

add the neon intrinsic int8 winograd F(2,3)

4c52ec5

optimize the armv7a int8 winograd F(2,3) with neon assembly

c84fa4a

optimize the armv7a int8 winograd F(2,3) input transform with assembly.

bcaa2d5

BUG1989 mentioned this pull request Feb 1, 2019

I'm dying to know how to implement the int8 winograd in conv3x3s1 oneapi-src/oneDNN#369

Closed

BUG1989 added 5 commits February 11, 2019 19:14

add the requantize layer and int8 relu implement.

07c63b6

add graph optimize conv1x1s2 -> conv1x1s1,begin optimize int8 aarch64.

eff997e

fix int8 bugs

c223155

add the c naive im2col with sgemm

732efb4

add aarch64 int8 winograd f23, conv3x3s2 naive implement

ecc291b

nihui mentioned this pull request Feb 17, 2019

Overflow problem for conv_3x3_int8 #783

Closed

BUG1989 added 2 commits February 19, 2019 15:05

add the int8 sgemm conv7x7s2 on x86/armv7a platform

bbe0a47

optimize the int8 sgemm by neon intrinsic and packed kernel

22593d4

This was referenced Feb 21, 2019

量化后变得更慢了？ BUG1989/caffe-int8-convert-tools#34

Closed

Conv time of squeezenet-int8 BUG1989/caffe-int8-convert-tools#33

Closed

Initial the int8 quantize inference implement #487

Closed

BUG1989 added 2 commits February 21, 2019 18:51

optimize the int8 sgemm with packed data

2c666de

optimize the int8 sgemm with armv7a neon assembly

a3cb7d3

BUG1989 mentioned this pull request Feb 27, 2019

8 bit Winograd Convolution? andravin/wincnn#16

Open

BUG1989 added 4 commits February 27, 2019 20:52

add the int8 sgemm on arm64-v8a platform

458a959

perpare to merge latest codes from master

08f505e

merge from master and push the armv7a int8

48c130f

add the int8 param files

8b22913

In the Class Net,add the fuse_network method

71b277b

nihui merged commit df3d224 into Tencent:master Mar 5, 2019

nihui added a commit to nihui/ncnn that referenced this pull request Jul 3, 2019

Revert "new int8 implement,better accuracy (Tencent#749)"

d36f2d8

This reverts commit 0009ad6.

This was referenced Oct 18, 2020

Add Winograd matrices computation. apache/tvm#3553

Merged

[Topi, ARM] Disbale Winograd for quantized tensors. apache/tvm#5363

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] new int8 implement,better accuracy #749

[WIP] new int8 implement,better accuracy #749

BUG1989 commented Jan 10, 2019 •

edited

Loading

BUG1989 commented Mar 4, 2019

spaul13 commented Apr 29, 2019

[WIP] new int8 implement,better accuracy #749

[WIP] new int8 implement,better accuracy #749

Conversation

BUG1989 commented Jan 10, 2019 • edited Loading

Better Accuracy

Faster Inference

new convert tool

x86-simulator

arm

New Feature

x86 simulator

armv7a(fix overflow)

arm64-v8a(fix overflow)

Another Int8 layers

x86 simulator

armv7a

arm64-v8a

BUG1989 commented Mar 4, 2019

spaul13 commented Apr 29, 2019

BUG1989 commented Jan 10, 2019 •

edited

Loading