Dev x86 openvino #516

Maosquerade · 2020-11-03T03:05:32Z

Update X86 Operators
Updata X86 Openvino Custom Operators

* [CONVERTER][BUG]1. fix complie failed on centos (gcc 4.9);

fix codecc warnning Co-authored-by: lnmdlong <lnmdlong@hotmail.com> Co-authored-by: devandong <devandong@tencent.com> Co-authored-by: quinnrong94 <quinnrong@tencent.com>

* interpret func use reference param * ncnn interpret use reference param

Co-authored-by: nihui <shuizhuyuanluo@126.com>

* [EXAMPLES][PATCH] add face align demo and refactor for some case * [EXAMPLES][FIX] fix align opencl error * [EXAMPLES][FIX] fix arm linux demo * [EXAMPLE][FIX] fix android preview size error * [UPD]update youtu face alignment mean pts logic (Tencent#385) * [BUG]fix YouTu face alignment model * [UPD]update mean pts file logic * [UPD]draw face points green * [UPD]unify example controller list * [UPD]unify example controller list Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * [ARM][BUG] fix sqrt layer with zero input (Tencent#392) * pull tflite2tnn tools (Tencent#378) * add tflite2tnn tools Co-authored-by: lucasktian <lucasktian@tencent.com> * [UPD]move blaze anchor file to resource; fix blazeface error; (Tencent#390) * [BUG]fix YouTu face alignment model * [UPD]update mean pts file logic * [UPD]draw face points green * [UPD]unify example controller list * [UPD]unify example controller list * [UPD]move blaze anchor file to resource Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * [OPENCL]support google pixel phone opencl mode (Tencent#399) Co-authored-by: janchen <janchen@tencent.com> Co-authored-by: ShaunDai <66760945+shaundai-tencent@users.noreply.github.com> Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * [UPD] update readme (Tencent#404) * [UPD] update readme * [UPD] fix newline in README_en.md Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com> * [OPENCL][BUG] fix gflops calculate bug in conv (Tencent#412) * [OPENCL][BUG] fix gflops calculate bug in conv * [OPENCL][FIX] fix deconv calculate flops Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: neiltian <neiltian@tencent.com> * Hotfix issue 400 (Tencent#410) * [CPU] fix bfp16 blob converter * [CPU] fix cpu device allocate * [CPU] skip blob converter to yuv mat Co-authored-by: lucasktian <lucasktian@tencent.com> * [ARM][BUG] fix armv7 gemm_float_n4 error Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com> Co-authored-by: quinnrong94 <67782915+quinnrong94@users.noreply.github.com> Co-authored-by: stephehuang <69882565+stephehuang@users.noreply.github.com> Co-authored-by: lucasktian <lucasktian@tencent.com> Co-authored-by: Bbean <j850447553@icloud.com> Co-authored-by: janchen <janchen@tencent.com> Co-authored-by: ShaunDai <66760945+shaundai-tencent@users.noreply.github.com> Co-authored-by: devandong <67893313+devandong@users.noreply.github.com> Co-authored-by: seanxcwang <66675860+seanxcwang@users.noreply.github.com>

issue, Tencent#463

… dev_x86_openvino

Update reshape 's conversion in TFLite Co-authored-by: lucasktian <lucasktian@tencent.com>

… dev_x86_openvino

Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com>

* [OPENCL][BUG] skip NNV21/NNV12 blob converter test case, not supported for now * [OPENCL][OPT] optimize reduce perf with fine-grained parallelism when parallelism is low, intensity is high * [OPENCL][BUG] fix workgroup size init * [OPENCL][BUG] fix work group size init, ensure size to be power of 2 * [OPENCL][OPT] optimize softmax perf with fine-grained parallelism when parallelism is low, intensity is high * [OPENCL] refine code for pull request * update opencl program * [OPENCL][FIX] use fp16 for local memory when enable && fix global work items filter && set threshold based on experiments Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: neiltian <neiltian@tencent.com>

* [OPT] rename int8 reformat; change SupportDevice to IsSupported * [OPT] add GetEnabledPrecision in abstract device; implement RegisterLayerPrecision in arm device * [OPT] update global_device_map only once * [OPT] set fp16 blob in network initlayers * [OPT] support fp16 reformat in net_optimizer * [OPT] get cpu fp16 capability; refactor update blob precision * [FIX] update fp16 blob with cpu support * [FIX] fix typo * [CHG] only update precision for cpu; rename to ImplementedPrecision

* [NPU][UPD] add test android * [NPU][UPD] add fp16 * [NPU][UPD] modify build test script * [NPU][UPD]add permute op Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: ShaunDai <66760945+shaundai-tencent@users.noreply.github.com>

[ARM][OPT] 1. add dw tail process 2. add qgemm asm kernel(big hw and small c) 3. add conv impl factory Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>

* [CHG] enhance mat converter param check * [CPU] implement cpu copy make border * [TEST] add copy make border unit test * [ARM] support copy make border * [METAL] support copy make border * [CHG] reset dst mat only when its data is nullptr * [OPENCL][ADD] support copy make border * [ARM] optimize mat copy make border * [Metal] disable interpolation-related unit_tests on Metal Co-authored-by: devandong <devandong@tencent.com> Co-authored-by: lnmdlong <lnmdlong@hotmail.com> Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>

* [OPENCL] fix chinese comments * [DEVICE][OPENCL] change comment Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: neiltian <neiltian@tencent.com>

* [CPU] support nearest warpaffine * [CPU] fix nearest choose error * [ARM] support nearest warpaffine * [Metal] support nearest warpaffine * [Metal] fix bilinear warpaffine border access error * [ARM] optmize channel equals 4 * [OPENCL] support nearest warpaffine Co-authored-by: devandong <devandong@tencent.com> Co-authored-by: lnmdlong <lnmdlong@hotmail.com>

fix pool fusion bug

bluaxe and others added 30 commits September 30, 2020 14:50

minor update

d0235f2

Hotfix linux compile (Tencent#446)

dd02b38

* [CONVERTER][BUG]1. fix complie failed on centos (gcc 4.9);

fix codecc warning (Tencent#441)

c817b97

fix codecc warnning Co-authored-by: lnmdlong <lnmdlong@hotmail.com> Co-authored-by: devandong <devandong@tencent.com> Co-authored-by: quinnrong94 <quinnrong@tencent.com>

interpret func use reference param (Tencent#451)

ad42100

* interpret func use reference param * ncnn interpret use reference param

fix enum value error (Tencent#454)

15cae1b

add missing letter (Tencent#455)

c06fa23

Co-authored-by: nihui <shuizhuyuanluo@126.com>

x86 demo minor change

380ccd5

x86 demo add resize

6e63a9a

add null check for MatUtils

ff3cad0

[RKNPU][CHG] add leakly relu convert (Tencent#465)

f84c7e4

[NPU][ADD] add Huawei NPU profiling support

005ee29

issue, Tencent#463

[OPENCL][BUG] fix profiling summary incorrect when loop count > 1

53dc3da

Merge branch 'master' into master

60b7fb0

add webcam based demo

351651f

add null check for MatUtils (Tencent#466)

32da450

[Fix] Fix pad layer inconsistent problem

0947877

Merge remote-tracking branch 'origin' into dev_x86_openvino

c46230a

Merge branch 'dev_x86_openvino' of https://github.com/bluaxe/TNN into…

545bc99

… dev_x86_openvino

[X86][OPENVINO] increase x86 unary layer operator

414ea3c

add x86 demo: blaze face detector & aligner

726d4ea

x86 demo change to UltraFaceDetecotr

f693990

Update reshape 's conversion in TFLite (Tencent#469)

5980d9c

Update reshape 's conversion in TFLite Co-authored-by: lucasktian <lucasktian@tencent.com>

x86 demo msvc ok

4a92aef

fix cmake versioning & macos build scripts

0d557db

[X86][OPENVINO] Add Binary Op Frame

f2dcfa7

Merge branch 'dev_x86_openvino' of https://github.com/bluaxe/TNN into…

344b11d

… dev_x86_openvino

[COMPILE][FIX] fix gnustl_static compile error and warning

0376745

fix xcode compile error

80f03fc

Merge branch 'dev_openvino' into dev_x86_openvino

190e322

bluaxe and others added 25 commits October 21, 2020 18:41

build metal on macos

d30cd6b

Merge branch 'dev_openvino' into dev_x86_openvino

4207565

Merge branch 'dev_openvino' into dev_x86_openvino

33624e7

[X86][OPENVINO] add splitv layer for openvino

c0d0f7b

Merge branch 'dev_x86_openvino' of https://github.com/bluaxe/TNN into…

a88c266

… dev_x86_openvino

fix display of README_EN (Tencent#484)

4f911a8

Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com>

[x86] add all reduce layer operations

09eb25f

[FIX] fix inner_product_layer_builder error

ec60190

[FIX][X86][OPENVINO] fix openvino deconvolution shape unaligned

974f277

[NPU][BUG] fix comiple error due to api change

d70e594

Enhance arm int8 (Tencent#486)

f452c41

[ARM][OPT] 1. add dw tail process 2. add qgemm asm kernel(big hw and small c) 3. add conv impl factory Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>

[OPENCL] fix chinese comments (Tencent#493)

d81574b

* [OPENCL] fix chinese comments * [DEVICE][OPENCL] change comment Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: neiltian <neiltian@tencent.com>

[X86] Add HardSwish layer acc

ae659df

[X86] Add Optimized HardSwish Layer ACC, fix custom_implmentation issue

ae1afbc

[ONNX][BUG] fix pool fusion bug (Tencent#500)

cc51ad5

fix pool fusion bug

[DEV][UPD] 1. Int8Reformat -> Reformat;

09f21c3

[X86] add concat layer

6793e98

[X86] Merge tencent/master

8ec4a18

[X86] resolve conflicts

3ea237a

[X86][OPENVINO] fix splitv layer builder

ef98891

bluaxe self-requested a review November 9, 2020 03:02

bluaxe approved these changes Nov 9, 2020

View reviewed changes

Merge branch 'dev_openvino' into dev_x86_openvino

10ba2a9

bluaxe merged commit 3413cac into Tencent:dev_openvino Nov 9, 2020

bluaxe deleted the dev_x86_openvino branch December 30, 2020 08:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dev x86 openvino #516

Dev x86 openvino #516

Maosquerade commented Nov 3, 2020

Dev x86 openvino #516

Dev x86 openvino #516

Conversation

Maosquerade commented Nov 3, 2020