Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dev x86 openvino #516

Merged
merged 68 commits into from
Nov 9, 2020
Merged

Dev x86 openvino #516

merged 68 commits into from
Nov 9, 2020

Conversation

Maosquerade
Copy link
Collaborator

Update X86 Operators
Updata X86 Openvino Custom Operators

bluaxe and others added 30 commits September 30, 2020 14:50
* [CONVERTER][BUG]1. fix complie failed on centos (gcc 4.9);
fix codecc warnning

Co-authored-by: lnmdlong <lnmdlong@hotmail.com>
Co-authored-by: devandong <devandong@tencent.com>
Co-authored-by: quinnrong94 <quinnrong@tencent.com>
* interpret func use reference param

* ncnn interpret use reference param
Co-authored-by: nihui <shuizhuyuanluo@126.com>
* [EXAMPLES][PATCH] add face align demo and refactor for some case

* [EXAMPLES][FIX] fix align opencl error

* [EXAMPLES][FIX] fix arm linux demo

* [EXAMPLE][FIX] fix android preview size error

* [UPD]update youtu face alignment mean pts logic (Tencent#385)

* [BUG]fix YouTu face alignment model

* [UPD]update mean pts file logic

* [UPD]draw face points green

* [UPD]unify example controller list

* [UPD]unify example controller list

Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>

* [ARM][BUG] fix sqrt layer with zero input (Tencent#392)

* pull tflite2tnn tools (Tencent#378)

* add tflite2tnn tools

Co-authored-by: lucasktian <lucasktian@tencent.com>

* [UPD]move blaze anchor file to resource; fix blazeface error; (Tencent#390)

* [BUG]fix YouTu face alignment model

* [UPD]update mean pts file logic

* [UPD]draw face points green

* [UPD]unify example controller list

* [UPD]unify example controller list

* [UPD]move blaze anchor file to resource

Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>

* [OPENCL]support google pixel phone opencl mode (Tencent#399)

Co-authored-by: janchen <janchen@tencent.com>
Co-authored-by: ShaunDai <66760945+shaundai-tencent@users.noreply.github.com>
Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>

* [UPD] update readme (Tencent#404)

* [UPD] update readme

* [UPD] fix newline in README_en.md

Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>
Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com>

* [OPENCL][BUG] fix gflops calculate bug in conv (Tencent#412)

* [OPENCL][BUG] fix gflops calculate bug in conv

* [OPENCL][FIX] fix deconv calculate flops

Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>
Co-authored-by: neiltian <neiltian@tencent.com>

* Hotfix issue 400 (Tencent#410)

* [CPU] fix bfp16 blob converter

* [CPU] fix cpu device allocate

* [CPU] skip blob converter to yuv mat

Co-authored-by: lucasktian <lucasktian@tencent.com>

* [ARM][BUG] fix armv7 gemm_float_n4 error

Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com>
Co-authored-by: quinnrong94 <67782915+quinnrong94@users.noreply.github.com>
Co-authored-by: stephehuang <69882565+stephehuang@users.noreply.github.com>
Co-authored-by: lucasktian <lucasktian@tencent.com>
Co-authored-by: Bbean <j850447553@icloud.com>
Co-authored-by: janchen <janchen@tencent.com>
Co-authored-by: ShaunDai <66760945+shaundai-tencent@users.noreply.github.com>
Co-authored-by: devandong <67893313+devandong@users.noreply.github.com>
Co-authored-by: seanxcwang <66675860+seanxcwang@users.noreply.github.com>
Update reshape 's conversion in TFLite

Co-authored-by: lucasktian <lucasktian@tencent.com>
bluaxe and others added 25 commits October 21, 2020 18:41
Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com>
* [OPENCL][BUG] skip NNV21/NNV12 blob converter test case, not supported for now

* [OPENCL][OPT] optimize reduce perf with fine-grained parallelism when parallelism is low, intensity is high

* [OPENCL][BUG] fix workgroup size init

* [OPENCL][BUG] fix work group size init, ensure size to be power of 2

* [OPENCL][OPT] optimize softmax perf with fine-grained parallelism when parallelism is low, intensity is high

* [OPENCL] refine code for pull request

* update opencl program

* [OPENCL][FIX] use fp16 for local memory when enable && fix global work items filter && set threshold based on experiments

Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>
Co-authored-by: neiltian <neiltian@tencent.com>
* [OPT] rename int8 reformat; change SupportDevice to IsSupported

* [OPT] add GetEnabledPrecision in abstract device; implement RegisterLayerPrecision in arm device

* [OPT] update global_device_map only once

* [OPT] set fp16 blob in network initlayers

* [OPT] support fp16 reformat in net_optimizer

* [OPT] get cpu fp16 capability; refactor update blob precision

* [FIX] update fp16 blob with cpu support

* [FIX] fix typo

* [CHG] only update precision for cpu; rename to ImplementedPrecision
* [NPU][UPD] add test android

* [NPU][UPD] add fp16

* [NPU][UPD] modify build test script

* [NPU][UPD]add permute op

Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>
Co-authored-by: ShaunDai <66760945+shaundai-tencent@users.noreply.github.com>
[ARM][OPT] 1. add dw tail process 2. add qgemm asm kernel(big hw and small c) 3. add conv impl factory

Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>
* [CHG] enhance mat converter param check

* [CPU] implement cpu copy make border

* [TEST] add copy make border unit test

* [ARM] support copy make border

* [METAL] support copy make border

* [CHG] reset dst mat only when its data is nullptr

* [OPENCL][ADD] support copy make border

* [ARM] optimize mat copy make border

* [Metal] disable interpolation-related unit_tests on Metal

Co-authored-by: devandong <devandong@tencent.com>
Co-authored-by: lnmdlong <lnmdlong@hotmail.com>
Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>
* [OPENCL] fix chinese comments

* [DEVICE][OPENCL] change comment

Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>
Co-authored-by: neiltian <neiltian@tencent.com>
* [CPU] support nearest warpaffine

* [CPU] fix nearest choose error

* [ARM] support nearest warpaffine

* [Metal] support nearest warpaffine

* [Metal] fix bilinear warpaffine border access error

* [ARM] optmize channel equals 4

* [OPENCL] support nearest warpaffine

Co-authored-by: devandong <devandong@tencent.com>
Co-authored-by: lnmdlong <lnmdlong@hotmail.com>
@bluaxe bluaxe self-requested a review November 9, 2020 03:02
@bluaxe bluaxe merged commit 3413cac into Tencent:dev_openvino Nov 9, 2020
@bluaxe bluaxe deleted the dev_x86_openvino branch December 30, 2020 08:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.