-
Notifications
You must be signed in to change notification settings - Fork 771
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dev x86 openvino #516
Merged
Merged
Dev x86 openvino #516
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
* [CONVERTER][BUG]1. fix complie failed on centos (gcc 4.9);
fix codecc warnning Co-authored-by: lnmdlong <lnmdlong@hotmail.com> Co-authored-by: devandong <devandong@tencent.com> Co-authored-by: quinnrong94 <quinnrong@tencent.com>
* interpret func use reference param * ncnn interpret use reference param
Co-authored-by: nihui <shuizhuyuanluo@126.com>
* [EXAMPLES][PATCH] add face align demo and refactor for some case * [EXAMPLES][FIX] fix align opencl error * [EXAMPLES][FIX] fix arm linux demo * [EXAMPLE][FIX] fix android preview size error * [UPD]update youtu face alignment mean pts logic (Tencent#385) * [BUG]fix YouTu face alignment model * [UPD]update mean pts file logic * [UPD]draw face points green * [UPD]unify example controller list * [UPD]unify example controller list Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * [ARM][BUG] fix sqrt layer with zero input (Tencent#392) * pull tflite2tnn tools (Tencent#378) * add tflite2tnn tools Co-authored-by: lucasktian <lucasktian@tencent.com> * [UPD]move blaze anchor file to resource; fix blazeface error; (Tencent#390) * [BUG]fix YouTu face alignment model * [UPD]update mean pts file logic * [UPD]draw face points green * [UPD]unify example controller list * [UPD]unify example controller list * [UPD]move blaze anchor file to resource Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * [OPENCL]support google pixel phone opencl mode (Tencent#399) Co-authored-by: janchen <janchen@tencent.com> Co-authored-by: ShaunDai <66760945+shaundai-tencent@users.noreply.github.com> Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * [UPD] update readme (Tencent#404) * [UPD] update readme * [UPD] fix newline in README_en.md Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com> * [OPENCL][BUG] fix gflops calculate bug in conv (Tencent#412) * [OPENCL][BUG] fix gflops calculate bug in conv * [OPENCL][FIX] fix deconv calculate flops Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: neiltian <neiltian@tencent.com> * Hotfix issue 400 (Tencent#410) * [CPU] fix bfp16 blob converter * [CPU] fix cpu device allocate * [CPU] skip blob converter to yuv mat Co-authored-by: lucasktian <lucasktian@tencent.com> * [ARM][BUG] fix armv7 gemm_float_n4 error Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com> Co-authored-by: quinnrong94 <67782915+quinnrong94@users.noreply.github.com> Co-authored-by: stephehuang <69882565+stephehuang@users.noreply.github.com> Co-authored-by: lucasktian <lucasktian@tencent.com> Co-authored-by: Bbean <j850447553@icloud.com> Co-authored-by: janchen <janchen@tencent.com> Co-authored-by: ShaunDai <66760945+shaundai-tencent@users.noreply.github.com> Co-authored-by: devandong <67893313+devandong@users.noreply.github.com> Co-authored-by: seanxcwang <66675860+seanxcwang@users.noreply.github.com>
… dev_x86_openvino
Update reshape 's conversion in TFLite Co-authored-by: lucasktian <lucasktian@tencent.com>
… dev_x86_openvino
… dev_x86_openvino
Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com>
* [OPENCL][BUG] skip NNV21/NNV12 blob converter test case, not supported for now * [OPENCL][OPT] optimize reduce perf with fine-grained parallelism when parallelism is low, intensity is high * [OPENCL][BUG] fix workgroup size init * [OPENCL][BUG] fix work group size init, ensure size to be power of 2 * [OPENCL][OPT] optimize softmax perf with fine-grained parallelism when parallelism is low, intensity is high * [OPENCL] refine code for pull request * update opencl program * [OPENCL][FIX] use fp16 for local memory when enable && fix global work items filter && set threshold based on experiments Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: neiltian <neiltian@tencent.com>
* [OPT] rename int8 reformat; change SupportDevice to IsSupported * [OPT] add GetEnabledPrecision in abstract device; implement RegisterLayerPrecision in arm device * [OPT] update global_device_map only once * [OPT] set fp16 blob in network initlayers * [OPT] support fp16 reformat in net_optimizer * [OPT] get cpu fp16 capability; refactor update blob precision * [FIX] update fp16 blob with cpu support * [FIX] fix typo * [CHG] only update precision for cpu; rename to ImplementedPrecision
* [NPU][UPD] add test android * [NPU][UPD] add fp16 * [NPU][UPD] modify build test script * [NPU][UPD]add permute op Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: ShaunDai <66760945+shaundai-tencent@users.noreply.github.com>
[ARM][OPT] 1. add dw tail process 2. add qgemm asm kernel(big hw and small c) 3. add conv impl factory Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>
* [CHG] enhance mat converter param check * [CPU] implement cpu copy make border * [TEST] add copy make border unit test * [ARM] support copy make border * [METAL] support copy make border * [CHG] reset dst mat only when its data is nullptr * [OPENCL][ADD] support copy make border * [ARM] optimize mat copy make border * [Metal] disable interpolation-related unit_tests on Metal Co-authored-by: devandong <devandong@tencent.com> Co-authored-by: lnmdlong <lnmdlong@hotmail.com> Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com>
* [OPENCL] fix chinese comments * [DEVICE][OPENCL] change comment Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: neiltian <neiltian@tencent.com>
* [CPU] support nearest warpaffine * [CPU] fix nearest choose error * [ARM] support nearest warpaffine * [Metal] support nearest warpaffine * [Metal] fix bilinear warpaffine border access error * [ARM] optmize channel equals 4 * [OPENCL] support nearest warpaffine Co-authored-by: devandong <devandong@tencent.com> Co-authored-by: lnmdlong <lnmdlong@hotmail.com>
fix pool fusion bug
bluaxe
approved these changes
Nov 9, 2020
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Update X86 Operators
Updata X86 Openvino Custom Operators