Efficient Conformer implementation #1636

zwglory · 2022-12-26T09:06:26Z

This PR is about our implementation of Efficient Conformer for WeNet encoder structure and runtime.

Original paper: https://arxiv.org/pdf/2109.01163.pdf
Original code: https://github.com/burchim/EfficientConformer

In 58.Com Inc, using Efficient Conformer can reduce CER by 6% relative to Conformer and a 10% increase in inference speed (CPU JIT runtime). Combined with int8 quantization, the inference speed can be improved by 50~70%. More detail of our work: https://mp.weixin.qq.com/s/7T1gnNrVmKIDvQ03etltGQ

Added features

Developers

Efficient Conformer Encoder structure: ( Yaru Wang & Wei Zhou )
Recognize and JIT export: ( Wei Zhou )
Streaming inference at JIT runtime: ( Yongze Li )
Configuration file of Aishell-1: ( Wei Zhou )

TODO

ONNX export and runtime
Aishell-1 experiment

xingchensong · 2022-12-26T10:20:30Z

粗看了一下，感觉很多地方可以复用已有的代码，特别是如果能把“att_cache_shape” “cnn_cache_shape” 的功能做进模型内部，那么runtime下的所有修改都是可以省去的。squeezeformer的pr保持了和既有forward_chunk接口的的一致性, squeezeformer 内部也会涉及不同层的cache shape不相同的问题，可以参考下。
wenet/efficient_conformer文件夹下，attention.py 中 MultiHeadedAttention 和 RelPositionMultiHeadedAttention 应该不需要重新实现（和wenet.transformer.MultiHeadAttention几乎没有太大差别），相应的小修改（主要在forward_attention函数涉及一些padding）通过在GroupedRelPositionMultiHeadedAttention中重载即可

zwglory · 2022-12-26T12:40:42Z

粗看了一下，感觉很多地方可以复用已有的代码，特别是如果能把“att_cache_shape” “cnn_cache_shape” 的功能做进模型内部，那么runtime下的所有修改都是可以省去的。squeezeformer的pr保持了和既有forward_chunk接口的的一致性, squeezeformer 内部也会涉及不同层的cache shape不相同的问题，可以参考下。

wenet/efficient_conformer文件夹下，attention.py 中 MultiHeadedAttention 和 RelPositionMultiHeadedAttention 应该不需要重新实现（和wenet.transformer.MultiHeadAttention几乎没有太大差别），相应的小修改（主要在forward_attention函数涉及一些padding）通过在GroupedRelPositionMultiHeadedAttention中重载即可

@xingchensong 感谢建议，cache shape问题我再具体看下。

…e changes. Completed the casual and non-casual convolution model tests for the EfficientConformer, as well as JIT runtime tests. Modified yaml files for Aishell-1

xingchensong · 2023-01-04T02:43:53Z

THX!

KakayaLin · 2023-03-13T05:57:05Z

@zwglory 請問有預訓練模型能夠下載測試嗎? 謝謝。

zwglory · 2023-03-14T03:32:49Z

@zwglory 請問有預訓練模型能夠下載測試嗎? 謝謝。

@KakayaLin 可以的，后面我们上传后在这里同步。

KakayaLin · 2023-03-20T02:48:38Z

@zwglory 不好意思, 請問上傳有更新進度嗎? 謝謝。

zwglory · 2023-03-20T15:20:01Z

@zwglory 請問有預訓練模型能夠下載測試嗎? 謝謝。

@KakayaLin 可以的，后面我们上传后在这里同步。

@KakayaLin AISHELL-1 的模型链接如下，后续会更新在相关README中

KakayaLin · 2023-03-21T01:35:32Z

@zwglory 請問有預訓練模型能夠下載測試嗎? 謝謝。

@KakayaLin 可以的，后面我们上传后在这里同步。

@KakayaLin AISHELL-1 的模型链接如下，后续会更新在相关README中

https://huggingface.co/58AILab/wenet_efficient_conformer_aishell_v1

https://huggingface.co/58AILab/wenet_efficient_conformer_aishell_v2

謝謝!!

dipeshhoncho07 · 2023-05-08T07:28:34Z

Can we incorporate LM in efficient conformer?

zwglory · 2023-05-09T05:51:54Z

Can we incorporate LM in efficient conformer?

@dipeshhoncho07 yes, efficient conformer support LM in runtime.

bourne979 · 2023-10-25T14:33:05Z

Hi, @zwglory, do you have an update on onnx cpu export? Thanks.

zwglory · 2023-10-26T03:38:52Z

Hi, @zwglory, do you have an update on onnx cpu export? Thanks.

You can refer to this description to try it out,
#1918 (comment)

and we will follow up on this part of the feature when we have time.

add Efficient Conformer implementation

f72cecd

fix trailing whitespace, formatting and semantic

7719cec

zwglory and others added 3 commits December 30, 2022 10:51

Ensures consistency of forward_chunk interface and deletes all runtim…

c2e5479

…e changes. Completed the casual and non-casual convolution model tests for the EfficientConformer, as well as JIT runtime tests. Modified yaml files for Aishell-1

Merge branch 'wenet-e2e:main' into main

48331bf

[EfficientConformer] add Aishell-1 Results

77553d6

xingchensong approved these changes Jan 4, 2023

View reviewed changes

xingchensong merged commit 7427258 into wenet-e2e:main Jan 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Efficient Conformer implementation #1636

Efficient Conformer implementation #1636

zwglory commented Dec 26, 2022 •

edited

Loading

xingchensong commented Dec 26, 2022

zwglory commented Dec 26, 2022

xingchensong commented Jan 4, 2023

KakayaLin commented Mar 13, 2023

zwglory commented Mar 14, 2023

KakayaLin commented Mar 20, 2023

zwglory commented Mar 20, 2023

KakayaLin commented Mar 21, 2023

dipeshhoncho07 commented May 8, 2023

zwglory commented May 9, 2023

bourne979 commented Oct 25, 2023

zwglory commented Oct 26, 2023

Efficient Conformer implementation #1636

Efficient Conformer implementation #1636

Conversation

zwglory commented Dec 26, 2022 • edited Loading

xingchensong commented Dec 26, 2022

zwglory commented Dec 26, 2022

xingchensong commented Jan 4, 2023

KakayaLin commented Mar 13, 2023

zwglory commented Mar 14, 2023

KakayaLin commented Mar 20, 2023

zwglory commented Mar 20, 2023

KakayaLin commented Mar 21, 2023

dipeshhoncho07 commented May 8, 2023

zwglory commented May 9, 2023

bourne979 commented Oct 25, 2023

zwglory commented Oct 26, 2023

zwglory commented Dec 26, 2022 •

edited

Loading