Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

你好,考虑将silero-vad加入到项目中吗 #49

Open
TszSimLaw opened this issue Apr 25, 2023 · 2 comments
Open

你好,考虑将silero-vad加入到项目中吗 #49

TszSimLaw opened this issue Apr 25, 2023 · 2 comments

Comments

@TszSimLaw
Copy link

No description provided.

@Z-yq
Copy link
Owner

Z-yq commented Apr 25, 2023

暂时没有用过这个项目,还没想好怎么加入。 后续再规划一下

@StuartIanNaylor
Copy link

I am not all that sure about silero-vad as the Number Detector and Language Classifier sort of make it a bit 'fat' for just VAD.
Maybe there are simpler and easier ways to chunk spoken audio to fit beam search lengths of incoming realtime audio?

Z-yq haven't looked much but likely a simpler lower parameter model than silero could be used.

Also I think farfield and BSS/Beamforming are likely wireless distributed arrays and ASR central due to the possible diversification of use zonal systems could use.

https://github.com/breizhn/DTLN is a pretty good filter but the dataset needs to be mixed with noise and processed by DTLN or any filter so artefacts are trained in.
https://github.com/Rikorose/DeepFilterNet is truly outstanding but more load and a shame the Ladspa plugin uses Tract as a ML framework as its single thread only.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants