Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VAD on short audio packet #1

Closed
eziolotta opened this issue Jan 28, 2021 · 2 comments
Closed

VAD on short audio packet #1

eziolotta opened this issue Jan 28, 2021 · 2 comments

Comments

@eziolotta
Copy link

eziolotta commented Jan 28, 2021

Nice project !!
From my test, rVADfast performs good segmentation of a medium sized wav file ( > 10/15 seconds )
However, If we cut the input audio (ex: to 5 sec. ), the results are not the same, why is a posteriori analysis done?

It would be nice to apply this library in realtime systems, where server analyzes audio stream, that is short packets of bit-audio in sequence.

@eziolotta
Copy link
Author

This is my experiment with rVADFast:
MozillaItalia/DeepSpeech-Italian-Model#107 (comment)

@HolgerBovbjerg
Copy link
Collaborator

Hi,

For now, we are not considering streaming VAD use.
Not sure what is meant by, "why is a posteriori analysis done?". However, you can find explanations for the various design choices in rVAD in the corresponding research paper: https://arxiv.org/abs/1906.03588

@HolgerBovbjerg HolgerBovbjerg closed this as not planned Won't fix, can't repro, duplicate, stale Aug 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants