Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can I automatically deactivate voice transcription if the user is silent for more than N seconds? #85

Open
ByMedion opened this issue May 21, 2023 · 1 comment

Comments

@ByMedion
Copy link

ByMedion commented May 21, 2023

Hello!
I want to make Wit deactivate if the user is silent for more than N seconds after the last spoken phrase.

However, I cannot rely on the mic level due to possible background noise. So I made a timer that resets every time (if it's still active) when the OnPartialTranscription or OnFullTranscription event is called. When the timer expires, it deactivates the service.

But this solution is also not suitable due to the fact that sometimes there are network/server delays and the response comes too late, when the timer has already expired and Wit is deactivated even before the user stops talking.

Can you advise how I can achieve my goal?

@yolanother
Copy link
Contributor

Server side endpointing should be occurring. Unfortunately noisy environments can be tricky. Are you receiving transcriptions at all in the environment? It should endpoint after no new word is detected as well, but maybe you're noisy environment is picking up new words as well?

Silent after N seconds is a fallback mechanism in the event that server doesn't respond or endpoint.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants