-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add delay until STT start media finishes playing #11
Comments
StreamAssist uses default Assist Pipeline component. It has some settings, but I don't really understand them :) Also realise that the player has finished playing for all kinds of media player can be a problem. |
Assist Microphone addon and wyoming satellite on raspberry py do not have this problem. Wait for awake response to finish playing then start listening. So there is something like that in the code, but we have to figure out where. And on the satellite on the Esp32 it has three levels of end-of-speech detection (Default, Relaxed and Aggressive). |
|
I get the idea. I don't know if I'll have time to implement this. |
I found a possible solution to this problem:
|
Block loop is very bad idea. You are blocking whole Hass. I know what can be done. I can stop forwarding audio stream from source to pipeline for some time |
I didn't think that it blocks whole Hass. Anyway, it doesn't really work because, I don't know why it starts recording as soon as the wake word is detected, then blocks and delays the VAD and doesn't recognize the commands. Stopping audio stream forwarding would be a much better solution. |
@AlexxIT please can you find a solution to this problem because I want to add visual responses instead of beeps in this integration and if I don't solve the problem with activate mute or delay listening I can't use such responses because it records them and no longer recognize commands. |
I don't have time for this in near future |
I added a browser mod popup with a gif and I need the player status to close the popup when the response finishes playing , but I'm not getting the "player_entity_id" from the args. @AlexxIT can you tell me how I could do it.
If I use the name of the player directly, it works, but not when I want to take it from args |
I'm not sure what args you talking about. I have never used browser mod. Don't understand your code. |
I just need to import the name of the player that is selected in the gui that the responses are playing on to set the popup to close when the response is done playing. |
I don't understand from what place your trying to get |
Did this ever get taken care of? I noticed teh VAD is way too aggressive as well and depending on how quickly the mp3 you play during start media vad is already over and the conversation agent cancels the request. |
Hello. Great job. I was waiting for the wake word for Stream Assist and I'm glad you managed to do it. My problem is that for "STT start media" I want to use personalized random answers like ”yes, i m listening”, ”how can I assist you” etc. and, because VAD is too aggressive, it also records part of the answer ”yes , i m listening” reason for which it gives an error response, that it did not understand the request. I tried an automation so that when it detects the wake word it turns off the microphone switch for a second and then turns it on again, but it doesn't start listening again. Can you make it possible to set a delay between wake word detection and STT listening?
The text was updated successfully, but these errors were encountered: