Voice Agent API (V1) — intermittent STT stall: UserStartedSpeaking fires but no ConversationText/UtteranceEnd returned for delivered audio (no error, no close) #1624
Replies: 3 comments 2 replies
-
|
Thanks for asking your question. Please be sure to reply with as much detail as possible so the community can assist you efficiently. |
Beta Was this translation helpful? Give feedback.
-
|
Hey there! It looks like you haven't connected your GitHub account to your Deepgram account. You can do this at https://community.deepgram.com - being verified through this process will allow our team to help you in a much more streamlined fashion. |
Beta Was this translation helpful? Give feedback.
-
|
It looks like we're missing some important information to help debug your issue. Would you mind providing us with the following details in a reply?
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Summary
On the Voice Agent API, a subset of our sessions intermittently stall on speech-to-text. The agent greets normally; the caller speaks; we receive
UserStartedSpeaking— but then no userConversationTextand noUtteranceEndare ever emitted, even though we keep streaming caller audio. There is no error event and noClosefrom the server. The session produces nothing further until the caller hangs up. Most sessions transcribe fine; this hits a minority.Configuration
AgentV1*API.nova-3,smart_format=true← the stage that stallsgpt-4o-mini(AgentV1OpenAiThinkProvider)auravoice)Specific failing session
request_id:
019eadbd-4218-7b40-aa5d-ccd219ed60b7Timeline (UTC, 2026-06-09):
Expected vs actual
UserStartedSpeaking, with audio still flowing, a userConversationText(and/orUtteranceEnd) so the agent can respond.Frequency
Intermittent; not load-correlated (reproduced on a single isolated call).
Questions
nova-3STT in Voice Agent to emitUserStartedSpeakingbut never emitConversationText/UtteranceEndfor audio that is still being delivered — with no error and noClose?request_id019eadbd-4218-7b40-aa5d-ccd219ed60b7, what happened to that session's transcription?Any help will be appreciated, thanks.
Beta Was this translation helpful? Give feedback.
All reactions