the on_realtime_transcription_update text issue #69

jacobtang · 2024-06-15T07:58:51Z

Hi,@KoljaB
I use the on_realtime_transcription_update(also try the on_realtime_transcription_stabilized) text,send to the web client,but it always show the pre text word.
Is there other method that can avoid show the repeated pre words?Thanks!

I try to implement live transcription function similar to Zoom Meeting in the product.

KoljaB · 2024-06-15T08:13:14Z

No, that ist due to the way Whisper works and how it therefore is implemented. You'd need to program this on top of that.

jacobtang · 2024-06-16T03:44:13Z

Ok,thanks a lot!
I use the process_text callback data, it costs about 3s to get the text data, is it normal? how should I reduce the time cost?
use the recorder_config,and recorder.feed_audio(audio_chunk) method in GPU env.

KoljaB · 2024-06-16T08:47:14Z

Absolutely not normal. Even with largest model or on CPU transcription time should be way below 1s.
I have no real idea, why this happens.
Can you please try another model, like small.en or medium? Also a separated test with only faster_whisper library could tell us if it's the transcription or maybe something related to the VAD models.

jacobtang closed this as completed Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the on_realtime_transcription_update text issue #69

the on_realtime_transcription_update text issue #69

jacobtang commented Jun 15, 2024

KoljaB commented Jun 15, 2024

jacobtang commented Jun 16, 2024

KoljaB commented Jun 16, 2024

the on_realtime_transcription_update text issue #69

the on_realtime_transcription_update text issue #69

Comments

jacobtang commented Jun 15, 2024

KoljaB commented Jun 15, 2024

jacobtang commented Jun 16, 2024

KoljaB commented Jun 16, 2024