You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I set up a local HLS stream playing a long video of someone talking.
Everything seems great until after exactly 2 minutes in the transcription stops completely.
INFO:faster_whisper:Processing audio with duration 00:07.936
INFO:faster_whisper:Processing audio with duration 00:02.984
INFO:faster_whisper:Processing audio with duration 00:03.032
INFO:faster_whisper:Processing audio with duration 00:01.432
INFO:faster_whisper:Processing audio with duration 00:03.480
INFO:faster_whisper:Processing audio with duration 00:05.272
INFO:faster_whisper:Processing audio with duration 00:01.152
INFO:faster_whisper:Processing audio with duration 00:03.200
INFO:faster_whisper:Processing audio with duration 00:02.548
INFO:faster_whisper:Processing audio with duration 00:04.596
INFO:faster_whisper:Processing audio with duration 00:01.796
INFO:faster_whisper:Processing audio with duration 00:03.844
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
In the server logs i can see that chunks of variable length are processed by the server. However the problem starts when the "00:02.484" chunks keep getting processed. I'm unsure if its just continuing to send the same chunk and it keeps translating it therefore the client appears to be "stuck" or if its stuck in a different loop of some sort.
Setting use_vad to True doesn't seem to make a difference.
I have tried both on Mac (M3 Max chip) and Windows 10. Both docker and python server. Both produce the same results.
I set up a local HLS stream playing a long video of someone talking.
Everything seems great until after exactly 2 minutes in the transcription stops completely.
INFO:faster_whisper:Processing audio with duration 00:07.936
INFO:faster_whisper:Processing audio with duration 00:02.984
INFO:faster_whisper:Processing audio with duration 00:03.032
INFO:faster_whisper:Processing audio with duration 00:01.432
INFO:faster_whisper:Processing audio with duration 00:03.480
INFO:faster_whisper:Processing audio with duration 00:05.272
INFO:faster_whisper:Processing audio with duration 00:01.152
INFO:faster_whisper:Processing audio with duration 00:03.200
INFO:faster_whisper:Processing audio with duration 00:02.548
INFO:faster_whisper:Processing audio with duration 00:04.596
INFO:faster_whisper:Processing audio with duration 00:01.796
INFO:faster_whisper:Processing audio with duration 00:03.844
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
INFO:faster_whisper:Processing audio with duration 00:02.484
In the server logs i can see that chunks of variable length are processed by the server. However the problem starts when the "00:02.484" chunks keep getting processed. I'm unsure if its just continuing to send the same chunk and it keeps translating it therefore the client appears to be "stuck" or if its stuck in a different loop of some sort.
Setting use_vad to True doesn't seem to make a difference.
I have tried both on Mac (M3 Max chip) and Windows 10. Both docker and python server. Both produce the same results.
This is the client code:
The text was updated successfully, but these errors were encountered: