You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I tried to get transcriptions for a video of David Silver's reinforcement learning playlist from YouTube .
The model was able to generate very good transcriptions at some timestamps , but at many timestamps , it generates transcriptions of some other language which apart from English. I haven't changed any settings or anything , just copy pasted the url of the video and clicked on transcribe . The result was out in 23.4 seconds but wasn't accurate .
For more information , please have a look at this image I'm attaching below :
!
In the image , you can clearly observe that the model is generating transcriptions of other language , even though english is asked for . Some part of it was in English , and the other part in some other language . #
The text was updated successfully, but these errors were encountered:
Hey @Bharadwajsai-121! Looks like it went into Welsh! This is because the Whisper model makes a prediction for the most likely language for each batch, and then predicts in that language for that batch.
We can actually pass the language to the Flax Whisper Pipeline:
If you pass the language argument as detailed above, you should be able to get perfect transcriptions for the David Silver lectures in nearly the same transcription time.
I tried to get transcriptions for a video of David Silver's reinforcement learning playlist from YouTube .
The model was able to generate very good transcriptions at some timestamps , but at many timestamps , it generates transcriptions of some other language which apart from English. I haven't changed any settings or anything , just copy pasted the url of the video and clicked on transcribe . The result was out in 23.4 seconds but wasn't accurate .
For more information , please have a look at this image I'm attaching below :
!
In the image , you can clearly observe that the model is generating transcriptions of other language , even though english is asked for . Some part of it was in English , and the other part in some other language . #
The text was updated successfully, but these errors were encountered: