You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Due to the implementation of the transcribe_file method and specially of the sub method load_audio, it is impossible to transcribe audio file with the same name despite being located in different directories since the method copy the audio file locally at savedir. The new file with the same name doesn't overwrite the file, since it already exist and make the transcription impossible.
This relates to #1303 - this topic touches on data handling in general:
for researchers, can a training recipe run through?
for industry, are the right audios loaded through the pretrained interfaces?
for curious users, does the demo code provided on HuggingFace run?
Dropping a file does not help if there's DDP and multiple nodes are having fun with the same file name.
Hope we can get to it soon. As you mentioned, it's internal data handling and there's an expectation of this just running well (as it is also stated in the mentioned PR).
Due to the implementation of the transcribe_file method and specially of the sub method load_audio, it is impossible to transcribe audio file with the same name despite being located in different directories since the method copy the audio file locally at
savedir
. The new file with the same name doesn't overwrite the file, since it already exist and make the transcription impossible.Example of file paths :
It will be cool to get rid of this local file to fix this issue and improve overall performances of the transcription method.
One easy but dirty way of fixing it is to remove the file at the end of the transcription to allow further ones.
The text was updated successfully, but these errors were encountered: