Multiple Language output #11

snehas89 · 2024-04-12T13:09:25Z

It seems that when uploading an audio or video in Kannada, only the initial portion gets transcribed accurately, while the subsequent part is transcribed in Tamil, as depicted in the provided screenshot. This likely arises due to a language detection error or a system glitch.

ronald0098 · 2024-04-12T14:32:14Z

It seems that when uploading an audio or video in Kannada, only the initial portion gets transcribed accurately, while the subsequent part is transcribed in Tamil, as depicted in the provided screenshot. This likely arises due to a language detection error or a system glitch.

can u pls tell me step by step how to run this project in my machine

kurianbenoy · 2024-04-13T08:39:17Z

@snehas89 can you give us more details about the error. I have also noticed this issue. It's not new to us to be honest.

But if you provide more details like:

Input Youtube video
Input video language:
Target video language:
Did you use advanced options to use any of our 4 model's other that faster-whisper.

@snehas89 it will be helpful for us. Also @snehas89 did you want to help us with issue #9 ?

snehas89 · 2024-04-15T07:27:39Z

@kurianbenoy

It was a local audio file which I uploaded
Input video language: Kannada
Target video language:Kannada
Yes I did use 3 of the models provided i.e, SeamlessM4T, Faster-Whisper, WhisperX

Out of the 3 models Faster-Whisper gave a result better than the other two.
My primary aim was to transcribe the audio file and later look into translation, but was not able to proceed with it.

snehas89 · 2024-04-15T07:33:04Z

@ronald0098
I'm not sure if I found any documentation on how to run the model locally, I used the Indic subtitler web app
https://indicsubtitler.in/ @kurianbenoy can confirm if this is right

kurianbenoy · 2024-04-15T12:33:09Z

Can you share the local audio file here if possible? @snehas89

We haven't added the documentation on how to run model locally, but yeah we can do that when we are free. Created an issue #13 for this.

snehas89 · 2024-04-16T05:03:27Z

@kurianbenoy
doc.zip

Please find the attached zip file, as github doesn't support audio formats uploading

kurianbenoy · 2024-04-16T12:04:01Z

Thanks @snehas89 for sharing the files via zip files. We can't do much for the time being to be honest.

Yet in the future, we might work on improving accuracy with LLMs, so these multiple language outputs doesn't happen.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple Language output #11

Multiple Language output #11

snehas89 commented Apr 12, 2024 •

edited

ronald0098 commented Apr 12, 2024

kurianbenoy commented Apr 13, 2024

snehas89 commented Apr 15, 2024

snehas89 commented Apr 15, 2024 •

edited

kurianbenoy commented Apr 15, 2024 •

edited

snehas89 commented Apr 16, 2024 •

edited

kurianbenoy commented Apr 16, 2024

Multiple Language output #11

Multiple Language output #11

Comments

snehas89 commented Apr 12, 2024 • edited

ronald0098 commented Apr 12, 2024

kurianbenoy commented Apr 13, 2024

snehas89 commented Apr 15, 2024

snehas89 commented Apr 15, 2024 • edited

kurianbenoy commented Apr 15, 2024 • edited

snehas89 commented Apr 16, 2024 • edited

kurianbenoy commented Apr 16, 2024

snehas89 commented Apr 12, 2024 •

edited

snehas89 commented Apr 15, 2024 •

edited

kurianbenoy commented Apr 15, 2024 •

edited

snehas89 commented Apr 16, 2024 •

edited