-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect translation while using demo app from the example #237
Comments
The illustration shown on the webpage for the translation demo appears to excessively exaggerate reality. In actuality, the translation does not seem to function as anticipated, wouldn't you agree? Additionally, it would be beneficial if you could include some examples for running seamless streaming. |
I thought I was the only one having this issue, why aren't enough people talking about this? Just like @iamshreeram I get some random text that has nothing to do with the original content. My input was an audio clip of 40 seconds, the result was some random sentence (like 3 words) that has nothing to do with the original content. I first thought this was my fault, thought maybe I messed up something with installation or messed up with running the code in a specific way. But I can confirm the exact same problem happens on the Huggingface space deployed by Meta. You can try yourself here: https://huggingface.co/spaces/facebook/seamless-m4t-v2-large If even the official demo deployed by the team is not working, I'm sure it's not our fault. There's something seriously wrong with the demo. ps. For the record, the broken ones are |
@Vaibhavs10 / @ggerganov , I would appreciate it if you could investigate this issue. The primary features of the model provided as an example do not appear to be functioning correctly, and there is a possibility that many similar issues may be reported. |
I think I am facing the same problem. I was the whole day trying to debug it and even left a comment at their space: |
can the devs do something?... |
Hey there, from transformers import pipeline
pipeline_generator = pipeline(
"automatic-speech-recognition",
"facebook/seamless-m4t-v2-large",
chunk_length_s=30,
device=3
)
transcript = pipeline_generator("https://www2.cs.uic.edu/~i101/SoundFiles/preamble10.wav",
generate_kwargs={
"tgt_lang": "spa", },
) Output: |
@ylacombe , Thank you, I appreciate it, your code snippet is working as intended. However, the question still remains as to why the ui would produce incorrect/irrelevant translations. Additionally, I have two minor questions:
|
I'm utilizing the demo app on my Mac M1 to try the model and the translation is not accurate. When attempting to translate the audio from the provided example, the model generates predictions, but the translated content is entirely unrelated. Below is the snapshot after translating the audio to spanish -
I'm seeking clarity on whether this behavior is expected or if there might be an underlying issue causing this discrepancy.
The text was updated successfully, but these errors were encountered: