Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

large audio file language processing #5

Open
MostafaAlaviyan opened this issue Feb 15, 2023 · 1 comment
Open

large audio file language processing #5

MostafaAlaviyan opened this issue Feb 15, 2023 · 1 comment

Comments

@MostafaAlaviyan
Copy link

Hi,
At the first, thanks for the valuable repo.
I have some audio file with average length of 15 minutes that several people with different language are talking in it.
How can I use your pretrained model to handle the aforementioned audio file?
Best regards
@bytosaur
@danomatika
@loelkes

@danomatika
Copy link
Member

Howdy Mostafa,

How can I use your pretrained model to handle the aforementioned audio file?

Good question. The readme says: "All models expect 5 seconds of normalized audio sampled at 16kHz..." so this can be fed in to see what you get out. What is missing in the readme is how to load a file. @bytosaur I assume this is possible but just not documented?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants