Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Supported audio/index length? #5

Closed
ghost opened this issue Nov 9, 2021 · 2 comments
Closed

Supported audio/index length? #5

ghost opened this issue Nov 9, 2021 · 2 comments

Comments

@ghost
Copy link

ghost commented Nov 9, 2021

Hello,

I am building a dataset to train with and need to ask a few questions before proceeding.

What is the max supported/suggested audio length? is several minutes alright or should the audio be limited to about ~20 seconds or so? Likewise, is there a reasonable limit to the length of the index?

Thank you.

@ghost ghost changed the title Supported audio/index length? Nested subfolders? Supported audio/index length? Jan 1, 2022
@Sleepwalking
Copy link
Owner

Hello, this is a bit case-dependent. My experience tells that long files are tricky to handle. For speech data it works the best with audio under 20 seconds. Actually the memory usage is O(N^2) where N is the file length so it'll quickly run out of memory for anything lasting longer than a minute. Quality also degrades when there are long silences.

There are tricks to tame those long audio files. For example, you can do this in two steps. First drop frames from the acoustic features and do a rough alignment. With this initial alignment, you can then cut the file into smaller pieces and do full scaled alignment on each of them.

@ghost
Copy link
Author

ghost commented Jan 11, 2022

I see. I should have clarified that I was asking about training and not just alignment.
The audio was way too long, several minutes each, and while I wasn't running out of memory the training wasn't progressing. I roughly segmented the audio with another tool and it worked as expected.

Thank you very much for the advice. Especially with the suggested tricks.

@ghost ghost closed this as completed Jan 11, 2022
This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant