Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

voxceleb dataset #5

Closed
yy835055664 opened this issue May 11, 2022 · 1 comment
Closed

voxceleb dataset #5

yy835055664 opened this issue May 11, 2022 · 1 comment

Comments

@yy835055664
Copy link

Hi, thank you so much for sharing this code.
Recently, I have been reproducing and learning your project. Regarding custom data sets, both data sets in your project use torchaudio's own data. I have tried custom data in the past few days, but there have been many problems, resulting in failure. Hope you can guide me on non-torchaudio data such as the voxceleb dataset. thank you very much!

@Wadaboa
Copy link
Owner

Wadaboa commented May 15, 2022

Hi there,

as explained in issue #3 you just need to subclass SpeakerDataset and the data you use doesn't need to be from torchaudio. What you could do for voxceleb is download the dataset locally, understand how the data is organized (in terms of folder structure) and implement the abstract methods required by SpeakerDataset.

Otherwise, you could wrap the dataset using some other library such as this one.

If you need more guidance, please be more specific with the "failures" you mentioned.

Thanks,
Alessio.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants