Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add VoxConverse recipe #1142

Merged
merged 3 commits into from
Sep 14, 2023
Merged

Conversation

flyingleafe
Copy link
Contributor

From here:

"VoxConverse is an audio-visual diarisation dataset consisting of multispeaker clips of human speech, extracted from YouTube videos.
Updates and additional information about the dataset can be found at our website (https://www.robots.ox.ac.uk/~vgg/data/voxconverse/index.html)."

Note: The default dev/test split is quite weird - namely, the test set is larger than the dev set... Hence, there is an option to use "dev" set as "train", and split "test" set in half into "dev" and "test", which can be disabled in the recipe.

@flyingleafe
Copy link
Contributor Author

@pzelasko ^ quite weird that a random test for a particular python version, totally irrelevant to the changes in the PR, fails

@pzelasko
Copy link
Collaborator

@pzelasko ^ quite weird that a random test for a particular python version, totally irrelevant to the changes in the PR, fails

Some tests depend on RNG and once in a while they get flaky. It's happening a bit too often though, so I'm trying to fix that in #1143

def prepare_voxconverse(
corpus_dir: Pathlike,
output_dir: Optional[Pathlike] = None,
split_test: bool = True, # test part is larger than dev part - split it into dev and test by default
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this a standard thing to do with this dataset? If not, it would be better to return the splits as defined by the creators by default.

@flyingleafe
Copy link
Contributor Author

@pzelasko yes, you're right, probably should not do the resplit by default, changed that

@pzelasko pzelasko added this to the v1.17 milestone Sep 14, 2023
Copy link
Collaborator

@pzelasko pzelasko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@pzelasko pzelasko merged commit 1389de4 into lhotse-speech:master Sep 14, 2023
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants