Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

updating the voxpopuli recipe #1243

Merged

Conversation

KarelVesely84
Copy link
Contributor

  • allow to use pre-downloaded transcripts (so data praparation can be run without Internet access)
    • transcripts .tgz is downloaded into /manifests, and not into tmp
  • search the audio data in folder: corpus_dir / "raw_audios" / lang

- allow to use pre-downloaded transcripts (so data praparation can be
  run without Internet access)
- search the audio data in folder: `corpus_dir / "raw_audios" / lang`
Copy link
Collaborator

@pzelasko pzelasko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, Karel!

@pzelasko pzelasko merged commit d1ae9c0 into lhotse-speech:master Dec 24, 2023
11 checks passed
@pzelasko pzelasko added this to the 1.19 milestone Dec 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants