-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add forced alignment #14
Comments
btw, stan pointed to voicecraft which seems to have a pipeline that can do transcription and alignment - may want to check differences between the current model and that one. also does text to speech, and perhaps we can avoid the older dependencies that TTS brings to the b2aiprep package. |
voicecraft is cool, I have attended one talks from them recently. But from what I know, they don't have a package on pypi yet |
but they have models on huggingface that we could use right? in fact, i played with their spaces, which means all the code for that is also on huggingface. so technically we should be able to create that pipeline. |
Do you want to include their source code in our repo? |
for things that do not have releases but have git repos and we plan to use their code directly, we can include the repo as a git submodule in an externals directory under source. however, if it's a matter of copying a script or a workflow with huggingface models, we should just create the workflow ourselves. it depends on the complexity implemented. |
Description
Create a forced alignment task in audio folder
Tasks
Freeform Notes
No response
The text was updated successfully, but these errors were encountered: