Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add forced alignment #14

Open
2 tasks
ibevers opened this issue May 9, 2024 · 5 comments · May be fixed by #69
Open
2 tasks

Add forced alignment #14

ibevers opened this issue May 9, 2024 · 5 comments · May be fixed by #69
Assignees
Labels
enhancement New feature or request
Milestone

Comments

@ibevers
Copy link
Collaborator

ibevers commented May 9, 2024

Description

Create a forced alignment task in audio folder

Tasks

  • create forced alignment task
  • create pydra version of alignment task

Freeform Notes

No response

@ibevers ibevers self-assigned this May 9, 2024
@ibevers ibevers changed the title Task: add forced alignment Add forced alignment May 14, 2024
@satra
Copy link
Collaborator

satra commented May 24, 2024

btw, stan pointed to voicecraft which seems to have a pipeline that can do transcription and alignment - may want to check differences between the current model and that one. also does text to speech, and perhaps we can avoid the older dependencies that TTS brings to the b2aiprep package.

@fabiocat93
Copy link
Collaborator

fabiocat93 commented May 24, 2024

btw, stan pointed to voicecraft which seems to have a pipeline that can do transcription and alignment - may want to check differences between the current model and that one. also does text to speech, and perhaps we can avoid the older dependencies that TTS brings to the b2aiprep package.

voicecraft is cool, I have attended one talks from them recently. But from what I know, they don't have a package on pypi yet

@satra
Copy link
Collaborator

satra commented May 24, 2024

but they have models on huggingface that we could use right? in fact, i played with their spaces, which means all the code for that is also on huggingface. so technically we should be able to create that pipeline.

@fabiocat93
Copy link
Collaborator

but they have models on huggingface that we could use right? in fact, i played with their spaces, which means all the code for that is also on huggingface. so technically we should be able to create that pipeline.

Do you want to include their source code in our repo?

@satra
Copy link
Collaborator

satra commented May 24, 2024

for things that do not have releases but have git repos and we plan to use their code directly, we can include the repo as a git submodule in an externals directory under source.

however, if it's a matter of copying a script or a workflow with huggingface models, we should just create the workflow ourselves. it depends on the complexity implemented.

@fabiocat93 fabiocat93 added the enhancement New feature or request label May 28, 2024
@fabiocat93 fabiocat93 added this to the Release 0.1.4 milestone May 31, 2024
@ibevers ibevers linked a pull request Jun 17, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Todo
Development

Successfully merging a pull request may close this issue.

3 participants