Unofficial implementation of Voicebox

Description

Unofficial implementation of Voicebox.

Core codes from lucidrains/voicebox-pytorch. I do not use voicebox-pytorch pypi release, instead put it in this repository just for convenient.

I did not trained duration model. It's on the TODO list.

Demo

see demo

LJSpeech:

Original Text: Field agents supplement those on the detail, particularly when the President is traveling.

Edited Text: Field agents supplement those on the detail, particularly when the Prime Minister is traveling.

AIShell3:

Original Text: 夺得队史第五座中超冠军

Edited Text: 夺得队史第五座英超冠军

Note: aishell3's edited.wav is not good enough, because vocoder i used need more steps to converge.

Checkpoint

see LJSpeech

How to run

First, install dependencies

# clone project   
git clone https://github.com/chenht2010/Voicebox.git

# install dependeces
pip install lightning[extra] torch torchaudio tgt vocos torchdiffeq torchode einops beartype naturalspeech2-pytorch audiolm-pytorch

Next, navigate to examples, check README and run it.

TODO

[] try other universal vocoder
[] try other alignment tools
[] train duration model

Citation

@article{YourName,
  title={Your Title},
  author={Your team},
  journal={Location},
  year={Year}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
demo		demo
examples		examples
tests		tests
voicebox		voicebox
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unofficial implementation of Voicebox

Description

Demo

Checkpoint

How to run

TODO

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Unofficial implementation of Voicebox

Description

Demo

Checkpoint

How to run

TODO

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages