An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
-
Updated
Feb 11, 2024 - Python
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
ChatGPT, GenerativeAI and LLMs Timeline
OpenMusic: SOTA Text-to-music (TTM) Generation
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
An unofficial PyTorch implementation of VALL-E
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Applying deep learning to translate animation and re-generate audio.
Add a description, image, and links to the vall-e topic page so that developers can more easily learn about it.
To associate your repository with the vall-e topic, visit your repo's landing page and select "manage topics."