Colab notebooks for text-to-audio generators

User-friendly Colab notebooks for various text prompt steered synthetic audio generators.

Available notebooks:

AudioLDM – text-to-audio
TorToiSe TTS – text-to-speech w/ voice-cloning
MubertAI Text-to-Music – text-to-music
TTS Voice Cloning – text-to-speech w/ voice-cloning

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models

Paper: Text-to-Audio Generation with Latent Diffusion Models

Colab for AudioLDM. Generates audio based on text description. This is probably the beginning of "Stable Diffusion of audio". Currently capable of producing 16 kHz audio only.

TorToiSe: Text-to-speech

Paper: TorToiSe - Spending Compute for High Quality TTS

Colab for TorToiSe text-to-speech voice-cloning. This notebook takes a text string and an audio file (or files) of a speaker's voice, and attempts to synthesize the text using the given voice. Currently works with English text only.

MubertAI Text-to-Music

UPDATE: it seems like Mubert API now requires (paid) API key.

Colab for MubertAI Text-to-Music. Generates music using predefined blocks created by the community (afaik) based on text description. See the source repository for information, such as licensing.

TTS Voice Cloning

Paper: Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis

Colab for Real-Time-Voice-Cloning text-to-speech voice-cloning. This notebook takes a text string and an audio file of a speaker's voice, and attempt to synthesize the text using the given voice. Fair warning: results are not great.

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
AudioLDM_pub.ipynb		AudioLDM_pub.ipynb
README.md		README.md
TTS_voice_cloning_pub.ipynb		TTS_voice_cloning_pub.ipynb
mubert_txt2music.ipynb		mubert_txt2music.ipynb
tortoise_tts_pub.ipynb		tortoise_tts_pub.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AudioLDM_pub.ipynb

AudioLDM_pub.ipynb

README.md

README.md

TTS_voice_cloning_pub.ipynb

TTS_voice_cloning_pub.ipynb

mubert_txt2music.ipynb

mubert_txt2music.ipynb

tortoise_tts_pub.ipynb

tortoise_tts_pub.ipynb

Repository files navigation

Colab notebooks for text-to-audio generators

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models

TorToiSe: Text-to-speech

MubertAI Text-to-Music

TTS Voice Cloning

About

Languages

olaviinha/NeuralTextToAudio

Folders and files

Latest commit

History

Repository files navigation

Colab notebooks for text-to-audio generators

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models

TorToiSe: Text-to-speech

MubertAI Text-to-Music

TTS Voice Cloning

About

Topics

Resources

Stars

Watchers

Forks

Languages