speech-synthesis

Here are 441 public repositories matching this topic...

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models speaker-diariazation generative-ai

Updated Jun 11, 2024
Python

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Jun 11, 2024
Python

X-LANCE / UniCATS-CTX-vec2wav

Star

[AAAI 2024] Code for CTX-vec2wav in UniCATS

speech-synthesis vocoder semantic-token unicats vocoding self-supervised-speech

Updated Jun 11, 2024
Python

DigitalPhonetics / IMS-Toucan

Star

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

text-to-speech deep-learning toolkit speech pytorch tts speech-synthesis speech-processing

Updated Jun 11, 2024
Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Jun 11, 2024
Python

Camb-ai / MARS5-TTS

Star

MARS5 speech model (TTS) from CAMB.AI

text-to-speech speech speech-synthesis prosody voice-cloning voice-cloneai

Updated Jun 11, 2024
Python

espnet / espnet

Star

End-to-End Speech Processing Toolkit

deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Jun 11, 2024
Python

ictnlp / StreamSpeech

Star

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Updated Jun 11, 2024
Python

cronelab / delayed-speech-synthesis

Star

Source code for the paper "Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS" by Angrick et al.

speech-synthesis bci

Updated Jun 10, 2024
Python

yoyololicon / torchlpc

Sponsor

Star

speech-synthesis linear-predictive-coding time-varying-systems ddsp time-varying-filter

Updated Jun 10, 2024
Python

voicepaw / so-vits-svc-fork

Star

so-vits-svc fork with realtime support, improved interface and more features.

lightning deep-learning realtime pytorch speech-synthesis gan hacktoberfest voice-conversion voice-changer pytorch-lightning hubert vits sovits so-vits-svc softvc contentvec

Updated Jun 10, 2024
Python

ssb22 / gradint

Star

Graduated Interval Recall program

language-learning spaced-repetition speech-synthesis multiplatform riscos cantonese-language mandarin-chinese visual-impairment-aid

Updated Jun 9, 2024
Python

yl4579 / StyleTTS2

Star

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

text-to-speech deep-learning pytorch tts speech-synthesis gan speaker-adaptation adversarial-training diffusion-models wavlm latent-diffusion latent-diffusion-models

Updated Jun 9, 2024
Python

metavoiceio / metavoice-src

Star

Foundational model for human-like, expressive TTS

text-to-speech ai deep-learning speech pytorch tts speech-synthesis voice-clone zero-shot-tts

Updated Jun 8, 2024
Python

NaomiProject / Naomi

Sponsor

Star

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

linux home-automation raspberry-pi iot text-to-speech voice speech-synthesis assistant speech-recognition personal-assistant jasper speech-to-text jarvis hacktoberfest naomi vocal-assistant

Updated Jun 7, 2024
Python

tensorflow / lingvo

Star

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

Updated Jun 6, 2024
Python

KoljaB / RealtimeTTS

Star

Converts text to speech in realtime

python text-to-speech realtime speech-synthesis

Updated Jun 6, 2024
Python

csun22 / Synthetic-Voice-Detection-Vocoder-Artifacts

Star

This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifacts accepted in CVPR Workshop on Media Forensic 2023.

speech-synthesis neural-vocoder deepfake-detection audio-deepfake-detection

Updated Jun 4, 2024
Python

MahtaFetrat / Persian-MultiSpeaker-Tacotron2

Star

Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.

text-to-speech tts speech-synthesis persian tacotron2 multi-speaker-tts persian-tts mana-tts

Updated Jun 3, 2024
Python

stefantaubert / pinyin-to-ipa

Star

Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.

linguistics tts speech-synthesis pinyin speech-recognition cyrillic chinese phonetics transcription bopomofo zhuyin international-phonetic-alphabet

Updated Jun 3, 2024
Python

Improve this page

Add a description, image, and links to the speech-synthesis topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-synthesis topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-synthesis

Here are 441 public repositories matching this topic...

NVIDIA / NeMo

leon-ai / leon

X-LANCE / UniCATS-CTX-vec2wav

DigitalPhonetics / IMS-Toucan

PaddlePaddle / PaddleSpeech

Camb-ai / MARS5-TTS

espnet / espnet

ictnlp / StreamSpeech

cronelab / delayed-speech-synthesis

yoyololicon / torchlpc

voicepaw / so-vits-svc-fork

ssb22 / gradint

yl4579 / StyleTTS2

metavoiceio / metavoice-src

NaomiProject / Naomi

tensorflow / lingvo

KoljaB / RealtimeTTS

csun22 / Synthetic-Voice-Detection-Vocoder-Artifacts

MahtaFetrat / Persian-MultiSpeaker-Tacotron2

stefantaubert / pinyin-to-ipa

Improve this page

Add this topic to your repo