#

speech

Here are 593 public repositories matching this topic...

MockingBird

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

text-to-speech ai deep-learning speech pytorch tts

Updated Jul 6, 2024
Python

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts speech-synthesis voice-conversion vocoder voice-synthesis tacotron voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts glow-tts hifigan tts-model

Updated Jul 8, 2024
Python

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

flow ai deep-learning voice speech pytorch audio-analysis generative-adversarial-network variational-inference voice-conversion vc voice-changer vits singing-voice-conversion voiceconversion sovits so-vits-svc

Updated Nov 11, 2023
Python

datasets

huggingface / datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

nlp machine-learning natural-language-processing computer-vision deep-learning tensorflow numpy speech pandas pytorch datasets hacktoberfest

Updated Jul 10, 2024
Python

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

Updated Jul 10, 2024
Python

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

audio music speech sound gpt talking-head

Updated Jul 6, 2024
Python

PaddlePaddle / models

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

nlp natural-language-processing computer-vision deep-learning neural-network models cv speech recommendation paddlepaddle

Updated Sep 5, 2023
Python

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

python text-to-speech ai deep-learning style prompt speech emotion pytorch tts speech-synthesis multi-speaker emotivoice

Updated Jun 20, 2024
Python

modelscope

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

python nlp science machine-learning deep-learning cv speech multi-modal

Updated Jul 9, 2024
Python

metavoiceio / metavoice-src

Foundational model for human-like, expressive TTS

text-to-speech ai deep-learning speech pytorch tts speech-synthesis voice-clone zero-shot-tts

Updated Jul 1, 2024
Python

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Jul 9, 2024
Python

tensorflow / lingvo

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

Updated Jul 9, 2024
Python

readbeyond / aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Updated Jun 22, 2024
Python

audio

pytorch / audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

audio python machine-learning speech pytorch io audio-processing

Updated Jul 9, 2024
Python

pytorch-kaldi

mravanelli / pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

deep-neural-networks deep-learning speech dnn pytorch recurrent-neural-networks lstm gru speech-recognition rnn kaldi rnn-model asr lstm-neural-networks multilayer-perceptron-network timit dnn-hmm

Updated Mar 14, 2022
Python

r9y9 / wavenet_vocoder

WaveNet vocoder

python speech pytorch speech-synthesis wavenet speech-processing wavenet-vocoder neural-vocoder

Updated Jul 29, 2023
Python

pndurette / gTTS

Python library and CLI tool to interface with Google Translate's text-to-speech API

python cli text-to-speech python-library pypi speech tts gtts speech-api

Updated Jun 17, 2024
Python

Camb-ai / MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

text-to-speech speech speech-synthesis prosody voice-cloning voice-cloneai

Updated Jul 5, 2024
Python

Rikorose / DeepFilterNet

Noise supression using deep filtering

audio rust deep-learning speech pytorch speech-enhancement noise-suppression

Updated Jul 9, 2024
Python

ahmetoner / whisper-asr-webservice

OpenAI Whisper ASR Webservice API

docker speech speech-recognition automatic-speech-recognition speech-to-text asr openai-whisper

Updated Jul 8, 2024
Python

Improve this page

Add a description, image, and links to the speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."