speech-processing

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

text-to-speech deep-learning toolkit speech pytorch tts speech-synthesis speech-processing

Updated May 14, 2024
Python

santi-pdp / pase

Star

Problem Agnostic Speech Encoder

deep-learning pytorch unsupervised-learning speech-processing multi-task-learning waveform-analysis self-supervised-learning

Updated Jul 6, 2023
Python

r9y9 / pysptk

Sponsor

Star

A python wrapper for Speech Signal Processing Toolkit (SPTK).

python dsp speech speech-synthesis python-wrapper digital-signal-processing speech-processing sptk

Updated Oct 17, 2023
Python

SuperKogito / spafe

Sponsor

Star

🔉 spafe: Simplified Python Audio Features Extraction

Updated May 13, 2024
Python

novoic / surfboard

Star

Novoic's audio feature extraction library

audio python machine-learning signal-processing healthcare feature-extraction speech-processing audio-processing alzheimers-disease parkinsons-disease

Updated Mar 4, 2022
Python

SforAiDl / Neural-Voice-Cloning-With-Few-Samples

Star

This repository has implementation for "Neural Voice Cloning With Few Samples"

deep-learning voice tts speech-processing voice-synthesis saidl speaker-adaptation voice-cloning speaker-encodings mel-spectogram

Updated Feb 23, 2021
Python

microsoft / UniSpeech

Star

UniSpeech - Large Scale Self-Supervised Learning for Speech

speech pytorch speech-recognition speaker-verification speech-processing speech-separation diarization speech-diarization

Updated Apr 5, 2024
Python

r9y9 / nnmnkwii

Sponsor

Star

Library to build speech synthesis systems designed for easy and fast prototyping.

python machine-learning text-to-speech speech-synthesis voice-conversion speech-processing

Updated Feb 3, 2023
Python

rishikksh20 / VocGAN

Sponsor

Star

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

text-to-speech speech-synthesis gan speech-processing vocoder melgan vocgan

Updated Oct 8, 2022
Python

Improve this page

Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-processing

Here are 236 public repositories matching this topic...

speechbrain / speechbrain

microsoft / torchscale

r9y9 / wavenet_vocoder

r9y9 / deepvoice3_pytorch

linto-ai / whisper-timestamped

mravanelli / SincNet

resemble-ai / resemble-enhance

haoheliu / voicefixer

drethage / speech-denoising-wavenet

breizhn / DTLN

Audio-WestlakeU / FullSubNet

DigitalPhonetics / IMS-Toucan

santi-pdp / pase

r9y9 / pysptk

SuperKogito / spafe

novoic / surfboard

SforAiDl / Neural-Voice-Cloning-With-Few-Samples

microsoft / UniSpeech

r9y9 / nnmnkwii

rishikksh20 / VocGAN

Improve this page

Add this topic to your repo