speech

Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

text-to-speech speech language-detection speech-synthesis speech-recognition speech-to-text source-separation language-identification forced-alignment speech-translation speech-alignment

Updated May 12, 2024
TypeScript

voidful / Codec-SUPERB

Sponsor

Star

Audio Codec Speech processing Universal PERformance Benchmark

audio speech codec audio-codec superb

Updated May 11, 2024
Python

interactiveaudiolab / ppgs

Star

High-Fidelity Neural Phonetic Posteriorgrams

distance speech pronunciation phonemes intelligibility posteriorgram

Updated May 13, 2024
Python

pytorch / audio

Star

Data manipulation and transformation for audio signal processing, powered by PyTorch

audio python machine-learning speech pytorch io audio-processing

Updated May 13, 2024
Python

sergiozc / signal-processing-scripts

Star

Some simulation macros related to signal processing

signal-processing speech image-processing dvb-t kalman-filter equalizer beamforming

Updated May 11, 2024
MATLAB

jarikomppa / soloud

Star

Free, easy, portable audio engine for games

Updated May 11, 2024
C

balisujohn / tortoise.cpp

Star

A ggml (C++) re-implementation of tortoise-tts. Under construction and seeking contributors.

text-to-speech text speech tts to tortoise-tts ggml

Updated May 11, 2024
C++

dusty-nv / NanoLLM

Star

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

speech multimodal rag edge-ai vector-database vision-transformer llm-inference

Updated May 11, 2024
Python

Improve this page

Add a description, image, and links to the speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech

Here are 1,616 public repositories matching this topic...

huggingface / datasets

felixbur / nkululeko

LitoMore / mac-say

modelscope / modelscope

weirongxu / auditory-reader

mskian / pronounce-and-speech

Mohamad-Hussein / speech-assistant

mishra-ankit / modi-speeches

OvidijusParsiunas / deep-chat

IAHispano / Applio

jim60105 / docker-whisperX

SWHL / AI-Competition-Collections

echogarden-project / echogarden

voidful / Codec-SUPERB

interactiveaudiolab / ppgs

pytorch / audio

sergiozc / signal-processing-scripts

jarikomppa / soloud

balisujohn / tortoise.cpp

dusty-nv / NanoLLM

Improve this page

Add this topic to your repo