speech

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

speech multimodal rag edge-ai vector-database vision-transformer llm-inference

Updated Jul 9, 2024
Python

snakers4 / silero-vad

Star

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Jul 9, 2024
Python

DigitalPhonetics / IMS-Toucan

Star

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

text-to-speech deep-learning toolkit speech pytorch tts speech-synthesis speech-processing

Updated Jul 9, 2024
Python

AudioLLMs / AudioBench

Star

AudioBench: A Universal Benchmark for Audio Large Language Models

speech speech-recognition speech-question-answering audio-scene-understanding

Updated Jul 9, 2024
Python

modelscope / modelscope

Star

ModelScope: bring the notion of Model-as-a-Service to life.

python nlp science machine-learning deep-learning cv speech multi-modal

Updated Jul 9, 2024
Python

SWHL / AI-Competition-Collections

Star

AI比赛经验帖子 & 训练和测试技巧帖子集锦（收集整理各种人工智能比赛经验帖）

nlp competition cv speech knowledge-graph data-discovery recommender-system graph-neural-networks

Updated Jul 9, 2024
Python

coqui-ai / TTS

Star

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts speech-synthesis voice-conversion vocoder voice-synthesis tacotron voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts glow-tts hifigan tts-model

Updated Jul 8, 2024
Python

omine-me / LaughterSegmentation

Star

Latest laughter detection & segmentaion model.

speech laugh-detection sound-synthesis laughter sound-event-detection laughter-detection laughter-segmentaion

Updated Jul 8, 2024
Python

ahmetoner / whisper-asr-webservice

Sponsor

Star

OpenAI Whisper ASR Webservice API

docker speech speech-recognition automatic-speech-recognition speech-to-text asr openai-whisper

Updated Jul 8, 2024
Python

zerospeech / benchmarks

Star

A command line tool that helps use the "Zero Ressource Challenge" benchmarks

machine-learning speech challenges research-tool zerospeech benchmarking-speech

Updated Jul 8, 2024
Python

CommanderCRM / AudioApp

Star

Серверная часть приложения для оценки качества речи

python server backend speech fastapi sqlmodel

Updated Jul 7, 2024
Python

AIGC-Audio / AudioGPT

Star

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

audio music speech sound gpt talking-head

Updated Jul 6, 2024
Python

Improve this page

Add a description, image, and links to the speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech

Here are 593 public repositories matching this topic...

egorsmkv / speech-recognition-uk

nafiuny / ICRCycleGAN-VC

huggingface / datasets

m-bain / whisperX

IAHispano / Applio

maxrmorrison / promonet

pytorch / audio

EveryVoiceTTS / EveryVoice

dusty-nv / NanoLLM

snakers4 / silero-vad

DigitalPhonetics / IMS-Toucan

AudioLLMs / AudioBench

modelscope / modelscope

SWHL / AI-Competition-Collections

coqui-ai / TTS

omine-me / LaughterSegmentation

ahmetoner / whisper-asr-webservice

zerospeech / benchmarks

CommanderCRM / AudioApp

AIGC-Audio / AudioGPT

Improve this page

Add this topic to your repo