Speech Recognition for Ukrainian
-
Updated
Jul 10, 2024 - Python
Speech Recognition for Ukrainian
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
VITS-based Voice Conversion focused on simplicity, quality and performance.
Prosody and Pronunciation Modification Network
Data manipulation and transformation for audio signal processing, powered by PyTorch
The EveryVoice TTS Toolkit - Text To Speech for your language
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
AudioBench: A Universal Benchmark for Audio Large Language Models
ModelScope: bring the notion of Model-as-a-Service to life.
AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Latest laughter detection & segmentaion model.
OpenAI Whisper ASR Webservice API
A command line tool that helps use the "Zero Ressource Challenge" benchmarks
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."