🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
-
Updated
May 13, 2024 - Python
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Machine learning speaker characteristics
The macOS built-in `say` CLI for JavaScript
ModelScope: bring the notion of Model-as-a-Service to life.
Pronounce and Speech Text - Enter Word and Get the Pronunciation and Speech Text.
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
VITS-based Voice Conversion focused on simplicity, quality and performance.
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)
Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Audio Codec Speech processing Universal PERformance Benchmark
High-Fidelity Neural Phonetic Posteriorgrams
Data manipulation and transformation for audio signal processing, powered by PyTorch
Some simulation macros related to signal processing
Free, easy, portable audio engine for games
A ggml (C++) re-implementation of tortoise-tts. Under construction and seeking contributors.
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."