- Jeju, South Korea
-
21:10
(UTC +09:00) - https://www.linkedin.com/in/sanghwakim/
Stars
Sound
9 repositories
Robust Speech Recognition via Large-Scale Weak Supervision
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
A TTS model capable of generating ultra-realistic dialogue in one pass.
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
Functional programming language for signal processing and sound synthesis
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Free and open source alternative to Wispr Flow / Superwhisper / Monologue / etc