一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Python 7,520 907 Updated Dec 5, 2025

BytedanceSpeech / seed-tts-eval

Python 1,531 142 Updated Jun 14, 2024

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 8,235 730 Updated Feb 24, 2026

open-mmlab / FoleyCrafter

[IJCV] FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师，给你的无声视频添加生动而且同步的音效 😝

Python 642 65 Updated Jul 26, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 19,698 2,225 Updated Feb 11, 2026

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 7,551 704 Updated Dec 30, 2025

ariesssxu / vta-ldm

Python 62 5 Updated Jun 15, 2025

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 2,058 161 Updated Apr 21, 2025

NVIDIA / BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,187 142 Updated Sep 5, 2024

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 34,549 4,907 Updated Nov 24, 2024

kwatcharasupat / source-separation-landing

Landing Page for All Things Source Separation

36 1 Updated Sep 12, 2025

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,905 494 Updated Oct 12, 2024

jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,270 110 Updated Mar 2, 2025

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 902 83 Updated Sep 28, 2025

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 3,125 223 Updated May 19, 2025

JusperLee / Apollo

Music repair method to convert lossy MP3 compressed music to lossless music.

Python 357 34 Updated Aug 12, 2025

haidog-yaqub / EzAudio

High-quality Text-to-Audio Generation with Efficient Diffusion Transformer

Python 328 25 Updated Dec 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

叶夜靥 ZZfive

Achievements

Achievements

Block or report ZZfive

audio

espnet / espnet

m-bain / whisperX

coqui-ai / TTS

RVC-Boss / GPT-SoVITS

collabora / WhisperFusion

myshell-ai / MeloTTS

jasonppy / VoiceCraft

myshell-ai / OpenVoice

SunoAI-API / Suno-API

microsoft / SpeechT5

fishaudio / fish-speech

X-LANCE / AniTalker

2noise / ChatTTS

jianchang512 / ChatTTS-ui