TTS API

支持三种模型的统一 TTS 服务接口，通过 config.yaml 切换模型，对外 API 保持一致。

支持的模型

active 值	模型	特点
`custom_voice`	Qwen3-TTS CustomVoice	预设音色（9个），甜美女声等
`base`	Qwen3-TTS Base	Voice Clone，需先注册自定义音色
`kokoro`	Kokoro-82M	轻量级，中英双语，18个音色

快速开始

1. 配置

cp config.yaml config.local.yaml
# 修改 config.local.yaml 中的 model_path 等配置

2. 启动

docker compose up -d

3. 验证

curl http://localhost:8420/tts/health

API 接口

所有接口的 base URL 默认为 http://localhost:8420

健康检查

GET /tts/health

返回：{"status": "ok", "model": "custom_voice"}

提交任务

POST /tts/submit
Content-Type: application/json

{
  "text": "你好世界",
  "language": "Chinese",
  "speaker": "Dylan",
  "instruct": "愉快，轻松",
  "temperature": 0.3,
  "top_k": 20,
  "top_p": 0.85,
  "repetition_penalty": 1.1
}

返回：{"task_id": "20260509_abc123", "position": 1} (HTTP 202)

查询状态

GET /tts/status/<task_id>

返回：{"task_id": "...", "status": "success", "download_url": "/tts/download/..."}

下载音频

GET /tts/download/<task_id>

返回：WAV 音频文件

音色列表

GET /tts/speakers

返回：{"speakers": [{"name": "Dylan", "description": "..."}]}

队列状态

GET /tts/queue

返回：{"queue_size": 0, "tasks_tracked": 5}

Voice Clone（Base 模型）

当 active: base 时，需先注册音色：

POST /tts/clones
Content-Type: multipart/form-data

name: my_voice
instruct: 参考音频对应的文字
audio: <wav文件>

已注册音色列表：

GET /tts/clones

采样参数说明

参数	默认值	范围	说明
`temperature`	0.3	0.1~1.0	越低越稳定
`do_sample`	true	true/false	true=采样，false=贪心解码
`top_k`	20	10~100	保留 top-k token，越小越集中
`top_p`	0.85	0.5~1.0	核采样阈值
`repetition_penalty`	1.1	1.0~1.5	>1.0 抑制重复

不传任何采样参数时，服务端使用上述保守默认值。

模型切换

修改 config.yaml 中的 model.active：

model:
  active: custom_voice  # 改为 base 或 kokoro

重启服务即可：

docker compose restart

目录结构

tts-api/
├── server.py              # 主入口，动态加载 handler
├── config.yaml            # 配置文件
├── tts_client.py          # Python 客户端 SDK
├── handlers/
│   ├── handler_custom_voice.py  # CustomVoice 模型
│   ├── handler_base.py          # Base 模型（Voice Clone）
│   └── handler_kokoro.py        # Kokoro-82M
├── tests/
│   ├── test_api.py        # API 接口测试（unittest）
│   ├── test_tts.py        # 广播剧生成脚本
│   └── test_download_base.py    # Base 模型下载验证
├── Dockerfile
├── docker-compose.yaml
└── requirements.txt

测试

# 启动服务后运行
cd tests
python -m pytest test_api.py -v

# 或直接运行
python test_api.py

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
docs		docs
handlers		handlers
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
docker-compose.yaml		docker-compose.yaml
requirements.txt		requirements.txt
server.py		server.py
tts_client.py		tts_client.py
部署文档.md		部署文档.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TTS API

支持的模型

快速开始

1. 配置

2. 启动

3. 验证

API 接口

健康检查

提交任务

查询状态

下载音频

音色列表

队列状态

Voice Clone（Base 模型）

采样参数说明

模型切换

目录结构

测试

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TTS API

支持的模型

快速开始

1. 配置

2. 启动

3. 验证

API 接口

健康检查

提交任务

查询状态

下载音频

音色列表

队列状态

Voice Clone（Base 模型）

采样参数说明

模型切换

目录结构

测试

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages