feat(tts):增加tts（阿里云）提供商CosyVoice TTS(API)，Qwen TTS Realtime(API)的支持，增加过滤 TTS 文本中的内容功能 by yuxwd · Pull Request #7651 · AstrBotDevs/AstrBot

yuxwd · 2026-04-18T15:04:32Z

Modifications / 改动点

tts提供商添加

项目原生tts阿里云提供商tts支持不全面，改动添加了CosyVoice TTS(API)，Qwen TTS Realtime(API)的支持

过滤 TTS 文本中的内容

bot发送tts优化，增加了，过滤 TTS 文本中的内容的功能，可以让tts不读（）的内容，支持正则过滤

代码测试

进行了macos系统和linux（Alibaba Cloud Linux 3.2104 LTS 64位）测试无问题

This is NOT a breaking change. / 这不是一个破坏性变更。

Screenshots or Test Results / 运行截图或测试结果

Checklist / 检查清单

[✅ ] 😊 If there are new features added in the PR, I have discussed it with the authors through issues/emails, etc.
/ 如果 PR 中有新加入的功能，已经通过 Issue / 邮件等方式和作者讨论过。
[✅ ] 👀 My changes have been well-tested, and "Verification Steps" and "Screenshots" have been provided above.
/ 我的更改经过了良好的测试，并已在上方提供了“验证步骤”和“运行截图”。
[ ✅ ] 🤓 I have ensured that no new dependencies are introduced, OR if new dependencies are introduced, they have been added to the appropriate locations in requirements.txt and pyproject.toml.
/ 我确保没有引入新依赖库，或者引入了新依赖库的同时将其添加到 requirements.txt 和 pyproject.toml 文件相应位置。
[ ✅ ] 😮 My changes do not introduce malicious code.
/ 我的更改没有引入恶意代码。

Add two new TTS providers using Alibaba Cloud DashScope SDK: - Qwen TTS Realtime: WebSocket streaming TTS with low latency, supports qwen3-tts-flash-realtime and qwen3-tts-instruct-flash-realtime models - CosyVoice TTS: Non-streaming TTS with multiple voice options, supports cosyvoice-v3.5/v3/v2 models Includes config templates, provider manager integration, and i18n translations (zh-CN, en-US, ru-RU). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sourcery-ai

Sorry, we are unable to review this pull request

The GitHub API does not allow us to fetch diffs exceeding 300 files, and this pull request has 600

gemini-code-assist

Code Review

This pull request introduces a TTS text filtering mechanism to strip markers like brackets and asterisks from text before synthesis, and adds support for Qwen TTS Realtime and CosyVoice TTS providers. The review feedback points out several critical issues: a logic error in the Qwen streaming implementation that causes audio duplication, a blocking call in an asynchronous function that could impact responsiveness, and incorrect usage of the DashScope SDK in the CosyVoice provider. Additionally, the FilteredQueue implementation requires a call to the base class constructor to ensure all inherited methods function correctly.

gemini-code-assist · 2026-04-18T15:07:39Z

+                    if accumulated_text:
+                        await loop.run_in_executor(
+                            None,
+                            qwen_tts.append_text,
+                            accumulated_text,
+                        )


在 get_audio_stream 方法中，text_part 已经在循环中通过 qwen_tts.append_text(text_part) 实时发送给了 API。当 text_part 为 None 时，再次发送 accumulated_text 会导致整个文本被重复发送给 TTS 引擎，从而导致生成的音频包含重复的内容。建议移除这段逻辑。

gemini-code-assist · 2026-04-18T15:07:39Z

+                callback.complete_event.set()
+
+        await loop.run_in_executor(None, _connect_and_send)
+        finished = callback.wait_for_finished(timeout=self.timeout)


callback.wait_for_finished(timeout=self.timeout) 是一个阻塞同步调用（内部使用 threading.Event.wait），在异步函数中直接调用会阻塞 asyncio 事件循环，导致机器人响应变慢。建议使用 loop.run_in_executor 将其放入线程池执行。

Suggested change

finished = callback.wait_for_finished(timeout=self.timeout)

finished = await loop.run_in_executor(None, callback.wait_for_finished, self.timeout)

gemini-code-assist · 2026-04-18T15:07:39Z

+        audio_bytes = await loop.run_in_executor(
+            None,
+            synthesizer.call,
+            text,
+            self.timeout_ms,
+        )


dashscope.audio.tts_v2.SpeechSynthesizer.call 返回的是 SpeechSynthesisResult 对象，直接将其作为音频字节写入文件会导致 TypeError。此外，call 方法的签名通常不接受 timeout 作为位置参数。建议修正调用方式并使用 result.get_audio_data() 获取音频字节。

result = await loop.run_in_executor(\n None,\n synthesizer.call,\n text,\n )\n audio_bytes = result.get_audio_data()

gemini-code-assist · 2026-04-18T15:07:39Z

+        self,
+        real_queue: asyncio.Queue[T | None],
+        custom_rules: list[str] | None = None,
+    ) -> None:
+        self._real_queue = real_queue
+        self._custom_rules = custom_rules
+


FilteredQueue 继承自 asyncio.Queue 但未调用 super().__init__()。虽然该类目前通过代理模式重写了主要方法，但未初始化的基类会导致 get_nowait、put_nowait 等未被重写的方法在调用时因缺少内部状态（如 _getters, _putters）而失败。建议显式调用 super().__init__() 以确保对象状态完整。

def __init__(\n self,\n real_queue: asyncio.Queue[T | None],\n custom_rules: list[str] | None = None,\n ) -> None:\n super().__init__(maxsize=real_queue.maxsize)\n self._real_queue = real_queue\n self._custom_rules = custom_rules

yuxwd · 2026-04-18T15:39:45Z

服务器测试情况

yuxwd and others added 6 commits April 18, 2026 22:08

feat::增加tts请求去除**（）等符号中包含内容的功能

09c71b2

feat::增加tts请求去除**（）等符号中包含内容的功能

d76715d

feat::增加tts请求去除**（）等符号中包含内容的功能,在三个语言文件中添加了翻译

c030c3b

fix：更新前端构建文件

630517e

fix：修复astrbot/core/astr_agent_run_util.py中text_queue变量

1f6df28

auto-assign bot requested review from Raven95676 and Soulter April 18, 2026 15:04

sourcery-ai bot reviewed Apr 18, 2026

View reviewed changes

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. area:provider The bug / feature is about AI Provider, Models, LLM Agent, LLM Agent Runner. labels Apr 18, 2026

gemini-code-assist bot reviewed Apr 18, 2026

View reviewed changes

yuxwd closed this Apr 18, 2026

fix: 修复 astr_agent_run_util.py 中 text_queue 变量名不一致问题，统一为 tts_text_queue

4894177

yuxwd reopened this Apr 18, 2026

Soulter force-pushed the master branch 2 times, most recently from faf411f to 0068960 Compare April 19, 2026 09:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(tts):增加tts（阿里云）提供商CosyVoice TTS(API)，Qwen TTS Realtime(API)的支持，增加过滤 TTS 文本中的内容功能#7651

feat(tts):增加tts（阿里云）提供商CosyVoice TTS(API)，Qwen TTS Realtime(API)的支持，增加过滤 TTS 文本中的内容功能#7651
yuxwd wants to merge 7 commits intoAstrBotDevs:masterfrom
yuxwd:master

yuxwd commented Apr 18, 2026

Uh oh!

sourcery-ai bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 18, 2026

Uh oh!

gemini-code-assist bot Apr 18, 2026

Uh oh!

gemini-code-assist bot Apr 18, 2026

Uh oh!

gemini-code-assist bot Apr 18, 2026

Uh oh!

yuxwd commented Apr 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	finished = callback.wait_for_finished(timeout=self.timeout)
	finished = await loop.run_in_executor(None, callback.wait_for_finished, self.timeout)

Uh oh!

Conversation

yuxwd commented Apr 18, 2026

Modifications / 改动点

tts提供商添加

过滤 TTS 文本中的内容

代码测试

Screenshots or Test Results / 运行截图或测试结果

Checklist / 检查清单

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 18, 2026

Choose a reason for hiding this comment

Uh oh!

yuxwd commented Apr 18, 2026

服务器测试情况

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant