语音实时交互

1. 介绍

通过3个开源模型 + pyaduio模块实现语音实时交互“类豆包”功能。3个模型为：

Faster Whisper语音转文字模型。https://github.com/SYSTRAN/faster-whisper
Qween3:14B通义千问大模型。
ChatTTS文字转语音模型。https://github.com/2noise/ChatTTS

2.环境配置

1. 安装Faster Whisper模型所需要的权重文件

链接：https://huggingface.co/Systran/faster-whisper-large-v3/tree/main

2. 安装ChatTTS模型所需的权重文件

安装到当前项目目录即可

链接：https://huggingface.co/2Noise/ChatTTS/tree/main

3. 通过ollama本地部署Qween大模型

curl -fsSL https://ollama.com/install.sh | sh
ollama -v

# 拉取代码
ollama pull qwen2.5:14b

# 本地运行测试
ollama run qwen2.5:14b

3.安装配置环境

git clone https://github.com/Novbo/realtime_dialog.git
cd realtime_dialog

安装python

conda create -n realtime_dialog python=3.11
conda activate realtime_dialog
pip install -r requirements.txt

3. 运行

注意：运行之前请先修改配置文件信息config.py

python main.py

4. 代码配置文件

manager/config.py

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
ChatTTS		ChatTTS
docs		docs
examples		examples
manager		manager
tests		tests
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
main.py		main.py
openai_api.ipynb		openai_api.ipynb
requirements.txt		requirements.txt
setup.py		setup.py
speech_to_txt.py		speech_to_txt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

语音实时交互

1. 介绍

2.环境配置

1. 安装Faster Whisper模型所需要的权重文件

2. 安装ChatTTS模型所需的权重文件

3. 通过ollama本地部署Qween大模型

3.安装配置环境

3. 运行

4. 代码配置文件

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

语音实时交互

1. 介绍

2.环境配置

1. 安装Faster Whisper模型所需要的权重文件

2. 安装ChatTTS模型所需的权重文件

3. 通过ollama本地部署Qween大模型

3.安装配置环境

3. 运行

4. 代码配置文件

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages