【开源推荐】SenseVoice — 一个模型同时做语音识别+情感检测+声音事件分类

## 项目介绍

[SenseVoice](https://github.com/FunAudioLLM/SenseVoice)（8.2K Stars）是阿里通义实验室的多任务语音理解模型。

**一个 234MB 模型同时完成：**
- 语音识别（ASR）
- 语种检测（中/英/日/韩/粤）
- 情感识别（开心/悲伤/愤怒/中性）
- 声音事件检测（笑声/掌声/音乐/哭泣）

## 速度

- GPU: **170 倍实时**（10 秒音频只需 70ms）
- CPU: **17 倍实时**（无需 GPU 也能用）

## 使用

```python
pip install funasr

from funasr import AutoModel
model = AutoModel(model="iic/SenseVoiceSmall", vad_model="fsmn-vad", device="cuda")
result = model.generate(input="audio.wav")
# 输出带情感标签: <|HAPPY|>今天天气真好<|NEUTRAL|>我们出去走走吧
```

## 应用场景

- 客服质检（自动检测客户情绪）
- 会议分析（识别氛围变化）
- 内容审核（检测异常声音事件）
- 语音助手（根据情绪调整回复）

## 链接

- GitHub: https://github.com/FunAudioLLM/SenseVoice
- FunASR 工具包: https://github.com/modelscope/FunASR
- 在线体验: https://www.funasr.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【开源推荐】SenseVoice — 一个模型同时做语音识别+情感检测+声音事件分类 #10058

项目介绍

速度

使用

应用场景

链接

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

【开源推荐】SenseVoice — 一个模型同时做语音识别+情感检测+声音事件分类 #10058

Description

项目介绍

速度

使用

应用场景

链接

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions