语音识别和图片特征描述提取 #32

stonexer · 2016-04-06T13:56:37Z

issue 发在这里有点不合适啊，不过也算作 voice-message 和 image-message 事件的 example 吧。展示下能够做到的功能。

大概记录下思路：

语音识别：暂时没找到直接可以处理 MP3 编码的语音识别模块，大致思路是转换成 wav 或其他原始编码，再对接相关 API 获得识别内容。前者可以通过 ffmpeg 或 sox 转换，应该都有 node 的模块，API 目前觉得 baidu 和 google 的各有利弊，再看看吧。

图片特征描述提取：暂时就考虑用 node-tesseract 和百度识图吧

spacelan · 2016-04-07T11:22:45Z

stonexer · 2016-04-07T11:47:07Z

66666

reverland · 2016-04-07T17:19:05Z

6666666

willin · 2017-07-28T00:52:06Z

@stonexer 南京工作机会考虑么~

stonexer self-assigned this Apr 6, 2016

stonexer added a commit that referenced this issue Apr 9, 2016

#32 语音识别

8fcbe64

stonexer added a commit that referenced this issue Apr 9, 2016

#32 语音识别

d16b4bb

stonexer added a commit that referenced this issue Apr 13, 2016

#32 语音识别

0624b22

ak5 mentioned this issue Nov 3, 2016

Support Message Type of Image/Video wechaty/wechaty#4

Closed

stonexer added the feature label Dec 10, 2016

Provide feedback