Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

语音识别 和 图片特征描述提取 #32

Open
stonexer opened this issue Apr 6, 2016 · 4 comments
Open

语音识别 和 图片特征描述提取 #32

stonexer opened this issue Apr 6, 2016 · 4 comments
Assignees
Labels

Comments

@stonexer
Copy link
Member

stonexer commented Apr 6, 2016

issue 发在这里有点不合适啊,不过也算作 voice-message 和 image-message 事件的 example 吧。展示下能够做到的功能。

大概记录下思路:

语音识别:暂时没找到直接可以处理 MP3 编码的语音识别模块,大致思路是转换成 wav 或其他原始编码,再对接相关 API 获得识别内容。前者可以通过 ffmpeg 或 sox 转换,应该都有 node 的模块,API 目前觉得 baidu 和 google 的各有利弊,再看看吧。

图片特征描述提取: 暂时就考虑用 node-tesseract 和 百度识图 吧

@stonexer stonexer self-assigned this Apr 6, 2016
@spacelan
Copy link
Member

spacelan commented Apr 7, 2016

http://cloudsightapi.com/api
这个api比较6

@stonexer
Copy link
Member Author

stonexer commented Apr 7, 2016

66666

@reverland
Copy link

6666666

stonexer added a commit that referenced this issue Apr 9, 2016
stonexer added a commit that referenced this issue Apr 9, 2016
stonexer added a commit that referenced this issue Apr 13, 2016
@willin
Copy link

willin commented Jul 28, 2017

@stonexer 南京工作机会考虑么~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants