New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
语音识别 和 图片特征描述提取 #32
Labels
Comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
issue 发在这里有点不合适啊,不过也算作 voice-message 和 image-message 事件的 example 吧。展示下能够做到的功能。
大概记录下思路:
语音识别:暂时没找到直接可以处理 MP3 编码的语音识别模块,大致思路是转换成 wav 或其他原始编码,再对接相关 API 获得识别内容。前者可以通过 ffmpeg 或 sox 转换,应该都有 node 的模块,API 目前觉得 baidu 和 google 的各有利弊,再看看吧。
图片特征描述提取: 暂时就考虑用 node-tesseract 和 百度识图 吧
The text was updated successfully, but these errors were encountered: