We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请问项目中有包括图片、文本、视频、音频等信息的modal fusion相关的模块吗?感谢。
The text was updated successfully, but these errors were encountered:
您好,感谢使用框架。 可以使用一下 examples/frame_title_fusion_embedding这个例子,将这个data_tfrecord.zip里的两个文件放到对应的data文件夹下。这个数据量很小,只是用来保证框架测试通过。 具体的例子可以参考我们两个月前比赛(https://algo.browser.qq.com/) 中的赛道一,数据有60+G,其中涉及到视频标题,视频抽帧和音频转文本的多模态融合。
Sorry, something went wrong.
收到,感谢您的回复,我找到对应的部分了,问题先关了。
shawnx-w
No branches or pull requests
请问项目中有包括图片、文本、视频、音频等信息的modal fusion相关的模块吗?感谢。
The text was updated successfully, but these errors were encountered: