-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Paddle demo中文数据集 #981
Comments
uci数据集:http://archive.ics.uci.edu/ml/index.html @beckett1124 贡献的两个数据集参考 |
上面@reyoung提的:中文的看图说话数据,是没有中文数据的;但看图问话是有的,见http://idl.baidu.com/FM-IQA.html 此外还需要 |
THUOCL:清华大学开放中文词库 近日开源,供参考。 |
如果类似THUOCL这种语料能用的话,那http://thunlp.org/site2/index.php/en/resources 这里还有几个 |
发现一个古诗的数据集。 最全中华古诗数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. |
Close this inactivate issue, please feel free to reopen. |
* add the taskflow doc for some task * update the ddparaser doc about taskflow * remove the unused code for the paddlenlp * update the taskflow docs * add the input check for the tasks * add the document for the taskflow Co-authored-by: Zeyu Chen <chenzeyu01@baidu.com>
Related #176
为了更好的做Paddle的demo、教程,需要有中文的数据集。数据集的获取方法可以是自己标注,也可以是找公开的数据集。
可能的中文数据集有:
The text was updated successfully, but these errors were encountered: