Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

几个问题 #8

Open
moving-on opened this issue Jan 6, 2017 · 1 comment
Open

几个问题 #8

moving-on opened this issue Jan 6, 2017 · 1 comment

Comments

@moving-on
Copy link

你好,想问几个问题,训练语料每一行的第一列表示什么?比如_*23134。是每一个文档的语料作为一行吗?那相似文档的输出怎么是没有分过词的?

@hiyijian
Copy link
Owner

hiyijian commented Jan 9, 2017

一行一个文档, _*23134是文档ID
相似文档的的输出只是把分词的空格去掉了而已,方便人看

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants