Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

代码Word读入是否有问题 #1

Closed
muye5 opened this issue May 8, 2015 · 3 comments
Closed

代码Word读入是否有问题 #1

muye5 opened this issue May 8, 2015 · 3 comments

Comments

@muye5
Copy link

muye5 commented May 8, 2015

test case的vocabulary只有一个

@Leonard-Xu
Copy link
Owner

希望您能更加具体的表述问题。

@muye5
Copy link
Author

muye5 commented May 9, 2015

跑了下readme中的例子,输入:你好 世界
我理解在输出的word.txt中应该是每个word的向量表示啊,但是输出的只有一个标记句子开始的字 ,是不是我哪里理解的不对?命令是这个

./cwe -train corpus.txt -output-word word.txt -output-char char.txt

@Leonard-Xu
Copy link
Owner

是这样的,缺省的-min-count是5,也就是说所有出现次数小于5的词都会被忽略,你可以修改这个参数进行测试。建议您使用更大的语料进行训练。

@muye5 muye5 closed this as completed May 10, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants