Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问Context Features中的Word + Character + Ngram是什么意思? #22

Closed
aluminumbox opened this issue Jun 7, 2018 · 3 comments
Closed

Comments

@aluminumbox
Copy link

这个ngram,是指考虑了word的ngram,还是character的ngram呢?可是常出现的character的ngram,不就是word了吗?不是很理解这个context features是怎么考虑的。求解答,谢谢!

@shenshen-hungry
Copy link
Collaborator

是word的unigram和bigram以及character的unigram,因为中文的词一般就是两三个字所以用character的ngram和词没什么区别了。

@aluminumbox
Copy link
Author

@shenshen-hungry 好的谢谢,那么再问一下。context feature中word和word+character的区别,是在分词后,前者去掉了分词结果为单个字的情况嘛?

@shenshen-hungry
Copy link
Collaborator

word就是分词意义上的词,word+character是说不但有词还有字。也就是在Skip-gram模型中,中心词不但预测上下文中的词,同时也预测里面的字。
40212943-60c1db42-5a85-11e8-9c3b-1258e193a270

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants