Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问是否提供out of vocabulary的词向量 #23

Closed
lujiaying opened this issue Jun 8, 2018 · 3 comments
Closed

请问是否提供out of vocabulary的词向量 #23

lujiaying opened this issue Jun 8, 2018 · 3 comments

Comments

@lujiaying
Copy link

一个oov的词,对应词向量里哪个token呢?

@shenshen-hungry
Copy link
Collaborator

oov你可以用所有词向量的平均,或者根据下游任务随机初始化一个向量之后finetune就可以。

@rudaoshi
Copy link

难道不是词表里对应 UNKNOWN 的那个向量吗?
训练时你们没有引入 UNK 词?

@shenshen-hungry
Copy link
Collaborator

@rudaoshi 我们参考了Google-news-300和GloVe官方的那些大语料训练的词向量,和他们一样也没有引入unk。你可以第一个回复的方法生成unk词向量。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants