Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the vocabulary constructed on snli #14

Open
Howardqlz opened this issue Jan 7, 2020 · 2 comments
Open

About the vocabulary constructed on snli #14

Howardqlz opened this issue Jan 7, 2020 · 2 comments

Comments

@Howardqlz
Copy link

I see this code:
self.TEXT.build_vocab(self.train, self.dev, self.test, vectors=GloVe(...))
As i know, we should constrcuct vocabulary only on trainset?

@galsang
Copy link
Owner

galsang commented Jan 8, 2020

The code line means that we build an embedding matrix that can map any word in datasets (including dev and test in addition to training) to the corresponding word representation initialized with the pre-trained GloVe vector.
We can, of course, utilize the pre-trained vector for a word that is not included in the training set but appears in the test set, even though the vector would not be fine-tuned during training.

@daitianxie
Copy link

Hello, why does the code stop after running an epoch
Uploading 截屏2020-11-24 下午7.17.30.png…

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants