Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed [CLS] token during inference? #5

Closed
dwang-sflscientific opened this issue Aug 24, 2021 · 3 comments
Closed

Fixed [CLS] token during inference? #5

dwang-sflscientific opened this issue Aug 24, 2021 · 3 comments

Comments

@dwang-sflscientific
Copy link

Hi

For inference, the CLS token(L157 and L160 in train.py) is still basing on ground-truth label, should they be static CLS token instead?

@dwang-sflscientific
Copy link
Author

For pretraining&fine-tuning, don't understand why ground truth labels are used as [CLS] token as well.

@somepago
Copy link
Owner

Hi, you don't need to base the CLS token on the actual label. It's an artifact from another project, the static token is generated in the embed_data_mask function (L321).

@dwang-sflscientific
Copy link
Author

Ah I see. Thanks for the explaianation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants