Fixed [CLS] token during inference? #5

dwang-sflscientific · 2021-08-24T10:18:41Z

Hi

For inference, the CLS token(L157 and L160 in train.py) is still basing on ground-truth label, should they be static CLS token instead?

dwang-sflscientific · 2021-08-24T10:27:11Z

For pretraining&fine-tuning, don't understand why ground truth labels are used as [CLS] token as well.

somepago · 2021-08-24T13:50:55Z

Hi, you don't need to base the CLS token on the actual label. It's an artifact from another project, the static token is generated in the embed_data_mask function (L321).

dwang-sflscientific · 2021-08-26T06:33:15Z

Ah I see. Thanks for the explaianation.

dwang-sflscientific closed this as completed Aug 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed [CLS] token during inference? #5

Fixed [CLS] token during inference? #5

dwang-sflscientific commented Aug 24, 2021

dwang-sflscientific commented Aug 24, 2021

somepago commented Aug 24, 2021

dwang-sflscientific commented Aug 26, 2021

Fixed [CLS] token during inference? #5

Fixed [CLS] token during inference? #5

Comments

dwang-sflscientific commented Aug 24, 2021

dwang-sflscientific commented Aug 24, 2021

somepago commented Aug 24, 2021

dwang-sflscientific commented Aug 26, 2021