Whats the point of the class token here? #20

Manojbhat09 · 2020-10-21T20:44:42Z

Really appreciate your work.
Question : As the topic.

lucidrains · 2020-10-21T21:14:19Z

@Manojbhat09 it's a common practice in NLP where you have one token that pools information from the rest of the tokens through rounds of attention, usually to classify the sentence at the end. whether it is completely necessary for ViT to work is up to debate. my take is it isn't that important. you could pool all the embeddings from the last layer and probably still get great results at scale

Manojbhat09 · 2020-10-22T05:10:42Z

Thank you for your clear explanation. Hope others too find it helpful.

zhongyy · 2021-01-15T07:51:42Z

Thank you for the explaination !

lucidrains closed this as completed Oct 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whats the point of the class token here? #20

Whats the point of the class token here? #20

Manojbhat09 commented Oct 21, 2020

lucidrains commented Oct 21, 2020 •

edited

Manojbhat09 commented Oct 22, 2020

zhongyy commented Jan 15, 2021

Whats the point of the class token here? #20

Whats the point of the class token here? #20

Comments

Manojbhat09 commented Oct 21, 2020

lucidrains commented Oct 21, 2020 • edited

Manojbhat09 commented Oct 22, 2020

zhongyy commented Jan 15, 2021

lucidrains commented Oct 21, 2020 •

edited