Resize embeddings and vocab #335

ambiSk · 2023-01-04T12:39:57Z

I finetuned text encoder of CLIP and added some additional tokens to it, I would like to know is there a way to load the checkpoint with greater embedding size?

mitchellnw · 2023-01-09T21:45:43Z

not totally sure if this will work but worth trying making a new model config and changing vocab_size there? e.g., make a copy of this https://github.com/mlfoundations/open_clip/blob/main/src/open_clip/model_configs/ViT-B-16.json called ViT-B-16-bigvocab then use --model ViT-B-16-bigvocab.

rwightman · 2023-04-16T17:50:50Z

don't think resizing existing models is practical, but definitely possible to make new ones, or us HF pretrained models that have larger vocab / context length

rwightman closed this as completed Apr 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resize embeddings and vocab #335

Resize embeddings and vocab #335

ambiSk commented Jan 4, 2023

mitchellnw commented Jan 9, 2023

rwightman commented Apr 16, 2023

Resize embeddings and vocab #335

Resize embeddings and vocab #335

Comments

ambiSk commented Jan 4, 2023

mitchellnw commented Jan 9, 2023

rwightman commented Apr 16, 2023