We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base_coco
large_coco
blip_caption
No description provided.
The text was updated successfully, but these errors were encountered:
Hi @gschurck, thanks for your interest.
base_coco is the BLIP_base finetuned on COCO; large_coco is the BLIP_large finetuned on COCO.
BLIP_base uses ViT_base, BLIP_large uses ViT_large.
Thanks.
Sorry, something went wrong.
Okay, are they directly related to Beam search or Nucleus Sampling algorithms ?
Hi @gschurck ,
No, base_coco and large_coco are related to model size. Both base_coco and large_coco support beam search and nucleus sampling.
Practically we found large_coco achieves better captioning metrics (higher quality captions)
Ok thanks.
No branches or pull requests
No description provided.
The text was updated successfully, but these errors were encountered: