adding vision caption evals #160

michaelharrisonmai · 2025-06-12T22:02:48Z

Adding Flickr30K and NoCaps evals. Each of these datasets contains images and 5-10 sample captions per image. The eval asks LLM judge to score a new caption from 0-5, given the sample captions.

Michael Harrison added 3 commits June 12, 2025 17:49

flickr30k onboard

e748623

onboard nocaps

bfd8a19

improvements to caption evals

1c8a01b

michaelharrisonmai requested review from vibhav-vineet and neelsj June 12, 2025 22:02

neelsj approved these changes Jun 24, 2025

View reviewed changes

michaelharrisonmai merged commit f948f57 into main Jun 25, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

adding vision caption evals #160

adding vision caption evals #160

Uh oh!

michaelharrisonmai commented Jun 12, 2025

Uh oh!

Uh oh!

Uh oh!

adding vision caption evals #160

adding vision caption evals #160

Uh oh!

Conversation

michaelharrisonmai commented Jun 12, 2025

Uh oh!

Uh oh!

Uh oh!