Evaluation on computer vision benchmarks #235

finitearth · 2023-03-16T10:08:53Z

Are there plans to evaluate the vision modality of GPT-4? I am interested to know how GPT-4 could perform on classification tasks with 0- and few-shot-learning and how it compares to vision-only models. If the few-shot-learning capabilities of LLMs translate to other modalities, this would be a real game changer.

Question out of curiosity: How was the vision-modality incorperated? Maybe similar approaches can be taken for other modalities, such as audio or video? Would be an interessting Open-Source project for sure :)

MoreTore · 2023-04-05T04:59:20Z

I have an engineering exam bank of about 1000 questions with simple illustrations. I have the questions already in JSONL format but some of them rely on the image to answer correctly.

jwang47 · 2023-04-13T18:37:17Z

Currently our API doesn't support vision, but if it does we'll definitely add support for that to this framework!

jwang47 added the Idea for Eval These issues keep track of requests for different kinds of eval PRs label Apr 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation on computer vision benchmarks #235

Evaluation on computer vision benchmarks #235

finitearth commented Mar 16, 2023

MoreTore commented Apr 5, 2023

jwang47 commented Apr 13, 2023

Evaluation on computer vision benchmarks #235

Evaluation on computer vision benchmarks #235

Comments

finitearth commented Mar 16, 2023

MoreTore commented Apr 5, 2023

jwang47 commented Apr 13, 2023