-
Notifications
You must be signed in to change notification settings - Fork 2.7k
Open
Labels
questionFurther information is requestedFurther information is requested
Description
Great work indeed!
From the description in the paper, I do not find any special OCR module. I am curious how LLaVA obtains the ability to understand the text in the image (e.g., the famous examples of chicken nuggets). Is there any magic in the training dataset?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested