Open
Description
The ImageDescriptionGenerator
is missing an option to make a prompt to guide the response to what sort of description is requested. The prompt could be to ignore objects in the background, or ask specific questions about the subject in the image.
When I was using the Phi3Vision model directly this was a huge benefit and opened up quite a few scenarios for analyzing images.