You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Nov 21, 2025. It is now read-only.
I want to use the ImageInferenceEngine from hi-ml-multimodal with an image model that can't cleanly be fit into the existing ImageModel class.
I'd like to propose something like:
We define a more abstract class for ImageModels that just requires the methods actually used by ImageInferenceEngine / ImageTextInferenceEngine (from what I can see, this is forward, get_patchwise_projected_embeddings, and the training attribute). The inference engine can then just expect something from this class as an input.
The existing ImageModel class can subclass this, so no expected change to current behaviour.
I can write a custom subclass for my special model and be able to use it in the inference engine.