Added OpenVINO vision model support #33

szeyu · 2024-09-10T09:40:48Z

Support OpenVINO vision model #32

This feature request aims to integrate support for vision language models into our existing framework. Currently, our framework supports non-vision models, but there is a need to extend this support to vision models, which are loaded and processed differently.

To achieve this, I have implemented the following:

Separated Initialization for Vision and Non-Vision Models:
- Vision models are initialized using the ov_phi3_vision.py script provided by OpenVINO.
- Non-vision models continue to be initialized using the existing methods.
Quantized Vision Model Support:
- We have added support for the quantized vision model Phi-3.5-vision-instruct-int4-ov.
- The model can be found at the following link: Phi-3.5-vision-instruct-int4-ov

Implementation Details

Vision Model Initialization:
- The OpenVinoEngine class now checks if the model is a vision model during initialization.
- If the model is a vision model, it uses the ov_phi3_vision.py script to load and initialize the model.
- The vision model is then processed using the AutoProcessor class from the transformers library.
Non-Vision Model Initialization:
- Non-vision models are initialized using the existing methods, ensuring backward compatibility.
Streamlined Generation Process:
- Both vision and non-vision models support streaming output, with timing information logged for performance analysis.
- The generate_vision method has been updated to log prompt length, new tokens generated, time to first token, prompt tokens per second, and new tokens per second.

References

…stment

* added record timing function * added streaming feature

tjtanaa · 2024-09-25T21:19:15Z

src/embeddedllm/backend/openvino_engine.py

+                self.model_path = snapshot_path
+
+            # it is case sensitive, only receive all char captilized only
+            self.model = OvPhi3Vision(


Can you help me to understand the behaviour, what if you pass in a model that is not phi3vision model, what happens? (not limited to what error it throws)

Add a try-catch block, if it fails, then print our a message for the user telling them that embeddedllm engine only support Phi3Vision model, then exit the program gracefully.

If I pass in a model that is not phi3vision model, it will show error of language_model.xml not found. Hence yes, a try catch block is needed there.

… pass to OvPhi3Vision

Related to openvinotoolkit/openvino_notebooks#2374

szeyu and others added 4 commits September 2, 2024 11:49

added openvino vision generation

c4518ad

added generation vision without streaming version

91b3808

change max_new_tokens to max_tokens so to follow the webui token adju…

7fe7f74

…stment

Update openvino_engine.py

aff0ae2

* added record timing function * added streaming feature

szeyu added the type: enhancement / feature New feature or request label Sep 10, 2024

tjtanaa self-requested a review September 25, 2024 21:04

tjtanaa assigned szeyu Sep 25, 2024

tjtanaa reviewed Sep 25, 2024

View reviewed changes

tjtanaa linked an issue Sep 25, 2024 that may be closed by this pull request

[FEAT] Support OpenVINO vision model #32

Closed

szeyu added 2 commits September 26, 2024 14:30

Add try-catch block in openvino_genine to handle non phi3vision model…

ee5d5e9

… pass to OvPhi3Vision

Update ov_phi3_vision.py

86b59d6

Related to openvinotoolkit/openvino_notebooks#2374

szeyu merged commit aeca16a into main Sep 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added OpenVINO vision model support #33

Added OpenVINO vision model support #33

Uh oh!

szeyu commented Sep 10, 2024

Uh oh!

tjtanaa Sep 25, 2024

Uh oh!

szeyu Sep 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added OpenVINO vision model support #33

Added OpenVINO vision model support #33

Uh oh!

Conversation

szeyu commented Sep 10, 2024

Support OpenVINO vision model #32

Implementation Details

References

Uh oh!

tjtanaa Sep 25, 2024

Choose a reason for hiding this comment

Uh oh!

szeyu Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants