Autoload + onnx model and ipexllm in the model ui #25

szeyu · 2024-08-13T09:48:42Z

Autodownload of model for Onnxruntime engine if the repo id of model is provided instead of model path
Update the model list of Onnxruntime models and IpexLLM in the modelui.py:
Onnxruntime Model (DirectML):

EmbeddedLLM/01-ai_Yi-1.5-6b-Chat-int4-onnx-directml
EmbeddedLLM/Phi-3-medium-128k-instruct-onnx-directml
EmbeddedLLM/Phi-3-medium-4k-instruct-onnx-directml
EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-directml
EmbeddedLLM/Phi-3-mini-4k-instruct-062024-int4-onnx-directml
EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-directml
EmbeddedLLM/Starling-LM-7b-beta-int4-onnx-directml
EmbeddedLLM/gemma-2b-it-int4-onnx-directml
EmbeddedLLM/gemma-7b-it-int4-onnx-directml
EmbeddedLLM/llama-2-7b-chat-int4-onnx-directml
EmbeddedLLM/mistral_Mistral-7b-instruct-v0.3-int4-onnx-directml
EmbeddedLLM/openchat-3.6-8b-20240522-int4-onnx-directml

Onnxruntime Model (CPU):

EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-cpu-int4-rtn-block-32
EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-cpu-int4-rtn-block-32-acc-level-4
EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-cpu-int4-rtn-block-32
EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-cpu-int4-rtn-block-32-acc-level-4
EmbeddedLLM/mistral-7b-instruct-v0.3-onnx-cpu-int4-rtn-block32
EmbeddedLLM/mistral-7b-instruct-v0.3-onnx-cpu-int4-rtn-block32-acc-level-4
EmbeddedLLM/openchat-3.6-8b-20240522-onnx-cpu-int4-rtn-block32
EmbeddedLLM/openchat-3.6-8b-20240522-onnx-cpu-int4-rtn-block32-acc-level-4

IpexLLM Model:
(Ipex and CPU)

microsoft/Phi-3-medium-128k-instruct
microsoft/Phi-3-medium-4k-instruct
microsoft/Phi-3-mini-128k-instruct
microsoft/Phi-3-mini-4k-instruct

Hot fixed the description of openvino backend and gpu device
Showing dropdown menu option where the user can use only base on its environment such as Ipex, DirectML or CPU.

…delui

…ne type cpu and ipex

…ly, then ipex use the same model list for cpu and xpu hence depend on backend only and not the engine_type for the model list

tjtanaa

LGTM

szeyu added 7 commits August 6, 2024 16:15

update new model list with new reuploaded model and ipex option in mo…

50f4c34

…delui

fix the typo of mistral repo id

ec2f421

edit to the latest version of models available

dbdefa0

change the context length of 128k to 131072

0965d51

Merge branch 'main' into szeyu-autoloader-1

b905aa9

onnx auto download model if repo id is provided as model path

5dbf495

formated with black

fb2c63e

szeyu requested a review from tjtanaa August 13, 2024 09:48

szeyu added 7 commits August 14, 2024 11:13

fixed with flake8

f8c8f27

add openvino description and the device gpu

d54b4d8

showing dropdown option where the user can use only

c82cc7b

fix mistake so Ipex can run the same model card with 2 different engi…

02006ac

…ne type cpu and ipex

served_model_name uncommented

a2528c7

add ipex cpu handler

14c155d

change the cpu model list to onnx_cpu as it is used by onnxruntime on…

8f5dd38

…ly, then ipex use the same model list for cpu and xpu hence depend on backend only and not the engine_type for the model list

This was linked to issues Aug 16, 2024

[FEAT] [UI] Add IpexLLM to Model UI #17

Closed

[FEAT] [ONNX Engine] Add Auto-Download of Model Weights from HF Repo #24

Closed

tjtanaa assigned szeyu Aug 16, 2024

tjtanaa approved these changes Aug 16, 2024

View reviewed changes

tjtanaa added priority: high High priority request type: documentation Improvements or additions to documentation type: enhancement / feature New feature or request labels Aug 16, 2024

tjtanaa merged commit 25fcb76 into main Aug 16, 2024

tjtanaa deleted the szeyu-autoloader-1 branch August 16, 2024 07:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Autoload + onnx model and ipexllm in the model ui #25

Autoload + onnx model and ipexllm in the model ui #25

Uh oh!

szeyu commented Aug 13, 2024 •

edited

Loading

Uh oh!

tjtanaa left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Autoload + onnx model and ipexllm in the model ui #25

Autoload + onnx model and ipexllm in the model ui #25

Uh oh!

Conversation

szeyu commented Aug 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tjtanaa left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

szeyu commented Aug 13, 2024 •

edited

Loading