Skip to content

Conversation

@szeyu
Copy link
Contributor

@szeyu szeyu commented Aug 13, 2024

  1. Autodownload of model for Onnxruntime engine if the repo id of model is provided instead of model path

  2. Update the model list of Onnxruntime models and IpexLLM in the modelui.py:
    Onnxruntime Model (DirectML):

  • EmbeddedLLM/01-ai_Yi-1.5-6b-Chat-int4-onnx-directml
  • EmbeddedLLM/Phi-3-medium-128k-instruct-onnx-directml
  • EmbeddedLLM/Phi-3-medium-4k-instruct-onnx-directml
  • EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-directml
  • EmbeddedLLM/Phi-3-mini-4k-instruct-062024-int4-onnx-directml
  • EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-directml
  • EmbeddedLLM/Starling-LM-7b-beta-int4-onnx-directml
  • EmbeddedLLM/gemma-2b-it-int4-onnx-directml
  • EmbeddedLLM/gemma-7b-it-int4-onnx-directml
  • EmbeddedLLM/llama-2-7b-chat-int4-onnx-directml
  • EmbeddedLLM/mistral_Mistral-7b-instruct-v0.3-int4-onnx-directml
  • EmbeddedLLM/openchat-3.6-8b-20240522-int4-onnx-directml

Onnxruntime Model (CPU):

  • EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-cpu-int4-rtn-block-32
  • EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-cpu-int4-rtn-block-32-acc-level-4
  • EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-cpu-int4-rtn-block-32
  • EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-cpu-int4-rtn-block-32-acc-level-4
  • EmbeddedLLM/mistral-7b-instruct-v0.3-onnx-cpu-int4-rtn-block32
  • EmbeddedLLM/mistral-7b-instruct-v0.3-onnx-cpu-int4-rtn-block32-acc-level-4
  • EmbeddedLLM/openchat-3.6-8b-20240522-onnx-cpu-int4-rtn-block32
  • EmbeddedLLM/openchat-3.6-8b-20240522-onnx-cpu-int4-rtn-block32-acc-level-4

IpexLLM Model:
(Ipex and CPU)

  • microsoft/Phi-3-medium-128k-instruct
  • microsoft/Phi-3-medium-4k-instruct
  • microsoft/Phi-3-mini-128k-instruct
  • microsoft/Phi-3-mini-4k-instruct
  1. Hot fixed the description of openvino backend and gpu device

  2. Showing dropdown menu option where the user can use only base on its environment such as Ipex, DirectML or CPU.
    image
    image

@szeyu szeyu requested a review from tjtanaa August 13, 2024 09:48
Copy link
Member

@tjtanaa tjtanaa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tjtanaa tjtanaa added priority: high High priority request type: documentation Improvements or additions to documentation type: enhancement / feature New feature or request labels Aug 16, 2024
@tjtanaa tjtanaa merged commit 25fcb76 into main Aug 16, 2024
@tjtanaa tjtanaa deleted the szeyu-autoloader-1 branch August 16, 2024 07:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

priority: high High priority request type: documentation Improvements or additions to documentation type: enhancement / feature New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[FEAT] [ONNX Engine] Add Auto-Download of Model Weights from HF Repo [FEAT] [UI] Add IpexLLM to Model UI

3 participants