-
Notifications
You must be signed in to change notification settings - Fork 13
Guide: Using Local ONNX Models
Thio's Universal Agent supports running fully local AI models, including ONNX Runtime GenAI models. It supports loading Onnx models directly, no other apps required at all.
Onnx models are structured slightly differently than what you might be used to (like a single .gguf file), so this guide will show you how to find and use them.
- You might have seen that other local LLM tools (like Ollama or llama.cpp) typically use models packaged as a single file.
- However, ONNX Runtime GenAI models are distributed as a complete folder of several files. Therefore in the config, you'll enter a directory path, not a specific file.
Because the agent relies strictly on visual perception to operate, pure text-only ONNX models (such as standard Phi-3-mini or Llama-3) will not work. You must use a multimodal vision-language model (VLM). These typically have -Vision- or VL in the name, but might also be mentioned on the "model card" (basically the readme page of the model).
Example Vision Model Families:
-
Phi-3 / Phi-3.5 / Phi-4 Vision (e.g.,
Phi-3-vision-128k-instruct,Phi-4-multimodal-instruct) -
Gemma-3 Vision (e.g.,
gemma-3-4b-it) -
Qwen-2.5-VL (e.g.,
Qwen2.5-VL-7B-Instruct)
IMPORTANT: Not all models may have an Onnx version available.
Many popular models are pre-converted and optimized specifically for ONNX Runtime GenAI.
- Search Hugging Face for models containing
-onnxor "ONNX Runtime GenAI". - Example repositories:
- If there's multiple folders named like
cpuandgpu, go into the one appropriate to your situation, like whether you have a dedicated GPU or not.- Note: If there's a nested folder inside, keep going until you get to the one containing the
genai_config.jsonand.onnxfiles and others.
- Note: If there's a nested folder inside, keep going until you get to the one containing the
- Download the entire folder contents into a folder on your machine.
- You don't need to download the entire repository, just the single folder with model files (The folder with
genai_config.json)
- You don't need to download the entire repository, just the single folder with model files (The folder with
🧐 Tip: You can use this browser tool I made to automatically download all the files in a Hugging Face folder
Once your model folder is ready, configure it in Thio's Universal Agent web UI:
Navigate to the Config menu. Under Provider Settings, select Local ONNX.
Configure the settings:
- Click "Detect Capabilities" to load the available settings such as Execution Provider for you system.
-
Model Folder Path: Enter the absolute path to your model directory (e.g.,
C:\AI\models\phi3-vision-dml). Do not append a file name to this path. -
Execution Provider (EP):
-
DML(DirectML): Recommended for Windows users. It runs on most hardware (AMD, Intel, NVIDIA). -
CUDA: If this is available, select this if you have an NVIDIA GPU and the CUDA Toolkit installed. -
CPU: A compatible fallback, but slow for processing vision/screenshots.
-