Replies: 5 comments 5 replies
-
|
I found how to download models but have some trouble. so i removed the model: Usage: openarc list [OPTIONS] [MODEL_NAME] Try 'openarc list --help' for help root@d3a3cad88907:/app# openarc list --rm Echo9Zulu/Qwen3-8B-ShiningValiant3-int4-asym-ov Note that in the doc the command is openarc list --remove The i added back the model with a different option: and loaded it: ────────────────────────────────────────────────────────────
How do you unload a model ? edit: found. It's trivial (but not in doc) replace load with unload :) ──────────────────────────────────────────────────────────── Total models loaded: 1` I guess ovgenai is not the correct engine. |
Beta Was this translation helpful? Give feedback.
-
|
Step by step i progress but still not there: I tried with another model and changed the add command parameters ( model path was not correct) ──────────────────────────────────────────────────────────── |
Beta Was this translation helpful? Give feedback.
-
|
Finally got it working :) |
Beta Was this translation helpful? Give feedback.
-
|
And here is a result of the first bench with default config (using GPU): depth (prior): 0 benching... (5/5) Qwen3-8B-ShiningValiant3-int4-asym-ov ┏━━━━━┳━━━┳━━━━━┳━━━━━┳━━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━┓ Does it seem correct to you ? |
Beta Was this translation helpful? Give feedback.
-
|
If you are ok with that i will continue to experiment and post here my results. ──────────────────────────────────────────────────────────── I guess it's a driver incompatibility but i don't know how to figure it out. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I'm very new to ai and struggle to choose what components should i choose to buil a local ai stack.
For now i don't have any dGPU.
My hardware is a minisforum ms02 with an intel core ultra 285HX with 32Gb ram for the iGPU (32 is the max allowed for the igpu. 256Gb total ram).
The pc is running proxmox and for now all ai stuff is running in docker containers (docker host is a lxc container).
Drivers are installed ans devices (/dev/dri and /dev/accel) are usable in docker containers.
Openarc seems very promising for my use case (intel hardware) as i plan to add an intel dGPU later if i succeed building a useful local ai stack.
But for now i know very little about inferencing, agents, llm,....
I build and run locally openarc and openwebui. I have also a container ollama-intel but the project doesn't seem to have recent update and is not compatible with recent models.
I have read this repo but i still can't figure out how to start.
From a beginner perspective it's still hard to understand.
For example once the container is running the first thing to do is to add a model.
I mapped a folder on my host to /models in container. But how do i put a model here? what should be the /models folder structure and what files are needed ?
is this strucutre right ?
/models/model_name1 with files model.xml model.bin
/models/model_name2 with files model.xml model.bin
How can i use openarc with agents ?
The vast majority of content available is about ollama which functionnalities not available in openarc like models download.
Beta Was this translation helpful? Give feedback.
All reactions