Replies: 23 comments 1 reply
-
|
yes, use sdnq |
Beta Was this translation helpful? Give feedback.
-
|
Will it work on on fly quantization |
Beta Was this translation helpful? Give feedback.
-
|
If sdnq quantization enable will it use gpu or cpu |
Beta Was this translation helpful? Give feedback.
-
|
yes, it works with on-the-fly-quantization. it cant use quantization acceleration methods such as triton, but its still perfectly fine. |
Beta Was this translation helpful? Give feedback.
-
|
I will try now thankyou for your valuable time |
Beta Was this translation helpful? Give feedback.
-
|
shall i use Disty0/FLUX.2-klein-4B-SDNQ-4bit-dynamic |
Beta Was this translation helpful? Give feedback.
-
|
those are very different, all i can say - try them |
Beta Was this translation helpful? Give feedback.
-
|
OpenVINO unfortunately doesn't support any quantization type. OpenVINO tries to be clever and converts the quantized weights to FP16 and keeps them at FP16. We don't have any control over this. Even their own NNCF quantization does this with OpenVINO. SDNQ works as it doesn't error out and produces correct outputs but OpenVINO casts INT8 to FP16 and keeps the model as FP16. |
Beta Was this translation helpful? Give feedback.
-
my setting is right as i want to SDNQ sdxl only with pre mode as i have 16 GB ram and intel irisx igpu |
Beta Was this translation helpful? Give feedback.
-
|
SDXL doesn't support pre mode, set it to auto. |
Beta Was this translation helpful? Give feedback.
-
|
ok |
Beta Was this translation helpful? Give feedback.
-
|
shall i use pre SDQN model from your huggingface model Disty0/Z-Image-Turbo-SDNQ-uint4-svd-r32 in my 16 GB RAM intel irisx igpu |
Beta Was this translation helpful? Give feedback.
-
|
If you want to use zimage turbo and don't want to deal with quantization process on model load, then yes. |
Beta Was this translation helpful? Give feedback.
-
|
Disty0/Z-Image-Turbo-SDNQ-uint4-svd-r32 will this model work with my gpu |
Beta Was this translation helpful? Give feedback.
-
|
Any guidance regarding zimage turbo in sdnext with openvino |
Beta Was this translation helpful? Give feedback.
-
SDNQ will work but you won't see any memory reduction because of OpenVINO. |
Beta Was this translation helpful? Give feedback.
-
|
Ok i try |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
|
Z-Image is not compatible with FP16. Try setting OpenVINO execution mode to accuracy. |
Beta Was this translation helpful? Give feedback.
-
|
If F16 not work then what setting should I use for z image turbo SDNQ quantization |
Beta Was this translation helpful? Give feedback.
-
|
btw, for general questions pls use discussions, transferring this thread |
Beta Was this translation helpful? Give feedback.
-
|
Step by step guide me which setting i use to run z image turbo SDNQ quantization model in sdnext as I have Intel irisX igpu with 16gb ram,what image size , sampler,steps i should use |
Beta Was this translation helpful? Give feedback.
-
|
SDNQ doesn’t work with OpenVINO or Intel iGPU — it’s CUDA/NVIDIA only. |
Beta Was this translation helpful? Give feedback.


Uh oh!
There was an error while loading. Please reload this page.
-
I have windows 11 with intel irisx iGPU with 16 GB RAM. I am using sdnext with openvino Can u guide me which quantization I should use for GPU and how setup That quantization.
I want to use flux, z-image turbo and sdxl SDNQ quantization model will they work on igpu, if SDNQ not then which quantization work with Openvino , i did not get any guidance from site doc and discord, GUIDE me.
i am in DEV Branch of sdnext
Beta Was this translation helpful? Give feedback.
All reactions