Skip to content

add Qwen-Image quantized version in the gallery #6323

@SuperPat45

Description

@SuperPat45

LocalAI version:
3.5.4 with docker image localai/localai:latest-gpu-nvidia-cuda-12

Environment, CPU architecture, OS, and Version:
Linux 6.11.0-25-generic #25-Ubuntu SMP PREEMPT_DYNAMIC x86_64 GNU/Linux
RTX 5090

Describe the bug
Qwen-image generation need too much VRAM event for a RTX 5090.
Can you add quantized version of this model in the gallery

To Reproduce

Expected behavior

Logs

Additional context
https://huggingface.co/city96/Qwen-Image-gguf/tree/main

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions