Skip to content

GGUF RTN gets OOM on 16GB Intel GPU #1028

@xin3he

Description

@xin3he

ONEAPI_DEVICE_SELECTOR=level_zero:3 python3 -m auto_round Qwen/Qwen3-8B --format gguf:q2_k_s --iters 0 --device_map xpu

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions