-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
wan2.1 transformer gguf load error #11088
Comments
This is working on my end, the config is downloaded from import torch
from diffusers import (
WanTransformer3DModel,
GGUFQuantizationConfig,
)
from huggingface_hub import hf_hub_download
path = hf_hub_download(
repo_id="city96/Wan2.1-I2V-14B-480P-gguf", filename="wan2.1-i2v-14b-480p-Q4_1.gguf"
)
transformer = WanTransformer3DModel.from_single_file(
path,
quantization_config=GGUFQuantizationConfig(compute_dtype=torch.bfloat16),
torch_dtype=torch.bfloat16,
) |
Hello @hlky Does diffusers Wan2.1 support GGUF for text encoder? |
@nitinmukesh, not atm, UMT5 comes from Here is a minimal reproduction you can share with import torch
from transformers import UMT5EncoderModel
from huggingface_hub import hf_hub_download
path = hf_hub_download(
repo_id="city96/umt5-xxl-encoder-gguf", filename="umt5-xxl-encoder-Q8_0.gguf"
)
text_encoder = UMT5EncoderModel.from_pretrained(
"Wan-AI/Wan2.1-I2V-14B-480P-Diffusers", # or any Hub path with the correct config
subfolder="text_encoder",
gguf_file=path,
torch_dtype=torch.bfloat16,
) |
Thank you. |
Describe the bug
I am testing the performance of the Wan2.1 image-to-video generation on an RTX 4090 using Diffusers' Wan2.1 model【https://huggingface.co/Wan-AI/Wan2.1-I2V-14B-480P-Diffusers】 and the City96 quantized GGUF model【https://huggingface.co/city96/Wan2.1-I2V-14B-480P-gguf/tree/main】. I referred to this documentation: https://huggingface.co/docs/diffusers/quantization/gguf. However, I encountered an error while trying to load the GGUF model—it seems to be downloading additional data that I don't require. If you could provide any suggestions to resolve this, I would greatly appreciate it.
Reproduction
Logs
System Info
Copy-and-paste the text below in your GitHub issue and FILL OUT the two last points.
NVIDIA GeForce RTX 4090, 24564 MiB
NVIDIA GeForce RTX 4090, 24564 MiB
NVIDIA GeForce RTX 4090, 24564 MiB
NVIDIA GeForce RTX 4090, 24564 MiB
NVIDIA GeForce RTX 4090, 24564 MiB
NVIDIA GeForce RTX 4090, 24564 MiB
NVIDIA GeForce RTX 4090, 24564 MiB
Who can help?
@DN6 @a-r-r-o-w
The text was updated successfully, but these errors were encountered: