-
Notifications
You must be signed in to change notification settings - Fork 6.4k
Open
Description
Model/Pipeline/Scheduler description
Lumina-DiMOO is a unified multimodal model built on fully discrete diffusion. It supports a wide range of tasks, including text-to-image, image-to-image (editing, subject-driven generation, inpainting), and image understanding. It also outperforms existing open-source multimodal models with stronger results and much higher sampling efficiency. It would be great to have this model in diffusers.
Open source status
- The model implementation is available.
- The model weights are available (Only relevant if addition is not a scheduler).
Provide useful links for the implementation
project page: https://synbol.github.io/Lumina-DiMOO
code: https://github.com/Alpha-VLLM/Lumina-DiMOO
model weights: https://huggingface.co/Alpha-VLLM/Lumina-DiMOO
Metadata
Metadata
Assignees
Labels
No labels