Skip to content

Add Lumina-DiMOO as a pipeline #12358

@qianyu-dlut

Description

@qianyu-dlut

Model/Pipeline/Scheduler description

Lumina-DiMOO is a unified multimodal model built on fully discrete diffusion. It supports a wide range of tasks, including text-to-image, image-to-image (editing, subject-driven generation, inpainting), and image understanding. It also outperforms existing open-source multimodal models with stronger results and much higher sampling efficiency. It would be great to have this model in diffusers.

Open source status

  • The model implementation is available.
  • The model weights are available (Only relevant if addition is not a scheduler).

Provide useful links for the implementation

project page: https://synbol.github.io/Lumina-DiMOO
code: https://github.com/Alpha-VLLM/Lumina-DiMOO
model weights: https://huggingface.co/Alpha-VLLM/Lumina-DiMOO

@synbol @ChinChyi

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions