[CVPR 2026] Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models

Installation

conda create --name lavida python=3.13
conda activate lavida
pip install -e .[lavida]
pip install wheel
MAX_JOBS=32 pip install flash-attn==2.7.4.post1 --no-build-isolation
pip install jupyter notebook
pip install -U huggingface_hub[hf_xet] --force-reinstall

Download Checkpoint

Please download checkpoints from [Huggingface]

Inference

Example inference script is provided as demo_sparse_lavida.py. This script will run both the standard decoding and sparse decoding with token truncation, and benchmark output latency.

Training

See Training Readme for general training of LaViDa-O.

Please run the training script to finetune the model for sparse parameterization. The script is just standard SFT script with the following key flags for sparsity:

    --block_causal True \
    --gen_enc_add_pos_emb True \
    --num_register_tokens 64,64 \
    --num_register_groups 25,25 \

LICENSE

Both the model and code are licensed with Adobe Research License, which is included here [License.pdf].

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assets		assets
config/data		config/data
data		data
eval		eval
eval_img		eval_img
llava		llava
scripts		scripts
.gitignore		.gitignore
Demo.ipynb		Demo.ipynb
License.pdf		License.pdf
README.md		README.md
demo_sparse_lavida.py		demo_sparse_lavida.py
download_checkpoint.py		download_checkpoint.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[CVPR 2026] Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models

Installation

Download Checkpoint

Inference

Training

LICENSE

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

[CVPR 2026] Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models

Installation

Download Checkpoint

Inference

Training

LICENSE

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages