Addition of a not-strictly-block-diffusion model #27993

settheworldonfireiii · 2026-06-12T03:36:37Z

settheworldonfireiii
Jun 12, 2026

The dLLM framework currently prioritizes block-diffusion models (LLaDA 2.0, SDAR), and 'non-block diffusion LLMs' is unchecked on the roadmap. Do you consider adding Fast-dLLM v1's version of bidirectional LLaDA-8B / Dream-7B with approximate-KV-cache + confidence-based decoding thresholding with gptq_marlin quantization or is it out-of-scope? And if it's in scope, is anyone already working on it, and what's the approximate timeline/ETA?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Addition of a not-strictly-block-diffusion model #27993

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Addition of a not-strictly-block-diffusion model #27993

Uh oh!

settheworldonfireiii Jun 12, 2026

Replies: 0 comments

settheworldonfireiii
Jun 12, 2026