DVD

We propose DrafterVerifierDiffusion (DVD), a speculative decoding framework that combines both approaches: a lightweight KV-cached MDLM drafts multiple denoising steps rapidly, while a bidirectional MDLM verifies outputs. WIP

Setup

Install the necessary requirements via uv or any standard Python package manager based on the pyproject.toml.

Extensibility and Customization

This repository is heavily customizable through the registry.yaml and Python script interfaces. See the Customization Guide for detailed instructions on:

Adding your own custom models, interacting with runtime.py, and extending load, tokenize, and generate methods.
Passing and receiving automatic custom kwargs through the broker system.
Adding algorithms and extending the verifier and drafter step functions.

Project Structure

The project is divided into core logic and user-provided interfaces:

src/: Contains the core math and background engine logic (runtime.py, model.py, utils.py, evals).
adapters/: Contains the user-provided Python scripts defining the interface for initializing and calling various diffusion models.
algorithms/: Contains the user-provided Python scripts defining the verification schedules and data flow logic.
registry.yaml: The configuration hub mapping human-readable names to your custom models and algorithm scripts.
main.py: The CLI entrypoint for executing runs.

Quick Start: Inference Usage

Run speculative generation via main.py. Basic inference allows you to generate a single prompt or batch.

python main.py \
  --drafter "llada_8b_base" \
  --verifier "llada_8b_instruct" \
  --algorithm "baseline_cascade" \
  --prompt "Give me a short story for a 5 year old" \
  --output_dir "baseline_output" \
  --tokenizer_max_len 32 \
  --gen_length 64 \
  --steps 32

Command Line Flags

--drafter: Name of the drafter model (from registry.yaml).
--verifier: Name of the verifier model (from registry.yaml).
--algorithm: Name of the verification algorithm (from registry.yaml).
--prompt: A single prompt string. (Mutually exclusive with --prompt_file)
--prompt_file: Path to a text or JSONL file containing a list of prompts.
--output_dir: (Optional) Directory to save the final output.jsonl results.
--verbose: (Optional) Prints detailed trace logs for adapters and verification steps.
**kwargs: Any extra arguments (e.g., --steps, --gen_length) are dynamically bundled and passed directly into your loaded algorithm and adapter scripts.

Benchmarking & Evaluation

We use lm-eval integration for benchmarking against popular tasks (e.g. GSM8K). Use the scripts benchmark_suite.sh or benchmark.sh.

Full Evaluation Suite

Run all configured algorithms (Baseline, Threshold, Top-K) across a set of steps:

bash benchmark_suite.sh

This iterates across configured STEPS_ARR=(10 20 30), computes metrics, and ultimately generates a plot (scripts/plot_metrics.py).

Granular Evaluation

You can also manually trigger run_evals.py:

python scripts/run_evals.py \
  --drafter "llada_8b_base" \
  --verifier "llada_8b_instruct" \
  --algorithm "top_k_intersection" \
  --tasks "gsm8k" \
  --limit 10 \
  --steps 20 \
  --gen_length 64

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
adapters		adapters
algorithms		algorithms
baseline_output		baseline_output
scripts		scripts
src		src
vis		vis
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
args.sh		args.sh
benchmark.sh		benchmark.sh
benchmark_suite.sh		benchmark_suite.sh
customization.md		customization.md
main.py		main.py
pyproject.toml		pyproject.toml
registry.yaml		registry.yaml
test_MDLM.sh		test_MDLM.sh
test_baseline.sh		test_baseline.sh
test_dream.sh		test_dream.sh
test_threshold.sh		test_threshold.sh
test_top_k.sh		test_top_k.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DVD

Setup

Extensibility and Customization

Project Structure

Quick Start: Inference Usage

Command Line Flags

Benchmarking & Evaluation

Full Evaluation Suite

Granular Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DVD

Setup

Extensibility and Customization

Project Structure

Quick Start: Inference Usage

Command Line Flags

Benchmarking & Evaluation

Full Evaluation Suite

Granular Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages