NI Sampling: Accelerating Discrete Diffusion Sampling by Token Order Opmization

This is the official implementaion of paper NI Sampling: Accelerating Discrete Diffusion Sampling by Token Order Opmization (ICLR26).

Overview

Neural Indicator Sampling (NI Sampling) is a novel framework designed to accelerate the sampling process of diffusion Large Language Models (LLMs). By training a lightweight neural indicator, we can dynamically predict which tokens should be sampled at each step, significantly reducing redundant computations while maintaining high generation quality. We will release the training code after re-arrangement.

Prepare

Please install the following dependencies:

pip install torch==2.1.2 transformers==4.45.2 accelerate

Download Trained Indicator

We release the checkpoints of our trained indicator at this link. Please download it first.

Evaluation

We provide commands to evaluate the indicator on several benchmarks.

Key Hyperparameters

prob_threshold: Confidence threshold for sampling.
indicator_threshold: Threshold for the neural indicator.
block_length / gen_length & steps: Adjust these to test under different efficiency settings.

Baseline (Confidence Threshold)

# GSM8K
accelerate launch --main_process_port 11450 --num_processes 1 eval_llada.py --tasks gsm8k --model llada_dist --model_args model_path='GSAI-ML/LLaDA-8B-Instruct',gen_length=256,steps=256,block_length=64,prob_threshold=0.9

# MATH
accelerate launch --main_process_port 11450 --num_processes 1 eval_llada.py --tasks minerva_math --model llada_dist --model_args model_path='GSAI-ML/LLaDA-8B-Instruct',gen_length=256,steps=256,block_length=32,prob_threshold=0.9

# HumanEval
accelerate launch --main_process_port 11450 --num_processes 1 eval_llada.py --tasks humaneval --model llada_dist --confirm_run_unsafe_code --model_args model_path='GSAI-ML/LLaDA-8B-Instruct',gen_length=256,steps=256,block_length=32,prob_threshold=0.9

# MBPP
accelerate launch --main_process_port 11450 --num_processes 1 eval_llada.py --tasks mbpp --model llada_dist --confirm_run_unsafe_code --model_args model_path='GSAI-ML/LLaDA-8B-Instruct',gen_length=256,steps=256,block_length=32,prob_threshold=0.9

NI Sampling

# GSM8K
accelerate launch --main_process_port 11450 --num_processes 1 eval_llada.py --tasks gsm8k --model llada_dist --model_args model_path='GSAI-ML/LLaDA-8B-Instruct',gen_length=256,steps=256,block_length=64,prob_threshold=0.95,indicator_path="/PATH/TO/INDICATOR",indicator_threshold=0.89,use_indicator=True

# MATH
accelerate launch --main_process_port 11450 --num_processes 1 eval_llada.py --tasks minerva_math --model llada_dist --model_args model_path='GSAI-ML/LLaDA-8B-Instruct',gen_length=256,steps=256,block_length=32,prob_threshold=0.95,indicator_path="/PATH/TO/INDICATOR",indicator_threshold=0.89,use_indicator=True

# HumanEval
accelerate launch --main_process_port 11450 --num_processes 1 eval_llada.py --tasks humaneval --model llada_dist --confirm_run_unsafe_code --model_args model_path='GSAI-ML/LLaDA-8B-Instruct',gen_length=256,steps=256,block_length=32,prob_threshold=0.95,indicator_path="/PATH/TO/INDICATOR",indicator_threshold=0.89,use_indicator=True

# MBPP
accelerate launch --main_process_port 11450 --num_processes 1 eval_llada.py --tasks mbpp --model llada_dist --confirm_run_unsafe_code --model_args model_path='GSAI-ML/LLaDA-8B-Instruct',gen_length=256,steps=256,block_length=32,prob_threshold=0.95,indicator_path="/PATH/TO/INDICATOR",indicator_threshold=0.89,use_indicator=True

Acknowledgement

This codebase is heavily based on LLaDA. We thank the authors for their contribution.

Citation

If you find our work helpful, please consider citing:

@inproceedings{liuni,
  title={NI Sampling: Accelerating Discrete Diffusion Sampling by Token Order Optimization},
  author={Liu, Enshu and Ning, Xuefei and Wang, Yu and Lin, Zinan},
  booktitle={The Fourteenth International Conference on Learning Representations}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
__pycache__		__pycache__
code_eval		code_eval
configs		configs
data		data
imgs		imgs
ni_sampling		ni_sampling
opencompass		opencompass
visualization		visualization
EVAL.md		EVAL.md
GUIDELINES.md		GUIDELINES.md
LICENSE		LICENSE
README.md		README.md
README_LLaDA.md		README_LLaDA.md
app.py		app.py
chat.py		chat.py
eval_llada.py		eval_llada.py
eval_llada_lm_eval.sh		eval_llada_lm_eval.sh
eval_llada_opencompass.sh		eval_llada_opencompass.sh
eval_reverse.py		eval_reverse.py
generate.py		generate.py
get_log_likelihood.py		get_log_likelihood.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NI Sampling: Accelerating Discrete Diffusion Sampling by Token Order Opmization

Overview

Prepare

Download Trained Indicator

Evaluation

Key Hyperparameters

Baseline (Confidence Threshold)

NI Sampling

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NI Sampling: Accelerating Discrete Diffusion Sampling by Token Order Opmization

Overview

Prepare

Download Trained Indicator

Evaluation

Key Hyperparameters

Baseline (Confidence Threshold)

NI Sampling

Acknowledgement

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages