Length Value Model

Scalable Value Pretraining for Token-Level Length Modeling

LenVM architecture and training pipeline

Token serves as the fundamental unit of computation in modern autoregressive models, and generation length directly influences both inference cost and reasoning performance. Despite its importance, existing approaches lack fine-grained length modeling, operating primarily at the coarse-grained sequence level. In this paper, we introduce the Length Value Model (LenVM), a token-level framework that models the remaining generation length at each decoding step. By formulating length modeling as a value estimation problem and assigning a constant negative reward to each generated token, LenVM predicts a bounded, discounted return that serves as a proxy for the remaining generation horizon.

✨ Key Features

Token-level value prediction — Fine-grained length modeling beyond coarse sequence-level objectives
Annotation-free pretraining — Scalable supervision from generated trajectories without manual labels
Cross-modal support — Works seamlessly across language-only and vision-language models
Inference-time control — Dynamic length adjustment and performance-efficiency trade-offs
Rich visualization tools — Interactive demos and value inspection utilities

📁 Repository Structure

Length-Value-Model/
├── data_generation/       # Data generation pipeline and sampling scripts
├── LlamaFactory-LenVM/    # LenVM training framework (LlamaFactory fork)
├── inference/             # SGLang serving and guided decoding
├── sglang-LenVM/          # LenVM-enabled SGLang runtime
└── tools/                 # Visualization and demo utilities

🚀 Quick Start

1️⃣ Data Generation

The data generation pipeline is located in ./data_generation/.

Complete pipeline: data_generation/run_data_generation.sh

Pipeline steps:

Downloads and prepares datasets (deepmath-103k, OpenCodeReasoning-2, wildchat, R1-Onevision)
Launches SGLang server for trajectory sampling
Generates training/test data for math, code, chat, and VLM tasks
Groups samples by prompt index for LenVM training

Data management:

Generated data can be downloaded or uploaded using hf.sh.

2️⃣ LenVM Training

LenVM training is built on the customized LlamaFactory-LenVM/ fork.

Training configuration:

Example configs are available in Length-Value-Model/LlamaFactory-LenVM/examples/train_lenvm/

Launch training:

cd Length-Value-Model
llamafactory-cli train \
  LlamaFactory-LenVM/examples/train_lenvm/base-qwen2.5-7b-instruct-lenvm-qwen2.5-1.5b-instruct.yaml

More examples:

See Length-Value-Model/train_lf.sh for additional training configurations.

3️⃣ SGLang Inference & Guided Decoding

LenVM-enabled inference scripts are in Length-Value-Model/inference/.

Launch SGLang server: inference/sglang_server.sh

Supported models:

Qwen3-30B-A3B-Instruct-2507
Qwen2.5-3B-Instruct
Qwen2.5-7B-Instruct
Qwen2.5-VL-7B-Instruct

Quick testing:

For visualization and sanity checks: inference/test_sglang_lvm.sh

4️⃣ Interactive Demo

Build a standalone interactive demo from logged model outputs:

cd Length-Value-Model
python tools/build_lenvm_hover_demo.py

This generates an HTML demo for inspecting token-level values and generation dynamics.

📚 Citation

If you find this work useful, please cite:

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
LlamaFactory-LenVM		LlamaFactory-LenVM
assets/figures		assets/figures
data_generation		data_generation
inference		inference
sglang-lvm-old		sglang-lvm-old
tools		tools
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
env_config.sh		env_config.sh
gen_diff.sh		gen_diff.sh
hf.sh		hf.sh
llamafactory_changes.diff		llamafactory_changes.diff
output.txt		output.txt
output_hover_demo.html		output_hover_demo.html
train_job.sh		train_job.sh
train_lf.sh		train_lf.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Length Value Model

✨ Key Features

📁 Repository Structure

🚀 Quick Start

1️⃣ Data Generation

2️⃣ LenVM Training

3️⃣ SGLang Inference & Guided Decoding

4️⃣ Interactive Demo

📚 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Length Value Model

✨ Key Features

📁 Repository Structure

🚀 Quick Start

1️⃣ Data Generation

2️⃣ LenVM Training

3️⃣ SGLang Inference & Guided Decoding

4️⃣ Interactive Demo

📚 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages