Sync/publish v0.2.1 GitHub by PanAndy · Pull Request #366 · alibaba/ROLL

PanAndy · 2026-03-09T03:37:48Z

大家好！感谢大家对ROLL的关注。
ROLL近期更新了大量新功能，以下是近期更新的一些梳理，我们将持续对ROLL进行迭代更新，欢迎加入ROLL的社区。

🚀亮点:

rollout 重构为由router调度，支持sglang-router
新增[On-Policy Distillation](docs_roll/i18n/zh-Hans/docusaurus-plugin-content-docs/current/User Guides/Pipeline/on_policy_distill_pipeline_start.md)训练支持
支持Qwen3.5 Dense / MoE 系列模型

🚀主要新特性：

Rollout
- 重构router调度支持
  - sglang strategy重构，同时支持engine、server两种模式。
  - Scheduler重构(rlvr的DynamicScheduler/ agentic的Rolloutscheduler)，统一由Router提供调度
  - 迁移原LoadBalancer、RequestScheduler为PromptAffinityRouter、EnvAffinityRouter
  - 新增支持sglang-router
pipeline recipe
- 新增On-Policy Distillation训练支持
Models
- 支持Qwen3.5 Dense/MoE系列模型
docker
- torch2.10 、vllm 0.16.0 nightly、vllm0.15.1版本、mcore 0.16.0
bug fix:
- 默认设置vllm VLLM_USE_FLASHINFER_SAMPLER=0 for torch 280，解决reponse重复度过高
- fix sglang & vllm 偶现port conflict
- fix sglang multi-nodes fail when infer_dp > 1
- fix reward worker metrics 透出能力
- fix model download get_node_ip cache，可能导致死锁timeout
- fix CPU offload时FSDP2 DCE save
- fix FSDP2 model initialization casting

…endency on ray._private.ray_constants.

… DPO pipeline.

…llout chat_template function typo.

…to training strategy.

…pt check.

PanAndy · 2026-03-09T03:38:29Z

#360

PanAndy · 2026-03-09T03:42:51Z

#363

PanAndy and others added 30 commits March 6, 2026 15:22

(fix): set vllm VLLM_USE_FLASHINFER_SAMPLER=0 for torch 280.

a9ef061

(fix): set sglang port range to avoid conflicting.

e7feb1a

(fix): fix sglang multi-nodes fail when worker num > 1.

f80f88d

(fix): optimize port allocation logic with atomic operation.

1be764a

(chore): fix qwen3-vl-32B 80GB config.

6499fe0

(fix): hardcode default async concurrency limit to 1000 to remove dep…

a058dc6

…endency on ray._private.ray_constants.

(fix): fix reward metrics expo.

a55436f

(fix): fix batch num tokens.

08079a1

(fix): fix vllm process weights.

5d8d154

(fix): fix func download get_node_ip.

8627137

(fix): fix sglang process weights.

7fc50bf

(fix): Make offload states configurable and Fix batch size setting in…

0a6a47e

… DPO pipeline.

(feat): support vllm 0.15.1.

17c0dd8

(fix): FSDP2 DCP Saving when CPU Offload.

20a934d

(feat): support sglang-router.

aff9054

(feat): add Dockerfile for torch2.10.0, support vllm 0.16.dev.

9c5dd64

(fix): pyarrow>15.0.0 jemalloc coredump, add torch2.10.0 deps, fix ro…

9c163ff

…llout chat_template function typo.

(feat): update mcore adapter.

b6df273

(feat): support training for qwen3.5-27B.

4593cc7

(fix): refactor sharded state dict metadata handling and integrate in…

71da469

…to training strategy.

(chore): move EnvAffinityRouter and PartialGPUManager to router.py.

9a3efd3

(fix): gracefully shutdown of Router.

a76c1f2

(chore): release docker image for torch2.10.0.

03f7c77

(feat): add example config for qwen3_5_35ba3.

de8a8a4

(fix): correct parameter name when constructing reward cluster.

8d01dea

(feat): support onpolicy distillation.

17ffcfd

(fix): fix version compare of torch for pg_options_param_name.

5a87769

(fix): separated the system role check from the skip_mock_system_prom…

868c49d

…pt check.

(fix): prevent sync generate request execution during shutdown.

0883643

(docs): update readme.

dfbc41f

dilixiati.dlxtmhte and others added 3 commits March 9, 2026 11:25

(fix): FSDP2 Model Initialization & Casting.

ddaffab

fix bugs in strategy config and opd config

2a7899c

(fix): add context parallel loss reduction in trainer.

25bc649

PanAndy merged commit 2eba7c3 into main Mar 9, 2026
6 checks passed

PanAndy mentioned this pull request Mar 9, 2026

🚀 [2026/03/09] Recent Updates Summary for ROLL Project #367

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync/publish v0.2.1 GitHub#366

Sync/publish v0.2.1 GitHub#366
PanAndy merged 33 commits intomainfrom
sync/publish_v0.2.1_github

PanAndy commented Mar 9, 2026

Uh oh!

PanAndy commented Mar 9, 2026

Uh oh!

PanAndy commented Mar 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

PanAndy commented Mar 9, 2026

Uh oh!

PanAndy commented Mar 9, 2026

Uh oh!

PanAndy commented Mar 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants