Skip to content

v0.6.0

Latest

Choose a tag to compare

@pan-x-c pan-x-c released this 26 Jun 10:04
f39a4d8

Overview

⭐️ Highlights

Trinity-RFT v0.6.0 adds SGLang support, optimizes fully asynchronous parameter synchronization and scheduling to reduce bubbles, improves MoE training stability, and upgrades the veRL backend to v0.8.0.

Explorer

  1. Add SGLang inference engine support, including SGLang OpenAI API responses with prompt_token_ids, token_ids, and routed_experts.
  2. Upgrade vLLM to v0.23.0 and SGLang to v0.5.13.
  3. Refactor rollout model creation and management to support cross-node deployment of rollout models.
  4. Optimize fully asynchronous parameter synchronization with partial rollouts.
  5. Refactor weight synchronization with vLLM's WeightTransferEngine.
  6. Improve MoE rollout support by returning routed experts from vLLM/SGLang OpenAI API paths.
  7. Add data, pipeline, and expert parallel settings for rollout models to improve rollout performance.
  8. Enhance multi-turn and multi-modal rollout support.
  9. Add rollout performance benchmarking tools.

Trainer

  1. Upgrade veRL to v0.8.0 while keeping v0.7.1 supported.
  2. Improve Qwen3.5 compatibility with FSDP and Megatron backends.
  3. Improve MoE model training stability and reduce OOM risk in FSDP/Megatron training paths.

Buffer

  1. Refactor Buffer with an asynchronous implementation.
  2. Update the SQL schema and add filtering support to the Experience Viewer.
  3. Optimize the Experience Viewer.

Others

  1. Add Connect the Dots (CoD) example.
  2. Add InterCode-SQL example/benchmark.
  3. Add the GiGPO and DAPO algorithms.
  4. Improve Docker and remote GPU unit test workflows.

🚨 Breaking Changes

Due to database schema changes, the database files in the existing checkpoint folder cannot be recognized by the new version. To continue running previous experiments, please remove the corresponding .db files (typically located in the checkpoint_root_dir/project/name/buffer directory).

What's Changed

New Contributors

Full Changelog: v0.5.2...v0.6.0