Skip to content

AstraFlow v0.1.0

Choose a tag to compare

@haizhongzheng haizhongzheng released this 20 May 06:44
· 51 commits to main since this release

First public release of AstraFlow, a dataflow-oriented RL system for (multi-)agentic LLMs.

Highlights

  • Fully async multi-policy collaborative RL
  • Elastic heterogeneous cross-region rollouts (RaaS)
  • Substitutable rollout and trainer services
  • Composable data algorithms (GRESO, dynamic sampling, buffer replay)

Recipes

math/, math-multi-agent/, math-efficient-data/, code/, code-multi-agent/, search/, alfworld/, webshop/. See the docs for details.

Install

docker run --gpus all --net=host --shm-size=512g -it astraflowai/astraflow:v0.1.0

Or from source — see the installation guide. Requires Linux, Python 3.10–3.12, CUDA 12.9.

Status

Alpha — runs end-to-end, but 0.x APIs may evolve. Issues welcome: https://github.com/Infini-AI-Lab/astraflow/issues