AstraFlow v0.1.0
First public release of AstraFlow, a dataflow-oriented RL system for (multi-)agentic LLMs.
- 📖 Paper: https://arxiv.org/abs/2605.15565
- 🌐 Site: https://Infini-AI-Lab.github.io/astraflow/
- 📚 Docs: https://Infini-AI-Lab.github.io/astraflow/docs/
Highlights
- Fully async multi-policy collaborative RL
- Elastic heterogeneous cross-region rollouts (RaaS)
- Substitutable rollout and trainer services
- Composable data algorithms (GRESO, dynamic sampling, buffer replay)
Recipes
math/, math-multi-agent/, math-efficient-data/, code/, code-multi-agent/, search/, alfworld/, webshop/. See the docs for details.
Install
docker run --gpus all --net=host --shm-size=512g -it astraflowai/astraflow:v0.1.0Or from source — see the installation guide. Requires Linux, Python 3.10–3.12, CUDA 12.9.
Status
Alpha — runs end-to-end, but 0.x APIs may evolve. Issues welcome: https://github.com/Infini-AI-Lab/astraflow/issues