EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence

[📜 Paper] [⭐️Project Page] [🤗 Model 7B] [🤗 Model 32B]

Introduction

Current models for embodied AI face challenges in agent alignment, real-time performance, and authentic evaluation. To address this, we propose EmbodiedBrain 1.0, a novel 7B and 32B vision-language foundation model that achieves state-of-the-art performance.

This repository provides the modular framework for training and evaluating our model. It is a comprehensive toolkit for embodied intelligence models, supporting both large-scale Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL).

Key features of our framework include:

An agent-aligned data structure.
A powerful RL training methodology, Step-Augmented Group Relative Policy Optimization (Step-GRPO).
A comprehensive and flexible reward system (e.g., a Generative Reward Model) with support for custom plugins.
A three-part evaluation benchmark, including a novel simulation environment, to ensure robust validation.

We will open-source all of our data, model weights, and evaluation methods to advance research in generalist embodied agents.

📁 Project Structure

Here is the core directory structure of this project:

EmbodiedBrain1.0/
├── data/
│   └── ...           (Coming Soon: Datasets will be released here)
│
├── evaluation/
│   └── ...           (Contains model evaluation scripts)
│
├── examples/
│   └── ...           (Contains partial examples of data construction)
│
└── train/
    ├── SFT/
    │   └── ...         (SFT training scripts based on LlamaFactory)
    │
    └── RL/
        └── ms-swift-main/
            ├── main_experiments/
            │   ├── plugin/
            │   │   └── ...     (Custom reward functions adapted for the SWIFT framework)
            │   │
            │   └── train_bash/
            │       └── train_rl.sh (RL training entry script)
            │
            └── ... (Other files for the SWIFT framework)

🚀 Quick Start

1. Setup

First, clone this repository to your local machine:

git clone [https://github.com/](https://github.com/)[Your-Username]/EmbodiedBrain1.0.git
cd EmbodiedBrain1.0

This project depends on the LlamaFactory and SWIFT frameworks. Please install the required dependencies according to their official documentation:

Install LlamaFactory (for SFT): Please refer to the LlamaFactory Official Documentation for environment setup.
Install SWIFT (for RL): Please refer to the SWIFT Official Documentation for environment setup.

2. SFT Training

SFT training scripts are located in the train/SFT/ directory.

cd train/SFT
# Run the LlamaFactory training command
# (Please modify according to LlamaFactory usage and the scripts you prepared in the SFT directory)
bash train_sft.sh

3. RL Training

RL training scripts are located in train/RL/ms-swift-main/main_experiments/train_bash/.

The ms-swift-main/main_experiments/plugin/ directory contains custom reward functions adapted for the SWIFT framework, which you can call in your training configuration.

# Enter the RL training script directory
cd train/RL/ms-swift-main/main_experiments/train_bash
# Start RL training
bash train_rl.sh

4. Evaluation

Evaluation scripts are located in the evaluation/ directory.

🙏 Acknowledgements

The development of this project relies on the following excellent open-source frameworks:

📄 License

This project is released under the MIT License.

Star History

🖊️ Citation

If you find this work helpful in your research, please consider giving this repo a star ⭐ and citing our paper:

@article{zou2025embodiedbrain,
  title={EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence},
  author={Zou, Ding and Wang, Feifan and Ge, Mengyu and Fan, Siyuan and Zhang, Zongbing and Chen, Wei and Wang, Lingfeng and Hu, Zhongyou and Yan, Wenrui and Gao, Zhengwei and Wang, Hao and Jin, Weizhao and Zhang Yu and Zhao, Hainan and Zhang, Mingliang and Xi, Xianxian and Zhang, Yaru and Li, Wenyuan and Wu, Yunpeng and Gao, Zhengguang and Zhu, Yurui},
  journal={arXiv preprint arXiv:2510.20578},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
evaluation		evaluation
examples		examples
images		images
train		train
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence

Introduction

📁 Project Structure

🚀 Quick Start

1. Setup

2. SFT Training

3. RL Training

4. Evaluation

🙏 Acknowledgements

📄 License

Star History

🖊️ Citation

About

Uh oh!

Releases

Packages

Languages

License

ZTERobot/EmbodiedBrain1.0

Folders and files

Latest commit

History

Repository files navigation

EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence

Introduction

📁 Project Structure

🚀 Quick Start

1. Setup

2. SFT Training

3. RL Training

4. Evaluation

🙏 Acknowledgements

📄 License

Star History

🖊️ Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages