ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation

ForceVLA is based on the π₀ model, a flow-based diffusion vision-language-action model (VLA)； Both training and inference are based on π₀.

Requirements

To run the models in this repository, you will need an NVIDIA GPU with at least the following specifications. These estimations assume a single GPU, but you can also use multiple GPUs with model parallelism to reduce per-GPU memory requirements by configuring fsdp_devices in the training config. Please also note that the current training script does not yet support multi-node training.

Mode	Memory Required	Example GPU
Inference	> 8 GB	RTX 4090
Fine-Tuning (LoRA)	> 22.5 GB	RTX 4090
Fine-Tuning (Full)	> 70 GB	A100 (80GB) / H100

The repo has been tested with Ubuntu 22.04, we do not currently support other operating systems.

dataset

https://huggingface.co/datasets/qiaojunyu/ForceVLA-real-data

Installation

When cloning this repo, make sure to update submodules:

conda create -n forcevla python=3.11 -y

python -m pip install --upgrade pip setuptools wheel
conda install -c nvidia cuda-toolkit=12.8

cd lerobot/
conda install ffmpeg=7.1.1 -c conda-forge
pip install -e .

cd ./openpi
pip install -e .

cd dlimp/
pip install -e .

cd packages/
cd openpi-client/
pip install -e .

cd flaxformer/
pip install -e .

train policy

export HF_LEROBOT_HOME="xxxxxx"
python scripts/compute_norm_stats.py --config-name forcevla_lora 
XLA_PYTHON_CLIENT_MEM_FRACTION=0.9  python scripts/train.py forcevla_lora --exp-name=my_experiment --overwrite  --batch_size 32 --save_interval 2000 --keep_period 10000

Name		Name	Last commit message	Last commit date
Latest commit History 125 Commits
.github		.github
.vscode		.vscode
dlimp		dlimp
docs		docs
examples		examples
flaxformer		flaxformer
lerobot		lerobot
packages/openpi-client		packages/openpi-client
scripts		scripts
src/openpi		src/openpi
third_party		third_party
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
load_lerobot_data.py		load_lerobot_data.py
pyproject copy.toml		pyproject copy.toml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation

Requirements

dataset

Installation

train policy

About

Uh oh!

Releases

Packages

Contributors 23

Uh oh!

Languages

License

ft-robotic/ForceVLA

Folders and files

Latest commit

History

Repository files navigation

ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation

Requirements

dataset

Installation

train policy

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 23

Uh oh!

Languages

Packages