Go2_ARX_mjlab

Go2_ARX_mjlab adds a Unitree Go2 + ARX L5 arm robot to mjlab, with reinforcement-learning tasks for base velocity tracking and end-effector pose tracking.

This repository includes:

a combined Go2 + ARX L5 MuJoCo XML model
position-control actions for the Go2 legs and ARX arm
end-effector pose commands for the arm
base velocity commands for the quadruped
flat and rough terrain task registrations

Tasks

The added task IDs are:

Mjlab-Velocity-Flat-Go2arm
Mjlab-Velocity-Rough-Go2arm

Main implementation files:

src/mjlab/asset_zoo/robots/go2arm/
src/mjlab/tasks/velocity/config/go2arm/
src/mjlab/tasks/velocity/mdp/go2arm_lab.py

Installation

This project uses the same setup as mjlab. From the repository root:

uv sync

Training requires an NVIDIA GPU.

Sanity Check

Run the environment with zero actions:

uv run play Mjlab-Velocity-Flat-Go2arm \
  --agent zero \
  --viewer viser \
  --num-envs 1

Run the environment with random actions:

uv run play Mjlab-Velocity-Flat-Go2arm \
  --agent random \
  --viewer viser \
  --num-envs 1

Training

You can use either Weights & Biases or TensorBoard for logging. The commands below use TensorBoard for demonstration.

Train on flat terrain:

uv run train Mjlab-Velocity-Flat-Go2arm \
  --env.scene.num-envs 4096 \
  --agent.logger tensorboard

Train on rough terrain:

uv run train Mjlab-Velocity-Rough-Go2arm \
  --env.scene.num-envs 4096 \
  --agent.logger tensorboard

To resume from a local checkpoint:

uv run train Mjlab-Velocity-Flat-Go2arm \
  --env.scene.num-envs 4096 \
  --agent.resume True \
  --agent.load-run RUN_DIRECTORY_NAME \
  --agent.load-checkpoint model_1000.pt \
  --agent.logger tensorboard

Play a Trained Policy

Play a checkpoint:

uv run play Mjlab-Velocity-Flat-Go2arm \
  --checkpoint-file /path/to/model.pt \
  --viewer viser \
  --num-envs 1

For visualization-only debugging, you can disable terminations:

uv run play Mjlab-Velocity-Flat-Go2arm \
  --checkpoint-file /path/to/model.pt \
  --viewer viser \
  --num-envs 1 \
  --no-terminations True

Deployment

Deployment support is currently in progress. The repository includes native MuJoCo sim-to-sim scripts for checkpoint sanity checks before real-robot deployment.

Run a checkpoint in native MuJoCo with fixed commands:

uv run python deploy/simulation/sim2sim.py \
  --checkpoint /path/to/model.pt \
  --lin-vel-x 0.2 \
  --lin-vel-y 0.0 \
  --ang-vel-z 0.0 \
  --ee-x 0.48 \
  --ee-y 0.0 \
  --ee-z 0.36

Run the keyboard-controlled sim-to-sim viewer:

uv run python deploy/simulation/sim2sim_keyboard.py \
  --checkpoint /path/to/model.pt

Keyboard commands are read from the terminal while the MuJoCo window is used for visualization:

W / S: base forward velocity +/-
A / D: base lateral velocity +/-
Q / E: base yaw velocity +/-
I / K: EE x +/-
J / L: EE y +/-
U / O: EE z +/-
R / F: EE pitch offset +/-
T / G: EE yaw offset +/-
Z / X: EE roll +/-
Space: zero base velocity
C: reset EE command
V: print current command
Esc or Ctrl-C: quit

The sim-to-sim scripts visualize the target end-effector pose and the measured end-effector pose in the MuJoCo viewer.

Commands and Actions

The policy receives two commands:

base_velocity: desired base linear and yaw velocity
ee_pose: desired end-effector pose, represented as (x, y, z, qw, qx, qy, qz)

The action space has 18 dimensions:

12 Go2 leg joint position targets
6 ARX L5 arm joint position targets

Assets

Robot assets include Unitree Go2 and ARX L5 resources. See the included license files:

src/mjlab/asset_zoo/robots/go2arm/xmls/unitree_go2/LICENSE
src/mjlab/asset_zoo/robots/go2arm/xmls/arx_l5/LICENSE

Contact/Author

If you have questions, issues, or suggestions, please open a GitHub issue or contact the maintainer:

Zhongyu Chen
Email: chenzhongyu56@gmail.com

Acknowledgments

This project is built on top of the mjlab framework. Many thanks to the mjlab authors and contributors for making this work possible.

The reinforcement-learning task design and algorithm setup reference Go2Arm_Lab. We sincerely thank the authors for their open-source work.

License

This repository is based on mjlab and keeps the original Apache-2.0 license. See LICENSE.

Third-party assets and code retain their original licenses. In particular, check the license files bundled with the Go2 and ARX L5 assets before using them in commercial or redistributed projects.

If you use the underlying mjlab framework in research, please also cite the original mjlab project.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
deploy/simulation		deploy/simulation
docs/assets		docs/assets
src/mjlab		src/mjlab
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Go2_ARX_mjlab

Tasks

Installation

Sanity Check

Training

Play a Trained Policy

Deployment

Commands and Actions

Assets

Contact/Author

Acknowledgments

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Go2_ARX_mjlab

Tasks

Installation

Sanity Check

Training

Play a Trained Policy

Deployment

Commands and Actions

Assets

Contact/Author

Acknowledgments

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages