(Preview) TextOp: Real-time Interactive Text-Driven Humanoid Robot Motion Generation and Control

News

[2025-11] We release the preview version of TextOp, including code, pretrained models and demo.

About

We propose TextOp, a novel framework for real-time, interactive, text-driven humanoid robot motion generation and control. It allows users to instruct the robot using natural language and modify commands on the fly, producing smooth, whole-body motions instantly.

Our system utilizes a two-layer architecture for execution. At the high level, a robot motion diffusion autoregressive model processes current user text commands to generate the kinematic motion trajectory. The low level employs a universal motion tracking policy for motor control. In this way, TextOp achieves both instant responsiveness and precise robot control.

TextOp is highly versatile and supports a wide range of behaviours, from simple gestures to complex motion sequences, all without pre-recorded scripts or manual programming. This approach provides a significantly more intuitive human-robot interaction paradigm, unlocking the potential for highly adaptable and easily controllable robots in real-world applications.

Key features:

End-to-end open-source pipeline covering dataset construction, model training, and real-robot deployment.
High-fidelity motion tracking: our universal Tracker policy achieves nearly 100% success per sequence on cleaned training data.
Clean and modular codebase, designed for readability, maintainability, and easy extension.

Repository Structure

TextOp/
│
├── TextOpRobotMDAR/        # High-level text-to-motion model
├── TextOpTracker/          # Low-level whole-body universal motion tracking policy
├── TextOpDeploy/           # Sim2sim and Sim2real deployment
├── dataset/                # Scripts for dataset processing
├── deps/                   # Third-party packages
└── docs/

We also provide the retargeted public datasets used in our experiments, as well as pretrained models for both RobotMDAR and Tracker policy. These resources enable you to reproduce our results out of the box.

Our models are trained on a mixture of public datasets and a small private dataset. However, comparable performance should be achievable using only the public data.

Usage

See USAGE.md for details.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgements

TextOpTracker is built upon Beyondmimic. TextOpRobotMDAR is based on a reconstruction of DART and is adapted for robot configurations.

We use publicly available human motion datasets, including AMASS with BABEL-TEACH annotations and LAFAN1, and employ GMR for retargeting.

Contact

Feel free to open an issue or discussion if you encounter any problems or have questions about this project.

For collaborations, feedback, or further inquiries, please reach out to:

Weiji Xie: xieweiji249@sjtu.edu.cn or Weixin shisoul
Jiakun Zheng: zjk9098@gmail.com
Chenjia Bai: baicj@chinatelecom.cn
You can also join our weixin discussion group for timely Q&A. Since the group already exceeds 200 members, you'll need to first add one of the authors on Weixin to receive an invitation to join.

We welcome contributions and are happy to support the community in building upon this work!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
TextOpDeploy		TextOpDeploy
TextOpRobotMDAR		TextOpRobotMDAR
TextOpTracker		TextOpTracker
dataset		dataset
deps		deps
docs		docs
.gitignore		.gitignore
.gitmodules		.gitmodules
DATASET.md		DATASET.md
LICENSE.md		LICENSE.md
README.md		README.md
USAGE.md		USAGE.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

(Preview) TextOp: Real-time Interactive Text-Driven Humanoid Robot Motion Generation and Control

News

About

Repository Structure

Usage

License

Acknowledgements

Contact

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

TeleHuman/TextOp

Folders and files

Latest commit

History

Repository files navigation

(Preview) TextOp: Real-time Interactive Text-Driven Humanoid Robot Motion Generation and Control

News

About

Repository Structure

Usage

License

Acknowledgements

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages