Shuanghao Bai*, Dakai Wang*, Cheng Chi*, Wanqi Zhou, Jing Lyu, Xiaoguang Zhao, Pengwei Wang, Zhongyuan Wang, Lei Xing, Shanghang Zhang, Badong Chen
-
Trajectory-level MEE reshapes action error distributions in VLA models.
-
Enhances the accuracy and robustness of action generation under standard, few-shot, noisy, and imbalanced settings within a characterized range.
-
A plug-and-play training objective with no inference-time overhead.
-
Provides theoretical analysis that characterizes its optimization properties, robustness, and range of applicability.
Refer to INSTALL.md for installation instructions.
Refer to DOWNLOAD_DATASET.md for instructions on downloading datasets.
Refer to DOWNLOAD_MODEL.md for instructions on downloading pre-trained models.
Refer to USAGE.md for instructions on training and evaluation.
If you have any questions, please create an issue on this repository or contact us at baishuanghao@stu.xjtu.edu.cn.
If you find our work useful, please consider citing:
@article{bai2026reshaping,
title={Reshaping Action Error Distributions for Reliable Vision-Language-Action Models},
author={Bai, Shuanghao and Wang, DaKai and Chi, Cheng and Zhou, Wanqi and Lyu, Jing and Zhao, Xiaoguang and Wang, Pengwei and Wang, Zhongyuan and Xing, Lei and Zhang, Shanghang and Chen, Badong},
journal={arXiv preprint arXiv:2602.04228},
year={2026}
}
@article{bai2025rethinking,
title={Rethinking Latent Redundancy in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation},
author={Bai, Shuanghao and Zhou, Wanqi and Ding, Pengxiang and Zhao, Wei and Wang, Donglin and Chen, Badong},
journal={arXiv preprint arXiv:2502.02853},
year={2025}
}
This project is primarily built upon starVLA, with the image noise components adapted from CronusVLA.
