Audio-VLA: Adding Contact Audio Perception to Vision-Language-Action Model for Robotic Manipulation This is the repository that contains source code for the AudioVLA This website demo is based on the Nerfies website. Website License This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.