vision-language-action

Official implementation of paper "AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning"

autonomous-driving vision-language-action reinforcement-finetuning grpo

Updated Jun 18, 2025

TongUI-agent / TongUI-agent

Star

Release of code, datasets and model for our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials

agent vision-language-model vision-language-action computer-use gui-agent vision-language-action-model computer-use-agent tongui

Updated Jun 16, 2025
HTML

jiaming-zhou / X-ICM

Star

official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method

manipulation vision-language-action

Updated May 26, 2025
Python

SS47816 / AGI-Elo

Star

AGI-Elo: How Far Are We From Mastering A Task?

benchmark leaderboard agi imagenet coco artificial-general-intelligence datasets evaluation-metrics elo-rating rating-system evaluation-framework sota ai-benchmarks waymo-open-dataset mmlu vision-language-action ai-evaluation-framework livecodebench navsim

Updated May 21, 2025
Python

miladfa7 / PickAgent

Star

PickAgent: OpenVLA-powered Pick and Place Agent | Gradio&Simulation | Vision Language Action Model

ai deep-learning gradio vision-language-model vision-language-action openvla

Updated Mar 26, 2025
Python

robosense2025 / track2

Star

Track 2: Social Navigation

social-navigation embodied-agent vision-language-action vision-language-models

Updated Jun 14, 2025

pl909 / VLAGen

Star

VLAGen: Automated Data Collection for Generalizing Robotic Policies

robot ai ml vision-language-action

Updated Feb 23, 2025
Python

OmniJarvis / omnijarvis.github.io

Star

Project Page of OmniJARVIS

agent minecraft vision-language-action

Updated Jul 2, 2024
HTML

Improve this page

Add a description, image, and links to the vision-language-action topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-language-action topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-language-action

Here are 11 public repositories matching this topic...

showlab / ShowUI

2toinf / UniAct

BridgeVLA / BridgeVLA

ucla-mobility / AutoVLA

TongUI-agent / TongUI-agent

jiaming-zhou / X-ICM

SS47816 / AGI-Elo

miladfa7 / PickAgent

robosense2025 / track2

pl909 / VLAGen

OmniJarvis / omnijarvis.github.io

Improve this page

Add this topic to your repo