🦾 VLA-Lab

The Missing Toolkit for Vision-Language-Action Model Deployment

Debug • Visualize • Analyze your VLA deployments in the real world

🚀 Quick Start · 📖 Documentation · 🎯 Features · 🔧 Installation

🎯 Why VLA-Lab?

Deploying VLA models to real robots is hard. You face:

🕵️ Black-box inference — Can't see what the model "sees" or why it fails
⏱️ Hidden latencies — Transport delays, inference bottlenecks, control loop timing issues
📊 No unified logging — Every framework logs differently, making cross-model comparison painful
🔄 Tedious debugging — Replaying failures requires manual log parsing and visualization

VLA-Lab solves this. One unified toolkit for all your VLA deployment needs.

┌─────────────────────────────────────────────────────────────────────────────┐
│                              VLA-Lab Architecture                           │
├─────────────────────────────────────────────────────────────────────────────┤
│                                                                             │
│   ┌──────────────┐    ┌──────────────────────┐    ┌────────────────────┐   │
│   │   Robot      │    │   Inference Server   │    │    VLA-Lab         │   │
│   │   Client     │───▶│   (DP / GR00T / ...) │───▶│    RunLogger       │   │
│   └──────────────┘    └──────────────────────┘    └─────────┬──────────┘   │
│                                                             │              │
│                                                             ▼              │
│                          ┌──────────────────────────────────────────┐      │
│                          │            Unified Run Storage            │      │
│                          │   ┌──────────┬────────────┬───────────┐  │      │
│                          │   │meta.json │ steps.jsonl│ artifacts/│  │      │
│                          │   └──────────┴────────────┴───────────┘  │      │
│                          └──────────────────┬───────────────────────┘      │
│                                             │                              │
│                                             ▼                              │
│   ┌─────────────────────────────────────────────────────────────────────┐  │
│   │                        Visualization Suite                           │  │
│   │  ┌─────────────┐  ┌──────────────────┐  ┌─────────────────────────┐ │  │
│   │  │  Inference  │  │     Latency      │  │       Dataset           │ │  │
│   │  │   Viewer    │  │     Analyzer     │  │       Browser           │ │  │
│   │  └─────────────┘  └──────────────────┘  └─────────────────────────┘ │  │
│   └─────────────────────────────────────────────────────────────────────┘  │
│                                                                             │
└─────────────────────────────────────────────────────────────────────────────┘

✨ Features

📊 Unified Logging Format

Standardized run structure with JSONL + image artifacts. Works across all VLA frameworks.

🔬 Inference Replay

Step-by-step playback with multi-camera views, 3D trajectory visualization, and action overlays.

📈 Deep Latency Analysis

Profile transport delays, inference time, control loop frequency. Find your bottlenecks.

🗂️ Dataset Browser

Explore Zarr-format training/evaluation datasets with intuitive UI.

🔧 Installation

pip install vlalab

Or install from source:

git clone https://github.com/ky-ji/VLA-Lab.git
cd VLA-Lab
pip install -e .

🚀 Quick Start

Minimal Example (3 Lines!)

import vlalab

# Initialize a run
run = vlalab.init(project="pick_and_place", config={"model": "diffusion_policy"})

# Log during inference
vlalab.log({"state": obs["state"], "action": action, "images": {"front": obs["image"]}})

Full Example

import vlalab

# Initialize with detailed config
run = vlalab.init(
    project="pick_and_place",
    config={
        "model": "diffusion_policy",
        "action_horizon": 8,
        "inference_freq": 10,
    },
)

# Access config anywhere
print(f"Action horizon: {run.config.action_horizon}")

# Inference loop
for step in range(100):
    obs = get_observation()
    
    t_start = time.time()
    action = model.predict(obs)
    latency = (time.time() - t_start) * 1000
    
    # Log everything in one call
    vlalab.log({
        "state": obs["state"],
        "action": action,
        "images": {"front": obs["front_cam"], "wrist": obs["wrist_cam"]},
        "inference_latency_ms": latency,
    })

    robot.execute(action)

# Auto-finishes on exit, or call manually
vlalab.finish()

Launch Visualization

# One command to view all your runs
vlalab view

📸 Screenshots (Click to expand)

Coming soon: Inference Viewer, Latency Analyzer, Dataset Browser screenshots

📖 Documentation

Core Concepts

Run — A single deployment session (one experiment, one episode, one evaluation)

Step — A single inference timestep with observations, actions, and timing

Artifacts — Images, point clouds, and other media saved alongside logs

API Reference

vlalab.init() — Initialize a run

run = vlalab.init(
    project: str = "default",     # Project name (creates subdirectory)
    name: str = None,             # Run name (auto-generated if None)
    config: dict = None,          # Config accessible via run.config.key
    dir: str = "./vlalab_runs",   # Base directory (or $VLALAB_DIR)
    tags: list = None,            # Optional tags
    notes: str = None,            # Optional notes
)

vlalab.log() — Log a step

vlalab.log({
    # Robot state
    "state": [...],                    # Full state vector
    "pose": [x, y, z, qx, qy, qz, qw], # Position + quaternion
    "gripper": 0.5,                    # Gripper opening (0-1)
    
    # Actions
    "action": [...],                   # Single action or action chunk
    
    # Images (multi-camera support)
    "images": {
        "front": np.ndarray,           # HWC numpy array
        "wrist": np.ndarray,
    },
    
    # Timing (any *_ms field auto-captured)
    "inference_latency_ms": 32.1,
    "transport_latency_ms": 5.2,
    "custom_metric_ms": 10.0,
})

RunLogger — Advanced API

For fine-grained control over logging:

from vlalab import RunLogger

logger = RunLogger(
    run_dir="runs/experiment_001",
    model_name="diffusion_policy",
    model_path="/path/to/checkpoint.pt",
    task_name="pick_and_place",
    robot_name="franka",
    cameras=[
        {"name": "front", "resolution": [640, 480]},
        {"name": "wrist", "resolution": [320, 240]},
    ],
    inference_freq=10.0,
)

logger.log_step(
    step_idx=0,
    state=[0.5, 0.2, 0.3, 0, 0, 0, 1, 1.0],
    action=[[0.51, 0.21, 0.31, 0, 0, 0, 1, 1.0]],
    images={"front": image_rgb},
    timing={
        "client_send": t1,
        "server_recv": t2,
        "infer_start": t3,
        "infer_end": t4,
    },
)

logger.close()

CLI Commands

# Launch visualization dashboard
vlalab view [--port 8501]

# Convert legacy logs (auto-detects format)
vlalab convert /path/to/old_log.json -o /path/to/output

# Inspect a run
vlalab info /path/to/run_dir

📁 Run Directory Structure

vlalab_runs/
└── pick_and_place/                 # Project
    └── run_20240115_103000/        # Run
        ├── meta.json               # Metadata (model, task, robot, cameras)
        ├── steps.jsonl             # Step records (one JSON per line)
└── artifacts/
            └── images/             # Saved images
        ├── step_000000_front.jpg
                ├── step_000000_wrist.jpg
        └── ...

🗺️ Roadmap

🤝 Contributing

We welcome contributions!

git clone https://github.com/ky-ji/VLA-Lab.git
cd VLA-Lab
pip install -e .

📄 License

MIT License — see LICENSE for details.

⭐ Star us on GitHub if VLA-Lab helps your research!

Built with ❤️ for the robotics community

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
examples		examples
src/vlalab		src/vlalab
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🦾 VLA-Lab

The Missing Toolkit for Vision-Language-Action Model Deployment

🎯 Why VLA-Lab?

✨ Features

📊 Unified Logging Format

🔬 Inference Replay

📈 Deep Latency Analysis

🗂️ Dataset Browser

🔧 Installation

🚀 Quick Start

Minimal Example (3 Lines!)

Full Example

Launch Visualization

📖 Documentation

Core Concepts

API Reference

CLI Commands

📁 Run Directory Structure

🗺️ Roadmap

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

ky-ji/VLA-Lab

Folders and files

Latest commit

History

Repository files navigation

🦾 VLA-Lab

The Missing Toolkit for Vision-Language-Action Model Deployment

🎯 Why VLA-Lab?

✨ Features

📊 Unified Logging Format

🔬 Inference Replay

📈 Deep Latency Analysis

🗂️ Dataset Browser

🔧 Installation

🚀 Quick Start

Minimal Example (3 Lines!)

Full Example

Launch Visualization

📖 Documentation

Core Concepts

API Reference

CLI Commands

📁 Run Directory Structure

🗺️ Roadmap

🤝 Contributing

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages