RoboCasa: wire training-data co-training + 12-DoF projection head for meaningful eval

## Context

The initial RoboCasa365 sim-eval integration (`src/opentau/envs/robocasa.py`, `RoboCasaEnv` config, factory dispatch) makes the **eval half** of the loop work: parallel vec envs, success-rate aggregation, and `grid_summary` wandb videos all run against the real sim. But RoboCasa is currently **eval-only** — there is no RoboCasa training data in the dataset mixture and no projection head sized for its action/state, so eval has nothing meaningful to run.

LIBERO, by contrast, is a closed loop: `TensorAuto/libero` (20 fps v2.1) is co-trained, then eval on the LIBERO sim yields benchmark-comparable success rates.

## Problem

- No RoboCasa LeRobot dataset is wired into `DatasetMixtureConfig`.
- RoboCasa's robot (PandaOmron) is **12-D action / 16-D state / 3 cameras**, distinct from LIBERO's 7-D/8-D. There is no validated per-`(robot_type, control_mode)` projection head for it, and no norm stats.
- The example eval config (`configs/examples/pi05_robocasa_eval_config.json`) loads a **LIBERO** checkpoint as a plumbing smoke, so rollouts are effectively random — "success rate" is not meaningful for RoboCasa yet.

## Why it matters

This is the single thing that turns RoboCasa from "validated plumbing" into a real benchmark sibling of LIBERO. Until a RoboCasa-trained policy exists, the eval metrics are not interpretable.

## Suggested approach

- Add a RoboCasa LeRobot dataset (e.g. one of the public `lerobot/robocasa_*` repos) to the mixture and confirm image/state/action keys line up with the env's `features_map`.
- Register a `(robot_type="PandaOmron", control_mode=...)` projection head for the 12-D action / 16-D state, reusing the per-`(robot_type, control_mode)` projection machinery (cf. #371 / #370 / #374), and confirm norm stats.
- Train a smoke and confirm a RoboCasa-trained policy beats random on `CloseFridge` (the end-to-end validation that makes "success rate" mean something).

## References

- `src/opentau/envs/robocasa.py`, `RoboCasaEnv` in `src/opentau/envs/configs.py`
- per-`(robot_type, control_mode)` projections: PRs #370, #371, #374

_Follow-up to the initial RoboCasa env integration (branch `claude/lucid-albattani-b33067`)._

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RoboCasa: wire training-data co-training + 12-DoF projection head for meaningful eval #379

Context

Problem

Why it matters

Suggested approach

References

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

RoboCasa: wire training-data co-training + 12-DoF projection head for meaningful eval #379

Description

Context

Problem

Why it matters

Suggested approach

References

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions