
Starred repositories
[CVPR 2025] InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions
We present Object Images (Omages): An homage to the classic Geometry Images.
SpatialLM: Large Language Model for Spatial Understanding
NVIDIA Isaac GR00T N1 is the world's first open foundation model for generalized humanoid robot reasoning and skills.
Tactile Sensing and Simulation; Visual Tactile Manipulation; Open Source.
AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind
HumEnv is an SMPL humanoid environment enabling systematic model comparison and reproducibility
I love CV.
Machine Learning Toolset for Houdini
A.K.A. Fourier Advanced Robot Teleoperation System (F.A.R.T.S.) 💨
Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Official implementation of "ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills"
Witness the aha moment of VLM with less than $3.
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes.
Transformer based 2D to 3D Human Pose Estimation
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
An easy-to-use jekyll theme for creating a workshop webpage (useful for AI / ML / CV / robotics folks)
[CVPR 2025] Official implementation of the solvers and estimators proposed in the paper "Relative Pose Estimation through Affine Corrections of Monocular Depth Priors"
Genesis Reinforcement Learning Environments
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence