Stars
[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
The repo for "AnyTouch: Learning Unified Static-Dynamic Representation across Multiple Visuo-tactile Sensors", ICLR 2025
Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"
[ICLR 2025] Reconstructive Visual Instruction Tuning
A suite of image and video neural tokenizers
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
[ICLR 2025] Codebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"