Lists (1)
Sort Name ascending (A-Z)
Starred repositories
This package contains the original 2012 AlexNet code.
Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
A Python package that provides evaluation and visualization tools for the HO-Cap dataset
Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"
[ICLR 2024] M/EEG-based image decoding with contrastive learning. i. Propose a contrastive learning framework to align image and eeg. ii. Resolving brain activity for biological plausibility.
Solve Visual Understanding with Reinforced VLMs
MoBA: Mixture of Block Attention for Long-Context LLMs
[ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Official Code for ECCV 2024 paper "EgoPoser: Robust Real-Time Egocentric Pose Estimation from Sparse and Intermittent Observations Everywhere"
The open-source solutions of FourCastNet and GraphCast
Code repository for emg2pose dataset and model benchmarks
A library for human kinematic motion and numerical optimization solvers to apply human motion
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
HInt dataset from HaMeR: Reconstructing Hands in 3D with Transformers
Code repository for Weakly-Supervised 3D Hand Reconstruction with Knowledge Prior and Uncertainty Guidance, ECCV2024
Python script for rendering 3D human poses using Blender
[ECCV'24] Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops
[ECCV'24] 3D Hand Pose Estimation in Everyday Egocentric Images
SEED-Voken: A Series of Powerful Visual Tokenizers