Stars
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
4DHumans: Reconstructing and Tracking Humans with Transformers
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"
Official Github Repo for Neurips 2024 Paper Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment
MoVQGAN - model for the image encoding and reconstruction
[CVPR 2024] Official PyTorch implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering
Python library for loading and using triangular meshes.
Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering
COLMAP - Structure-from-Motion and Multi-View Stereo
Scaling Diffusion Transformers with Mixture of Experts
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Model Compression Toolbox for Large Language Models and Diffusion Models
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
Accelerated First Order Parallel Associative Scan
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"
Efficient vision foundation models for high-resolution generation and perception.
Implementation of Autoregressive Diffusion in Pytorch
Open-Sora: Democratizing Efficient Video Production for All
`std::execution`, the proposed C++ framework for asynchronous and parallel programming.
Implementation of "High Speed and Robust RGB-Thermal Tracking via Dual Attentive Stream Siamese Network" on Pytorch.
[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
A Unreal Engine 5 (UE5) based plugin aiming to provide real-time visulization, management, editing, and scalable hybrid rendering of Guassian Splatting model.