Highlights
- Pro
Stars
NeurIPS 2024 - Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation
Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"
Video Generation Foundation Models: https://saiyan-world.github.io/goku/
Erasing Concepts from Diffusion Models
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
A suite of image and video neural tokenizers
Make websites accessible for AI agents
UniVRM is a gltf-based VRM format implementation for Unity. English is here https://vrm.dev/en/ . 日本語 はこちら https://vrm.dev/
VRM Software for Windows to move avatar with minimal devices.
A generative world for general-purpose robotics & embodied AI learning.
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
Ablating Concepts in Text-to-Image Diffusion Models (ICCV 2023)
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds
CityGaussian Series for High-quality Large-Scale Scene Reconstruction with Gaussians
[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
A Unified Framework for Real-Time Rendering on the Web
[CVPR 2021] Multi-Modal-CelebA-HQ: A Large-Scale Text-Driven Face Generation and Understanding Dataset
[AAAI 2024] ConceptBed Evaluations for Personalized Text-to-Image Diffusion Models
[CVPR 2024] GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation