-
Nanyang Technological University
- Singapore
-
02:34
- 8h ahead - shangchenzhou.com
- @ShangchenZhou
Highlights
- Pro
Lists (13)
Sort Name ascending (A-Z)
Stars
The official implementation of "Bokeh Diffusion: Defocus Blur Control in Text-to-Image Diffusion Models"
[CVPR2025] SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration
A node-based image processing GUI aimed at making chaining image processing tasks easy and customizable. Born as an AI upscaling application, chaiNNer has grown into an extremely flexible and power…
[CVPR 2025] 3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement
[ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization
Author's Implementation for E-LatentLPIPS
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Concept Sliders for Precise Control of Diffusion Models
[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation
Official inference repo for FLUX.1 models
BokehMe: When Neural Rendering Meets Classical Rendering (CVPR 2022 Oral)
🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)
Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"
Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"
High-quality and editable surfel 3D Gaussian generation through native 3D diffusion (ICLR 2025)
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
State-of-the-art 2D and 3D Face Analysis Project
[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
SEED-Voken: A Series of Powerful Visual Tokenizers
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image