-
National University of Singapore
- Singapore
-
10:11
- 8h ahead - enderfga.cn
- https://orcid.org/0009-0004-7344-2333
- @Enderfga
- in/enderfga
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.
[CVPR 2025] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Enjoy the magic of Diffusion models!
Official Repo for the Paper Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control
Lora traing script for Lightricks LTX-video
Solve Visual Understanding with Reinforced VLMs
[CVPR 2025] X-Dyna: Expressive Dynamic Human Image Animation
Fine-tuned LLMs generate accurate 3D human avatars from textual descriptions using the SMPL-X model, enhancing customization and simulation in virtual environments.
Memory-optimized training library for diffusion models
A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation
A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which aims to generate realistic composite image.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
The official Python library for the OpenAI API
A curated list of recent diffusion models for video generation, editing, and various other applications.
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
A minimal and universal controller for FLUX.1.
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
Three Ways of Generating Terrain with Erosion Features
Simple Controlnet module for CogvideoX model.
Official inference repo for FLUX.1 models