🌟 Align diffusion processes with detailed human preferences to improve machine learning models for richer, more accurate outputs.
reinforcement-learning offline animation popup rl alertview generative super-resolution actionsheet diffusion pose-estimation camera-pose-estimation score-based-models d4rl two-view-geometry srpo behavior-regularization eccv2024
-
Updated
Oct 14, 2025 - Python