Skip to content
View pikeyang's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report pikeyang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first work to systematically explore R1 for video]

Python 185 4 Updated Mar 28, 2025

Generative Uncertainty in Diffusion Models

Jupyter Notebook 4 Updated Mar 6, 2025

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Jupyter Notebook 986 69 Updated Mar 25, 2023

Medical Diffusion: This repository contains the code to our paper Medical Diffusion: Denoising Diffusion Probabilistic Models for 3D Medical Image Synthesis

Jupyter Notebook 401 68 Updated May 12, 2023

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

1,738 156 Updated Jan 4, 2025

Align Anything: Training All-modality Model with Feedback

Python 3,102 394 Updated Mar 23, 2025

DepictQA: Depicted Image Quality Assessment with Vision Language Models

Python 129 5 Updated Feb 27, 2025
Python 23 3 Updated May 17, 2024
Jupyter Notebook 95 3 Updated Mar 14, 2024

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 952 67 Updated Mar 21, 2025

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 2,346 196 Updated Mar 23, 2025

Reproduction of DDPO paper (RLHF for diffusion)

Jupyter Notebook 83 2 Updated Sep 20, 2023

Solve Visual Understanding with Reinforced VLMs

Python 4,396 271 Updated Mar 24, 2025

A comprehensive collection of IQA papers

TeX 1,147 74 Updated Mar 20, 2025

Witness the aha moment of VLM with less than $3.

Python 3,424 268 Updated Mar 1, 2025

[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution

Python 2,378 151 Updated Jul 12, 2024

②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.

Python 219 10 Updated Aug 12, 2024

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,404 1,443 Updated Mar 10, 2025

An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.

Python 229 6 Updated Feb 4, 2025

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Python 598 21 Updated Mar 17, 2025

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

Python 1,234 92 Updated Apr 25, 2024

[ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.

Python 65 4 Updated Feb 17, 2025
Python 2 Updated Dec 16, 2024

IP Adapter Instruct

Python 203 4 Updated Aug 10, 2024

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 394 25 Updated Mar 12, 2025

Memory-optimized training library for diffusion models

Python 1,001 111 Updated Mar 24, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,452 792 Updated Mar 12, 2025

Let's finetune video generation models!

Python 431 18 Updated Mar 26, 2025
Next
Showing results