Skip to content
View Wuziyi616's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@VectorInstitute @pairlab

Block or report Wuziyi616

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization

Python 7 Updated Mar 20, 2025

This package contains the original 2012 AlexNet code.

Cuda 2,294 289 Updated Mar 12, 2025

Code release for DynamicTanh (DyT)

Python 771 70 Updated Mar 18, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 795 43 Updated Mar 20, 2025

[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

762 19 Updated Mar 27, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 113 3 Updated Mar 26, 2025

[CVPR 2025] VGGT: Visual Geometry Grounded Transformer

Python 3,663 232 Updated Mar 30, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,168 322 Updated Mar 25, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 9,351 645 Updated Mar 27, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 45,654 5,583 Updated Mar 28, 2025

Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.

Python 70 2 Updated Mar 27, 2025

Improving Video Generation with Human Feedback

Python 146 1 Updated Feb 12, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,219 95 Updated Mar 28, 2025

Machine Learning Engineering Open Book

Python 13,271 805 Updated Mar 29, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 1,355 100 Updated Mar 13, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,023 51 Updated Feb 25, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,112 536 Updated Mar 28, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,726 183 Updated Mar 26, 2025

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Python 86 5 Updated Mar 23, 2025

Perceptual video quality assessment based on multi-method fusion.

Python 4,849 774 Updated Mar 13, 2025

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers"

Python 527 53 Updated Mar 3, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 9,342 1,014 Updated Mar 29, 2025

This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.

Jupyter Notebook 99 6 Updated Nov 26, 2024
Python 493 29 Updated Dec 21, 2024
Python 155 8 Updated Jul 12, 2024

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 563 23 Updated Jan 22, 2025

Official Repository of "Unpaired Image-to-Image Translation via Neural Schrödinger Bridge" (ICLR 2024)

Python 200 13 Updated Apr 30, 2024

Dual Diffusion Implicit Bridges for Image-to-Image Translation. ICLR 2023.

Python 384 33 Updated Feb 14, 2023
Next
Showing results