Skip to content
View mx-mark's full-sized avatar

Block or report mx-mark

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting

137 5 Updated Mar 13, 2025

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 560 23 Updated Jan 22, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 28,300 5,794 Updated Mar 28, 2025

[CVPR2025] MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model

65 Updated Mar 5, 2025

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

Python 963 62 Updated Jun 19, 2023
JavaScript 1 Updated Sep 27, 2024

More relighting!

Python 7,779 476 Updated Feb 20, 2025

UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing

Python 105 4 Updated Nov 7, 2024

Self-supervised Spatiotemporal Learning via Video Clip Order Prediction

Python 106 17 Updated Jul 22, 2023

EVA Series: Visual Representation Fantasies from BAAI

Python 2,456 182 Updated Aug 1, 2024

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Python 156 26 Updated Mar 7, 2023

Official DeiT repository

Python 4,166 564 Updated Mar 15, 2024

ConvMAE: Masked Convolution Meets Masked Autoencoders

Python 497 41 Updated Mar 14, 2023

PyTorch implementation of Disjoint Masking with Joint Distillation for Efficient Masked Image Modeling

Python 10 3 Updated Jan 3, 2023

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 28,146 3,511 Updated Jul 23, 2024

Example models using DeepSpeed

Python 6,391 1,075 Updated Mar 27, 2025
Python 160 23 Updated Nov 9, 2023

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video ta…

Python 1,597 383 Updated Feb 12, 2025

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 8,751 2,689 Updated Aug 13, 2024

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,526 1,270 Updated Aug 14, 2024

Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)

1 Updated Apr 12, 2022

Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned Sound (VAS) dataset.

Python 53 12 Updated Dec 15, 2020

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,670 1,254 Updated Jul 23, 2024

The Official PyTorch Implementation of "NVAE: A Deep Hierarchical Variational Autoencoder" (NeurIPS 2020 spotlight paper)

Python 1,046 166 Updated Dec 6, 2022

[CVPR2020] Adversarial Latent Autoencoders

Python 3,525 556 Updated Jan 23, 2021

Repository for the paper "Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images"

Python 440 88 Updated Apr 28, 2023

Official Implementation of the paper "A U-Net Based Discriminator for Generative Adversarial Networks" (CVPR 2020)

Python 391 59 Updated Apr 14, 2022

A mix of GAN implementations including progressive growing

Python 1,625 270 Updated Oct 12, 2021

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 33,634 4,872 Updated Feb 23, 2025
Next
Showing results