Skip to content
View dmarx's full-sized avatar

Organizations

@pytti-tools

Block or report dmarx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

ML finetune

349 repositories

Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)

Python 36 Updated Apr 17, 2022

1.4B latent diffusion model fine tuning

Python 265 52 Updated May 16, 2022

Audio generation using diffusion models, in PyTorch.

Python 2,094 177 Updated Jun 12, 2023

Home of `erlich` and `ongo`. Finetune latent-diffusion/glid-3-xl text2image on your own data.

Python 181 17 Updated Aug 5, 2022

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 2,180 234 Updated May 20, 2024
Python 661 54 Updated Nov 28, 2023

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Python 1,699 163 Updated Feb 23, 2026

Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.

Jupyter Notebook 60 4 Updated Mar 31, 2022

Robust fine-tuning of zero-shot models

Python 760 75 Updated Apr 29, 2022

Official Source code of "One-Shot Adaptation of GAN in Just One CLIP" IEEE Transactions on Pattern Anaylsis and Machine Intelligence (TPAMI)

Jupyter Notebook 65 1 Updated Jun 5, 2023

The code repository for "Few-Shot Learning via Embedding Adaptation with Set-to-Set Functions"

Python 433 85 Updated Jul 31, 2020

Ensembling Off-the-shelf Models for GAN Training (CVPR 2022 Oral)

Python 420 32 Updated Sep 9, 2022

Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744

Jupyter Notebook 931 117 Updated Aug 1, 2024
Jupyter Notebook 350 48 Updated Aug 8, 2021

[CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models

Python 867 130 Updated Mar 27, 2023

A PyTorch toolbox for domain generalization, domain adaptation and semi-supervised learning.

Python 1,417 195 Updated Nov 3, 2023

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,554 250 Updated Apr 24, 2024

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Python 958 165 Updated Apr 26, 2024

[CVPR 2022 Oral] Official implementation of DN-DETR

Python 603 71 Updated Dec 20, 2023

Visual Taste Approximator (VTA) is a very simple tool that helps anyone create an automatic replica of themselves that can approximate their own personal visual taste

Python 40 5 Updated Sep 11, 2022

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Jupyter Notebook 1,002 111 Updated Jan 3, 2024

Multi-modality pre-training

Python 510 35 Updated May 8, 2024

Train vision models using JAX and 🤗 transformers

Python 100 10 Updated Dec 14, 2025
Jupyter Notebook 118 13 Updated Sep 11, 2022

Metric learning and retrieval pipelines, models and zoo.

Python 984 73 Updated Nov 26, 2025

Official Implementation for "HyperDomainNet: Universal Domain Adaptation for Generative Adversarial Networks" (NeurIPS 2022)

Python 92 5 Updated Sep 6, 2023
Python 11 2 Updated Oct 30, 2022

A python library for self-supervised learning on images.

Python 3,689 322 Updated Feb 23, 2026

Code repository for the paper "Meta-Learning via Classifier(-free) Diffusion Guidance"

Python 32 3 Updated Apr 7, 2023