feat(ml): export clip models to ONNX and host models on Hugging Face #4700

mertalev · 2023-10-29T19:00:32Z

Description

We currently use clip-as-service for downloading CLIP models. The motivation of using this was to avoid the need to export models ourselves, as well as to have models ready to use after downloading without exporting to ONNX at runtime. However, this has caused a number of issues, particularly due to the hosting server being intermittently unavailable.

This PR transitions away from using clip-as-service to handle model exporting and hosting ourselves. The full ONNX catalog of clip-as-service is supported for feature parity and backwards compatibility, and models are downloaded with a different cache structure than before. As a result, this is a drop-in replacement that should not require any manual intervention.

Exported models are uploaded to a brand new set of Hugging Face repos with a new organization. Relevant model repos are downloaded at runtime and are completely self-contained in the files they need.

The CLIP implementation in the ML service has been refactored to integrate with these repos. Moreover, all dependence on PyTorch has been removed from this section of the code: preprocessing is now exclusively done in Pillow and NumPy. This paves the way for shrinking the image size considerably, leaving the image classification code as the only remaining reliance on PyTorch.

While this PR is focused on CLIP, using our own Hugging Face repos for models enables many exciting possibilities in the future. This is just the start.

How has this been tested?

Every model listed here has been tested with Postman for both image and text. Additionally, I tested text search with ViT-B-32__openai before running an Encode CLIP job, confirming the results were relevant (i.e. the model outputs are correct and compatible with existing embeddings). The Encode CLIP job ran successfully as well, as did changing the model to XLM-Roberta-Large-Vit-L-14 (an M-CLIP model that is handled differently than OpenAI and OpenCLIP models).

Fixes #4117

refactored export code

cleanup

…t, general refactoring

vercel · 2023-10-29T19:00:38Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
immich	⬜️ Ignored (Inspect)	Visit Preview		Oct 29, 2023 11:17pm

fyfrey

This is really good work! Looking forward to finally removing the large pytorch dependency. Even though, removing the dependency on clip-as-a-service is great! The fewer deps, the better :-)
I really like the new export functionality/image.

machine-learning/export/env.yaml

machine-learning/app/models/clip.py

fyfrey · 2023-10-30T04:36:54Z

Could you add a short README for the exporter? What it does, how to use it for some example model.

aviv926 · 2023-10-31T22:25:50Z

I'm trying to ask for permission to check if it's not a problem in terms of copyright to create another download source for the models, from the license condition and from what I understand there shouldn't be a problem, but I'd rather ask anyway

If you know the answer to my question, I would love to hear it

…mmich-app#4700) * export clip models * export to hf refactored export code * export mclip, general refactoring cleanup * updated conda deps * do transforms with pillow and numpy, add tokenization config to export, general refactoring * moved conda dockerfile, re-added poetry * minor fixes * updated link * updated tests * removed `requirements.txt` from workflow * fixed mimalloc path * removed torchvision * cleaner np typing * review suggestions * update default model name * update test

nodis · 2023-11-01T16:23:14Z

The model used in my Smart Search is "M-CLIP/XLM Robertsa Large Vit B-16Plus". Do I need to modify this name? For example, modify it to: "immich app/XLM Robertsa Large Vit B-16Plus", thank you

mertalev · 2023-11-01T16:35:21Z

It ignores anything before the slash, so you should be fine.

nodis · 2023-11-01T17:00:50Z

It ignores anything before the slash, so you should be fine.

My "model-cache\clip" directory originally had a folder named "M-CLIP_XLM-Robertsa-Large Vit-B-16Plus". After upgrading to 1.84, a folder named "XLM-Robertsa-Large Vit-B-16Plus" appeared. Can I delete the "M-CLIP_XLM-Robertsa-Large Vit-B-16Plus" folder?

mertalev · 2023-11-01T20:26:31Z

Yes, that's a stale folder at this point.

…mmich-app#4700) * export clip models * export to hf refactored export code * export mclip, general refactoring cleanup * updated conda deps * do transforms with pillow and numpy, add tokenization config to export, general refactoring * moved conda dockerfile, re-added poetry * minor fixes * updated link * updated tests * removed `requirements.txt` from workflow * fixed mimalloc path * removed torchvision * cleaner np typing * review suggestions * update default model name * update test

sushilkhadkaanon · 2024-09-06T08:10:17Z

Hi @mertalev , @aviv926 . Do you provide any script for conversion of open_clip model to onnx. I've been stuck at converting apple/DFN2B-CLIP-ViT-L-14. Also the model immich-app/ViT-L-14-quickgelu__dfn2b under your hgface repos same?

aviv926 · 2024-09-06T08:13:41Z

Hi @mertalev , @aviv926 . Do you provide any script for conversion of open_clip model to onnx. I've been stuck at converting apple/DFN2B-CLIP-ViT-L-14. Also the model immich-app/ViT-L-14-quickgelu__dfn2b under your hgface repos same?

@mertalev Write a script that does this.

The export code is available here https://github.com/immich-app/immich/tree/main/machine-learning/export

It downloads the openclip model, traces it to torchscript and exports the torchscript model to ONNX.

mertalev added 9 commits October 29, 2023 00:45

export clip models

ce97e39

export to hf

76a5b6d

refactored export code

export mclip, general refactoring

39c7d28

cleanup

updated conda deps

b00640d

do transforms with pillow and numpy, add tokenization config to expor…

b28759b

…t, general refactoring

moved conda dockerfile, re-added poetry

ed478ac

minor fixes

6a220f0

updated link

1b10751

updated tests

1429fc4

mertalev requested review from fyfrey, bo0tzz and alextran1502 October 29, 2023 19:00

mertalev added 3 commits October 29, 2023 15:05

removed requirements.txt from workflow

ea731aa

fixed mimalloc path

09f4305

removed torchvision

c8ac2d3

fyfrey reviewed Oct 29, 2023

View reviewed changes

machine-learning/export/env.yaml Show resolved Hide resolved

machine-learning/app/models/clip.py Outdated Show resolved Hide resolved

mertalev added 2 commits October 29, 2023 16:02

cleaner np typing

6df07b1

review suggestions

9191c9e

mertalev force-pushed the chore/ml-export-models branch from 8468d51 to 9191c9e Compare October 29, 2023 20:16

fyfrey approved these changes Oct 29, 2023

View reviewed changes

mertalev added 2 commits October 29, 2023 17:48

update default model name

4862275

update test

f006ab8

alextran1502 merged commit 87a0ba3 into main Oct 31, 2023
21 checks passed

alextran1502 deleted the chore/ml-export-models branch October 31, 2023 10:02

martabal mentioned this pull request Oct 31, 2023

fix!: machine-learning dependencies in v1.84 imagegenius/docker-immich#200

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ml): export clip models to ONNX and host models on Hugging Face #4700

feat(ml): export clip models to ONNX and host models on Hugging Face #4700

mertalev commented Oct 29, 2023 •

edited

Loading

vercel bot commented Oct 29, 2023 •

edited

Loading

fyfrey left a comment

fyfrey commented Oct 30, 2023

aviv926 commented Oct 31, 2023

nodis commented Nov 1, 2023

mertalev commented Nov 1, 2023

nodis commented Nov 1, 2023

mertalev commented Nov 1, 2023

sushilkhadkaanon commented Sep 6, 2024

aviv926 commented Sep 6, 2024

feat(ml): export clip models to ONNX and host models on Hugging Face #4700

feat(ml): export clip models to ONNX and host models on Hugging Face #4700

Conversation

mertalev commented Oct 29, 2023 • edited Loading

Description

How has this been tested?

vercel bot commented Oct 29, 2023 • edited Loading

fyfrey left a comment

Choose a reason for hiding this comment

fyfrey commented Oct 30, 2023

aviv926 commented Oct 31, 2023

nodis commented Nov 1, 2023

mertalev commented Nov 1, 2023

nodis commented Nov 1, 2023

mertalev commented Nov 1, 2023

sushilkhadkaanon commented Sep 6, 2024

aviv926 commented Sep 6, 2024

mertalev commented Oct 29, 2023 •

edited

Loading

vercel bot commented Oct 29, 2023 •

edited

Loading