fix: fix model sizes in supported models lists #167

joein · 2024-03-29T21:10:16Z

No description provided.

joein · 2024-03-29T21:17:12Z

I went through list of supported models and found some discrepancies.
I tried to fix them, could you please check if I am right about this fixes, or not?

There were various cases, I'll try to describe how I chose the sizes:

When there is only model.onnx available - I simply took its size
When there are model.onnx and model.onnx_data - I chose the latter, since the former one is usually less then 1mb
When there are model.onnx, modelfp16.onnx, model_quantized.onnx - I chose the first one, I assumed that it would be the right thing to choose because there are no mentions of fp16 / quantized, etc. in the descriptions, and I assumed that it takes model.onnx by default.

BAAI/bge-small-en-v1.5 contained 2 sources - GCP and HF, however the models there varies in size. HF one is 2 times smaller then GCP, so I decoupled them.

intfloat/multilingual-e5-large had a wrong link, it was named differently in the cloud, I renamed it there.

fastembed/text/onnx_embedding.py

review-notebook-app · 2024-03-30T11:19:47Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

NirantK · 2024-04-01T10:45:20Z

Wherever we've quantized models which meet the allclose tests — we'll use that as our default, without supporting the direct onnx import to avoid confusion.

We also prefer HF over URL download.

The split here would've changed the underlying default model without informing the user, so I've tried to make that consistent for all models here

cc @joein @Anush008

joein · 2024-04-01T10:57:46Z

fastembed/text/onnx_embedding.py

@@ -70,15 +61,6 @@
        "dim": 384,
        "description": "Fast and Default English model",
        "size_in_GB": 0.13,


I believe this should be adjusted since the quantized model is 2 times smaller

fix: fix model sizes in supported models lists

2e27448

joein force-pushed the fix-model-info branch from 23966ec to 2e27448 Compare March 29, 2024 21:11

joein requested review from NirantK and Anush008 March 29, 2024 21:11

fix: remove redundant comment

7147817

fix: fix test

60ed079

Anush008 reviewed Mar 30, 2024

View reviewed changes

fastembed/text/onnx_embedding.py Outdated Show resolved Hide resolved

fix: update supported models notebook

9cb724e

joein requested a review from Anush008 March 30, 2024 11:20

Anush008 approved these changes Mar 30, 2024

View reviewed changes

NirantK and others added 2 commits April 1, 2024 12:59

Merge branch 'main' into fix-model-info

fd8ae65

Consistentcy around quantization in supported_onnx_models

985ad20

NirantK approved these changes Apr 1, 2024

View reviewed changes

NirantK merged commit 8b7b847 into main Apr 1, 2024
17 checks passed

NirantK deleted the fix-model-info branch April 1, 2024 10:48

joein commented Apr 1, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: fix model sizes in supported models lists #167

fix: fix model sizes in supported models lists #167

joein commented Mar 29, 2024

joein commented Mar 29, 2024

review-notebook-app bot commented Mar 30, 2024

NirantK commented Apr 1, 2024

joein Apr 1, 2024

fix: fix model sizes in supported models lists #167

fix: fix model sizes in supported models lists #167

Conversation

joein commented Mar 29, 2024

joein commented Mar 29, 2024

review-notebook-app bot commented Mar 30, 2024

NirantK commented Apr 1, 2024

joein Apr 1, 2024

Choose a reason for hiding this comment