huggingface: use AutoModel instead of SentenceTransformer #1250

l4b4r4b4b4 · 2023-11-06T13:02:26Z

Switch SentenceTransformer for AutoModel in order to set trust_remote_code needed to use the encode method with embeddings models like jinai-v2

Description

This PR fixes #

Notes for Reviewers

Signed commits

Yes, I signed my commits.

Switch SentenceTransformer for AutoModel in order to set trust_remote_code needed to use the encode method with embeddings models like jinai-v2 Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>

mudler · 2023-11-07T09:47:31Z

the failure looks genuine @l4b4r4b4b4 - is that compatible with sentence formers? does it requires the models to be pulled manually?

Maybe it makes sense to have a separate backend for this instead

lunamidori5 · 2023-11-07T11:20:34Z

the failure looks genuine @l4b4r4b4b4 - is that compatible with sentence formers? does it requires the models to be pulled manually?

Maybe it makes sense to have a separate backend for this instead

We need to make sure that the trust_remote_code is setable in the yaml file if we could @mudler

l4b4r4b4b4 · 2023-11-07T12:25:58Z

the failure looks genuine @l4b4r4b4b4 - is that compatible with sentence formers? does it requires the models to be pulled manually?

Maybe it makes sense to have a separate backend for this instead

@mudler it was my understanding its compatible with transformers since its a component from HugginsFace's transformer library. Could also be implemented as fallback option in case normal SentenceTransformer does not work for not exposing trust_remote_code property.

l4b4r4b4b4 · 2023-11-09T04:57:01Z

the failure looks genuine @l4b4r4b4b4 - is that compatible with sentence formers? does it requires the models to be pulled manually?

Maybe it makes sense to have a separate backend for this instead

ah and no you don't have to download anything manually. Simply set the yml in ./models folder as before and it downloads the model and infers embedding vectors successfully.

So don't think a seperate backend is actually needed 🤷‍♂️

mudler · 2023-11-09T08:25:51Z

the failure looks genuine @l4b4r4b4b4 - is that compatible with sentence formers? does it requires the models to be pulled manually?
Maybe it makes sense to have a separate backend for this instead

ah and no you don't have to download anything manually. Simply set the yml in ./models folder as before and it downloads the model and infers embedding vectors successfully.

So don't think a seperate backend is actually needed 🤷‍♂️

I do agree 100% with you here, lets keep to one if it's possible, but it's weird - the test failed complaining that could not find the model, maybe it misses some option then?

mudler · 2023-11-19T11:12:17Z

@l4b4r4b4b4 friendly ping, are you looking into the failures, or shall I help here?

l4b4r4b4b4 · 2023-11-19T11:38:19Z

@l4b4r4b4b4 friendly ping, are you looking into the failures, or shall I help here?

@mudler That would be great! Since in my local installation it successfully compiles and does inference with jinai model. However I am not sure why it is not able to do that on build test. Maybe it calls for GPU inference by default? 🤔

mudler · 2023-11-20T16:55:49Z

seems in some cases automodel indeed requires quite few additional steps here:

https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2#usage-huggingface-transformers

@l4b4r4b4b4 friendly ping, are you looking into the failures, or shall I help here?

@mudler That would be great! Since in my local installation it successfully compiles and does inference with jinai model. However I am not sure why it is not able to do that on build test. Maybe it calls for GPU inference by default? 🤔

The tests-linux pipeline runs only on CPU, no GPU is involved here

mudler · 2023-11-20T16:58:30Z

the fact is - we will have as per #1126 a transformers backend - and that will make sense maybe to use Automodel? maybe we can have the transformers backend using AutoModel for embeddings as well and have a backend for transformers #1015

mudler · 2023-11-20T23:59:43Z

Closing as #1308 was merged,thanks @l4b4r4b4b4 !

Update huggingface.py

e46e77b

Switch SentenceTransformer for AutoModel in order to set trust_remote_code needed to use the encode method with embeddings models like jinai-v2 Signed-off-by: Lucas Hänke de Cansino <lhc@next-boss.eu>

lunamidori5 requested review from mudler and lunamidori5 November 6, 2023 15:59

Merge branch 'master' into patch-1

545c148

Merge branch 'master' into patch-1

4a247ec

mudler mentioned this pull request Nov 11, 2023

Feat: support jina-embeddings via sentence-transformers #1278

Closed

Merge branch 'master' into patch-1

c2982e5

mudler changed the title ~~Update huggingface.py~~ huggingface: use AutoModel instead of SentenceTransformer Nov 11, 2023

mudler mentioned this pull request Nov 20, 2023

feat(transformers): add embeddings with Automodel #1308

Merged

mudler closed this Nov 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

huggingface: use AutoModel instead of SentenceTransformer #1250

huggingface: use AutoModel instead of SentenceTransformer #1250

l4b4r4b4b4 commented Nov 6, 2023

mudler commented Nov 7, 2023

lunamidori5 commented Nov 7, 2023

l4b4r4b4b4 commented Nov 7, 2023 •

edited

l4b4r4b4b4 commented Nov 9, 2023

mudler commented Nov 9, 2023 •

edited

mudler commented Nov 19, 2023

l4b4r4b4b4 commented Nov 19, 2023 •

edited

mudler commented Nov 20, 2023 •

edited

mudler commented Nov 20, 2023

mudler commented Nov 20, 2023

huggingface: use AutoModel instead of SentenceTransformer #1250

huggingface: use AutoModel instead of SentenceTransformer #1250

Conversation

l4b4r4b4b4 commented Nov 6, 2023

mudler commented Nov 7, 2023

lunamidori5 commented Nov 7, 2023

l4b4r4b4b4 commented Nov 7, 2023 • edited

l4b4r4b4b4 commented Nov 9, 2023

mudler commented Nov 9, 2023 • edited

mudler commented Nov 19, 2023

l4b4r4b4b4 commented Nov 19, 2023 • edited

mudler commented Nov 20, 2023 • edited

mudler commented Nov 20, 2023

mudler commented Nov 20, 2023

l4b4r4b4b4 commented Nov 7, 2023 •

edited

mudler commented Nov 9, 2023 •

edited

l4b4r4b4b4 commented Nov 19, 2023 •

edited

mudler commented Nov 20, 2023 •

edited