Handle length safe embedding only if needed #3723

ravwojdyla · 2023-04-28T16:43:46Z

Copy pasting context from the issue:

https://github.com/hwchase17/langchain/blob/1bf1c37c0cccb7c8c73d87ace27cf742f814dbe5/langchain/embeddings/openai.py#L210-L211

Means that the length safe embedding method is "always" used, initial implementation #991 has the embedding_ctx_length set to -1 (meaning you had to opt-in for the length safe method), #2330 changed that to max length of OpenAI embeddings v2, meaning the length safe method is used at all times.

How about changing that if branch to use length safe method only when needed, meaning when the text is longer than the max context length?

hwchase17

makes sense, thanks!

Re: langchain-ai#3722 Copy pasting context from the issue: https://github.com/hwchase17/langchain/blob/1bf1c37c0cccb7c8c73d87ace27cf742f814dbe5/langchain/embeddings/openai.py#L210-L211 Means that the length safe embedding method is "always" used, initial implementation langchain-ai#991 has the `embedding_ctx_length` set to -1 (meaning you had to opt-in for the length safe method), langchain-ai#2330 changed that to max length of OpenAI embeddings v2, meaning the length safe method is used at all times. How about changing that if branch to use length safe method only when needed, meaning when the text is longer than the max context length?

Handle length safe embedding only if needed

0690c7b

hwchase17 approved these changes Apr 29, 2023

View reviewed changes

hwchase17 merged commit 37ed6f2 into langchain-ai:master Apr 29, 2023
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle length safe embedding only if needed #3723

Handle length safe embedding only if needed #3723

ravwojdyla commented Apr 28, 2023

hwchase17 left a comment

Handle length safe embedding only if needed #3723

Handle length safe embedding only if needed #3723

Conversation

ravwojdyla commented Apr 28, 2023

hwchase17 left a comment

Choose a reason for hiding this comment