[ENH] add exponential backoff and jitter to embedding calls #1526

rancomp · 2023-12-14T22:29:31Z

This is a WIP, closes #1524

Summarize the changes made by this PR.

Improvements & Bug fixes
- Use tenacity to add exponential backoff and jitter
New functionality
- control the parameters of the exponential backoff and jitter and allow the user to use their own wait functions from tenacity's API

Test plan

How are these changes tested?

Tests pass locally with pytest for python, yarn test for js

Documentation Changes

None

github-actions · 2023-12-14T22:29:44Z

tazarov

@rancomp, have a look at my comments.

tazarov · 2023-12-15T13:37:10Z

chromadb/utils/embedding_functions.py

@@ -31,6 +33,19 @@
 logger = logging.getLogger(__name__)


+def _retry_call(call):


A few comments on this:

Does it make sense to move this to a separate util module

Does it make sense to have _ for a decorator? Seems odd

This decorator seems to erase type info.

Does it make sense to add configuration options e.g. how many retries, how much to wait, ignored exceptions?

The @retry decorator seems to throw RetryException. Does it make sense to raise the original exception as it will be better DX

Thanks @tazarov!

A few comments on this:

Does it make sense to move this to a separate util module

Yes that makes sense. But then again I don't see it used elsewhere at the moment.

Does it make sense to have _ for a decorator? Seems odd

I agree with you. It was just a leftover from my first commit where I had it as a private function within the class

This decorator seems to erase type info.

Is it because I'm using *args? This can be fixed by explicitly naming the arguments. I should also annotate the outputs Embeddings.

Does it make sense to add configuration options e.g. how many retries, how much to wait, ignored exceptions?

Yes that makes sense. My idea was to have each EmbeddingFunction instance carry its own arguments for Tenacity.retry through self.wait, which is what I wanted to achieve with the super().__init__(). Another thing I can do is avoid the super() all together and just let the call_wrapper_factory check whether self has attribute wait. If not, set the decorator to be the default wait_exponential_jitter().

The @retry decorator seems to throw RetryException. Does it make sense to raise the original exception as it will be better DX

Important catch here. I'll see how tenacity suggest raise the original exception.

AFAIK there's no way to implement a decorator like this in python <= 3.10 without destroying type info. Our auth and telemetry decorators destroy type info and it's something I would like to fix.

The fact that we can't use a decorator and preserve type info makes me think we shouldn't do it. Instead, we could (in order of my preference off the top of my head):

Use a contextmanager

Just use tenacity directly wherever we call out to embedding providers.

Have an explicit function try_with_retries or something, which takes retry parameters and the relevant method call.

Hey @beggers . I pushed a small update addressing @tazarov's remarks.

Regarding your points:

AFAIK there's no way to implement a decorator like this in python <= 3.10 without destroying type info. Our auth and telemetry decorators destroy type info and it's something I would like to fix.

The fact that we can't use a decorator and preserve type info makes me think we shouldn't do it. Instead, we could (in order of my preference off the top of my head):

Use a contextmanager

I'm not familiar with contextlib but I'm looking into it now. I want to ask, IIUC the problem with type-info exists in other places in the repo. Should we push for a solution that solves all of these together?

Just use tenacity directly wherever we call out to embedding providers.

Have an explicit function try_with_retries or something, which takes retry parameters and the relevant method call.

I can do both. Let me think about these.

chromadb/api/types.py

beggers · 2023-12-18T21:41:46Z

@rancomp I see this PR is still a draft -- tag me when it's ready for a review and I'll take a look

rancomp · 2023-12-21T23:22:15Z

Hey @beggers .I'm finding it challenging to create a straightforward backend along with a user-friendly API for this task. My initial attempt involved the use of the class attributes to edit the retry parameters, but that doesn't feel right.

Let's start with a simple solution which is your 3rd suggestion.
I added a class method EmbeddingFunction.embed_with_retries (in types.py), which simply returns retry(**retry_kwargs)(self.__call__)(input). This basically allows the user to access tenacity directly. I like it because it gives the user direct control over the retry parameters. Anything else is either masking retry or making it cumbersome to edit these parameters. What do you think?

beggers · 2023-12-23T23:14:38Z

Sorry for my lateness here.

I like the approach of having embed_with_retries on the top-level EmbeddingFunction so it's available to all other EmbeddingFunctions. If you get rid of @retry_decorator I'd be happy to have this in our codebase. I agree that giving users direct control over tenacity is the correct flow here.

One other option for us to consider: We could give each EmbeddingFunction a retry_kwargs dict as a field for users to set, and if it's populated we could wrap the actual embedding call in EmbeddingFunctions' __call__s with tenacity retries. In other words, every embedding function's __call__ method would internally check for the existence of the dict and use tenacity to make the call with retries. This could probably be abstracted to embed_with_retries on the top-level EmbeddingFunction but it would accept a function handle, args, and kwargs for the embedding and use the retry_kwargs. WDYT? I'm happy to do this either way.

rancomp · 2023-12-24T14:20:49Z

hey @beggers NP.
I was thinking about your other suggestion. I think this could give the user a slightly cleaner access to retry through the __call__ method, but that's at the cost of a "messier" back-end + more parameters to each EmbeddingFunction. Another thing I'm uncertain about is how Tenacity plays out with multiprocessing. That's another reason why I don't want to change the __call__ which probably should be a stable method of the the API.

Compare this with the newly added embed_with_retries: Clean back-end, no new parameters to the instantiation, and clear method. The downside is that the user would need to specify the kwargs at every call, or probably wrap it with a lambda function.

I'm leaning towards the embed_with_retries solution because it doesn't change existing methods.

…-exp-backoff-and-jitter-with-tenacity

beggers

hey @rancomp , sorry I dropped this. Looks good! If you fix the merge conflicts I'll run the precommit hooks and get this merged.

rancomp · 2024-01-11T13:36:32Z

alright @beggers np!

PS, I got a weird message flake error when merging main into my branch:

flake8...................................................................Failed
- hook id: flake8
- exit code: 1

chromadb/utils/embedding_functions.py:717:18: F821 undefined name 'boto3'

Is it worth adding noqa: F821 on this line?

rancomp · 2024-01-16T22:36:57Z

merged main (conflicts), but black hook caught some stuff from other modules and corrected it.

amanAtHoneyHealth · 2024-01-17T03:43:48Z

chromadb/api/types.py

@@ -194,6 +195,9 @@ def __call__(self: EmbeddingFunction[D], input: D) -> Embeddings:

        setattr(cls, "__call__", __call__)

+    def embed_with_retries(self, input: D, **retry_kwargs: Dict) -> Embeddings:
+        return retry(**retry_kwargs)(self.__call__)(input)


Now that I think about this, this retry will work even if the errors from OPENAI are non transient correct? In cases where let say user max send limit for their Openai is reached no amount of retry is going to fix it until they update their spend limit. Do we want to retry for non transient errors? I guess consumers of chromaDb should be handling this no 🤔

I am a user of chromaDB and what I have seen usually is Open AI waiting 600 secs and returning Timeouts. And literally so many folks complain about this error on their forum - https://community.openai.com/t/frequent-api-timeout-errors-recently/106903

We should also have a way to not wait 600 seconds and allow consumers to configure this. Ideally openAI should have given us a configuration option but there does not seem to be one.

thoughts? @tazarov @beggers

I haven't used tenacity directly before (others on the team have and it's what we use elsewhere in the codebase), but it looks like it allows you to set a timeout: https://tenacity.readthedocs.io/en/latest/#stopping . I didn't see anything in my skimming about retrying only certain error codes though I'm sure it's possible.

@rancomp designed this implementation so Chromadb users have full control over the tenacity retry logic so it should be plug-and-play to get this working.

beggers

@rancomp sorry for the slog here. CI is currently broken. I'll merge it into this branch and re-run tests once we've fixed it -- no action required from you and we'll get this over the finish line.

beggers · 2024-01-17T17:00:17Z

chromadb/api/types.py

@@ -194,6 +195,9 @@ def __call__(self: EmbeddingFunction[D], input: D) -> Embeddings:

        setattr(cls, "__call__", __call__)

+    def embed_with_retries(self, input: D, **retry_kwargs: Dict) -> Embeddings:
+        return retry(**retry_kwargs)(self.__call__)(input)


I haven't used tenacity directly before (others on the team have and it's what we use elsewhere in the codebase), but it looks like it allows you to set a timeout: https://tenacity.readthedocs.io/en/latest/#stopping . I didn't see anything in my skimming about retrying only certain error codes though I'm sure it's possible.

@rancomp designed this implementation so Chromadb users have full control over the tenacity retry logic so it should be plug-and-play to get this working.

wip

8767ca1

rancomp mentioned this pull request Dec 14, 2023

[Feature Request]: Exponential backoff retries in embedding functions #1524

Closed

wrap everything into a decorator and simplify attributes

cacdf80

tazarov reviewed Dec 15, 2023

View reviewed changes

rancomp added 3 commits December 16, 2023 13:30

move decorator to its own module

e97e66f

remove super

0f796cb

cleanup

14fc8c0

beggers assigned rancomp Dec 19, 2023

rancomp closed this Dec 19, 2023

rancomp reopened this Dec 19, 2023

add embed_with_retries class method

e614c67

revert decorator

adca7c8

rancomp marked this pull request as ready for review December 28, 2023 20:16

rancomp added 2 commits January 3, 2024 09:14

Merge branch 'main' of https://github.com/rancomp/chroma into enh/add…

8204813

…-exp-backoff-and-jitter-with-tenacity

Merge branch 'main' of https://github.com/rancomp/chroma into enh/add…

ede721b

…-exp-backoff-and-jitter-with-tenacity

beggers approved these changes Jan 10, 2024

View reviewed changes

merge main

dd86b42

rancomp added 2 commits January 17, 2024 00:33

merge main

32d87b4

merge main (added files were untracked)

e9292fd

amanAtHoneyHealth reviewed Jan 17, 2024

View reviewed changes

beggers reviewed Jan 17, 2024

View reviewed changes

beggers merged commit 9824336 into chroma-core:main Jan 17, 2024
94 of 97 checks passed

HammadB changed the title ~~[WIP] [ENH] add exponential backoff and jitter to embedding calls~~ [ENH] add exponential backoff and jitter to embedding calls Jan 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] add exponential backoff and jitter to embedding calls #1526

[ENH] add exponential backoff and jitter to embedding calls #1526

rancomp commented Dec 14, 2023 •

edited

github-actions bot commented Dec 14, 2023

tazarov left a comment

tazarov Dec 15, 2023

rancomp Dec 15, 2023 •

edited

beggers Dec 15, 2023

rancomp Dec 19, 2023

beggers commented Dec 18, 2023

rancomp commented Dec 21, 2023

beggers commented Dec 23, 2023

rancomp commented Dec 24, 2023

beggers left a comment

rancomp commented Jan 11, 2024

rancomp commented Jan 16, 2024

amanAtHoneyHealth Jan 17, 2024

beggers Jan 17, 2024

beggers left a comment

beggers Jan 17, 2024

		@@ -31,6 +33,19 @@
		logger = logging.getLogger(__name__)


		def _retry_call(call):

[ENH] add exponential backoff and jitter to embedding calls #1526

[ENH] add exponential backoff and jitter to embedding calls #1526

Conversation

rancomp commented Dec 14, 2023 • edited

Test plan

Documentation Changes

github-actions bot commented Dec 14, 2023

Reviewer Checklist

Testing, Bugs, Errors, Logs, Documentation

System Compatibility

Quality

tazarov left a comment

Choose a reason for hiding this comment

tazarov Dec 15, 2023

Choose a reason for hiding this comment

rancomp Dec 15, 2023 • edited

Choose a reason for hiding this comment

beggers Dec 15, 2023

Choose a reason for hiding this comment

rancomp Dec 19, 2023

Choose a reason for hiding this comment

beggers commented Dec 18, 2023

rancomp commented Dec 21, 2023

beggers commented Dec 23, 2023

rancomp commented Dec 24, 2023

beggers left a comment

Choose a reason for hiding this comment

rancomp commented Jan 11, 2024

rancomp commented Jan 16, 2024

amanAtHoneyHealth Jan 17, 2024

Choose a reason for hiding this comment

beggers Jan 17, 2024

Choose a reason for hiding this comment

beggers left a comment

Choose a reason for hiding this comment

beggers Jan 17, 2024

Choose a reason for hiding this comment

rancomp commented Dec 14, 2023 •

edited

rancomp Dec 15, 2023 •

edited