Adding Caching to OpenAI chat #603

adamgordonbell · 2024-01-25T12:28:16Z

Caching was no longer happening in newest version. Reintroducing here for OpenAI chat.

Not sure if this is the ideal area to insert caching or if model.call could somehow cache for all.

slundberg · 2024-01-25T20:58:54Z

Thanks @adamgordonbell ! I am just about merge some extensive client/server updates, then I'll come back and try and loop this in.

slundberg · 2024-01-25T21:04:44Z

One quick question on the PR. Adding poetry support sounds nice, but do you know if there is a way to do that without creating two copies of our dependency lists? (we can also just leave that out for now if it slows things down too much)

adamgordonbell · 2024-01-25T21:17:56Z

One quick question on the PR. Adding poetry support sounds nice, but do you know if there is a way to do that without creating two copies of our dependency lists? (we can also just leave that out for now if it slows things down too much)

It's outside of my area of knowledge but I think so. Though I do think the cpp files probably complicate that. I can remove that from this PR, just helped with my development.

Thanks @adamgordonbell ! I am just about merge some extensive client/server updates, then I'll come back and try and loop this in.

Awesome!

slundberg · 2024-02-01T01:00:35Z

Merged this with the latest updates. Thanks again!

prescod · 2024-02-18T00:42:03Z

Thank. you for adding this @adamgordonbell and @slundberg . Caching was about 50% of the reason I used Guidance. It's easy to implement myself but adding it to each project becomes tedious.

maximegmd · 2024-05-18T15:13:46Z

Why was this merged?

It cannot be disabled
It doesn't distinguish models, so calling gpt4 with a prompt after calling with gpt3.5 will return the gpt3.5 data
There is no doc on resetting the cache

Harsha-Nori · 2024-05-18T17:37:13Z

Hi @maximegmd, I was running into similar challenges with caching, and dropped it as part of a simplification we're making to the codebase (#820).

I think this PR was a strong initial concept (thank you @adamgordonbell!!), but as we've started changing the structure of guidance in the last few months, I'd like to pursue a more universal solution (beyond just OpenAI) that also addresses your concerns. If you want to use a no-cache version of the codebase today, just install from source.

adamgordonbell · 2024-05-27T10:35:17Z

Yeah, this caching implementation had some weakenesses. Excited to see what the next iteration is.

Why was this merged?

It cannot be disabled

It was disabled for a call when the temperature was above 0. I thought that made sense, as with 0 temp you are asking for a deterministic result, which temp 0 and caching will give.

maximegmd · 2024-05-27T20:35:40Z

Yeah, this caching implementation had some weakenesses. Excited to see what the next iteration is.

Why was this merged?

It cannot be disabled

It was disabled for a call when the temperature was above 0. I thought that made sense, as with 0 temp you are asking for a deterministic result, which temp 0 and caching will give.

Except it didn't distinguish between models so with temperature 0 you get the cache of whichever model was used first with a specific prompt and we were doing model comparisons so we scratched our heads a little to understand why all OpenAI models had the same score haha

adamgordonbell added 2 commits January 25, 2024 07:25

Adding caching to openai chat

f075f80

remove prints

7394294

adamgordonbell mentioned this pull request Jan 25, 2024

Is Caching documented? #497

Open

adamgordonbell and others added 4 commits January 26, 2024 08:07

Adding poetry

6cdd145

Merge branch 'main' into agb/caching

e8a0ff8

Update pyproject.toml

09ab100

Fix merge issue

9c10bd0

slundberg merged commit 496bc53 into guidance-ai:main Feb 1, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Caching to OpenAI chat #603

Adding Caching to OpenAI chat #603

adamgordonbell commented Jan 25, 2024

slundberg commented Jan 25, 2024

slundberg commented Jan 25, 2024

adamgordonbell commented Jan 25, 2024

slundberg commented Feb 1, 2024

prescod commented Feb 18, 2024

maximegmd commented May 18, 2024

Harsha-Nori commented May 18, 2024 •

edited

adamgordonbell commented May 27, 2024

maximegmd commented May 27, 2024

Adding Caching to OpenAI chat #603

Adding Caching to OpenAI chat #603

Conversation

adamgordonbell commented Jan 25, 2024

slundberg commented Jan 25, 2024

slundberg commented Jan 25, 2024

adamgordonbell commented Jan 25, 2024

slundberg commented Feb 1, 2024

prescod commented Feb 18, 2024

maximegmd commented May 18, 2024

Harsha-Nori commented May 18, 2024 • edited

adamgordonbell commented May 27, 2024

maximegmd commented May 27, 2024

Harsha-Nori commented May 18, 2024 •

edited