Ability to deploy to a serverless environment #205

moltar · 2023-09-28T13:23:18Z

It is already possible to deploy a Go server, wrapped into a Docker, into an AWS Lambda now, with an adapter. This could be hugely beneficial for occasional usage scenarios.

But the one thing I am concerned about though is state.

Does Zep maintain any in-memory state for a long time?

Or is it mainly an API layer between the services and the storage (Postgres)?

Will it commit, or drain, the state on SIGTERM?

danielchalef · 2023-09-28T13:50:37Z

The open source version of Zep does retain state as task management for async embedding, summarization, etc have not yet been moved into an external message queue. I'd recommend using ECS or EKS rather than Lambdas.

oesni · 2023-10-04T06:32:43Z

So, ZEP itself is a stateful application?? I want to deploy zep to my k8s cluster, but I think it's not safe to scale out if it's stateful.
@danielchalef

danielchalef · 2023-10-06T01:26:17Z

@oesni Zep can be scaled horizontally, but can't be scaled in without being sure that queues have drained on an instance of Zep. You could potentially taint the pod, removing it from the load balancer, wait for the queues to drain and then deleting the pod. This is being improved to utilize message queues.

moltar · 2023-10-17T11:39:14Z

@danielchalef Does this also mean that load balancing multiple instances is currently not recommended, as they would have split loads? Or would they own the loads entirely, and there's no cross-talk required?

danielchalef · 2023-10-17T14:00:38Z

Instances own the load and there's no crosstalk.

danielchalef · 2023-11-04T19:56:50Z

Closing this as since #246 , Zep no longer holds state.

moltar · 2023-11-05T16:10:05Z

To run the API in a serverless environment then is all clear. There's an adapter available even specific to go servers.

What about the jobs then? Would I need to execute the binary on schedule to process the jobs?

Is there a specific entry point or a flag that would run the workers without the http server?

Is there way to obtain the queue size?

danielchalef · 2023-11-05T17:37:32Z

@moltar While Zep doesn't hold state anymore, it's still not designed for a serverless deployment. I'm unsure how it would perform in such an environment, particularly as there is some warm up time required (<1sec, but still meaningful). Deployment using Kubernetes now makes a lot more sense, since you can automatically scale in and out without concern for state.

We could potentially add some command line flags that tell Zep to only run the API / or the web UI. Or only run the TaskRouter. This may reduce startup time for API-only use. It would also mean you could run a persistent implementation of a TaskRouter instance or two, which would execute tasks.

Would love a contribution if the above might be helpful!

moltar · 2023-11-05T19:45:43Z

The use case I was thinking of is occasional, low volume use. It'd be more economical to run in a Lambda even with the warm up times being a bit high.

For anything serious a proper container is of course better.

danielchalef · 2023-11-05T20:46:20Z

Cool. Let me know how it goes!

danielchalef closed this as completed Nov 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to deploy to a serverless environment #205

Ability to deploy to a serverless environment #205

moltar commented Sep 28, 2023

danielchalef commented Sep 28, 2023

oesni commented Oct 4, 2023 •

edited

Loading

danielchalef commented Oct 6, 2023 •

edited

Loading

moltar commented Oct 17, 2023

danielchalef commented Oct 17, 2023

danielchalef commented Nov 4, 2023

moltar commented Nov 5, 2023

danielchalef commented Nov 5, 2023

moltar commented Nov 5, 2023

danielchalef commented Nov 5, 2023

Ability to deploy to a serverless environment #205

Ability to deploy to a serverless environment #205

Comments

moltar commented Sep 28, 2023

danielchalef commented Sep 28, 2023

oesni commented Oct 4, 2023 • edited Loading

danielchalef commented Oct 6, 2023 • edited Loading

moltar commented Oct 17, 2023

danielchalef commented Oct 17, 2023

danielchalef commented Nov 4, 2023

moltar commented Nov 5, 2023

danielchalef commented Nov 5, 2023

moltar commented Nov 5, 2023

danielchalef commented Nov 5, 2023

oesni commented Oct 4, 2023 •

edited

Loading

danielchalef commented Oct 6, 2023 •

edited

Loading