Skip to content
Discussion options

You must be logged in to vote

The way to load the VLLM to be used by fastAPI at startup (even considering the multiple potential workers) is to use the lifespan events to the this cold start.

https://fastapi.tiangolo.com/advanced/events/#lifespan

Otherwise u will be facing issues when forking these objects.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by Kludex
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Question or problem
3 participants