Skip to content

SIE preload models loads on all node types in k8s, blowing up the ones you didn't want to preload on #194

@apatrida

Description

@apatrida

If you want to preload a model on the SGLang nodes, but not the default workers (or vis versa), you cannot.

Add it to preload models, blows up one or the other. A mixed deployment can never use preload therefore.

You have to manually patch the nodes after they are created in k8s, and do it after every deploy update because there is only one variable for the helm chart and it applies universally. Yet, the two main current node types would never preload the same models, so there really needs to be a preload per node type.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions