Skip to content

Add support for serverless deployments #1220

Open
@RobertLucian

Description

@RobertLucian

Description

The cluster.yaml config would no longer specify an instance type for its deployments.

The instance type would be determined at deploy time, based on the amount of GPUs/CPUs/Mem/Infs/etc requested and based on the value of a field like instance_selection_policy: cost/performance. There would still be the option of overriding the instance type using the instance_type field in the cortex.yaml config.

Motivation

Eliminates the users' need to think of what resources they want and how they want it.

Notes

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions