Skip to content

Finish GLCC Project: Large Language Model (LLM) Operations Based on Kusion #1307

@kaysonyu

Description

@kaysonyu

I have completed the tasks outlined in issue #1134 for GLCC Project and have submitted several PRs to address the issue. Below are the links to each PR and a brief description of what they accomplish:

KusionStack/catalog#91
feat: finish schema of module inference

KusionStack/catalog#92
fix: correct inference module pod spec generate

KusionStack/catalog#93
fix(inference): enable generated YAML matches the correct resources

KusionStack/catalog#94
fix(inference): fix portName and env name; finish the example

KusionStack/catalog#96
refactor(inference): change the structure of the inference module

KusionStack/catalog#99
feat(inference): add a proxy to easily switch models

KusionStack/kusionstack.io#573
chore: add a introduction of changing the model in inference module

Metadata

Metadata

Assignees

Labels

help wantedExtra attention is needed

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions