Skip to content

Unify the chat api for all inference servers #218

@kerthcet

Description

@kerthcet

What would you like to be added:

The chat api:

  • POST /v1/chat/completions

The completion api:

  • POST /v1/completions

Why is this needed:

Reduce the operating cost for users. If we can unify the post content, this could be better.

Completion requirements:

This enhancement requires the following artifacts:

  • Design doc
  • API change
  • Docs update

The artifacts should be linked in subsequent comments.

Metadata

Metadata

Assignees

No one assigned

    Labels

    featureCategorizes issue or PR as related to a new feature.needs-priorityIndicates a PR lacks a label and requires one.needs-triageIndicates an issue or PR lacks a label and requires one.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions