Skip to content
This repository was archived by the owner on Jul 4, 2025. It is now read-only.
This repository was archived by the owner on Jul 4, 2025. It is now read-only.

chore: Cortex Docs should be updated with Docs and API Endpoints #1613

@dan-menlo

Description

@dan-menlo

Goal

  • Cortex Docs and README should be up-to-date with latest changes
  • Goal: by end-Monday, Cortex Docs are good enough to
  • Goal: epics start with Docs first, and then once approved coding can start

I would like the full team to understand how the OpenAPI format works, to enable us to handle documentation scalably.

Docs Site

Overall

  • Document how to create Redirects for Docusaurus @hiento09
  • Are we able to split up OpenAPI file into smaller parts? (e.g. by route)

Get Started

Installation

Owner: @hiento09

  • Installation Page (i.e,. how does our installer work)
    • Differences between cortex stable, beta, nightly. Remove repetition from elsewhere
    • Local vs network installer
    • Docker @hiento09 (thank you!)

Quickstart
Basic Usage

Owner: @gabrielle-ong

  • Combine Basic Usage and API Server

Architecture

Capabilities

We should talk about capabilities, and link to the appropriate API References.

  • Pulling Models @namchuai (thank you!)
    • Model Formats (i.e. our Cortex Model repo format), or Huggingface GGUF repo
  • Running Models @nguyenhoangthuan99
    • Should explain runtime params vs. model.yaml
    • Should explain Models use Engines
  • Engine Management @namchuai
    • Should give high-level overview of Engine Management, e.g. different types, Versions, Variants
    • llama.cpp (should give instructions on how to install, update, change versions/variants, explain AVX2, AVX-512 etc)
    • Building Engine Extensions (should give instructions on EngineI.h)
  • Hardware Awareness @vansangpfiev
  • Text Generation
  • Embeddings

Guides

API Reference

Chat

Assignee: @nguyenhoangthuan99

API Reference

  • /chat/completions

Pulling Models

Assignee: @namchuai

API Reference

  • Pull a Model
  • Stop Model Download
  • Import Model

Running Models

Assignee: @vansangpfiev

API Reference

  • List Models
  • Start Models
  • Stop Model
  • Get Model
  • Delete model
  • Update model

Engines

Assignee: @namchuai

API Reference

  • List available engines
  • Get an engine
  • Install an engine
  • Uninstall an engine

Server

Assignee: @vansangpfiev

Docs

  • Basic Usage

API Reference

  • Start, Stop
  • Update
  • Terminate
  • Check health

Hardware

Assignee: @vansangpfiev (write Docs before implementing, keep in PR)

Docs

  • Hardware Awareness section

API Reference

  • Hardware APIs

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions