Skip to content
This repository was archived by the owner on Jul 4, 2025. It is now read-only.
This repository was archived by the owner on Jul 4, 2025. It is now read-only.

feat: Model Pull has clear API and CLI to support Huggingface Repos #1242

@dan-menlo

Description

@dan-menlo

Goal

  • cortex model pull should have clear APIs that support different model repo sources
  • e.g. Huggingface, Cortex Hub

Tasklist

CLI

# Pulls immediately
cortex model pull <huggingface_url_sppecific_gguf>

# Lets user select quantization using CLI
cortex model pull <huggingface_url> 

# NOT SURE: Do we need an "info" equivalent?
# Gets repo type (e.g. GGUF, in future ONNX, TensorRT-LLM, dumps possible versions)
# Will power the "select quantization"
cortex model info <huggingface_url>
cortex model info <cortex_repo_url>    # Dumps tags

API

- How do we represent Huggingface strings?
- How do we handle Huggingface Repos (i.e. where user has to select quantization)? 

Key Questions

  • Does Cortex need an API to get Huggingface Repo metadata?
  • Does this need to be bubbled to Jan, to support the Huggingface Repo page?

Linked Issues

Jan's Requirements

  1. User enters Huggingface URL in import box
  2. User clicks deep link from Huggingface

Cortex should support an API, that can support the following UI:

Image

Metadata

Metadata

Assignees

Type

No type

Projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions