This repository was archived by the owner on Jul 4, 2025. It is now read-only.

Description
Goal
cortex model pull should have clear APIs that support different model repo sources
- e.g. Huggingface, Cortex Hub
Tasklist
CLI
# Pulls immediately
cortex model pull <huggingface_url_sppecific_gguf>
# Lets user select quantization using CLI
cortex model pull <huggingface_url>
# NOT SURE: Do we need an "info" equivalent?
# Gets repo type (e.g. GGUF, in future ONNX, TensorRT-LLM, dumps possible versions)
# Will power the "select quantization"
cortex model info <huggingface_url>
cortex model info <cortex_repo_url> # Dumps tags
API
- How do we represent Huggingface strings?
- How do we handle Huggingface Repos (i.e. where user has to select quantization)?
Key Questions
- Does Cortex need an API to get Huggingface Repo metadata?
- Does this need to be bubbled to Jan, to support the Huggingface Repo page?
Linked Issues
Jan's Requirements
- User enters Huggingface URL in import box
- User clicks deep link from Huggingface
Cortex should support an API, that can support the following UI:
