This repository was archived by the owner on Jul 4, 2025. It is now read-only.
  
  
  
  
  
Description
Describe the bug
When pulling a model from HF, I could not select the model quantization, it downloads the first one by default (lowest quality)
To Reproduce
Steps to reproduce the behavior:
- run cortex
- pull any models on HF E.g. TheBlock/Tiny
- it pulls Q2 by default
- see error
Expected behavior
It should prompt users to select the quantization they want to pull