Skip to content
This repository was archived by the owner on Jul 4, 2025. It is now read-only.
This repository was archived by the owner on Jul 4, 2025. It is now read-only.

bug: Huge model size - overflow #1384

@gabrielle-ong

Description

@gabrielle-ong

Cortex version

v129

Describe the Bug

Mac M1

"Overflow issue" - Llama3.1-8B requires Additional 16777216.00 TB need to be downloaded

> cortex-nightly pull https://huggingface.co/bartowski/Meta-Llama-3.1-8B-Instruct-GGUF/Meta-Llama-3.1-8B-Instruct-Q2_K.gguf
Validating download items, please wait..
Start downloading: Meta-Llama-3.1-8B-Instruct-Q2_K.gguf
Found unfinished download! Additional 16777216.00 TB need to be downloaded.
Continue download [Y/n]:

Steps to Reproduce

No response

Screenshots / Logs

image

What is your OS?

  • MacOS
  • Windows
  • Linux

What engine are you running?

  • cortex.llamacpp (default)
  • cortex.tensorrt-llm (Nvidia GPUs)
  • cortex.onnx (NPUs, DirectML)

Metadata

Metadata

Assignees

No one assigned

    Labels

    P1: importantImportant feature / fixtype: bugSomething isn't working

    Type

    No type

    Projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions