Unable to Use GPU with llama-cpp-python on Jetson Orin

I'm trying to install the `llama-cpp-python` package to run code on **NVIDIA Jetson AGX Orin** (CUDA version: 12.2) using the **GPU, but it's running on the CPU** instead.

I attempted the following commands to enable CUDA support:

`CMAKE_ARGS="-DGGML_CUDA=on" FORCE_CMAKE=1 pip install llama-cpp-python --force-reinstall --upgrade --no-cache-dir`

`CMAKE_ARGS="-DGGML_CUDA=on" FORCE_CMAKE=1 pip install --no-cache-dir llama-cpp-python --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cu122`

However, I am still unable to get it to use the GPU.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unable to Use GPU with llama-cpp-python on Jetson Orin #1779

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Unable to Use GPU with llama-cpp-python on Jetson Orin #1779

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions