llama.cpp Installer for Windows

This project provides a PowerShell script to automate the setup of the llama.cpp development environment on Windows. It installs all required prerequisites silently, selects an appropriate CUDA Toolkit version, and builds llama.cpp from source.

What’s new

No more -CudaArch flag. The script auto-detects your GPU’s SM (compute capability) via the NVIDIA driver (NVML). If detection isn’t possible, it falls back to CMAKE_CUDA_ARCHITECTURES=native.
Headless VS Build Tools install (via winget). Includes the Windows SDK and required C++ components—no GUI.
Sane CUDA selection. If your GPU is pre-Turing (SM < 70), the script uses CUDA 12.4 for compatibility; otherwise it uses the latest installed (≥ 12.4).

Prerequisites

Windows 10/11 x64
PowerShell 7
Recent NVIDIA driver (no CUDA toolkit required)
~20 GB free disk space
App Installer / winget available (to install dependencies)
Administrator rights (elevated PowerShell)

The GPU SM auto-detect uses nvml.dll from the NVIDIA driver. If NVML isn’t available, the script falls back to a WMI-based heuristic and then to CMAKE_CUDA_ARCHITECTURES=native.

What the installer does

Admin check (must be elevated).
Installs prerequisites if missing (silent, via winget):
- Git
- CMake
- Visual Studio 2022 Build Tools (with C++ toolchain and Windows SDK)
- Ninja (and a portable fallback if needed)
Chooses CUDA Toolkit:
- Detects your GPU’s SM via NVML.
- SM < 70 (pre-Turing) → installs/uses CUDA 12.4.
- SM ≥ 70 or unknown → uses latest installed CUDA (≥ 12.4).
Clones and builds llama.cpp under vendor\llama.cpp.

Installation

Run from an elevated PowerShell prompt:

# Allow script execution for this session
Set-ExecutionPolicy Bypass -Scope Process

# Run the installer (auto-detects GPU SM; falls back to native)
./install_llama_cpp.ps1

Optional: skip the build step (installs prerequisites + CUDA only):

./install_llama_cpp.ps1 -SkipBuild

The built binaries will be in:

vendor\llama.cpp\build\bin

Uninstallation

Run from an elevated PowerShell prompt:

Set-ExecutionPolicy Bypass -Scope Process
./uninstall_llama_cpp.ps1

This removes the winget-installed prerequisites and the vendor directory (and portable Ninja if it was created).

Running the Server

The run_llama_cpp_server.ps1 script provides a convenient way to start the llama.cpp server with the Qwen3-4B model.

Downloads the Model: It automatically downloads the Qwen3-4B-Instruct-2507-Q8_0-GGUF model to a models subdirectory if it's not already present.
Starts the Server: It launches the llama-server.exe with parameters optimized for the Qwen3-4B model.
Opens Web UI: After starting the server, it automatically opens http://localhost:8080 in your default web browser.

To run the server, use the following command in PowerShell:

./run_llama_cpp_server.ps1

You can also specify the number of CPU threads to use with the -Threads parameter:

./run_llama_cpp_server.ps1 -Threads 12

Troubleshooting

winget not found: Install “App Installer” from the Microsoft Store, then re-run.
Pending reboot: Some installs require a reboot (Windows Update/VS Installer). Reboot and re-run.
CUDA side-by-side: Multiple CUDA toolkits can co-exist; the uninstaller can remove them via winget.
NVML missing: The script falls back to a heuristic and then CMAKE_CUDA_ARCHITECTURES=native.
Locked files: Stop llama-server/llama-cli before uninstalling.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgements

This project is a simplified version of the local-qwen3-coder-env repository, focusing solely on the installation of llama.cpp.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
install_llama_cpp.ps1		install_llama_cpp.ps1
run_llama_cpp_server.ps1		run_llama_cpp_server.ps1
uninstall_llama_cpp.ps1		uninstall_llama_cpp.ps1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

llama.cpp Installer for Windows

What’s new

Prerequisites

What the installer does

Installation

Uninstallation

Running the Server

Troubleshooting

License

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

Uh oh!

License

Uh oh!

Danmoreng/llama.cpp-installer

Folders and files

Latest commit

History

Repository files navigation

llama.cpp Installer for Windows

What’s new

Prerequisites

What the installer does

Installation

Uninstallation

Running the Server

Troubleshooting

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages