GitHub - mpwang/llama-cpp-windows-guide

how to run llama.cpp on windows

I'm not familiar with windows development, here are just something I wish can help.

please refer to llama.cpp

Prerequisites

Visual Studio Community installed with Desktop C++ Environment selected during installation
Chocolatey (a package manager for Windows) installed
CMake installed
Python 3 installed
LLaMA models downloaded (dalai can help)

Steps

install make

Install Make Open PowerShell as an administrator and run the following command:

choco install make

python

if python is not installed, you can install python via choco

choco install python

clone llama.cpp

Clone repository using Git or download the repository as a ZIP file and extract it to a directory on your machine.

llama.cpp

build llama.cpp

Use Visual Studio to open llama.cpp directory.

Select "View" and then "Terminal" to open a command prompt within Visual Studio. Type the following commands:

cmake .
make

On the right hand side panel:

right click file quantize.vcxproj -> select build
this output .\Debug\quantize.exe


right click ALL_BUILD.vcxproj -> select build
this output .\Debug\llama.exe

create a python virtual environment

back to the powershell termimal, cd to lldma.cpp directory, suppose LLaMA models have been download to models directory

python -m venv venv

.\venv\Scripts\pip.exe install torch torchvision torchaudio sentencepiece numpy

.\venv\Scripts\python.exe convert-pth-to-ggml.py models/7B/ 1

.\Debug\quantize.exe ./models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_0.bin 2

.\Debug\llama.exe -m ./models/7B/ggml-model-q4_0.bin -t 8 -n 128

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

how to run llama.cpp on windows

Prerequisites

Steps

install make

python

clone llama.cpp

build llama.cpp

create a python virtual environment

About

Releases

Packages

mpwang/llama-cpp-windows-guide

Folders and files

Latest commit

History

Repository files navigation

how to run llama.cpp on windows

Prerequisites

Steps

install make

python

clone llama.cpp

build llama.cpp

create a python virtual environment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages