Skip to content

teddykoker/llama-rs-server

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

llama-rs-server

In progress. OpenAI-style HTTP API for LLaMA, Alpaca, etc.

Usage

Generate ggml models with llama.cpp. Run with:

cargo run --release -- -m models/7B/ggml-model-q4_0.bin

Example usage:

$ curl --request POST \
  --url http://localhost:3000/completions \
  --header 'Content-Type: application/json' \
  --data '{
        "prompt": "Llamas are ",
        "max_tokens": 128
}'

{"text":"Llamas are 5-9 in height, with average weights of around one to two hundred pounds. They have an amazing memory and make great pets! Llama's live for anywhere between twenty five years up to thirty nine depending on the care they recieve...\nMiniature Goats (Nigerian Dwarfs) are adorable, smart animals that can grow from 20-36 inches tall. They have a great ability of adapting and surviving in harsh climates with their dense hair coating to protect them against the elements...."}

References

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages