# Local-LLM Tests and Examples

Simply choose your favorite model of choice from the models list and paste it into the `model` variable on the API calls. You can get a list of models below.

Install OpenAI and requests:

```bash
pip install openai requests
```

**Note, you do not need an OpenAI API Key, the API Key is your `LOCAL_LLM_API_KEY` for the server if you defined one in your `.env` file.**

## Models

Get a list of models to choose from if you don't already know what model you want to use.


In [1]:
import requests

models = requests.get("http://localhost:8091/v1/models")
print(models.json())

['bakllava-1-7b', 'llava-v1.5-7b', 'llava-v1.5-13b', 'DiscoLM_German_7b_v1', 'DiscoLM_German_7b_v1', 'Garrulus', 'DareVox-7B', 'NexoNimbus-7B', 'Lelantos-Maid-DPO-7B', 'stable-code-3b', 'Dr_Samantha-7B', 'NeuralBeagle14-7B', 'tigerbot-13B-chat-v5', 'Nous-Hermes-2-Mixtral-8x7B-SFT', 'Thespis-13B-DPO-v0.7', 'Code-290k-13B', 'Nous-Hermes-2-Mixtral-8x7B-DPO', 'Venus-120b-v1.2', 'LLaMA2-13B-Estopia', 'medicine-LLM', 'finance-LLM-13B', 'Yi-34B-200K-DARE-megamerge-v8', 'phi-2-orange', 'laser-dolphin-mixtral-2x7b-dpo', 'bagel-dpo-8x7b-v0.2', 'Everyone-Coder-4x7b-Base', 'phi-2-electrical-engineering', 'Cosmosis-3x34B', 'HamSter-0.1', 'Helion-4x34B', 'Bagel-Hermes-2x34b', 'deepmoney-34b-200k-chat-evaluator', 'deepmoney-34b-200k-base', 'TowerInstruct-7B-v0.1', 'PiVoT-SUS-RP', 'Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss', 'TenyxChat-7B-v1', 'UNA-TheBeagle-7B-v1', 'WhiteRabbitNeo-33B-v1', 'WinterGoliath-123b', 'Open_Gpt4_8x7B_v0.2', 'neuronovo-7B-v0.3', 'Nous-Capybara-limarpv3-34B', 'MegaDolphin-120

## Completion

[OpenAI API Reference](https://platform.openai.com/docs/api-reference/completions/create)


In [18]:
import openai

# Modify this prompt to generate different outputs
prompt = "Write a haiku about dragons."

openai.base_url = "http://localhost:8091/v1/"
openai.api_key = "Your LOCAL_LLM_API_KEY from your .env file"

completion = openai.completions.create(
    model="phi-2-dpo",
    prompt=prompt,
    temperature=1.31,
    max_tokens=1024,
    top_p=1.0,
    n=1,
    stream=False,
)
print(completion.choices[0].text)

InternalServerError: Internal Server Error

## Chat Completion

[OpenAI API Reference](https://platform.openai.com/docs/api-reference/chat)


In [17]:
import openai

# Modify this prompt to generate different outputs
prompt = "Write a haiku about Taco Bell's Doritos Locos Tacos."

openai.api_key = "Your LOCAL_LLM_API_KEY from your .env file"
openai.base_url = "http://localhost:8091/v1/"

messages = [{"role": "system", "content": prompt}]
response = openai.chat.completions.create(
    model="phi-2-dpo",
    messages=messages,
    temperature=1.31,
    max_tokens=1024,
    top_p=1.0,
    n=1,
    stream=False,
)
print(response.messages[1]["content"])

Dorito bites joy, crunchy and hot
  Sizzling glory wraps beef and beans tight
  Taco Bell's silly snacks, pure bliss

A playful fusion o' flavor
Sampled fingers cajole caps off gaze
You'll arrive your D-Lit!

  Endangered bird speaks
Chirp of carne loca erupts now
Trademark Tacos as Band, haha!!!
+ Mockingbirdmckitty.com  (We hope it's tongue twister right!)


## Embeddings

[OpenAI API Reference](https://platform.openai.com/docs/api-reference/embeddings)


In [10]:
import openai

# Modify this prompt to generate different outputs
prompt = "Columbus is the capital of Ohio."

openai.base_url = "http://localhost:8091/v1/"
openai.api_key = "Your LOCAL_LLM_API_KEY from your .env file"

response = openai.embeddings.create(
    input=prompt,
    model="phi-2-dpo",
)
print(response.data[0].embedding)

[0.6698899269104004, -0.32914745807647705, -0.4694478213787079, -0.4214792251586914, 0.3705234229564667, -0.47680866718292236, -1.4957921504974365, -0.44858619570732117, -0.3574576675891876, 0.8155132532119751, -0.7275238037109375, 0.25924068689346313, 0.8450661301612854, 0.9737700819969177, -0.992473840713501, 0.2610418200492859, 0.5055912733078003, 1.0409797430038452, -0.31559237837791443, -0.2266843616962433, -0.5597264766693115, 1.420119285583496, -0.5836111307144165, -0.8978241682052612, -0.394643634557724, 0.0677919089794159, 0.8806999921798706, -0.45033740997314453, 1.1135565042495728, 0.13567695021629333, 0.6969098448753357, 0.47800207138061523, 0.13237224519252777, 0.4649584889411926, 1.7724541425704956, 0.12804192304611206, -0.28987017273902893, 0.05258326604962349, 1.0611693859100342, 0.07527725398540497, 1.4735026359558105, 0.03624189645051956, -0.7749932408332825, -1.318582534790039, -0.032036032527685165, -0.5194717645645142, 1.005630612373352, -0.7919350862503052, -0.531