# Local-LLM Tests and Examples

Simply choose your favorite model of choice from the models list and paste it into the `model` variable on the API calls. You can get a list of models below.

Install OpenAI and requests:

```bash
pip install openai requests
```

**Note, you do not need an OpenAI API Key, the API Key is your `LOCAL_LLM_API_KEY` for the server if you defined one in your `.env` file.**

## Models

Get a list of models to choose from if you don't already know what model you want to use.


In [1]:
import requests

models = requests.get("http://localhost:8091/v1/models")
print(models.json())

['bakllava-1-7b', 'llava-v1.5-7b', 'llava-v1.5-13b', 'DiscoLM_German_7b_v1', 'DiscoLM_German_7b_v1', 'Garrulus', 'DareVox-7B', 'NexoNimbus-7B', 'Lelantos-Maid-DPO-7B', 'stable-code-3b', 'Dr_Samantha-7B', 'NeuralBeagle14-7B', 'tigerbot-13B-chat-v5', 'Nous-Hermes-2-Mixtral-8x7B-SFT', 'Thespis-13B-DPO-v0.7', 'Code-290k-13B', 'Nous-Hermes-2-Mixtral-8x7B-DPO', 'Venus-120b-v1.2', 'LLaMA2-13B-Estopia', 'medicine-LLM', 'finance-LLM-13B', 'Yi-34B-200K-DARE-megamerge-v8', 'phi-2-orange', 'laser-dolphin-mixtral-2x7b-dpo', 'bagel-dpo-8x7b-v0.2', 'Everyone-Coder-4x7b-Base', 'phi-2-electrical-engineering', 'Cosmosis-3x34B', 'HamSter-0.1', 'Helion-4x34B', 'Bagel-Hermes-2x34b', 'deepmoney-34b-200k-chat-evaluator', 'deepmoney-34b-200k-base', 'TowerInstruct-7B-v0.1', 'PiVoT-SUS-RP', 'Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss', 'TenyxChat-7B-v1', 'UNA-TheBeagle-7B-v1', 'WhiteRabbitNeo-33B-v1', 'WinterGoliath-123b', 'Open_Gpt4_8x7B_v0.2', 'neuronovo-7B-v0.3', 'Nous-Capybara-limarpv3-34B', 'MegaDolphin-120

## Completion

[OpenAI API Reference](https://platform.openai.com/docs/api-reference/completions/create)


In [52]:
import openai

# Modify this prompt to generate different outputs
prompt = "Write a haiku about dragons, then end with </s>."

openai.base_url = "http://localhost:8091/v1/"
openai.api_key = "Your LOCAL_LLM_API_KEY from your .env file"

completion = openai.completions.create(
    model="phi-2-dpo",
    prompt=prompt,
    temperature=0.3,
    max_tokens=1024,
    top_p=0.90,
    n=1,
    stream=False,
)
print(completion.choices[0].text)

Dragon's breath,  
   Flames dance in the night sky,  
   Mythic wonder.   

   Scales shimmer bright,  
   Ancient beasts guard their lair,  
   Legends never die.


## Chat Completion

[OpenAI API Reference](https://platform.openai.com/docs/api-reference/chat)


In [41]:
import openai

# Modify this prompt to generate different outputs
prompt = "Write a haiku about Taco Bell's Doritos Locos Tacos, then end with </s>."

openai.api_key = "Your LOCAL_LLM_API_KEY from your .env file"
openai.base_url = "http://localhost:8091/v1/"

messages = [{"role": "system", "content": prompt}]
response = openai.chat.completions.create(
    model="phi-2-dpo",
    messages=messages,
    temperature=0.3,
    max_tokens=1024,
    top_p=0.90,
    stream=False,
)
print(response.messages[1]["content"])

Crunchy shells burst,
  Cheesy salsa and meat blend,
  Loyalty never fades.

  Taco Bell's Locos Tacos,
  A taste of childhood memories,
  Always a favorite.

  Doritos Locos Tacos,
  The perfect snack for any time,
  Satisfy your cravings.


## Embeddings

[OpenAI API Reference](https://platform.openai.com/docs/api-reference/embeddings)


In [53]:
import openai

# Modify this prompt to generate different outputs
prompt = "Tacos are great."

openai.base_url = "http://localhost:8091/v1/"
openai.api_key = "Your LOCAL_LLM_API_KEY from your .env file"

response = openai.embeddings.create(
    input=prompt,
    model="phi-2-dpo",
)
print(response.data[0].embedding)

[0.2244519144296646, -8.292693138122559, -5.11789083480835, 6.12165641784668, 0.1106947809457779, -3.0858023166656494, 4.464869499206543, -3.687350034713745, -4.991833209991455, -1.3502041101455688, 0.3602711856365204, 0.9228768944740295, 0.48679086565971375, 7.6126322746276855, -11.623971939086914, 1.3585466146469116, 2.2658090591430664, -4.2918572425842285, 1.5248425006866455, -1.7390656471252441, -1.6036713123321533, 0.4873034954071045, -1.803336501121521, 1.1754555702209473, -2.9386746883392334, -3.4137110710144043, -0.06379293650388718, -0.33252498507499695, -7.282681941986084, 0.5026590824127197, 7.986130237579346, -1.1864018440246582, 1.942865014076233, 1.8377327919006348, -0.6112569570541382, -4.406113624572754, -2.1407082080841064, 0.6969444155693054, -0.166226327419281, -0.6015466451644897, 3.156931161880493, -9.312156677246094, 8.893461227416992, 0.09793171286582947, 6.1327948570251465, -1.658631682395935, -4.0215253829956055, -3.54531192779541, 2.787688732147217, -1.4372832