# Local-LLM Examples

Simply choose your favorite model of choice from the models list and paste it into the `model` variable on the API calls. You can get a list of models below.

**Note, you do not need an OpenAI API Key, the API Key is your own API Key for the server if you defined one.**


In [None]:
import requests

models = requests.get("http://localhost:8091/v1/models")
print(models.json())

Install OpenAI


In [None]:
%pip install openai==0.28.1

# Completion

[OpenAI API Reference](https://platform.openai.com/docs/api-reference/completions/create)


In [16]:
import openai

openai.api_base = "http://localhost:8091/v1"
openai.api_key = ""
prompt = "Tell me a short story about dragons."

response = openai.Completion.create(
    model="Mistral-7B-OpenOrca",
    prompt=prompt,
    temperature=1.31,
    max_tokens=8192,
    top_p=1.0,
    n=1,
    stream=False,
)
print(response)

{
  "id": "cmpl-59e8a65d-41a3-4d48-be8b-7a3c8fc36361",
  "object": "text_completion",
  "created": 1705516910,
  "model": "Mistral-7B-OpenOrca",
  "choices": [
    {
      "text": "Once upon a time, in a land far far away full of all kinds of magical creatures, there lived large and fantastic dragons who could breathe powerful fire.\n\nThe elders, giants renowned for their storytelling abilities, loved to remember the tale when the lands were full of life and joy unlike anything ever seen before. Long ago, the world had plenty of resources; oceans swarming with fish, mountains of precious stones, and meadows of wildflowers. But soon as these magical creatures started disappearing, others stepped into their habitat which changed the earth's balance forever but among all those creatures, DRAGONS always had a special place in human\u2019s mind.\n\nAn enigmatic volcanic dragon named Malgoryst stood tall in an azure mountain valley surrounded by greenery as far as his mighty red and scaly e

## Chat Completion

[OpenAI API Reference](https://platform.openai.com/docs/api-reference/chat)


In [17]:
import openai

openai.api_base = "http://localhost:8091/v1"
openai.api_key = ""
prompt = "What is the capital of Ohio?"
messages = [{"role": "system", "content": prompt}]

response = openai.ChatCompletion.create(
    model="Mistral-7B-OpenOrca",
    messages=messages,
    temperature=1.31,
    max_tokens=8192,
    top_p=1.0,
    n=1,
    stream=False,
)
print(response)

{
  "id": "cmpl-13bd0802-0175-49e9-9a5a-5eec0986c835",
  "object": "text_completion",
  "created": 1705517135,
  "model": "Mistral-7B-OpenOrca",
  "usage": {
    "prompt_tokens": 62,
    "completion_tokens": 51,
    "total_tokens": 113
  },
  "messages": [
    {
      "role": "user",
      "content": "What is the capital of Ohio?"
    },
    {
      "role": "assistant",
      "content": "The capital of Ohio is Columbus. It's where the state's legislative, executive, and judicial branch buildings are all located, along with offices for governmental entities on a state or federal level."
    }
  ]
}


## Embeddings

[OpenAI API Reference](https://platform.openai.com/docs/api-reference/embeddings)

The embeddings endpoint it currently uses is an ONNX embedder with 256 max tokens.


In [18]:
import openai

openai.api_base = "http://localhost:8091/v1"
openai.api_key = ""
prompt = "Columbus is the capital of Ohio."

response = openai.Embedding.create(
    input=prompt,
    engine="Mistral-7B-OpenOrca",
)

print(response)

{
  "object": "list",
  "data": [
    {
      "object": "embedding",
      "embedding": [
        3.397202491760254,
        -4.438299655914307,
        2.4834322929382324,
        -4.250019550323486,
        2.6028881072998047,
        4.35982084274292,
        4.733896732330322,
        0.24392911791801453,
        5.598224639892578,
        -3.29296612739563,
        -3.223780632019043,
        0.8454574942588806,
        -1.2303225994110107,
        1.2776594161987305,
        0.7008570432662964,
        -3.6054036617279053,
        0.6928118467330933,
        0.5641763806343079,
        3.0058746337890625,
        -0.24843311309814453,
        -2.102562665939331,
        -2.213374376296997,
        -1.8864010572433472,
        6.977653503417969,
        -7.276993274688721,
        3.125082492828369,
        3.601240873336792,
        0.5221434831619263,
        -8.706903457641602,
        0.8038478493690491,
        0.6118654012680054,
        -4.164190292358398,
        -3.433185