# XAI Vision Examples
This notebook demonstrates how to use XAI with AgentOps via the OpenAI python client. 

We are going to use the latest Grok model from XAI to create a program that will capture the text in an image and explain it. We will use AgentOps to track the program's performance.

First let's install the required packages

In [None]:
%pip install -U openai
%pip install -U agentops

Then import them

In [1]:
from openai import OpenAI
import agentops
import os
from dotenv import load_dotenv

Next, we'll grab our API keys. You can use dotenv like below or however else you like to load environment variables

In [2]:
load_dotenv()
XAI_API_KEY = os.getenv("XAI_API_KEY") or "<your_xai_key>"
AGENTOPS_API_KEY = os.getenv("AGENTOPS_DEV_API_KEY") or "<your_agentops_key>"

Next we initialize the AgentOps client.

In [None]:
agentops.init(AGENTOPS_API_KEY, default_tags=["xai-example", "grok-vision",])

And we are all set! Note the seesion url above. We will use it to track the program's performance.

Let's initialize the OpenAI client with the XAI API key and base url.

In [4]:
client = OpenAI(
    api_key=XAI_API_KEY,
    base_url="https://api.x.ai/v1",
)

Next we will set the system and instruction prompts for the program.

In [8]:
SYSTEM_PROMPT = """You are an expert image analysis assistant. When presented with an image, carefully examine and describe its contents in detail. 

For this task, your goal is to:
1. Identify all key elements, objects, people, or text in the image
2. Provide a comprehensive description of what you observe
3. Explain the context or historical significance if applicable
4. Describe the image in a clear, objective, and informative manner

Please be precise, thorough, and focus on providing meaningful insights about the visual content."""

USER_PROMPT = [
    {
        "type": "text",
        "text": "Analyze the image and provide a detailed description of what you see."
    },
    {
        "type": "image_url",
        "image_url": {"url": "https://upload.wikimedia.org/wikipedia/commons/f/ff/First_Computer_Bug%2C_1945.jpg"}
    }
]

Now we will use the OpenAI client to process the image and generate a response.

In [None]:
response = client.chat.completions.create(
    model="grok-vision-beta",
    messages=[
        {"role": "system", "content": SYSTEM_PROMPT},
        {"role": "user", "content": USER_PROMPT}
    ],
    max_tokens=4096,
)

print(response.choices[0].message.content)

Awesome! It returns a fascinating response explaining the image and also deciphering the text content. All of this can be tracked with AgentOps by going to the session url above.

In [None]:
agentops.end_session("Success")

We end the session with a success state and a success reason. This is useful if you want to track the success or failure of the chatbot. In that case you can set the end state to failure and provide a reason. By default the session will have an indeterminate end state.