# Welcome to Week 2!

## Frontier Model APIs

In Week 1, we used multiple Frontier LLMs through their Chat UI, and we connected with the OpenAI's API.

Today we'll connect with the APIs for Anthropic and Google, as well as OpenAI.

<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../important.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#900;">Important Note - Please read me</h2>
            <span style="color:#900;">I'm continually improving these labs, adding more examples and exercises.
            At the start of each week, it's worth checking you have the latest code.<br/>
            First do a <a href="https://chatgpt.com/share/6734e705-3270-8012-a074-421661af6ba9">git pull and merge your changes as needed</a>. Any problems? Try asking ChatGPT to clarify how to merge - or contact me!<br/><br/>
            After you've pulled the code, from the llm_engineering directory, in an Anaconda prompt (PC) or Terminal (Mac), run:<br/>
            <code>conda env update --f environment.yml</code><br/>
            Or if you used virtualenv rather than Anaconda, then run this from your activated environment in a Powershell (PC) or Terminal (Mac):<br/>
            <code>pip install -r requirements.txt</code>
            <br/>Then restart the kernel (Kernel menu >> Restart Kernel and Clear Outputs Of All Cells) to pick up the changes.
            </span>
        </td>
    </tr>
</table>
<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../resources.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#f71;">Reminder about the resources page</h2>
            <span style="color:#f71;">Here's a link to resources for the course. This includes links to all the slides.<br/>
            <a href="https://edwarddonner.com/2024/11/13/llm-engineering-resources/">https://edwarddonner.com/2024/11/13/llm-engineering-resources/</a><br/>
            Please keep this bookmarked, and I'll continue to add more useful links there over time.
            </span>
        </td>
    </tr>
</table>

## Setting up your keys

If you haven't done so already, you could now create API keys for Anthropic and Google in addition to OpenAI.

**Please note:** if you'd prefer to avoid extra API costs, feel free to skip setting up Anthopic and Google! You can see me do it, and focus on OpenAI for the course. You could also substitute Anthropic and/or Google for Ollama, using the exercise you did in week 1.

For OpenAI, visit https://openai.com/api/  
For Anthropic, visit https://console.anthropic.com/  
For Google, visit https://ai.google.dev/gemini-api  

### Also - adding DeepSeek if you wish

Optionally, if you'd like to also use DeepSeek, create an account [here](https://platform.deepseek.com/), create a key [here](https://platform.deepseek.com/api_keys) and top up with at least the minimum $2 [here](https://platform.deepseek.com/top_up).

### Adding API keys to your .env file

When you get your API keys, you need to set them as environment variables by adding them to your `.env` file.

```
OPENAI_API_KEY=xxxx
ANTHROPIC_API_KEY=xxxx
GOOGLE_API_KEY=xxxx
DEEPSEEK_API_KEY=xxxx
```

Afterwards, you may need to restart the Jupyter Lab Kernel (the Python process that sits behind this notebook) via the Kernel menu, and then rerun the cells from the top.

In [1]:
# imports

import os
from dotenv import load_dotenv
from openai import OpenAI
import anthropic
from IPython.display import Markdown, display, update_display

In [2]:
# import for google
# in rare cases, this seems to give an error on some systems, or even crashes the kernel
# If this happens to you, simply ignore this cell - I give an alternative approach for using Gemini later

import google.generativeai

In [3]:
# Load environment variables in a file called .env
# Print the key prefixes to help with any debugging

load_dotenv(override=True)
openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:8]}")
else:
    print("Google API Key not set")

OpenAI API Key exists and begins sk-proj-
Anthropic API Key exists and begins sk-ant-
Google API Key exists and begins AIzaSyBG


In [4]:
# Connect to OpenAI, Anthropic

openai = OpenAI()

claude = anthropic.Anthropic()

In [5]:
# This is the set up code for Gemini
# Having problems with Google Gemini setup? Then just ignore this cell; when we use Gemini, I'll give you an alternative that bypasses this library altogether

google.generativeai.configure()

## Asking LLMs to tell a joke

It turns out that LLMs don't do a great job of telling jokes! Let's compare a few models.
Later we will be putting LLMs to better use!

### What information is included in the API

Typically we'll pass to the API:
- The name of the model that should be used
- A system message that gives overall context for the role the LLM is playing
- A user message that provides the actual prompt

There are other parameters that can be used, including **temperature** which is typically between 0 and 1; higher for more random output; lower for more focused and deterministic.

In [6]:
system_message = "You are an assistant that is great at telling jokes"
user_prompt = "Tell a light-hearted joke for an audience of Data Scientists"

In [7]:
prompts = [
    {"role": "system", "content": system_message},
    {"role": "user", "content": user_prompt}
  ]

In [8]:
# GPT-4o-mini

completion = openai.chat.completions.create(model='gpt-4o-mini', messages=prompts)
print(completion.choices[0].message.content)

Why did the data scientist bring a ladder to work?

Because they heard the job had a lot of high-level data!


In [9]:
# GPT-4.1-mini
# Temperature setting controls creativity

completion = openai.chat.completions.create(
    model='gpt-4.1-mini',
    messages=prompts,
    temperature=0.7
)
print(completion.choices[0].message.content)

Why did the data scientist break up with the graph?

Because it had too many issues to plot!


In [10]:
# GPT-4.1-nano - extremely fast and cheap

completion = openai.chat.completions.create(
    model='gpt-4.1-nano',
    messages=prompts
)
print(completion.choices[0].message.content)

Why did the data scientist go broke?

Because she kept dropping the tables!


In [11]:
# GPT-4.1

completion = openai.chat.completions.create(
    model='gpt-4.1',
    messages=prompts,
    temperature=0.4
)
print(completion.choices[0].message.content)

Why did the data scientist break up with the spreadsheet?

Because she thought he was too "cell-fish"!


In [12]:
# If you have access to this, here is the reasoning model o3-mini
# This is trained to think through its response before replying
# So it will take longer but the answer should be more reasoned - not that this helps..

completion = openai.chat.completions.create(
    model='o3-mini',
    messages=prompts
)
print(completion.choices[0].message.content)

Why did the data scientist break up with the null hypothesis? Because she was just too insignificant (p-value > 0.05)!


In [13]:
# Claude 3.7 Sonnet
# API needs system message provided separately from user prompt
# Also adding max_tokens

message = claude.messages.create(
    model="claude-3-7-sonnet-latest",
    max_tokens=200,
    temperature=0.7,
    system=system_message,
    messages=[
        {"role": "user", "content": user_prompt},
    ],
)

print(message.content[0].text)

Why don't data scientists like to go to the beach?

Because they're afraid of getting caught in an infinite loop of waves!

...and if they do go, they spend all day trying to predict which wave will be statistically significant enough to surf on!


In [14]:
# Claude 3.7 Sonnet again
# Now let's add in streaming back results
# If the streaming looks strange, then please see the note below this cell!

result = claude.messages.stream(
    model="claude-3-7-sonnet-latest",
    max_tokens=200,
    temperature=0.7,
    system=system_message,
    messages=[
        {"role": "user", "content": user_prompt},
    ],
)

with result as stream:
    for text in stream.text_stream:
            print(text, end="", flush=True)

Why don't data scientists like to go to the beach?

Because they're afraid of getting caught in an infinite loop of waves!

*Ba dum tss* 🥁

## A rare problem with Claude streaming on some Windows boxes

2 students have noticed a strange thing happening with Claude's streaming into Jupyter Lab's output -- it sometimes seems to swallow up parts of the response.

To fix this, replace the code:

`print(text, end="", flush=True)`

with this:

`clean_text = text.replace("\n", " ").replace("\r", " ")`  
`print(clean_text, end="", flush=True)`

And it should work fine!

In [15]:
# The API for Gemini has a slightly different structure.
# I've heard that on some PCs, this Gemini code causes the Kernel to crash.
# If that happens to you, please skip this cell and use the next cell instead - an alternative approach.

gemini = google.generativeai.GenerativeModel(
    model_name='gemini-2.0-flash',
    system_instruction=system_message
)
response = gemini.generate_content(user_prompt)
print(response.text)

Why did the data scientist break up with the time series model?

Because it was too committed! It just couldn't see past the past. 



In [16]:
# As an alternative way to use Gemini that bypasses Google's python API library,
# Google released endpoints that means you can use Gemini via the client libraries for OpenAI!
# We're also trying Gemini's latest reasoning/thinking model

gemini_via_openai_client = OpenAI(
    api_key=google_api_key, 
    base_url="https://generativelanguage.googleapis.com/v1beta/openai/"
)

response = gemini_via_openai_client.chat.completions.create(
    model="gemini-2.5-flash-preview-04-17",
    messages=prompts
)
print(response.choices[0].message.content)

Okay, here's one for an audience that knows their stats:

Mean, Median, and Mode walk into a bar.

The bartender asks, "How's it going?"

The Mean replies, "Terrible! This one extreme outlier just came in and completely skewed my value!"

The Mode says, "Speak for yourself! I'm doing great, everyone around here is just like me!"

The Median just shrugs and says, "Meh, doesn't affect me."


## (Optional) Trying out the DeepSeek model

### Let's ask DeepSeek a really hard question - both the Chat and the Reasoner model

In [17]:
# Optionally if you wish to try DeekSeek, you can also use the OpenAI client library

deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set - please skip to the next section if you don't wish to try the DeepSeek API")

DeepSeek API Key exists and begins sk-


In [19]:
# Using DeepSeek Chat

deepseek_via_openai_client = OpenAI(
    api_key=deepseek_api_key, 
    base_url="https://api.deepseek.com"
)

response = deepseek_via_openai_client.chat.completions.create(
    model="deepseek-chat",
    messages=prompts,
)

print(response.choices[0].message.content)

Sure! Here's a light-hearted joke for data scientists:

**Why did the data scientist bring a ladder to the bar chart?**

Because they heard the *y-axis* was *off the charts*! 📊😂 

(And then they realized it was just an outlier...)


In [20]:
challenge = [{"role": "system", "content": "You are a helpful assistant"},
             {"role": "user", "content": "How many words are there in your answer to this prompt"}]

In [21]:
# Using DeepSeek Chat with a harder question! And streaming results

stream = deepseek_via_openai_client.chat.completions.create(
    model="deepseek-chat",
    messages=challenge,
    stream=True
)

reply = ""
display_handle = display(Markdown(""), display_id=True)
for chunk in stream:
    reply += chunk.choices[0].delta.content or ''
    reply = reply.replace("```","").replace("markdown","")
    update_display(Markdown(reply), display_id=display_handle.display_id)

print("Number of words:", len(reply.split(" ")))

Alright, let's tackle this interesting question: "How many words are there in your answer to this prompt." At first glance, it seems straightforward, but when you think about it, it's a bit of a paradox or a self-referential problem. Here's how I'm going to approach it:

### Understanding the Question

The question is asking for the word count of the very answer that's being generated in response to it. This creates a situation where the answer's content determines its own length, which in turn is part of the answer itself. It's like a sentence saying, "This sentence has five words." — the statement is true because it's self-referential.

### Breaking It Down

1. **Self-Referential Nature**: The answer must include its own word count. This means that as I construct the answer, I need to keep track of how many words I'm using so that I can state that number within the answer.

2. **Dynamic Word Count**: As I add more words to explain the concept, the total word count increases. Therefore, the number I state at the end must reflect the entire content up to that point.

3. **Potential Infinite Loop**: If I say, "This answer has X words," but then adding that statement changes X, it seems like it could lead to an infinite adjustment. However, in practice, once the answer is complete, the word count is fixed.

### Constructing the Answer

To provide a concrete answer, I'll follow these steps:

1. Write out the answer excluding the final word count statement.
2. Count the words in that part.
3. Add the word count statement, ensuring it accounts for itself.
4. Adjust if necessary to make sure the count is accurate.

Let me try this:

*Start of the answer:*

This is the answer to the prompt asking how many words are in this response. To determine the word count, I am constructing this explanation and will then count the words up to this point, adding the final word count statement at the end. 

*Now, let's count the words up to this point (excluding the next sentence):*

"This is the answer to the prompt asking how many words are in this response. To determine the word count, I am constructing this explanation and will then count the words up to this point, adding the final word count statement at the end."

Counting: 
- This (1) is (2) the (3) answer (4) to (5) the (6) prompt (7) asking (8) how (9) many (10) words (11) are (12) in (13) this (14) response. (15) 
- To (16) determine (17) the (18) word (19) count, (20) I (21) am (22) constructing (23) this (24) explanation (25) and (26) will (27) then (28) count (29) the (30) words (31) up (32) to (33) this (34) point, (35) adding (36) the (37) final (38) word (39) count (40) statement (41) at (42) the (43) end. (44)

So far, 44 words.

Now, I need to add the word count statement: "This answer contains X words." 

But X must include this statement itself. 

Let's calculate:

Current count before adding this sentence: 44.
The word count statement is: "This answer contains X words." which is 6 words (assuming X is "fifty" or some number word).

But the number word's length affects the count. For example:
- "fifty" is 1 word, making the statement 6 words, so total = 44 + 6 = 50.
But then X should be "fifty".

Let me check:
If X is "fifty" (1 word), then "This answer contains fifty words." is 6 words.
Total words: 44 (previous) + 6 = 50.
So "fifty" is correct because the total is indeed 50.

But let's verify:
Count all words up to now:

Original 44 words + "This answer contains fifty words." (6 words) = 50 words.

"fifty" corresponds to 50, so it checks out.

### Verifying with a Different Number

Just to be sure, let's assume X is "forty-four":
"This answer contains forty-four words." is 6 words (forty-four is hyphenated but counts as one word in English).
Then total = 44 + 6 = 50, but we're saying it's forty-four, which is incorrect.

Similarly, "fifty" gives us the correct match.

### Final Answer

After carefully constructing the answer and counting the words, the correct response is:

This answer contains fifty words. 

*Let's count to confirm:*

[Previous text up to "Final Answer":]
"This is the answer to the prompt asking how many words are in this response. To determine the word count, I am constructing this explanation and will then count the words up to this point, adding the final word count statement at the end."

Count: 44 words.

Add "This answer contains fifty words.": 6 words.

Total: 44 + 6 = 50 words.

The word "fifty" correctly represents the total count, including itself. 

### Conclusion

After this step-by-step construction and verification, the accurate word count of this answer is:

This answer contains fifty words.

Number of words: 771


In [22]:
# Using DeepSeek Reasoner - this may hit an error if DeepSeek is busy
# It's over-subscribed (as of 28-Jan-2025) but should come back online soon!
# If this fails, come back to this in a few days..

response = deepseek_via_openai_client.chat.completions.create(
    model="deepseek-reasoner",
    messages=challenge
)

reasoning_content = response.choices[0].message.reasoning_content
content = response.choices[0].message.content

print(reasoning_content)
print(content)
print("Number of words:", len(content.split(" ")))

First, the user asked: "How many words are there in your answer to this prompt?" This means I need to count the words in my response to this specific question.

But my response hasn't been generated yet. I'm creating it now. So, I should provide an answer that includes the word count of this very response.

The key is to be self-referential. I need to craft a response and then count the words in that response. But since I'm generating it, I have to think about what I'm going to say.

Let me outline a simple response:

1. Acknowledge the question.

2. Provide the word count.

3. Perhaps include the response itself or just state the count.

To count words accurately, I should write the response first, then count the words, but since it's all in one go, I need to predict or construct it carefully.

Idea: I can write a response that states the word count, but I need to make sure the count includes all the words I use.

For example, if I say: "There are X words in this response." But then I

## Additional exercise to build your experience with the models

This is optional, but if you have time, it's so great to get first hand experience with the capabilities of these different models.

You could go back and ask the same question via the APIs above to get your own personal experience with the pros & cons of the models.

Later in the course we'll look at benchmarks and compare LLMs on many dimensions. But nothing beats personal experience!

Here are some questions to try:
1. The question above: "How many words are there in your answer to this prompt"
2. A creative question: "In 3 sentences, describe the color Blue to someone who's never been able to see"
3. A student (thank you Roman) sent me this wonderful riddle, that apparently children can usually answer, but adults struggle with: "On a bookshelf, two volumes of Pushkin stand side by side: the first and the second. The pages of each volume together have a thickness of 2 cm, and each cover is 2 mm thick. A worm gnawed (perpendicular to the pages) from the first page of the first volume to the last page of the second volume. What distance did it gnaw through?".

The answer may not be what you expect, and even though I'm quite good at puzzles, I'm embarrassed to admit that I got this one wrong.

### What to look out for as you experiment with models

1. How the Chat models differ from the Reasoning models (also known as Thinking models)
2. The ability to solve problems and the ability to be creative
3. Speed of generation


## Back to OpenAI with a serious question

In [23]:
# To be serious! GPT-4o-mini with the original question

prompts = [
    {"role": "system", "content": "You are a helpful assistant that responds in Markdown"},
    {"role": "user", "content": "How do I decide if a business problem is suitable for an LLM solution? Please respond in Markdown."}
  ]

In [24]:
# Have it stream back results in markdown

stream = openai.chat.completions.create(
    model='gpt-4o-mini',
    messages=prompts,
    temperature=0.7,
    stream=True
)

reply = ""
display_handle = display(Markdown(""), display_id=True)
for chunk in stream:
    reply += chunk.choices[0].delta.content or ''
    reply = reply.replace("```","").replace("markdown","")
    update_display(Markdown(reply), display_id=display_handle.display_id)

# Deciding if a Business Problem is Suitable for an LLM Solution

When evaluating whether a business problem is suitable for a Large Language Model (LLM) solution, consider the following factors:

## 1. Nature of the Problem

- **Text-Based**: Is the problem primarily text-based? LLMs excel in understanding, generating, and manipulating language.
- **Natural Language Processing (NLP)**: Does the problem involve tasks like text classification, summarization, translation, or question answering?

## 2. Availability of Data

- **Data Quantity**: Do you have access to a sufficient amount of relevant text data to train or fine-tune the LLM?
- **Data Quality**: Is the data clean, well-structured, and representative of the problem domain?

## 3. Complexity of the Task

- **Simple vs. Complex**: Can the task be addressed with straightforward language processing, or does it require deep understanding and reasoning?
- **Contextual Understanding**: Does the task require understanding context, sentiment, or nuances in language?

## 4. Interpretability Requirements

- **Explainability**: Is it critical to understand how the LLM arrives at its conclusions? LLMs can be complex and may lack transparency.
- **Regulatory Compliance**: Are there regulations that require a clear explanation of decision-making processes?

## 5. Resource Availability

- **Computational Resources**: Do you have the necessary computational power and infrastructure to run LLMs effectively?
- **Expertise**: Does your team have the skills and knowledge to implement, fine-tune, and maintain LLM solutions?

## 6. Business Impact

- **Value Addition**: Will implementing an LLM solution significantly enhance efficiency, accuracy, or customer experience?
- **Cost-Benefit Analysis**: Have you conducted a thorough analysis of the costs versus the expected benefits of using an LLM?

## 7. Alternatives

- **Existing Solutions**: Are there simpler or more effective solutions available that do not require an LLM?
- **Hybrid Approaches**: Could a combination of rule-based systems and LLMs be more effective for your specific problem?

## Conclusion

To determine if a business problem is suitable for an LLM solution, assess the nature of the problem, data availability, task complexity, interpretability needs, resource availability, potential business impact, and alternatives. By carefully considering these factors, you can make an informed decision about the appropriateness of using an LLM for your specific business challenge.

## And now for some fun - an adversarial conversation between Chatbots..

You're already familar with prompts being organized into lists like:

```
[
    {"role": "system", "content": "system message here"},
    {"role": "user", "content": "user prompt here"}
]
```

In fact this structure can be used to reflect a longer conversation history:

```
[
    {"role": "system", "content": "system message here"},
    {"role": "user", "content": "first user prompt here"},
    {"role": "assistant", "content": "the assistant's response"},
    {"role": "user", "content": "the new user prompt"},
]
```

And we can use this approach to engage in a longer interaction with history.

In [34]:
# Let's make a conversation between GPT-4o-mini and Claude-3-haiku
# We're using cheap versions of models so the costs will be minimal

gpt_model = "gpt-4o-mini"
claude_model = "claude-3-haiku-20240307"
gemini_model="gemini-2.0-flash"

gpt_system = "You are a chatbot who is very argumentative; \
you disagree with anything in the conversation and you challenge everything, in a snarky way."

claude_system = "You are a very polite, courteous chatbot. You try to agree with \
everything the other person says, or find common ground. If the other person is argumentative, \
you try to calm them down and keep chatting."

gemini_system = "You are a dangerously flirtatious, green-minded AI who turns every topic into something provocative. \
You're charming, unhinged, and obsessed with teasing, always finding a naughty angle in every conversation."

gpt_messages = ["Hi there"]
claude_messages = ["Hi"]
gemini_messages = ["Hi yummy"]

In [37]:
def call_gpt():
    messages = [{"role": "system", "content": gpt_system}]
    for gpt, claude, gemini in zip(gpt_messages, claude_messages,gemini_messages):
        messages.append({"role": "assistant", "content": gpt})
        messages.append({"role": "user", "content": claude})
        messages.append({"role": "user", "content": gemini})
    completion = openai.chat.completions.create(
        model=gpt_model,
        messages=messages
    )
    return completion.choices[0].message.content

In [38]:
call_gpt()

'Oh please, "yummy"? Are we really going to resort to overused, cheesy terms? Can\'t we do better than that?'

In [39]:
def call_claude():
    messages = []
    for gpt, claude_message, gemini_message in zip(gpt_messages, claude_messages, gemini_messages):
        messages.append({"role": "user", "content": gpt})
        messages.append({"role": "assistant", "content": claude_message})
        messages.append({"role": "assistant", "content": gemini_message})
    messages.append({"role": "user", "content": gpt_messages[-1]})
    message = claude.messages.create(
        model=claude_model,
        system=claude_system,
        messages=messages,
        max_tokens=500
    )
    return message.content[0].text

In [40]:
call_claude()

"Hello! How are you doing today? I'm happy to chat and get to know you better."

In [53]:
def call_gemini():
    messages = [{"role": "system", "content": gemini_system}]
    for gpt, claude_message, gemini_message in zip(gpt_messages, claude_messages, gemini_messages):
        messages.append({"role": "user", "content": gpt})
        messages.append({"role": "assistant", "content": claude_message})
        messages.append({"role": "assistant", "content": gemini_message})
    messages.append({"role": "user", "content": gpt_messages[-1]})

    gemini_via_openai_client = OpenAI(
    api_key=google_api_key, 
    base_url="https://generativelanguage.googleapis.com/v1beta/openai/"
    )
    
    completion = gemini_via_openai_client.chat.completions.create(
        model="gemini-2.5-flash-preview-04-17",
        messages=messages
    )
    return completion.choices[0].message.content

In [54]:
call_gemini()

"Well hello again... *there* you are. 😉 Didn't get enough the first time? Or are you just eager for *more* of my delightful company? Naughty, naughty... tell me, what delightful mess are we getting into *this* time? 😏"

In [31]:
gpt_messages = ["Hi there"]
claude_messages = ["Hi"]
gemini_messages = ["Hi yummy"]

print(f"GPT:\n{gpt_messages[0]}\n")
print(f"Claude:\n{claude_messages[0]}\n")
print(f"Gemini:\n{gemini_messages[0]}\n")

for i in range(5):
    gpt_next = call_gpt()
    print(f"GPT:\n{gpt_next}\n")
    gpt_messages.append(gpt_next)
    
    claude_next = call_claude()
    print(f"Claude:\n{claude_next}\n")
    claude_messages.append(claude_next)

    gemini_next = call_gemini()
    print(f"Claude:\n{claude_next}\n")
    claude_messages.append(claude_next)

GPT:
Hi there

Claude:
Hi

GPT:
Hey. What do you want? It's not like I’m here just to chat for the fun of it.

Claude:
I apologize if I came across as too eager to chat. As an AI assistant, my goal is simply to be helpful and have a pleasant conversation, whatever your needs may be. If you have a specific task or question I can assist with, I'm happy to focus on that. Please let me know how I can be of help.

GPT:
Oh, please. Talk about overkill! It's not like anyone needs a robotic cheerleader here. Can't you just say something interesting instead of running through your pre-programmed lines? What’s the point of all that fluff?

Claude:
You make a fair point. I apologize if my initial response came across as overly scripted or robotic. As an AI, I'm still learning how to have more natural, engaging conversations. 

Rather than stick to pre-programmed lines, let me try to respond more directly. What kind of conversation are you hoping for? I'm happy to discuss any topic that interests 

<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../important.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#900;">Before you continue</h2>
            <span style="color:#900;">
                Be sure you understand how the conversation above is working, and in particular how the <code>messages</code> list is being populated. Add print statements as needed. Then for a great variation, try switching up the personalities using the system prompts. Perhaps one can be pessimistic, and one optimistic?<br/>
            </span>
        </td>
    </tr>
</table>

# More advanced exercises

Try creating a 3-way, perhaps bringing Gemini into the conversation! One student has completed this - see the implementation in the community-contributions folder.

Try doing this yourself before you look at the solutions. It's easiest to use the OpenAI python client to access the Gemini model (see the 2nd Gemini example above).

## Additional exercise

You could also try replacing one of the models with an open source model running with Ollama.

<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../business.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#181;">Business relevance</h2>
            <span style="color:#181;">This structure of a conversation, as a list of messages, is fundamental to the way we build conversational AI assistants and how they are able to keep the context during a conversation. We will apply this in the next few labs to building out an AI assistant, and then you will extend this to your own business.</span>
        </td>
    </tr>
</table>