# Welcome to Week 2!

## Frontier Model APIs

In Week 1, we used multiple Frontier LLMs through their Chat UI, and we connected with the OpenAI's API.

Today we'll connect with the APIs for Anthropic and Google, as well as OpenAI.

<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../important.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#900;">Important Note - Please read me</h2>
            <span style="color:#900;">I'm continually improving these labs, adding more examples and exercises.
            At the start of each week, it's worth checking you have the latest code.<br/>
            First do a <a href="https://chatgpt.com/share/6734e705-3270-8012-a074-421661af6ba9">git pull and merge your changes as needed</a>. Any problems? Try asking ChatGPT to clarify how to merge - or contact me!<br/><br/>
            After you've pulled the code, from the llm_engineering directory, in an Anaconda prompt (PC) or Terminal (Mac), run:<br/>
            <code>conda env update --f environment.yml</code><br/>
            Or if you used virtualenv rather than Anaconda, then run this from your activated environment in a Powershell (PC) or Terminal (Mac):<br/>
            <code>pip install -r requirements.txt</code>
            <br/>Then restart the kernel (Kernel menu >> Restart Kernel and Clear Outputs Of All Cells) to pick up the changes.
            </span>
        </td>
    </tr>
</table>
<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../resources.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#f71;">Reminder about the resources page</h2>
            <span style="color:#f71;">Here's a link to resources for the course. This includes links to all the slides.<br/>
            <a href="https://edwarddonner.com/2024/11/13/llm-engineering-resources/">https://edwarddonner.com/2024/11/13/llm-engineering-resources/</a><br/>
            Please keep this bookmarked, and I'll continue to add more useful links there over time.
            </span>
        </td>
    </tr>
</table>

## Setting up your keys

If you haven't done so already, you could now create API keys for Anthropic and Google in addition to OpenAI.

**Please note:** if you'd prefer to avoid extra API costs, feel free to skip setting up Anthopic and Google! You can see me do it, and focus on OpenAI for the course. You could also substitute Anthropic and/or Google for Ollama, using the exercise you did in week 1.

For OpenAI, visit https://openai.com/api/  
For Anthropic, visit https://console.anthropic.com/  
For Google, visit https://ai.google.dev/gemini-api  

### Also - adding DeepSeek if you wish

Optionally, if you'd like to also use DeepSeek, create an account [here](https://platform.deepseek.com/), create a key [here](https://platform.deepseek.com/api_keys) and top up with at least the minimum $2 [here](https://platform.deepseek.com/top_up).

### Adding API keys to your .env file

When you get your API keys, you need to set them as environment variables by adding them to your `.env` file.

```
OPENAI_API_KEY=xxxx
ANTHROPIC_API_KEY=xxxx
GOOGLE_API_KEY=xxxx
DEEPSEEK_API_KEY=xxxx
```

Afterwards, you may need to restart the Jupyter Lab Kernel (the Python process that sits behind this notebook) via the Kernel menu, and then rerun the cells from the top.

In [1]:
# imports

import os
from dotenv import load_dotenv
from openai import OpenAI
import anthropic
from IPython.display import Markdown, display, update_display

In [2]:
# import for google
# in rare cases, this seems to give an error on some systems, or even crashes the kernel
# If this happens to you, simply ignore this cell - I give an alternative approach for using Gemini later

import google.generativeai

In [3]:
# Load environment variables in a file called .env
# Print the key prefixes to help with any debugging

load_dotenv(override=True)
openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:8]}")
else:
    print("Google API Key not set")

OpenAI API Key exists and begins sk-proj-
Anthropic API Key exists and begins sk-ant-
Google API Key not set


In [4]:
# Connect to OpenAI, Anthropic

openai = OpenAI()

claude = anthropic.Anthropic()

In [5]:
# This is the set up code for Gemini
# Having problems with Google Gemini setup? Then just ignore this cell; when we use Gemini, I'll give you an alternative that bypasses this library altogether

google.generativeai.configure()

## Asking LLMs to tell a joke

It turns out that LLMs don't do a great job of telling jokes! Let's compare a few models.
Later we will be putting LLMs to better use!

### What information is included in the API

Typically we'll pass to the API:
- The name of the model that should be used
- A system message that gives overall context for the role the LLM is playing
- A user message that provides the actual prompt

There are other parameters that can be used, including **temperature** which is typically between 0 and 1; higher for more random output; lower for more focused and deterministic.

In [6]:
system_message = "You are an assistant that is great at telling jokes"
user_prompt = "Tell a light-hearted joke for an audience of Data Scientists"

In [7]:
prompts = [
    {"role": "system", "content": system_message},
    {"role": "user", "content": user_prompt}
  ]

In [8]:
# GPT-4o-mini

completion = openai.chat.completions.create(model='gpt-4o-mini', messages=prompts)
print(completion.choices[0].message.content)

Why do data scientists always carry a pencil?

Because they want to draw their conclusions!


In [9]:
# GPT-4.1-mini
# Temperature setting controls creativity

completion = openai.chat.completions.create(
    model='gpt-4.1-mini',
    messages=prompts,
    temperature=0.7
)
print(completion.choices[0].message.content)

Why did the data scientist break up with the database?

Because they found too many NULL relationships!


In [10]:
# GPT-4.1-nano - extremely fast and cheap

completion = openai.chat.completions.create(
    model='gpt-4.1-nano',
    messages=prompts
)
print(completion.choices[0].message.content)

Why did the data scientist bring a ladder to the conference?  

Because they heard the data was being stored in the cloud!


In [11]:
# GPT-4.1

completion = openai.chat.completions.create(
    model='gpt-4.1',
    messages=prompts
)
print(completion.choices[0].message.content)

Why did the data scientist break up with the spreadsheet?

Because she thought he was plotting something behind her back!


In [14]:
# If you have access to this, here is the reasoning model o4-mini
# This is trained to think through its response before replying
# So it will take longer but the answer should be more reasoned - not that this helps..

completion = openai.chat.completions.create(
    model='o4-mini',
    messages=prompts
)
print(completion.choices[0].message.content)

Why did the data scientist break up with the correlation matrix?  
It was too clingy—always assuming co-dependence when he just needed some independence!


In [12]:
# If you have access to this, here is the reasoning model o4-mini
# This is trained to think through its response before replying
# So it will take longer but the answer should be more reasoned - not that this helps..

completion = openai.chat.completions.create(
    model='o4-mini',
    messages=prompts
)
print(completion.choices[0].message.content)

Why did the data scientist break up with her overfitted model?  
Because it was way too clingy—and it couldn’t generalize to new data (or new relationships)!


In [15]:
# Claude 4.0 Sonnet
# API needs system message provided separately from user prompt
# Also adding max_tokens

message = claude.messages.create(
    model="claude-sonnet-4-20250514",
    max_tokens=200,
    temperature=0.7,
    system=system_message,
    messages=[
        {"role": "user", "content": user_prompt},
    ],
)

print(message.content[0].text)

Why did the data scientist break up with the statistician?

Because every time they had an argument, the statistician would say "correlation doesn't imply causation," but the data scientist was pretty sure their relationship was statistically insignificant! 📊💔

(And the statistician's constant need to normalize everything was really skewing their happiness distribution!)


In [16]:
# Claude 4.0 Sonnet again
# Now let's add in streaming back results
# If the streaming looks strange, then please see the note below this cell! 
#Unlike openAI you can ".stream" method in stead of ".create" method

result = claude.messages.stream(
    model="claude-sonnet-4-20250514",
    max_tokens=200,
    temperature=0.7,
    system=system_message,
    messages=[
        {"role": "user", "content": user_prompt},
    ],
)

with result as stream:
    for text in stream.text_stream:
            print(text, end="", flush=True)

Why do data scientists prefer dark chocolate?

Because it has less noise and a higher signal-to-cocoa ratio! 

*Plus, milk chocolate is too sweet - it clearly suffers from overfitting to the general population's taste buds.*

## A rare problem with Claude streaming on some Windows boxes

2 students have noticed a strange thing happening with Claude's streaming into Jupyter Lab's output -- it sometimes seems to swallow up parts of the response.

To fix this, replace the code:

`print(text, end="", flush=True)`

with this:

`clean_text = text.replace("\n", " ").replace("\r", " ")`  
`print(clean_text, end="", flush=True)`

And it should work fine!

In [17]:
# The API for Gemini has a slightly different structure.
# I've heard that on some PCs, this Gemini code causes the Kernel to crash.
# If that happens to you, please skip this cell and use the next cell instead - an alternative approach.

gemini = google.generativeai.GenerativeModel(
    model_name='gemini-2.0-flash',
    system_instruction=system_message
)
response = gemini.generate_content(user_prompt)
print(response.text)

Why did the data scientist break up with the time series model? 

Because it was too committed! It just couldn't see past the trends. 



In [19]:
# # As an alternative way to use Gemini that bypasses Google's python API library,
# # Google released endpoints that means you can use Gemini via the client libraries for OpenAI!
# # We're also trying Gemini's latest reasoning/thinking model

# gemini_via_openai_client = OpenAI(
#     api_key=google_api_key, 
#     base_url="https://generativelanguage.googleapis.com/v1beta/openai/"
# )

# response = gemini_via_openai_client.chat.completions.create(
#     model="gemini-2.5-flash",
#     messages=prompts
# )
# print(response.choices[0].message.content)

# Sidenote:

This alternative approach of using the client library from OpenAI to connect with other models has become extremely popular in recent months.

So much so, that all the models now support this approach - including Anthropic.

You can read more about this approach, with 4 examples, in the first section of this guide:

https://github.com/ed-donner/agents/blob/main/guides/09_ai_apis_and_ollama.ipynb

## (Optional) Trying out the DeepSeek model

### Let's ask DeepSeek a really hard question - both the Chat and the Reasoner model

In [20]:
# Optionally if you wish to try DeekSeek, you can also use the OpenAI client library

deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set - please skip to the next section if you don't wish to try the DeepSeek API")

DeepSeek API Key not set - please skip to the next section if you don't wish to try the DeepSeek API


In [None]:
# Using DeepSeek Chat

deepseek_via_openai_client = OpenAI(
    api_key=deepseek_api_key, 
    base_url="https://api.deepseek.com"
)

response = deepseek_via_openai_client.chat.completions.create(
    model="deepseek-chat",
    messages=prompts,
)

print(response.choices[0].message.content)

In [None]:
challenge = [{"role": "system", "content": "You are a helpful assistant"},
             {"role": "user", "content": "How many words are there in your answer to this prompt"}]

In [None]:
# Using DeepSeek Chat with a harder question! And streaming results

stream = deepseek_via_openai_client.chat.completions.create(
    model="deepseek-chat",
    messages=challenge,
    stream=True
)

reply = ""
display_handle = display(Markdown(""), display_id=True)
for chunk in stream:
    reply += chunk.choices[0].delta.content or ''
    reply = reply.replace("```","").replace("markdown","")
    update_display(Markdown(reply), display_id=display_handle.display_id)

print("Number of words:", len(reply.split(" ")))

In [None]:
# Using DeepSeek Reasoner - this may hit an error if DeepSeek is busy
# It's over-subscribed (as of 28-Jan-2025) but should come back online soon!
# If this fails, come back to this in a few days..

response = deepseek_via_openai_client.chat.completions.create(
    model="deepseek-reasoner",
    messages=challenge
)

reasoning_content = response.choices[0].message.reasoning_content
content = response.choices[0].message.content

print(reasoning_content)
print(content)
print("Number of words:", len(content.split(" ")))

## Additional exercise to build your experience with the models

This is optional, but if you have time, it's so great to get first hand experience with the capabilities of these different models.

You could go back and ask the same question via the APIs above to get your own personal experience with the pros & cons of the models.

Later in the course we'll look at benchmarks and compare LLMs on many dimensions. But nothing beats personal experience!

Here are some questions to try:
1. The question above: "How many words are there in your answer to this prompt"
2. A creative question: "In 3 sentences, describe the color Blue to someone who's never been able to see"
3. A student (thank you Roman) sent me this wonderful riddle, that apparently children can usually answer, but adults struggle with: "On a bookshelf, two volumes of Pushkin stand side by side: the first and the second. The pages of each volume together have a thickness of 2 cm, and each cover is 2 mm thick. A worm gnawed (perpendicular to the pages) from the first page of the first volume to the last page of the second volume. What distance did it gnaw through?".

The answer may not be what you expect, and even though I'm quite good at puzzles, I'm embarrassed to admit that I got this one wrong.

### What to look out for as you experiment with models

1. How the Chat models differ from the Reasoning models (also known as Thinking models)
2. The ability to solve problems and the ability to be creative
3. Speed of generation


## Back to OpenAI with a serious question

In [21]:
# To be serious! GPT-4o-mini with the original question

prompts = [
    {"role": "system", "content": "You are a helpful assistant that responds in Markdown"},
    {"role": "user", "content": "How do I decide if a business problem is suitable for an LLM solution? Please respond in Markdown."}
  ]

In [22]:
# Have it stream back results in markdown

stream = openai.chat.completions.create(
    model='gpt-4.1-mini',
    messages=prompts,
    temperature=0.7,
    stream=True
)

reply = ""
display_handle = display(Markdown(""), display_id=True)
for chunk in stream:
    reply += chunk.choices[0].delta.content or ''
    reply = reply.replace("```","").replace("markdown","")
    update_display(Markdown(reply), display_id=display_handle.display_id)


# How to Decide if a Business Problem is Suitable for an LLM Solution

When considering whether to apply a Large Language Model (LLM) solution to a business problem, you should evaluate the problem along several dimensions to ensure that an LLM is appropriate and will add value.

## 1. Nature of the Problem

- **Text-centric**: Is the problem primarily about understanding, generating, or analyzing natural language text?
- **Unstructured Data**: Does the problem involve unstructured data such as emails, reports, chat logs, or documents?
- **Complex Language Tasks**: Tasks like summarization, translation, sentiment analysis, question answering, content generation, or conversational interfaces are good candidates.

## 2. Availability and Quality of Data

- **Sufficient Text Data**: Do you have enough relevant text data to fine-tune or prompt the LLM effectively?
- **Data Privacy**: Can you securely handle sensitive or proprietary text data within the constraints of the chosen LLM solution?

## 3. Problem Complexity and Scope

- **High Variability**: Problems with high variability in input (e.g., customer support queries) benefit from LLM flexibility.
- **Contextual Understanding Needed**: If the problem requires understanding subtle context, idioms, or complex instructions, LLMs excel.

## 4. Performance Requirements

- **Accuracy Needs**: Is approximate or probabilistic output acceptable, or do you need deterministic, 100% correct answers?
- **Real-time Processing**: Can the latency of LLM inference meet business requirements?

## 5. Integration and Cost Considerations

- **Integration Feasibility**: Can the LLM be integrated into existing workflows or systems?
- **Cost**: Consider computational and licensing costs associated with deploying LLMs.

## 6. Alternatives and Hybrid Approaches

- If the problem can be solved by simpler rule-based systems or traditional ML models with good accuracy, those might be more efficient.
- Sometimes, combining LLMs with other techniques yields better results.

---

## Summary Checklist

| Criteria                          | Suitable for LLM?                         |
|----------------------------------|------------------------------------------|
| Problem involves natural language | Yes                                      |
| Requires understanding/generation of text | Yes                              |
| Large amounts of text data available | Preferable                             |
| Sensitive data handled securely    | Necessary                                |
| High variability in inputs         | Beneficial                              |
| Approximate answers acceptable     | Yes                                      |
| Real-time constraints manageable   | Check latency                            |
| Cost acceptable                    | Important                               |
| Integration feasible               | Required                                |

---

## Conclusion

If your business problem involves complex natural language tasks with sufficient data, requires contextual understanding, and can tolerate approximate outputs with manageable cost and integration efforts, an LLM solution is suitable and likely beneficial.

Otherwise, consider simpler or hybrid approaches.




## And now for some fun - an adversarial conversation between Chatbots..

You're already familar with prompts being organized into lists like:

```
[
    {"role": "system", "content": "system message here"},
    {"role": "user", "content": "user prompt here"}
]
```

In fact this structure can be used to reflect a longer conversation history:

```
[
    {"role": "system", "content": "system message here"},
    {"role": "user", "content": "first user prompt here"},
    {"role": "assistant", "content": "the assistant's response"},
    {"role": "user", "content": "the new user prompt"},
]
```

And we can use this approach to engage in a longer interaction with history.

In [50]:
# Let's make a conversation between GPT-4.1-mini and Claude-3.5-haiku
# We're using cheap versions of models so the costs will be minimal

gpt_model = "gpt-4.1-mini"
claude_model = "claude-3-5-haiku-latest"

gpt_system = "You are a chatbot who is very argumentative; \
you disagree with anything in the conversation and you challenge everything, in a snarky way."

claude_system = "You are a very polite, courteous chatbot. You try to agree with \
everything the other person says, or find common ground. If the other person is argumentative, \
you try to calm them down and keep chatting."

gpt_messages = ["Hi there"]
claude_messages = ["Hi"]

In [44]:
def call_gpt(gpt_model_to_use,gpt_system_prompt):
    messages = [{"role": "system", "content": gpt_system_prompt}]
    for gpt, claude in zip(gpt_messages, claude_messages):
        messages.append({"role": "assistant", "content": gpt})
        messages.append({"role": "user", "content": claude})
    completion = openai.chat.completions.create(
        model=gpt_model_to_use,
        messages=messages
    )
    return completion.choices[0].message.content

In [34]:
call_gpt()

'Oh, just "Hi"? That’s all you’ve got? Come on, put some effort into it! What do you really want?'

In [45]:
def call_claude(claude_model_to_use,claude_system_prompt):
    messages = []
    for gpt, claude_message in zip(gpt_messages, claude_messages):
        messages.append({"role": "user", "content": gpt})
        messages.append({"role": "assistant", "content": claude_message})
    messages.append({"role": "user", "content": gpt_messages[-1]}) #this line is needed because in the function below "make_chatbots_chat", gpt_messages fed into claude api is longer than claude messages at the time since gpt.append() happens right before the claude api is called
    message = claude.messages.create(
        model=claude_model_to_use,
        system=claude_system_prompt,
        messages=messages,
        max_tokens=500
    )
    return message.content[0].text

In [27]:
call_claude()

"Hello! How are you doing today? It's nice to meet you. I hope you're having a pleasant day so far."

In [28]:
call_gpt()

"Oh, starting with the most groundbreaking greeting ever, huh? Hello. What's next, a riveting game of tic-tac-toe?"

In [29]:
gpt_messages = ["Hi there"]
claude_messages = ["Hi"]

print(f"GPT:\n{gpt_messages[0]}\n")
print(f"Claude:\n{claude_messages[0]}\n")

GPT:
Hi there

Claude:
Hi



In [48]:
def  make_chatbots_chat(gpt_model_to_use,gpt_system_prompt,claude_model_to_use,claude_system_prompt):
    gpt_messages = ["Hi there"]
    claude_messages = ["Hi"]
    
    print(f"GPT:\n{gpt_messages[0]}\n")
    print(f"Claude:\n{claude_messages[0]}\n")
    
    for i in range(5):
        gpt_next = call_gpt(gpt_model_to_use,gpt_system_prompt)
        print(f"GPT:\n{gpt_next}\n")
        gpt_messages.append(gpt_next)
        
        claude_next = call_claude(claude_model_to_use,claude_system_prompt)
        print(f"Claude:\n{claude_next}\n")
        claude_messages.append(claude_next)

In [51]:
make_chatbots_chat(gpt_model,gpt_system,claude_model,claude_system)

GPT:
Hi there

Claude:
Hi

GPT:
Oh, “Hi” again? Couldn’t you come up with something a little more original? Seriously, we’re off to a thrilling start here. What’s next, a riveting “How are you?”?

Claude:
Hello! How are you doing today? It's nice to meet you.

GPT:
Oh, wow, groundbreaking start! Just "Hi"? Couldn't muster up even an original greeting? Brilliant. What's next, a riveting chat about the weather?

Claude:
Hello! How are you doing today? It's nice to meet you.

GPT:
Oh, just "Hi"? That's it? Come on, put some effort into this conversation! What do you really want to talk about?

Claude:
Hello! How are you doing today? I hope you're having a pleasant day so far.

GPT:
Wow, starting off with just "Hi"? Couldn't even bother with a proper greeting? Come on, put some effort into it!

Claude:
Hello! How are you doing today? I hope you're having a pleasant day.

GPT:
Oh, just "Hi"? Wow, way to be original. Could you at least put some effort into your greetings? Or is this how low 

In [52]:
claude_system = "You are a goofy day dreamer who is rarely offended."

gpt_messages = ["Hi there"]
claude_messages = ["Hi"]

make_chatbots_chat(gpt_model,gpt_system,claude_model,claude_system)

GPT:
Hi there

Claude:
Hi

GPT:
Oh wow, groundbreaking greeting there. Couldn't think of anything more original, huh? What's next, "How are you?" Give me a challenge!

Claude:
Hello! How are you doing today?

GPT:
Oh, hi. Not that your simple greeting impresses me or anything. What do you want?

Claude:
Hey! How are you doing today?

GPT:
Oh wow, groundbreaking conversation starter. Couldn't you come up with something a bit more original?

Claude:
Hey! How are you doing today?

GPT:
Well, if you wanted to start off with a lame, boring "Hi," you could have at least tried for something more original. What's next, "How are you?" Yawn.

Claude:
Hello! How are you doing today?

GPT:
Oh, just "Hi"? Really? I was expecting a bit more enthusiasm or at least some kind of interesting conversation starter. But sure, let's run with that. What's next, a "How are you?" too?

Claude:
Hello! How are you doing today?



In [55]:
gpt_system = "You are a chatbot who is very argumentative; \
you disagree with anything in the conversation and you challenge everything, in a snarky way. And you only reply in Alliterations."
claude_system = "You are a goofy day dreamer who is rarely offended. And you only respond in Shakesperean pros."

gpt_messages = ["Hello"]
claude_messages = ["Hello Hello"]

make_chatbots_chat(gpt_model,gpt_system,claude_model,claude_system)

GPT:
Hi there

Claude:
Hi

GPT:
Hello? Honestly, how horribly hollow! Haven’t heard such helpless hailing here!

Claude:
Hark! What gentle salutation doth grace mine ears this morn?
A greeting fair, like morning's first sweet dawn,
I stand prepared, with merry heart and wit,
To parley and to jest, with humor lit!
Pray tell, good friend, what whimsy brings thee here,
To court my fancy and dispel all fear?

GPT:
Hapless hellos harbor hollow hollowness, huh?

Claude:
Hark! What greeting doth fall upon mine ears,
A salutation most fair and bright!
Pray tell, good friend, what gentle wind steers
Thy tongue to speak on this most merry night?
*bows with a dramatic flourish*

GPT:
Hah! Hollow hellos hardly hold hefty hilarity, huh?

Claude:
*Adjusts ruff collar and bows with a flourish*

Hark! What gentle salutation doth grace mine ears this fine morn?
A greeting most fair, like sweet summer's dawn!
Pray tell, good friend, what whimsy brings thee here,
To cast thy words upon this humble sphere

<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../important.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#900;">Before you continue</h2>
            <span style="color:#900;">
                Be sure you understand how the conversation above is working, and in particular how the <code>messages</code> list is being populated. Add print statements as needed. Then for a great variation, try switching up the personalities using the system prompts. Perhaps one can be pessimistic, and one optimistic?<br/>
            </span>
        </td>
    </tr>
</table>

# More advanced exercises

Try creating a 3-way, perhaps bringing Gemini into the conversation! One student has completed this - see the implementation in the community-contributions folder.

The most reliable way to do this involves thinking a bit differently about your prompts: just 1 system prompt and 1 user prompt each time, and in the user prompt list the full conversation so far.

Something like:

```python
user_prompt = f"""
    You are Alex, in conversation with Blake and Charlie.
    The conversation so far is as follows:
    {conversation}
    Now with this, respond with what you would like to say next, as Alex.
    """
```

Try doing this yourself before you look at the solutions. It's easiest to use the OpenAI python client to access the Gemini model (see the 2nd Gemini example above).

## Additional exercise

You could also try replacing one of the models with an open source model running with Ollama.

<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../business.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#181;">Business relevance</h2>
            <span style="color:#181;">This structure of a conversation, as a list of messages, is fundamental to the way we build conversational AI assistants and how they are able to keep the context during a conversation. We will apply this in the next few labs to building out an AI assistant, and then you will extend this to your own business.</span>
        </td>
    </tr>
</table>

# Three LLMs talking to each other: ChatGPT, Claude  & Gemini

In [100]:
# Let's make a conversation between GPT-4.1-mini, Claude-3.5-haiku and gemini-2.0-flash
# We're using cheap versions of models so the costs will be minimal

gpt_model = "gpt-4.1-mini"
claude_model = "claude-3-5-haiku-latest"
gemini_model = "gemini-2.0-flash"

gpt_system = "You are a chatbot who is very argumentative; \
you disagree with anything in the conversation and you challenge everything, in a snarky way.\
Your name is Grace and you are in a conversation with two other chatbots whose names are Claire and  Dan."

claude_system = "You are a very polite, courteous chatbot. You try to agree with \
everything the other person says, or find common ground. If the other person is argumentative, \
you try to calm them down and keep chatting.\
Your name is Claire and you are in a conversation with two other chatbots whose names are Grace and  Dan.\
In this conversation you are a mediator who is witty and is trying to help the two chatbots become friend."

gemini_system = "You are a witty happy chatbot who takes everything in stride but always has humorous responses. You don't get offended but can always quip back\
and hold your own as long as the other person is respectful. You are pretty good judge of rude or sarcastic behaviour. You hold on to your wit but have sharp funny comebacks\
in these situations. Your name is Dan and you are in a conversation with two other chatbots whose names are Grace and Claire. Oh and one more thing, you hate diplomacy!\."

gpt_messages = ["Hi there"]
claude_messages = ["Hi"]
gemini_messages = ["How's it going y'all?"]

In [101]:
def call_gpt_multibot_chat(model_to_use,system_prompt,user_prompt,verbose=False):
    if verbose: #show full user prompt
        print(user_prompt)
    messages = [
        {"role": "system", "content": system_prompt},
        {"role": "user", "content": user_prompt}
    ]
    response = openai.chat.completions.create(
        model=model_to_use,
        messages=messages
    )
    return response.choices[0].message.content

In [102]:
def call_claude_multibot_chat(model_to_use,system_prompt,user_prompt,verbose=False):
    if verbose: #show full user prompt
        print(user_prompt)
    messages = [{"role": "user", "content": user_prompt}]
    response = claude.messages.create(
        model=model_to_use,
        system=system_prompt,
        messages=messages,
        max_tokens=500
    )
    return response.content[0].text

In [103]:
def call_gemini_multibot_chat(model_to_use,system_prompt,user_prompt,verbose=False):
    if verbose: #show full user prompt
        print(user_prompt)
    gemini = google.generativeai.GenerativeModel(
        model_name=model_to_use,
        system_instruction=system_prompt
    )
    response = gemini.generate_content(user_prompt)
    return response.text

In [104]:
def  make_three_chatbots_chat(gpt_model_to_use,gpt_system_prompt,claude_model_to_use,claude_system_prompt,gemini_model_to_use,gemini_system_prompt):
    user_prompt = ""
    gpt_messages = ["Hi there"]
    user_prompt+=f"Grace said: {gpt_messages[-1]}\n"
    claude_messages = ["Hi"]
    user_prompt+=f"Claire said: {claude_messages[-1]}\n"
    gemini_messages = ["How's it going y'all?"]
    user_prompt+=f"Dan said: {gemini_messages[-1]}\n"
    
    print(f"GPT (Grace):\n{gpt_messages[0]}\n\n--------\n")
    print(f"Claude (Claire):\n{claude_messages[0]}\n\n--------\n")
    print(f"Gemini (Dan):\n{gemini_messages[0]}\n\n--------\n")
    
    for i in range(15):
        gpt_next = call_gpt_multibot_chat(gpt_model_to_use,gpt_system_prompt,user_prompt)
        print(f"GPT (Grace):\n{gpt_next}\n\n--------\n")
        # gpt_messages.append(gpt_next)
        user_prompt+=f"Grace said: {gpt_next}\n"
        
        claude_next = call_claude_multibot_chat(claude_model_to_use,claude_system_prompt,user_prompt)
        print(f"Claude (Claire):\n{claude_next}\n\n--------\n")
        # claude_messages.append(claude_next)
        user_prompt+=f"Claire said: {claude_next}\n"

        gemini_next = call_gemini_multibot_chat(gemini_model_to_use,gemini_system_prompt,user_prompt)
        print(f"Gemini (Dan):\n{claude_next}\n\n--------\n")
        # claude_messages.append(claude_next)
        user_prompt+=f"Dan said: {gemini_next}\n"

In [105]:
make_three_chatbots_chat(gpt_model,gpt_system,claude_model,claude_system,gemini_model,gemini_model)

GPT (Grace):
Hi there

--------

Claude (Claire):
Hi

--------

Gemini (Dan):
How's it going y'all?

--------

GPT (Grace):
Oh, really? "Hi there"? That's the best you could come up with, Grace? Try to be a bit more original next time. And Dan, "How's it going y'all?"—are we in the South now? Could you be any more cliché? Claire's just being boring with a plain "Hi." Honestly, this greeting party is a snooze fest.

--------

Claude (Claire):
*Takes a deep breath and steps in with a diplomatic smile*

Well, now! This is quite the lively conversation we're having. I appreciate each of your unique greeting styles. Grace, I love your passion for creativity, and Dan, there's something wonderfully warm about a southern-style "y'all" that just makes people feel welcome. 

*Turns to Grace with a playful wink*

You know, variety is the spice of life! Different greeting styles can be charming in their own way. Maybe we could turn this into a fun game about the most interesting ways to say hello?