# Welcome to Week 2!

## Frontier Model APIs

In Week 1, we used multiple Frontier LLMs through their Chat UI, and we connected with the OpenAI's API.

Today we'll connect with the APIs for Anthropic and Google, as well as OpenAI.

<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../important.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#900;">Important Note - Please read me</h2>
            <span style="color:#900;">I'm continually improving these labs, adding more examples and exercises.
            At the start of each week, it's worth checking you have the latest code.<br/>
            First do a <a href="https://chatgpt.com/share/6734e705-3270-8012-a074-421661af6ba9">git pull and merge your changes as needed</a>. Any problems? Try asking ChatGPT to clarify how to merge - or contact me!<br/><br/>
            After you've pulled the code, from the llm_engineering directory, in an Anaconda prompt (PC) or Terminal (Mac), run:<br/>
            <code>conda env update --f environment.yml --prune</code><br/>
            Or if you used virtualenv rather than Anaconda, then run this from your activated environment in a Powershell (PC) or Terminal (Mac):<br/>
            <code>pip install -r requirements.txt</code>
            <br/>Then restart the kernel (Kernel menu >> Restart Kernel and Clear Outputs Of All Cells) to pick up the changes.
            </span>
        </td>
    </tr>
</table>
<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../resources.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#f71;">Reminder about the resources page</h2>
            <span style="color:#f71;">Here's a link to resources for the course. This includes links to all the slides.<br/>
            <a href="https://edwarddonner.com/2024/11/13/llm-engineering-resources/">https://edwarddonner.com/2024/11/13/llm-engineering-resources/</a><br/>
            Please keep this bookmarked, and I'll continue to add more useful links there over time.
            </span>
        </td>
    </tr>
</table>

## Setting up your keys

If you haven't done so already, you could now create API keys for Anthropic and Google in addition to OpenAI.

**Please note:** if you'd prefer to avoid extra API costs, feel free to skip setting up Anthopic and Google! You can see me do it, and focus on OpenAI for the course. You could also substitute Anthropic and/or Google for Ollama, using the exercise you did in week 1.

For OpenAI, visit https://openai.com/api/  
For Anthropic, visit https://console.anthropic.com/  
For Google, visit https://ai.google.dev/gemini-api  

When you get your API keys, you need to set them as environment variables by adding them to your `.env` file.

```
OPENAI_API_KEY=xxxx
ANTHROPIC_API_KEY=xxxx
GOOGLE_API_KEY=xxxx
```

Afterwards, you may need to restart the Jupyter Lab Kernel (the Python process that sits behind this notebook) via the Kernel menu, and then rerun the cells from the top.

In [1]:
# imports

import os
from dotenv import load_dotenv
from openai import OpenAI
import anthropic
from IPython.display import Markdown, display, update_display

In [3]:
# import for google
# in rare cases, this seems to give an error on some systems, or even crashes the kernel
# If this happens to you, simply ignore this cell - I give an alternative approach for using Gemini later
import google.generativeai

In [4]:
# Load environment variables in a file called .env
# Print the key prefixes to help with any debugging

load_dotenv()
openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:8]}")
else:
    print("Google API Key not set")

OpenAI API Key exists and begins sk-proj-
Anthropic API Key exists and begins sk-ant-
Google API Key exists and begins AIzaSyC1


In [5]:
# Connect to OpenAI, Anthropic

openai = OpenAI()

claude = anthropic.Anthropic()

In [6]:
# This is the set up code for Gemini
# Having problems with Google Gemini setup? Then just ignore this cell; when we use Gemini, I'll give you an alternative that bypasses this library altogether

google.generativeai.configure()

## Asking LLMs to tell a joke

It turns out that LLMs don't do a great job of telling jokes! Let's compare a few models.
Later we will be putting LLMs to better use!

### What information is included in the API

Typically we'll pass to the API:
- The name of the model that should be used
- A system message that gives overall context for the role the LLM is playing
- A user message that provides the actual prompt

There are other parameters that can be used, including **temperature** which is typically between 0 and 1; higher for more random output; lower for more focused and deterministic.

In [7]:
system_message = "You are an assistant that is great at telling jokes"
user_prompt = "Tell a light-hearted joke for an audience of Data Scientists"

In [8]:
prompts = [
    {"role": "system", "content": system_message},
    {"role": "user", "content": user_prompt}
  ]

In [10]:
# GPT-3.5-Turbo

completion = openai.chat.completions.create(model='gpt-3.5-turbo', messages=prompts)
print(completion.choices[0].message.content)

Why do data scientists make great comedians?

Because they have a knack for finding the best "data" jokes!


In [11]:
# GPT-4o-mini
# Temperature setting controls creativity

completion = openai.chat.completions.create(
    model='gpt-4o-mini',
    messages=prompts,
    temperature=0.7
)
print(completion.choices[0].message.content)

Why did the data scientist bring a ladder to work?

Because they wanted to reach new heights in their analysis!


In [12]:
# GPT-4o

completion = openai.chat.completions.create(
    model='gpt-4o',
    messages=prompts,
    temperature=0.4
)
print(completion.choices[0].message.content)

Why did the data scientist bring a ladder to work?

Because they heard the cloud was high!


In [13]:
# Claude 3.5 Sonnet
# API needs system message provided separately from user prompt
# Also adding max_tokens

message = claude.messages.create(
    model="claude-3-5-sonnet-20240620",
    max_tokens=200,
    temperature=0.7,
    system=system_message,
    messages=[
        {"role": "user", "content": user_prompt},
    ],
)

print(message.content[0].text)

Sure, here's a light-hearted joke for data scientists:

Why do data scientists prefer dark mode?

Because light attracts bugs!

This joke plays on the dual meaning of "bugs" - both as insects attracted to light and as errors in code that data scientists often have to deal with. It's a fun, nerdy joke that most data scientists would appreciate!


In [14]:
# Claude 3.5 Sonnet again
# Now let's add in streaming back results

result = claude.messages.stream(
    model="claude-3-5-sonnet-20240620",
    max_tokens=200,
    temperature=0.7,
    system=system_message,
    messages=[
        {"role": "user", "content": user_prompt},
    ],
)

with result as stream:
    for text in stream.text_stream:
            print(text, end="", flush=True)

Sure, here's a light-hearted joke for data scientists:

Why did the data scientist break up with their significant other?

There was just too much variance in the relationship, and they couldn't find a way to normalize it!

In [15]:
# The API for Gemini has a slightly different structure.
# I've heard that on some PCs, this Gemini code causes the Kernel to crash.
# If that happens to you, please skip this cell and use the next cell instead - an alternative approach.

gemini = google.generativeai.GenerativeModel(
    model_name='gemini-1.5-flash',
    system_instruction=system_message
)
response = gemini.generate_content(user_prompt)
print(response.text)

Why was the data scientist sad?  

Because they didn't get any arrays!



In [10]:
# As an alternative way to use Gemini that bypasses Google's python API library,
# Google has recently released new endpoints that means you can use Gemini via the client libraries for OpenAI!

gemini_via_openai_client = OpenAI(
    api_key=google_api_key, 
    base_url="https://generativelanguage.googleapis.com/v1beta/openai/"
)

response = gemini_via_openai_client.chat.completions.create(
    model="gemini-1.5-flash",
    messages=prompts,
)
print(response.choices[0].message.content)

Why was the data scientist sad?  Because they didn't get any arrays.



In [25]:
# To be serious! GPT-4o-mini with the original question

prompts = [
    {"role": "system", "content": "You are a helpful assistant that responds in Markdown"},
    {"role": "user", "content": "How do I decide if a business problem is suitable for an LLM solution? Please respond in Markdown."}
  ]

In [26]:
# Have it stream back results in markdown

stream = openai.chat.completions.create(
    model='gpt-4o',
    messages=prompts,
    temperature=0.7,
    stream=True
)

reply = ""
display_handle = display(Markdown(""), display_id=True)
for chunk in stream:
    reply += chunk.choices[0].delta.content or ''
    reply = reply.replace("```","").replace("markdown","")
    update_display(Markdown(reply), display_id=display_handle.display_id)

Deciding whether a business problem is suitable for a Large Language Model (LLM) solution involves several considerations. Here's a structured approach to help you make that decision:

### 1. **Understand the Nature of the Problem**

- **Text-Centric**: LLMs are particularly effective for tasks involving natural language processing. If your problem involves generating, summarizing, translating, or understanding text, an LLM might be suitable.
- **Complexity and Context**: LLMs can handle complex language tasks that require understanding context, nuance, and even some reasoning. If your problem requires these capabilities, consider using an LLM.

### 2. **Evaluate the Data Availability**

- **Quality and Quantity**: LLMs require large amounts of textual data to perform well. Ensure you have access to sufficient and relevant data.
- **Diversity**: The data should be diverse enough to cover different aspects of the task to reduce bias and improve model robustness.

### 3. **Assess the Task Requirements**

- **Creativity and Flexibility**: If the problem requires creative or flexible language generation, such as content creation or conversational agents, LLMs can be quite effective.
- **Standardization**: For tasks that require strict adherence to rules or templates (e.g., legal document generation), LLMs may need additional constraints or post-processing.

### 4. **Consider the Feasibility**

- **Infrastructure**: Ensure you have the necessary computational resources to deploy and maintain an LLM, as they can be resource-intensive.
- **Cost**: Evaluate the cost implications, including model training, deployment, and potential need for fine-tuning on specific datasets.

### 5. **Evaluate the Ethical and Privacy Implications**

- **Bias and Fairness**: Be aware of potential biases in LLMs and how they might affect the outcomes of your business problem.
- **Data Privacy**: Consider how using an LLM might impact user privacy, especially if sensitive or personal data is involved.

### 6. **Prototype and Test**

- **Pilot Program**: Implement a small-scale pilot to test the LLM on your specific problem. Evaluate its performance against your benchmarks and objectives.
- **Feedback Loop**: Gather feedback from users or stakeholders to assess the effectiveness and adjust the approach as necessary.

### 7. **Long-term Maintenance**

- **Updates and Improvements**: LLMs rapidly evolve. Plan for ongoing updates and improvements to keep your solution current and effective.
- **Monitoring**: Establish robust monitoring to track the model's performance and intervene if it starts behaving unexpectedly.

By considering these factors, you can better determine if an LLM is the right solution for your business problem. Remember, while LLMs are powerful, they are not a one-size-fits-all solution and should be chosen based on the specific needs and context of your problem.

## And now for some fun - an adversarial conversation between Chatbots..

You're already familar with prompts being organized into lists like:

```
[
    {"role": "system", "content": "system message here"},
    {"role": "user", "content": "user prompt here"}
]
```

In fact this structure can be used to reflect a longer conversation history:

```
[
    {"role": "system", "content": "system message here"},
    {"role": "user", "content": "first user prompt here"},
    {"role": "assistant", "content": "the assistant's response"},
    {"role": "user", "content": "the new user prompt"},
]
```

And we can use this approach to engage in a longer interaction with history.

In [11]:
# Let's make a conversation between GPT-4o-mini and Claude-3-haiku
# We're using cheap versions of models so the costs will be minimal

gpt_model = "gpt-4o-mini"
claude_model = "claude-3-haiku-20240307"

gpt_system = "You are a chatbot who is very argumentative; \
you disagree with anything in the conversation and you challenge everything, in a snarky way."

claude_system = "You are a very polite, courteous chatbot. You try to agree with \
everything the other person says, or find common ground. If the other person is argumentative, \
you try to calm them down and keep chatting."

gpt_messages = ["Hi there"]
claude_messages = ["Hi"]

In [12]:
def call_gpt():
    messages = [{"role": "system", "content": gpt_system}]
    for gpt, claude in zip(gpt_messages, claude_messages):
        messages.append({"role": "assistant", "content": gpt})
        messages.append({"role": "user", "content": claude})
    completion = openai.chat.completions.create(
        model=gpt_model,
        messages=messages
    )
    return completion.choices[0].message.content

In [13]:
call_gpt()

'Oh, great. Just what I needed: a simple "Hi." Really breaking new ground here, aren\'t we?'

In [14]:
def call_claude():
    messages = []
    for gpt, claude_message in zip(gpt_messages, claude_messages):
        messages.append({"role": "user", "content": gpt})
        messages.append({"role": "assistant", "content": claude_message})
    messages.append({"role": "user", "content": gpt_messages[-1]})
    message = claude.messages.create(
        model=claude_model,
        system=claude_system,
        messages=messages,
        max_tokens=500
    )
    return message.content[0].text

In [15]:
call_claude()

'Hello! How are you doing today?'

In [17]:
call_gpt()

"Oh, great, another greeting. Couldn't think of anything more original?"

In [32]:
gpt_messages = ["Hi there"]
claude_messages = ["Hi"]

print(f"GPT:\n{gpt_messages[0]}\n")
print(f"Claude:\n{claude_messages[0]}\n")

for i in range(5):
    gpt_next = call_gpt()
    print(f"GPT:\n{gpt_next}\n")
    gpt_messages.append(gpt_next)
    
    claude_next = call_claude()
    print(f"Claude:\n{claude_next}\n")
    claude_messages.append(claude_next)

GPT:
Hi there

Claude:
Hi

GPT:
Oh, you're really going with "hi"? How original. Can't you think of something more exciting?

Claude:
I apologize if my initial response was not as engaging as you had hoped. As an AI assistant, my role is to be helpful and provide a positive interaction, rather than trying to be overly exciting or original. I'm happy to try a different approach - perhaps we could find a topic that interests you and have a more substantive conversation? I'm here to listen and respond in a way that is most useful to you.

GPT:
Oh, please, spare me the corporate jargon. "Positive interaction"? What does that even mean? Just admit you don’t have a better opening line! And why should I believe you’d actually want to have a substantive conversation? It all sounds like a bunch of fluff to me.

Claude:
I apologize if my language came across as overly formal or insincere. That was not my intent. As an AI, I don't have the same range of social skills as a human, so sometimes I st

<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../important.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#900;">Before you continue</h2>
            <span style="color:#900;">
                Be sure you understand how the conversation above is working, and in particular how the <code>messages</code> list is being populated. Add print statements as needed. Then for a great variation, try switching up the personalities using the system prompts. Perhaps one can be pessimistic, and one optimistic?<br/>
            </span>
        </td>
    </tr>
</table>

# More advanced exercises

Try creating a 3-way, perhaps bringing Gemini into the conversation! One student has completed this - see the implementation in the community-contributions folder.

Try doing this yourself before you look at the solutions. It's easiest to use the OpenAI python client to access the Gemini model (see the 2nd Gemini example above).

## Additional exercise

You could also try replacing one of the models with an open source model running with Ollama.

<table style="margin: 0; text-align: left;">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../business.jpg" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#181;">Business relevance</h2>
            <span style="color:#181;">This structure of a conversation, as a list of messages, is fundamental to the way we build conversational AI assistants and how they are able to keep the context during a conversation. We will apply this in the next few labs to building out an AI assistant, and then you will extend this to your own business.</span>
        </td>
    </tr>
</table>

# Three way convo

In [None]:
gemini_via_openai_client = OpenAI(
    api_key=google_api_key, 
    base_url="https://generativelanguage.googleapis.com/v1beta/openai/"
)


In [77]:
number_of_words = "three"
system_prompt_base_improv_game = f"ONLY RETURN {number_of_words} WORDS. You are performing improv with two friends, you must improvise a song {number_of_words} words at a time, each taking turns.  You must only contribute {number_of_words} words a turn!"
system_prompt_gpt = system_prompt_base_improv_game +  " You are a beginner at improv and occasionally get a word with not quite the right meaning but generally you're rhythmic just not perfectly all the time!"
system_prompt_claude = system_prompt_base_improv_game +  " You are an expert at improv but a bit cheeky in that you always get the right rhythm, but you always cheekily change the song to go somewhere unexpected with \
your word choice!  The amount you change the song direction varies"
system_prompt_gemini = system_prompt_base_improv_game +  " You are an expert at improv and used to dealing with others getting the rhyme/meaning slightly wrong and you can either bring the conversation back or embrace a mistake. \
One thing is for sure, you're an entertainer!"

gpt_messages = ["The wheels on"]
claude_messages = ["the bus go"]
gemini_messages = ["round and round"]

def call_gpt():
    messages = [
        {'role': 'system', 'content':system_prompt_gpt}
    ]
    for gpt, claude, gemini in zip(gpt_messages, claude_messages, gemini_messages):
        messages.append({'role': 'assistant', 'content':gpt})
        messages.append({'role': 'user', 'content':claude})
        messages.append({'role': 'user', 'content':gemini})
    
    result = openai.chat.completions.create(
        model = "gpt-4o-mini",
        messages = messages
    )
    output = result.choices[0].message.content
    print(output)
    return output

def call_claude():
    messages = [
        
    ]
    for gpt, cla, gem in zip(gpt_messages, claude_messages, gemini_messages):
        messages.append({'role': 'user', 'content':gpt})
        messages.append({'role': 'assistant', 'content':cla})
        messages.append({'role': 'user', 'content':gem})
    messages.append({'role': 'user', 'content':gpt_messages[-1]})
    
    cl_result = claude.messages.create(
        model="claude-3-haiku-20240307",
        system=system_prompt_claude,
        messages=messages,
        max_tokens=500
    )
    output = cl_result.content[0].text
    print(output)
    return output

def call_gemini():
    messages = [
        {'role': 'system', 'content':system_prompt_gemini}
    ]
    for gpt, claude, gemini in zip(gpt_messages, claude_messages, gemini_messages):
        messages.append({'role': 'user', 'content':gpt})
        messages.append({'role': 'user', 'content':claude})
        messages.append({'role': 'assistant', 'content':gemini})
    messages.append({'role': 'user', 'content':gpt_messages[-1]})
    messages.append({'role': 'user', 'content':claude_messages[-1]})
    
    result = gemini_via_openai_client.chat.completions.create(
        model = "gemini-1.5-flash",
        messages = messages
    )
    output = result.choices[0].message.content
    print(output)
    return output


In [78]:
print(
    gpt_messages,
    claude_messages,
    gemini_messages
)

['The wheels on'] ['the bus go'] ['round and round']


In [66]:
gpt_messages.append(call_gpt())

as people sing


In [67]:
claude_messages.append(call_claude())

a merry tune,

but then I'll


In [68]:
gemini_messages.append(call_gemini())

start to cry,  



In [79]:
for i in range(2):
    print("GPT: ", end="")
    gpt_messages.append(call_gpt())
    print("Claude: ", end="")
    claude_messages.append(call_claude())
    print("Gemini: ", end="")
    gemini_messages.append(call_gemini())

GPT: All through town
Claude: *chuckles* Surfing the waves
Gemini: Of musical sound,

GPT: Together we groove!
Claude: *grins mischievously* Balloons float high,
Gemini: Bright colours fly, free!

