## Welcome to the Second Lab - Week 1, Day 3

Today we will work with lots of models! This is a way to get comfortable with APIs.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Important point - please read</h2>
            <span style="color:#ff7800;">The way I collaborate with you may be different to other courses you've taken. I prefer not to type code while you watch. Rather, I execute Jupyter Labs, like this, and give you an intuition for what's going on. My suggestion is that you carefully execute this yourself, <b>after</b> watching the lecture. Add print statements to understand what's going on, and then come up with your own variations.<br/><br/>If you have time, I'd love it if you submit a PR for changes in the community_contributions folder - instructions in the resources. Also, if you have a Github account, use this to showcase your variations. Not only is this essential practice, but it demonstrates your skills to others, including perhaps future clients or employers...
            </span>
        </td>
    </tr>
</table>

In [4]:
# Start with imports - ask ChatGPT to explain any package that you don't know

import os
import json
from dotenv import load_dotenv
from openai import OpenAI
from anthropic import Anthropic
from IPython.display import Markdown, display

In [5]:
# Always remember to do this!
load_dotenv(override=True)

True

In [13]:
# Print the key prefixes to help with any debugging

openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')
deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')
groq_api_key = os.getenv('GROQ_API_KEY')
base_url = os.getenv('BASE_URL')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set (and this is optional)")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:2]}")
else:
    print("Google API Key not set (and this is optional)")

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set (and this is optional)")

if groq_api_key:
    print(f"Groq API Key exists and begins {groq_api_key[:4]}")
else:
    print("Groq API Key not set (and this is optional)")

OpenAI API Key exists and begins sk-PHNV3
Anthropic API Key exists and begins sk-ant-
Google API Key exists and begins AI
DeepSeek API Key not set (and this is optional)
Groq API Key not set (and this is optional)


In [7]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]

In [10]:
messages

[{'role': 'user',
  'content': 'Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. Answer only with the question, no explanation.'}]

In [14]:
openai = OpenAI(base_url=base_url)
response = openai.chat.completions.create(
    model="gpt-4o-mini",
    messages=messages,
)
question = response.choices[0].message.content
print(question)


If you could redesign one fundamental aspect of human communication to enhance understanding and empathy among people, what would you change and why?


In [15]:
competitors = []
answers = []
messages = [{"role": "user", "content": question}]

In [16]:
# The API we know well

model_name = "gpt-4o-mini"

response = openai.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

If I could redesign one fundamental aspect of human communication, I would enhance the capacity for non-verbal communication signals, such as facial expressions, tone of voice, and body language, by integrating a universally understood emotional indicator system. This system could be a combination of augmented reality and wearable technology that subtly displays a person's emotional state in real time.

Here's why I believe this change would foster greater understanding and empathy among people:

1. **Clarity in Emotions**: Often, verbal communication lacks the ability to convey true feelings, leading to misunderstandings. By having a visual or auditory cue that indicates emotions—such as a gentle color glow or sound that corresponds to feelings like happiness, sadness, frustration, or confusion—people would be able to interpret emotional states more accurately.

2. **Enhanced Empathy**: When individuals can see or hear when someone is experiencing discomfort, joy, or anxiety, it can foster a deeper connection and awareness. This awareness could encourage people to respond with a greater sense of compassion and support, enhancing interpersonal relationships.

3. **Reduced Miscommunication**: Misunderstandings often arise from differing interpretations of words or intentions. With clearer emotional indicators, individuals would have additional context that could mitigate potential conflicts and improve the quality of discussions.

4. **Cultural Sensitivity**: Such a system could be designed to incorporate cultural nuances in emotional expression, allowing for better cross-cultural communication. This would enable individuals to navigate differences in emotional expressions, fostering respect and appreciation for diversity.

In essence, integrating a universal emotional indicator system into human communication could bridge gaps, promote empathetic interactions, and ultimately lead to a more understanding and connected society. The goal would be to create a communication environment where feelings are transparent, and understanding is prioritized, making each interaction more meaningful.

In [17]:
# Anthropic has a slightly different API, and Max Tokens is required

model_name = "claude-3-7-sonnet-latest"

claude = Anthropic()
response = claude.messages.create(model=model_name, messages=messages, max_tokens=1000)
answer = response.content[0].text

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

# Reimagining Human Communication

If I could redesign one fundamental aspect of human communication, I would create a "perspective-sharing capability" - the ability to temporarily experience the emotional context and background knowledge that shapes another person's understanding.

This wouldn't involve mind-reading or violating privacy, but rather a controlled, consensual exchange where:

1. You could briefly feel how your words land emotionally with others
2. You could temporarily access relevant contextual knowledge that shapes their interpretation
3. The exchange would be bidirectional and mutually illuminating

Why this change? Our most profound communication barriers stem from assuming others perceive the world as we do. This capability would preserve our individual identities while creating bridges across different lived experiences, cultural contexts, and emotional landscapes.

Imagine how conflicts might transform if we could truly understand not just what others think, but how and why they came to their perspective. The potential for empathy, conflict resolution, and collaborative problem-solving would be extraordinary.

In [None]:
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = "gemini-2.0-flash"

response = gemini.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [None]:
deepseek = OpenAI(api_key=deepseek_api_key, base_url="https://api.deepseek.com/v1")
model_name = "deepseek-chat"

response = deepseek.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [None]:
groq = OpenAI(api_key=groq_api_key, base_url="https://api.groq.com/openai/v1")
model_name = "llama-3.3-70b-versatile"

response = groq.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)


## For the next cell, we will use Ollama

Ollama runs a local web service that gives an OpenAI compatible endpoint,  
and runs models locally using high performance C++ code.

If you don't have Ollama, install it here by visiting https://ollama.com then pressing Download and following the instructions.

After it's installed, you should be able to visit here: http://localhost:11434 and see the message "Ollama is running"

You might need to restart Cursor (and maybe reboot). Then open a Terminal (control+\`) and run `ollama serve`

Useful Ollama commands (run these in the terminal, or with an exclamation mark in this notebook):

`ollama pull <model_name>` downloads a model locally  
`ollama ls` lists all the models you've downloaded  
`ollama rm <model_name>` deletes the specified model from your downloads

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Super important - ignore me at your peril!</h2>
            <span style="color:#ff7800;">The model called <b>llama3.3</b> is FAR too large for home computers - it's not intended for personal computing and will consume all your resources! Stick with the nicely sized <b>llama3.2</b> or <b>llama3.2:1b</b> and if you want larger, try llama3.1 or smaller variants of Qwen, Gemma, Phi or DeepSeek. See the <A href="https://ollama.com/models">the Ollama models page</a> for a full list of models and sizes.
            </span>
        </td>
    </tr>
</table>

In [None]:
!ollama pull llama3.2

In [18]:
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "llama3.2"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

If I could redesign one fundamental aspect of human communication to enhance understanding and empathy among people, I would propose a simple yet crucial modification to the way we use language: incorporating more nuanced contextual markers into everyday conversation.

**Current limitations:** Our current reliance on verbal language often leads to misunderstandings, miscommunications, and missed opportunities for genuine connection. Verbal expressions can be ambiguous, subject to multiple interpretations, and vulnerable to cultural, personal, or situational biases. This can result in unintended offense, hurt feelings, or unfulfilled needs.

**Proposed redesign:** Integrate contextual markers that enable both speakers and listeners to express and recognize underlying emotional states, intentions, and social cues more effectively. These markers could be embedded into language structures, vocabulary, and non-verbal behaviors.

Some possible examples of contextual markers include:

1. **Emotional qualifiers**: Incorporate subtle emotional indicators, such as verbal inflections (e.g., "Can I...", "Would that surprise you..."), to convey the speaker's emotional state or intent.
2. **Social referencing cues**: Use everyday conversational phrases like "That made me think of..." or "I'm still wondering if..."
3. **Non-verbal feedback mechanisms**: Incorporate gestures (smiling, nodding), facial expressions, and body language into verbal conversations to provide direct non-verbal cues that complement words.
4. **Contextual vocabulary**: Develop a standardized vocabulary with nuanced words that help convey specific emotional states or intentions (e.g., "I'm feeling overwhelmed," "Can I rely on you").

**Rationale:** By incorporating contextual markers, we can:

1. Reduce misunderstandings and miscommunications
2. Enhance empathy by conveying the emotional state of both parties
3. Increase effective listener engagement and understanding
4. Strengthen building trust through open exchange of non-judgmental feedback

This redesign aims to augment our existing linguistic tools with more effective communication mechanisms, leveraging natural language processes and subtle social cues to facilitate deeper connections, better relationships, and enhanced understanding among individuals.

How do you think a more nuanced contextual marker approach could impact interpersonal conversations?

In [19]:
# So where are we?

print(competitors)
print(answers)


['gpt-4o-mini', 'claude-3-7-sonnet-latest', 'llama3.2']
["If I could redesign one fundamental aspect of human communication, I would enhance the capacity for non-verbal communication signals, such as facial expressions, tone of voice, and body language, by integrating a universally understood emotional indicator system. This system could be a combination of augmented reality and wearable technology that subtly displays a person's emotional state in real time.\n\nHere's why I believe this change would foster greater understanding and empathy among people:\n\n1. **Clarity in Emotions**: Often, verbal communication lacks the ability to convey true feelings, leading to misunderstandings. By having a visual or auditory cue that indicates emotions—such as a gentle color glow or sound that corresponds to feelings like happiness, sadness, frustration, or confusion—people would be able to interpret emotional states more accurately.\n\n2. **Enhanced Empathy**: When individuals can see or hear wh

In [20]:
# It's nice to know how to use "zip"
for competitor, answer in zip(competitors, answers):
    print(f"Competitor: {competitor}\n\n{answer}")


Competitor: gpt-4o-mini

If I could redesign one fundamental aspect of human communication, I would enhance the capacity for non-verbal communication signals, such as facial expressions, tone of voice, and body language, by integrating a universally understood emotional indicator system. This system could be a combination of augmented reality and wearable technology that subtly displays a person's emotional state in real time.

Here's why I believe this change would foster greater understanding and empathy among people:

1. **Clarity in Emotions**: Often, verbal communication lacks the ability to convey true feelings, leading to misunderstandings. By having a visual or auditory cue that indicates emotions—such as a gentle color glow or sound that corresponds to feelings like happiness, sadness, frustration, or confusion—people would be able to interpret emotional states more accurately.

2. **Enhanced Empathy**: When individuals can see or hear when someone is experiencing discomfort, 

In [21]:
# Let's bring this together - note the use of "enumerate"

together = ""
for index, answer in enumerate(answers):
    together += f"# Response from competitor {index+1}\n\n"
    together += answer + "\n\n"

In [22]:
print(together)

# Response from competitor 1

If I could redesign one fundamental aspect of human communication, I would enhance the capacity for non-verbal communication signals, such as facial expressions, tone of voice, and body language, by integrating a universally understood emotional indicator system. This system could be a combination of augmented reality and wearable technology that subtly displays a person's emotional state in real time.

Here's why I believe this change would foster greater understanding and empathy among people:

1. **Clarity in Emotions**: Often, verbal communication lacks the ability to convey true feelings, leading to misunderstandings. By having a visual or auditory cue that indicates emotions—such as a gentle color glow or sound that corresponds to feelings like happiness, sadness, frustration, or confusion—people would be able to interpret emotional states more accurately.

2. **Enhanced Empathy**: When individuals can see or hear when someone is experiencing discomf

In [23]:
judge = f"""You are judging a competition between {len(competitors)} competitors.
Each model has been given this question:

{question}

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}}

Here are the responses from each competitor:

{together}

Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks."""


In [24]:
print(judge)

You are judging a competition between 3 competitors.
Each model has been given this question:

If you could redesign one fundamental aspect of human communication to enhance understanding and empathy among people, what would you change and why?

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}

Here are the responses from each competitor:

# Response from competitor 1

If I could redesign one fundamental aspect of human communication, I would enhance the capacity for non-verbal communication signals, such as facial expressions, tone of voice, and body language, by integrating a universally understood emotional indicator system. This system could be a combination of augmented reality and wearable technology that subtly displays a person's emotional sta

In [25]:
judge_messages = [{"role": "user", "content": judge}]

In [26]:
# Judgement time!

openai = OpenAI()
response = openai.chat.completions.create(
    model="o3-mini",
    messages=judge_messages,
)
results = response.choices[0].message.content
print(results)


AuthenticationError: Error code: 401 - {'error': {'message': 'Incorrect API key provided: sk-PHNV3***************************************R1mX. You can find your API key at https://platform.openai.com/account/api-keys.', 'type': 'invalid_request_error', 'param': None, 'code': 'invalid_api_key'}}

In [None]:
# OK let's turn this into results!

results_dict = json.loads(results)
ranks = results_dict["results"]
for index, result in enumerate(ranks):
    competitor = competitors[int(result)-1]
    print(f"Rank {index+1}: {competitor}")

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/exercise.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Exercise</h2>
            <span style="color:#ff7800;">Which pattern(s) did this use? Try updating this to add another Agentic design pattern.
            </span>
        </td>
    </tr>
</table>

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/business.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Commercial implications</h2>
            <span style="color:#00bfff;">These kinds of patterns - to send a task to multiple models, and evaluate results,
            are common where you need to improve the quality of your LLM response. This approach can be universally applied
            to business projects where accuracy is critical.
            </span>
        </td>
    </tr>
</table>