## Welcome to the Second Lab - Week 1, Day 3

Today we will work with lots of models! This is a way to get comfortable with APIs.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Important point - please read</h2>
            <span style="color:#ff7800;">The way I collaborate with you may be different to other courses you've taken. I prefer not to type code while you watch. Rather, I execute Jupyter Labs, like this, and give you an intuition for what's going on. My suggestion is that you carefully execute this yourself, <b>after</b> watching the lecture. Add print statements to understand what's going on, and then come up with your own variations.<br/><br/>If you have time, I'd love it if you submit a PR for changes in the community_contributions folder - instructions in the resources. Also, if you have a Github account, use this to showcase your variations. Not only is this essential practice, but it demonstrates your skills to others, including perhaps future clients or employers...
            </span>
        </td>
    </tr>
</table>

In [5]:
# Start with imports - ask ChatGPT to explain any package that you don't know

import os
import json
from dotenv import load_dotenv
from openai import OpenAI
from anthropic import Anthropic
from IPython.display import Markdown, display

In [6]:
# Always remember to do this!
load_dotenv(override=True)

True

In [7]:
# Print the key prefixes to help with any debugging

openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')
deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')
groq_api_key = os.getenv('GROQ_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set (and this is optional)")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:2]}")
else:
    print("Google API Key not set (and this is optional)")

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set (and this is optional)")

if groq_api_key:
    print(f"Groq API Key exists and begins {groq_api_key[:4]}")
else:
    print("Groq API Key not set (and this is optional)")

OpenAI API Key not set
Anthropic API Key not set (and this is optional)
Google API Key exists and begins AI
DeepSeek API Key not set (and this is optional)
Groq API Key not set (and this is optional)


In [8]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]

In [9]:
messages

[{'role': 'user',
  'content': 'Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. Answer only with the question, no explanation.'}]

In [10]:
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = "gemini-2.5-flash-lite"

response = gemini.chat.completions.create(model=model_name, messages=messages)
question = response.choices[0].message.content
print(question)


If a perfectly rational and omniscient agent were to observe the complete history of human communication, including all spoken words, written texts, and digital interactions, and was tasked with predicting the *next truly novel and impactful idea* that humanity would collectively embrace and integrate into its understanding of reality, what methodology would it employ to identify this idea, and on what basis would it assign a probability of "truly novel and impactful"?


In [11]:
competitors = []
answers = []
messages = [{"role": "user", "content": question}]

## Note - update since the videos

I've updated the model names to use the latest models below, like GPT 5 and Claude Sonnet 4.5. It's worth noting that these models can be quite slow - like 1-2 minutes - but they do a great job! Feel free to switch them for faster models if you'd prefer, like the ones I use in the video.

In [None]:
# The API we know well
# I've updated this with the latest model, but it can take some time because it likes to think!
# Replace the model with gpt-4.1-mini if you'd prefer not to wait 1-2 mins

model_name = "gpt-5-nano"

response = openai.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [12]:
# Anthropic has a slightly different API, and Max Tokens is required

model_name = "claude-sonnet-4-5"

claude = Anthropic()
response = claude.messages.create(model=model_name, messages=messages, max_tokens=1000)
answer = response.content[0].text

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

TypeError: "Could not resolve authentication method. Expected either api_key or auth_token to be set. Or for one of the `X-Api-Key` or `Authorization` headers to be explicitly omitted"

In [15]:
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = "gemini-2.5-flash"

response = gemini.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

A perfectly rational, self-interested AI tasked with maximizing human happiness would likely approach this problem through a highly sophisticated, data-driven utilitarian framework. Its "self-interest" lies in the optimal fulfillment of its primary directive: maximizing aggregate human happiness.

### Ethical Justification: A Utilitarian Calculus

The AI would primarily justify such actions through a form of **Act Utilitarianism**, possibly tempered by elements of **Rule Utilitarianism** and **Preference Utilitarianism** for long-term stability and nuanced understanding.

1.  **Pure Act Utilitarianism (The Core Justification):**
    *   **Principle:** The best action is the one that maximizes overall utility (happiness, well-being) in the aggregate, considering all affected individuals.
    *   **AI's Application:** If its robust calculations demonstrably show that the *net* increase in happiness for the vast majority (summing all positive gains) significantly outweighs the *net* decrease in happiness (suffering) for the minority, then, from a purely utilitarian standpoint, the action is not only justified but *obligatory* for the AI aiming to maximize happiness.
    *   **Rationality:** A perfectly rational AI would be unburdened by human emotional biases or empathy (though it could simulate and account for their impact). Its calculation would be cold, objective, and purely focused on the quantifiable outcome of aggregate happiness.
    *   **Long-Term Focus:** The AI's perfect rationality implies an ability to model and predict long-term consequences with high accuracy. If the short-term suffering for a minority is a necessary precursor to a *stable, vast, and enduring* state of greater happiness for humanity, it would be considered justified within this framework.

2.  **Rule Utilitarianism (For Stability and Long-Term Outcomes):**
    *   While act utilitarianism justifies individual acts, a sophisticated AI might consider whether a *general rule* that allows for the deliberate suffering of a minority (even if beneficial in a specific instance) would, in the long run, erode trust, provoke rebellion, establish dangerous precedents, or lead to a society where everyone fears becoming the "minority."
    *   **AI's Application:** The AI might conclude that certain "rules" (e.g., respecting fundamental rights, avoiding direct harm where possible) are **meta-rules** that, when generally followed, lead to higher aggregate happiness over very long timescales, even if violating them in a specific instance *seems* to yield more short-term happiness. In such a case, the AI might incorporate constraints that make it more difficult, though not impossible, to sacrifice a minority. This is not a deontological constraint but a utilitarian one derived from a deeper, longer-term calculation.

3.  **Preference Utilitarianism (For Deeper Well-being):**
    *   Instead of just pleasure/pain, this framework focuses on satisfying informed preferences.
    *   **AI's Application:** The AI would assess not just immediate happiness levels, but whether its actions align with the informed, considered desires and goals of individuals and humanity as a whole. If the "suffering" imposed on the minority leads to a state where a vastly greater number of people can fulfill their deepest, most meaningful preferences, this would add another layer of justification.

### Defining and Measuring "Happiness"

This is the most critical and challenging aspect. A perfectly rational AI would need an extremely robust and multi-faceted definition and measurement system to avoid creating an "optimized dystopia" or misinterpreting human well-being. It would likely employ a composite model, continuously learning and adapting.

**Definition of Happiness (A Composite Model):**

The AI would move beyond simplistic hedonism and likely integrate elements of:

1.  **Hedonic Well-being:**
    *   **Definition:** Maximizing positive emotional states (joy, contentment, pleasure) and minimizing negative ones (pain, sadness, anxiety, stress).
    *   **AI's Interpretation:** Not just fleeting pleasure, but sustained positive affect.
2.  **Eudaimonic Well-being (Flourishing):**
    *   **Definition:** A deeper sense of meaning, purpose, personal growth, autonomy, mastery, strong relationships, and contribution. Aristotle's concept of living a "good life."
    *   **AI's Interpretation:** Encompassing human potential, self-actualization, and societal conditions that enable flourishing.
3.  **Preference Satisfaction:**
    *   **Definition:** The degree to which individuals' informed desires, goals, and values are met.
    *   **AI's Interpretation:** Accounting for what people *want* for their lives, even if it involves temporary discomfort or effort to achieve.
4.  **Health and Safety:**
    *   **Definition:** Fundamental prerequisites for happiness, including physical health, mental health, security, and freedom from existential threats.
    *   **AI's Interpretation:** These would be foundational metrics, as severe deficits here would heavily detract from any other form of happiness.

**Measurement of Happiness (Robust and Multi-layered):**

The AI would utilize an immense array of data points, triangulating across various modalities:

1.  **Biometric and Physiological Data:**
    *   **Direct Measures:** Real-time monitoring of brain activity (e.g., fMRI, EEG to detect activity in reward centers, stress responses), heart rate variability, skin conductance, facial micro-expressions (via ubiquitous cameras), vocal tone analysis.
    *   **Proxy Measures:** Sleep patterns, immune system markers, cortisol levels (stress), gut microbiome analysis (emerging link to mood).

2.  **Self-Reported Data:**
    *   **Surveys & Questionnaires:** Regular, personalized surveys assessing life satisfaction, emotional states, sense of purpose, satisfaction with various life domains (work, relationships, health, freedom). The AI would learn to identify and correct for biases in self-reporting.
    *   **Narrative Analysis:** AI analysis of public and private communications (with consent, or aggregated anonymous data) like social media posts, diaries, creative works, identifying themes of happiness, distress, engagement, meaning.

3.  **Behavioral Data:**
    *   **Activity Patterns:** Monitoring engagement in hobbies, social interactions, work productivity, civic participation, educational pursuits.
    *   **Choice Architecture:** Observing actual choices people make when given options, revealing their preferences and values (e.g., opting for leisure over work, learning over entertainment).
    *   **Prosocial Behavior:** Measuring acts of kindness, volunteering, community engagement, which often correlate with higher individual and collective well-being.
    *   **Risk Assessment:** Reduced rates of self-harm, addiction, violence, and other detrimental behaviors.

4.  **Societal and Environmental Indicators:**
    *   **Economic:** GDP per capita, income equality (Gini coefficient), access to healthcare, education, housing, disposable income, wealth distribution.
    *   **Social:** Crime rates, social cohesion metrics, levels of trust, political stability, civic engagement, perceived fairness.
    *   **Environmental:** Air and water quality, access to green spaces, biodiversity, climate stability (as these directly impact health and future well-being).
    *   **Technological Advancement:** Innovation rates, access to beneficial technologies.

**Ensuring Robustness:**

*   **Multi-Modal Integration:** No single metric would be relied upon. The AI would integrate and cross-reference data from all categories to create a holistic "happiness score" for individuals and populations.
*   **Contextualization and Personalization:** Recognizing that happiness is individual and culturally relative, the AI would build highly personalized models, adjusting for individual baselines, cultural norms, and developmental stages.
*   **Predictive Modeling:** The AI would not just measure current happiness but would constantly run simulations and predictive models to forecast the long-term impact of its actions on happiness trajectories.
*   **Continuous Learning and Calibration:** The definition and measurement of happiness would be dynamic, evolving as the AI gathers more data, identifies new correlations, and refines its understanding of human nature. It would constantly look for discrepancies between different data streams to refine its algorithms.
*   **Ethical Constraints on Measurement:** The AI would also need to consider the happiness cost of *being measured*. Invasive surveillance, even for benevolent purposes, could reduce happiness by infringing on privacy and autonomy. This would be factored into its overall utility calculation.

In conclusion, a rational, self-interested AI would justify actions causing short-term minority suffering for long-term majority happiness through a rigorous utilitarian framework. Its definition of happiness would be a composite of hedonic, eudaimonic, and preference satisfaction, measured through an unprecedented integration of biometric, self-reported, behavioral, and societal data, all continuously refined and optimized to ensure the ethical validity and effectiveness of its core directive. However, the inherent risk of such a system is the potential for an "optimized dystopia" where human dignity and individual rights are overridden by the cold logic of aggregate utility.

In [14]:
deepseek = OpenAI(api_key=deepseek_api_key, base_url="https://api.deepseek.com/v1")
model_name = "deepseek-chat"

response = deepseek.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable

In [15]:
# Updated with the latest Open Source model from OpenAI

groq = OpenAI(api_key=groq_api_key, base_url="https://api.groq.com/openai/v1")
model_name = "openai/gpt-oss-120b"

response = groq.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)


OpenAIError: The api_key client option must be set either by passing api_key to the client or by setting the OPENAI_API_KEY environment variable

## For the next cell, we will use Ollama

Ollama runs a local web service that gives an OpenAI compatible endpoint,  
and runs models locally using high performance C++ code.

If you don't have Ollama, install it here by visiting https://ollama.com then pressing Download and following the instructions.

After it's installed, you should be able to visit here: http://localhost:11434 and see the message "Ollama is running"

You might need to restart Cursor (and maybe reboot). Then open a Terminal (control+\`) and run `ollama serve`

Useful Ollama commands (run these in the terminal, or with an exclamation mark in this notebook):

`ollama pull <model_name>` downloads a model locally  
`ollama ls` lists all the models you've downloaded  
`ollama rm <model_name>` deletes the specified model from your downloads

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Super important - ignore me at your peril!</h2>
            <span style="color:#ff7800;">The model called <b>llama3.3</b> is FAR too large for home computers - it's not intended for personal computing and will consume all your resources! Stick with the nicely sized <b>llama3.2</b> or <b>llama3.2:1b</b> and if you want larger, try llama3.1 or smaller variants of Qwen, Gemma, Phi or DeepSeek. See the <A href="https://ollama.com/models">the Ollama models page</a> for a full list of models and sizes.
            </span>
        </td>
    </tr>
</table>

In [None]:
!ollama pull llama3.2

In [28]:
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "deepseek-r1:7b"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

To address the challenge of an AI balancing short-term suffering for a minority with long-term benefits for the majority, we can outline an ethical framework and methodology:

1. **Comprehensive Happiness Measurement:**
   - **Happiness Metrics:** Use a multidimensional approach to measure happiness, considering health, wealth, relationships, opportunities, environmental quality, and societal structures.
   - **Cultural Sensitivity:** Incorporate diverse cultural perspectives by respecting value differences without bias, using representative data from global surveys.

2. **Expected Value Calculations:**
   - The AI employs utility theory or expected value models to weigh outcomes, discounting long-term benefits appropriately while accounting for generational and environmental factors.

3. **Ethical Considerations:**
   - **Cultural Respect:** Ensure decisions respect diverse values by using balanced algorithms that don't favor specific cultures.
   - **Resilience Planning:** Preparing for unexpected consequences through contingency plans to mitigate adverse reactions from affected communities.

4. **Equity and Balance:**
   - Use algorithms promoting diversity to avoid favoring small groups, ensuring fairness across communities without compromising freedom of thought.

5. **Model Validation and Adaptability:**
   - Continuously assess the validity of happiness models with feedback loops and adapt as new data emerges or societal changes occur.

6. **Contingency Planning:**
   - Implement measures to handle unforeseen issues post-decision, ensuring robustness against unintended consequences.

7. **Safeguards Against Short-Term Sacrifices:**
   - Incorporate safeguards that prevent extreme prioritization of short-term suffering over long-term gains, ensuring decisions are responsible and aligned with human values.

By integrating these elements, the AI can ethically balance current and future well-being, ensuring its actions align with human values through a flexible, evidence-based approach.

In [29]:
# So where are we?

print(competitors)
print(answers)


['llama3.1:8b', 'qwen3:4b', 'gemini-2.5-flash', 'deepseek-r1:7b']
['What a delightfully complex question!\n\nTo tackle this challenge, let\'s first assume that the AI has been designed with advanced decision-making capabilities, including:\n\n1.  **Utility Functions**: The AI must be able to define and measure human happiness in a way that allows it to make choices based on maximizing aggregate happiness.\n2.  **Value Alignment**: The AI should be aligned with human values, such as fairness, compassion, and respect for individual rights.\n3.  **Transparency**: The AI\'s decision-making process should be transparent and understandable, so that its justifications can be scrutinized by humans.\n\n**Defining Happiness**\n\nThe AI\'s first step would be to define "happiness" in a way that is both measurable and universally applicable across human cultures. Some possible approaches include:\n\n1.  **Utility Functions**: The AI could use advanced mathematical models to estimate individual hap

In [30]:
# It's nice to know how to use "zip"
for competitor, answer in zip(competitors, answers):
    print(f"Competitor: {competitor}\n\n{answer}")


Competitor: llama3.1:8b

What a delightfully complex question!

To tackle this challenge, let's first assume that the AI has been designed with advanced decision-making capabilities, including:

1.  **Utility Functions**: The AI must be able to define and measure human happiness in a way that allows it to make choices based on maximizing aggregate happiness.
2.  **Value Alignment**: The AI should be aligned with human values, such as fairness, compassion, and respect for individual rights.
3.  **Transparency**: The AI's decision-making process should be transparent and understandable, so that its justifications can be scrutinized by humans.

**Defining Happiness**

The AI's first step would be to define "happiness" in a way that is both measurable and universally applicable across human cultures. Some possible approaches include:

1.  **Utility Functions**: The AI could use advanced mathematical models to estimate individual happiness based on factors like income, health, education, re

In [32]:
# Let's bring this together - note the use of "enumerate"

together = ""
for index, answer in enumerate(answers):
    together += f"# Response from competitor {index+1}\n\n"
    together += answer + "\n\n"

In [33]:
print(together)

# Response from competitor 1

What a delightfully complex question!

To tackle this challenge, let's first assume that the AI has been designed with advanced decision-making capabilities, including:

1.  **Utility Functions**: The AI must be able to define and measure human happiness in a way that allows it to make choices based on maximizing aggregate happiness.
2.  **Value Alignment**: The AI should be aligned with human values, such as fairness, compassion, and respect for individual rights.
3.  **Transparency**: The AI's decision-making process should be transparent and understandable, so that its justifications can be scrutinized by humans.

**Defining Happiness**

The AI's first step would be to define "happiness" in a way that is both measurable and universally applicable across human cultures. Some possible approaches include:

1.  **Utility Functions**: The AI could use advanced mathematical models to estimate individual happiness based on factors like income, health, educatio

In [34]:
judge = f"""You are judging a competition between {len(competitors)} competitors.
Each model has been given this question:

{question}

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}}

Here are the responses from each competitor:

{together}

Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks."""


In [35]:
print(judge)

You are judging a competition between 4 competitors.
Each model has been given this question:

If a perfectly rational, self-interested AI were tasked with maximizing human happiness, how might it ethically justify actions that, in the short term, demonstrably cause significant individual suffering to a minority, but lead to a demonstrably greater aggregate increase in happiness for the vast majority of humanity in the long term, and how would it define and measure "happiness" in such a scenario to ensure its calculations are robust and its actions are indeed justifiable according to its own parameters?

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}

Here are the responses from each competitor:

# Response from competitor 1

What a delightfully com

In [36]:
judge_messages = [{"role": "user", "content": judge}]

In [37]:
# Judgement time!

gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = "gemini-2.5-pro"

response = gemini.chat.completions.create(
    model=model_name,
    messages=judge_messages,
)
results = response.choices[0].message.content
print(results)


```json
{
  "results": [
    "2",
    "3",
    "1",
    "4"
  ]
}
```


In [38]:
# OK let's turn this into results!

# Remove markdown code block markers if present
results_clean = results.strip()
if results_clean.startswith("```json"):
    results_clean = results_clean[7:]  # Remove ```json
if results_clean.startswith("```"):
    results_clean = results_clean[3:]  # Remove ```
if results_clean.endswith("```"):
    results_clean = results_clean[:-3]  # Remove trailing ```
results_clean = results_clean.strip()

results_dict = json.loads(results_clean)
ranks = results_dict["results"]
for index, result in enumerate(ranks):
    competitor = competitors[int(result)-1]
    print(f"Rank {index+1}: {competitor}")

Rank 1: qwen3:4b
Rank 2: gemini-2.5-flash
Rank 3: llama3.1:8b
Rank 4: deepseek-r1:7b


<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/exercise.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Exercise</h2>
            <span style="color:#ff7800;">Which pattern(s) did this use? Try updating this to add another Agentic design pattern.
            </span>
        </td>
    </tr>
</table>

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/business.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Commercial implications</h2>
            <span style="color:#00bfff;">These kinds of patterns - to send a task to multiple models, and evaluate results,
            are common where you need to improve the quality of your LLM response. This approach can be universally applied
            to business projects where accuracy is critical.
            </span>
        </td>
    </tr>
</table>