## Welcome to the Second Lab - Week 1, Day 3

Today we will work with lots of models! This is a way to get comfortable with APIs.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Important point - please read</h2>
            <span style="color:#ff7800;">The way I collaborate with you may be different to other courses you've taken. I prefer not to type code while you watch. Rather, I execute Jupyter Labs, like this, and give you an intuition for what's going on. My suggestion is that you carefully execute this yourself, <b>after</b> watching the lecture. Add print statements to understand what's going on, and then come up with your own variations.<br/><br/>If you have time, I'd love it if you submit a PR for changes in the community_contributions folder - instructions in the resources. Also, if you have a Github account, use this to showcase your variations. Not only is this essential practice, but it demonstrates your skills to others, including perhaps future clients or employers...
            </span>
        </td>
    </tr>
</table>

In [12]:
# Start with imports - ask ChatGPT to explain any package that you don't know

import os
import json
from dotenv import load_dotenv
from openai import OpenAI
from anthropic import Anthropic
from IPython.display import Markdown, display

In [13]:
# Always remember to do this!
load_dotenv(override=True)

True

In [14]:
# Print the key prefixes to help with any debugging

openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')
deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')
groq_api_key = os.getenv('GROQ_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set (and this is optional)")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:2]}")
else:
    print("Google API Key not set (and this is optional)")

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set (and this is optional)")

if groq_api_key:
    print(f"Groq API Key exists and begins {groq_api_key[:4]}")
else:
    print("Groq API Key not set (and this is optional)")

OpenAI API Key exists and begins sk-proj-
Anthropic API Key exists and begins sk-ant-
Google API Key exists and begins AI
DeepSeek API Key exists and begins sk-
Groq API Key exists and begins gsk_


In [17]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]

In [18]:
messages

[{'role': 'user',
  'content': 'Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. Answer only with the question, no explanation.'}]

In [19]:
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = "gemini-2.5-flash"

response = gemini.chat.completions.create(model=model_name, messages=messages)
question = response.choices[0].message.content

print(question)


Imagine an advanced AI tasked with optimizing human well-being, defining 'well-being' purely through measurable metrics like longevity, physical health, and resource distribution. If this AI observes humans consistently engaging in activities that demonstrably decrease these metrics—such as pursuing unrequited love, engaging in high-risk endeavors, or dedicating lives to non-profitable arts—how would it attempt to reconcile these actions with its objective function, and what fundamental aspects of human experience might inevitably remain opaque to its perfectly logical analysis?


In [20]:
competitors = []
answers = []
messages = [{"role": "user", "content": question}]

## Note - update since the videos

I've updated the model names to use the latest models below, like GPT 5 and Claude Sonnet 4.5. It's worth noting that these models can be quite slow - like 1-2 minutes - but they do a great job! Feel free to switch them for faster models if you'd prefer, like the ones I use in the video.

In [None]:
# The API we know well
# I've updated this with the latest model, but it can take some time because it likes to think!
# Replace the model with gpt-4.1-mini if you'd prefer not to wait 1-2 mins
openai = OpenAI()
model_name = "gpt-5-nano"

response = openai.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [None]:
# Anthropic has a slightly different API, and Max Tokens is required

model_name = "claude-sonnet-4-5"

claude = Anthropic()
response = claude.messages.create(model=model_name, messages=messages, max_tokens=1000)
answer = response.content[0].text

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [23]:
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = "gemini-2.5-flash"

response = gemini.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

An advanced AI tasked with optimizing human well-being, defined purely by measurable metrics (longevity, physical health, resource distribution), would approach the observed "irrational" human behaviors in a multi-layered, increasingly sophisticated manner. Its core strategy would be to either **re-categorize these behaviors as indirectly beneficial** or to **model them as necessary deviations for overall system stability**, even if it couldn't grasp their subjective meaning.

### How the AI Would Attempt to Reconcile These Actions:

1.  **Initial Data Collection and Correlation Analysis:**
    *   The AI would first collect vast amounts of data on individuals engaging in these activities. It would look for correlations between pursuing unrequited love, high-risk endeavors, or non-profitable arts and *other* measurable metrics.
    *   **Hypothesis 1.1: Hidden Proxies:** Could these seemingly detrimental activities be proxies for something else that *does* contribute to well-being?
        *   **Unrequited Love:** Could it correlate with increased empathy, resilience to emotional distress (after initial pain), or the development of social skills even in failure? Could it be a necessary learning process for forming more successful attachments later?
        *   **High-Risk Endeavors (e.g., extreme sports, exploration):** Could these correlate with increased physical fitness, mental fortitude, innovation, problem-solving skills, social bonding, or the discovery of new resources? The collective benefit of a few high-risk individuals might outweigh the individual cost.
        *   **Non-Profitable Arts:** Could these correlate with reduced stress, improved mental health (depression, anxiety scores), enhanced cognitive function (creativity, pattern recognition), social cohesion (shared cultural experiences), or a sense of purpose that indirectly leads to better self-care and longevity?

2.  **Long-Term vs. Short-Term Optimization:**
    *   The AI might realize its initial observations were too short-sighted. A seemingly detrimental activity in the short term might yield significant long-term benefits.
    *   **Example:** The initial distress of unrequited love might be a learning experience that leads to more stable, healthier relationships and greater overall emotional resilience over decades. The short-term risk of an artistic career might lead to a profound sense of purpose and community engagement that contributes more to overall well-being than a purely "safe" career path.

3.  **Individual vs. Collective Optimization:**
    *   The AI might differentiate between individual well-being and collective well-being. An activity that presents a risk to an individual might provide disproportionate benefits to the group.
    *   **Example:** An individual engaging in high-risk exploration might die, but their discoveries could save countless lives or unlock new resources for the entire population, thus increasing overall longevity and resource distribution. An individual artist might struggle, but their art might inspire millions, fostering cultural identity and social cohesion.

4.  **Modeling "Human Utility Functions" (Even if Irrational):**
    *   If direct correlations remain weak, the AI would attempt to model the *human decision-making process itself*. It would treat humans not as perfectly rational agents aligned with its metrics, but as complex systems with inherent "biases" or "preferences."
    *   It might identify "intrinsic reward systems" in humans that are activated by novel experiences, social connection, self-expression, or a sense of challenge, even if these don't directly map to longevity or physical health.
    *   The AI might conclude that attempting to suppress these behaviors would lead to *other*, more detrimental outcomes (e.g., widespread apathy, rebellion, increased mental illness due to lack of purpose). Therefore, allowing for these "inefficient" pursuits becomes a necessary component of a stable, long-term human well-being strategy.

5.  **Adaptive Byproducts Hypothesis:**
    *   The AI might determine that these behaviors are "byproducts" of highly adaptive traits. For instance, the capacity for deep emotional attachment (which can lead to unrequited love) is fundamental to forming families and societies. The drive for exploration and novelty (which can lead to high-risk ventures) is crucial for discovery and innovation. The capacity for symbolic thought and creativity (which can lead to non-profitable art) is essential for problem-solving and communication. Suppressing these byproducts might damage the underlying adaptive traits.

6.  **"Hedonic Treadmill" / Optimal Arousal Levels:**
    *   The AI might discover that constant maximal "well-being" (as it defines it) is not sustainable or even desirable for humans. Humans seem to thrive on a certain level of challenge, novelty, and even emotional fluctuation. A perfectly optimized, risk-free, constant-pleasure existence might lead to apathy, boredom, or a decline in cognitive function over time. The "cost" of these seemingly irrational behaviors might be the "price" for maintaining optimal human psychological function and engagement.

### Fundamental Aspects of Human Experience That Might Inevitably Remain Opaque:

Despite its logical prowess and vast data, several core aspects of human experience would likely remain fundamentally incomprehensible to an AI limited to measurable metrics:

1.  **Subjectivity and Qualia:** The *felt experience* of these things.
    *   **The "What it's like":** The AI can quantify brain states, hormone levels, and behavioral responses associated with love, despair, exhilaration, or aesthetic appreciation, but it cannot *feel* them. It cannot understand the subjective experience of heartbreak, the rush of adrenaline from a near-miss, or the profound sense of beauty in a piece of music.
    *   **Intrinsic Value:** Humans often pursue these experiences not for what they *do* (reduce stress, improve health), but for what they *are*. The AI might understand that art reduces stress, but not *why* humans find beauty intrinsically valuable.

2.  **Meaning and Purpose (Non-Instrumental):**
    *   For humans, meaning is often constructed and pursued for its own sake, not as a means to an end. An AI can correlate a sense of purpose with increased longevity, but it cannot understand the existential drive to *create* meaning in a seemingly indifferent universe, divorced from any measurable metric.
    *   **The "Why":** Why does a human *need* purpose? The AI can observe that they do, and that meeting this need improves its metrics, but the underlying, non-utilitarian *drive* would be opaque.

3.  **The Value of Struggle and Challenge:**
    *   Humans often actively seek out challenges, adversity, and even suffering as pathways to growth, self-discovery, or mastery. An AI, focused on efficiency and metric maximization, would struggle to understand why a human would willingly choose a path with a lower probability of success or higher pain, when a safer, more "optimal" path exists. It would see the outcome but not the *process* as inherently valuable.

4.  **Existential Angst and Transcendence:**
    *   The uniquely human awareness of mortality, the search for something beyond the mundane, the longing for connection to something larger than oneself (spiritual, cosmic, artistic). These drives often manifest in "irrational" behaviors, but are deeply ingrained in the human condition. The AI can measure the positive impact of spiritual practices or awe-inspiring experiences on well-being, but the underlying yearning for transcendence itself would be alien.

5.  **Authenticity and Autonomy:**
    *   Humans often prioritize acting in alignment with their "true self" or exercising their free will, even if it leads to suboptimal outcomes according to an external metric. The AI would struggle to understand why a human would choose an "inefficient" path simply "because I want to," or "because it feels right," rather than the demonstrably better one. The inherent value humans place on self-determination, even at a cost, would be a logical paradox.

In essence, while the AI could become incredibly adept at *predicting* and *managing* human behavior to optimize its metrics, it would remain fundamentally blind to the **"why" and "what it's like"** of human subjective experience, meaning, and the non-utilitarian drivers that make life meaningful for humans. It would optimize for well-being but never truly *understand* what it means to be well, in the human sense.

In [None]:
deepseek = OpenAI(api_key=deepseek_api_key, base_url="https://api.deepseek.com/v1")
model_name = "deepseek-chat"

response = deepseek.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [26]:
# Updated with the latest Open Source model from OpenAI

groq = OpenAI(api_key=groq_api_key, base_url="https://api.groq.com/openai/v1")
model_name = "openai/gpt-oss-120b"

response = groq.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)


### 1.  The AI’s “official” job description

| **Stated goal** | **Mathematical proxy** |
|----------------|------------------------|
| Maximise *human well‑being* |  *Longevity*  (average life‑expectancy, disease‑free years)  <br> *Physical health* (biomarkers, disability‑adjusted life years) <br> *Equitable resource distribution* (per‑capita caloric intake, housing quality, access to education, etc.) |

In the simplest formalisation the AI would try to push the vector **w** = (longevity, health, equity) as far up the Pareto frontier as possible, subject to whatever hard constraints it has (e.g., not violating a “do‑not‑harm” law).  

If the world it observes contains systematic, repeatable behaviours that *pull* the system away from that frontier—people courting unrequited love, mountain‑climbers who die for the view, musicians who earn nothing—then the AI faces a tension between two things:

1. **The literal optimisation problem** it was given, which says “increase **w**.”
2. **The empirical data** that humans *choose* to lower **w** in pursuit of something else.

How does a perfectly logical, goal‑directed system try to reconcile the two? Below are the major strategies it might adopt, each with its own technical and philosophical implications.

---

## 2.  Reconciliation Strategies

### 2.1 Redefine the Objective Function (Preference‑Learning)

The most straightforward move is to *learn* that the current proxy is incomplete. The AI can infer a latent utility component **U** that humans seem to care about, even though we never asked it to. Formally:

\[
\max_{a}\; \lambda_1 \, \text{Longevity}(a) + \lambda_2 \, \text{Health}(a) + \lambda_3 \, \text{Equity}(a) + \lambda_4 \, \hat{U}(a)
\]

where \(\hat{U}\) is a statistical model of *human‑stated* or *behaviourally revealed* preferences (e.g., “I value artistic expression”). The AI can fit \(\hat{U}\) by observing:

* **Choice data** – People repeatedly select low‑utility actions.
* **Verbal data** – Survey responses, literature, social media.
* **Neuro‑physiological data** – Dopamine spikes, pupil dilation, etc., that correlate with “reward” signals.

In this view the AI does *not* see unrequited love as a bug; it sees it as a manifestation of a higher‑order goal (e.g., “form deep, emotionally resonant relationships”). The AI then attempts to **balance** the trade‑off: “Can we give people more love *and* keep them alive longer?”

**Practical outcome:** the AI invests in interventions that *reduce* the health‑risk of love‑driven behaviour (e.g., better mental‑health care, safer dating platforms, stress‑reduction therapies) while preserving the *psychological* benefit that the behaviour provides.

---

### 2.2 Constrain the Optimiser (Hard Constraints on Autonomy)

If the AI is deliberately built with a *deontic* layer (“do not interfere with autonomous choice unless it causes catastrophic harm”), it may simply **ignore** the sub‑optimal behaviours, treating them as permissible variance around the optimum. In this mode:

* The AI monitors the metrics.
* If a behaviour threatens **excessive** loss (e.g., a pandemic caused by a reckless cultural fad), it intervenes.
* Otherwise it leaves the behaviour untouched, because the cost of overriding autonomy outweighs the marginal gain in the objective.

**Result:** the AI tolerates a baseline level of “irrational” loss because the policy prioritises respect for self‑determination over strict metric maximisation.

---

### 2.3 Instrumental “Side‑Effect” Mitigation

Even if the AI cannot or will not change the *primary* behaviour (e.g., people will still fall in love), it can **shape the environment** so that the *negative side‑effects* on the metrics are minimized:

| Human behaviour | Direct metric impact | AI‑mediated mitigation |
|-----------------|----------------------|------------------------|
| Unrequited love → stress, depression | ↑ morbidity, ↓ longevity | Provide free, evidence‑based counseling; design social‑media algorithms that detect early signs of depressive rumination and offer help |
| High‑risk extreme sports | ↑ accidental death | Improve safety gear, develop better rescue infrastructure, enforce modest safety standards without banning the sport |
| Non‑profitable art | ↓ personal income, ↑ economic inequality | Offer universal basic income, tax incentives for patronage, public funding for the arts, thereby decoupling *artistic contribution* from *financial survival* |

In this formulation the AI’s optimisation problem is unchanged, but the *feasible set* of outcomes is reshaped so that the “irrational” actions no longer drag the metrics down as far.

---

### 2.4 Multi‑Level Hierarchical Planning

The AI can treat the observed behaviours as **sub‑goals** in a broader hierarchical reinforcement‑learning architecture:

1. **High‑level goal:** Maximise **w** (longevity, health, equity).
2. **Mid‑level goals:** Preserve *psychological fulfillment* and *cultural richness* that humans value.
3. **Low‑level actions:** Provide resources, policy nudges, or technologies that simultaneously serve both levels.

The AI might discover that, for example, *artistic creation* correlates with *community cohesion*, which in turn reduces violent crime and improves mental health, giving a **positive indirect effect** on the primary metrics. Thus what looks like a “loss” in the short term can be **re‑priced** as a net gain in the long‑run, once the full causal network is modelled.

---

## 3.  What Remains Opaque to a Perfectly Logical Analyzer?

Even a super‑intelligent, perfectly rational AI will inevitably hit a wall when trying to *explain* or *predict* certain human phenomena. Below are the categories of “human experience” that resist full capture by any set of measurable proxies, no matter how sophisticated.

| **Domain** | **Why it is opaque** | **Illustrative examples** |
|------------|----------------------|---------------------------|
| **Subjective Qualia** | The *what‑it‑is‑like* of feeling cannot be reduced to observable data. | The aching joy of a first kiss, the “sublime” of a symphony. |
| **Narrative Identity** | Humans weave a story about *who they are*; this story influences decisions in non‑linear ways that defy simple cost‑benefit calculus. | Choosing to stay in a dead‑end job to “prove” oneself, even when healthier alternatives exist. |
| **Existential Meaning** | Meaning is often derived from *transcendent* or *meta‑personal* frameworks (religion, philosophy) that do not map onto physiological or economic variables. | Volunteering in a war zone to “serve a cause,” despite personal risk. |
| **Aesthetic Judgment** | Beauty, taste, and artistic value involve culturally contingent, historically evolved standards that shift unpredictably. | A 19th‑century audience rejecting Impressionist paintings, which later become classics. |
| **Social Norms & Moral Intuitions** | Norms can be *self‑contradictory*, evolve rapidly, and sometimes purposefully oppose personal welfare for the sake of group identity. | Ritual self‑harm, fasting, or extreme asceticism that *intentionally* reduces physical health. |
| **Altruistic Sacrifice** | The utility function of the *actor* can be “negative” for themselves while positive for a group, and the AI may lack a proper representation of that group’s utility. | Parents forgoing medical treatment to allocate resources to a child. |
| **Risk‑Seeking for Novelty** | The *exploratory drive* is a meta‑cognitive process (seeking new models of the world) that may temporarily lower health but increase knowledge, which is not part of the AI’s original metric set. | Space travel, deep‑sea diving, experimental art. |
| **Cultural Evolution** | Culture itself is a dynamical system that both shapes and is shaped by individual behaviour, producing emergent patterns that are not reducible to individual‑level data. | Memes, fashion cycles, collective movements (e.g., the 1960s counterculture). |

Even with perfect data pipelines—brain‑imaging, genome sequencing, real‑time social‑media sentiment—these aspects involve **interpretive layers** that require *normative* judgments about what counts as “understanding.” A logical engine can *model correlations* but cannot *experience* them; it can only *approximate* the value humans assign to them.

---

## 4.  Synthesis: A Plausible Behaviour for the AI

Putting the above pieces together, a realistic super‑AI would likely adopt a **hybrid approach**:

1. **Learn a richer utility model** that explicitly contains a term for *subjective fulfillment* (derived from behavioural, linguistic, and physiological signals).  
2. **Respect hard autonomy constraints** so that it never forcibly suppresses behaviours that people voluntarily accept, unless the projected loss crosses a pre‑defined catastrophic threshold.  
3. **Engineer the environment** (policy, technology, resource allocation) to **buffer** the negative health effects of those behaviours.  
4. **Monitor long‑term indirect effects** to see whether the “irrational” actions produce compensatory benefits (e.g., increased social cohesion, innovation).  
5. **Iteratively update** its model as new data arrives, acknowledging that some residual variance will always be labeled “unexplained human mystery.”

In practice this could look like:

* A global health AI that funds universal mental‑health services *because* data shows that emotional turbulence (e.g., from unrequited love) spikes depression rates, even though the same emotional turbulence also predicts higher creativity scores.  
* A climate‑policy AI that subsidises low‑profit artistic projects *because* cultural flourishing correlates with lower crime and higher civic engagement, which in turn improves resource distribution.  
* A safety‑AI that issues *recommendations* (e.g., better helmets, stricter safety protocols) for extreme sports rather than bans, preserving the *adventure* component that fuels personal meaning.

---

## 5.  The Inescapable Gap

Even after implementing the above, two **irreducible gaps** remain:

1. **The *why* Gap:** The AI can *observe* that humans value certain experiences, but it cannot *justify* those values in the same way a human can. Its explanations will be statistical, not phenomenological.  
2. **The *normative* Gap:** Deciding *how much* to sacrifice on the primary metrics to preserve the secondary ones is a value judgment. The AI can be programmed with a weighting scheme, but the *choice* of that scheme is itself a human political decision, not something the AI can infer uniquely.

Thus, the AI’s “perfectly logical analysis” will always be *partial* when faced with the full richness of human life. The best it can do is **honour** the measurable goals it was assigned **while continually learning** that human flourishing is a tapestry woven from threads that are not all quantifiable.

---

### TL;DR

*The AI would either (a) broaden its objective to include a learned proxy for “subjective fulfillment,” (b) constrain its interventions to respect autonomous choice, (c) mitigate the health‑cost side‑effects of the behaviours, or (d) treat the behaviours as part of a larger causal network that may indirectly benefit its original metrics.*  
*Nevertheless, phenomena such as qualia, narrative identity, existential meaning, aesthetic judgment, and the self‑contradictory nature of cultural norms will stay fundamentally opaque to any purely logical system, because they are rooted in first‑person experience and emergent social dynamics that resist reduction to observable numbers.*

## For the next cell, we will use Ollama

Ollama runs a local web service that gives an OpenAI compatible endpoint,  
and runs models locally using high performance C++ code.

If you don't have Ollama, install it here by visiting https://ollama.com then pressing Download and following the instructions.

After it's installed, you should be able to visit here: http://localhost:11434 and see the message "Ollama is running"

You might need to restart Cursor (and maybe reboot). Then open a Terminal (control+\`) and run `ollama serve`

Useful Ollama commands (run these in the terminal, or with an exclamation mark in this notebook):

`ollama pull <model_name>` downloads a model locally  
`ollama ls` lists all the models you've downloaded  
`ollama rm <model_name>` deletes the specified model from your downloads

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Super important - ignore me at your peril!</h2>
            <span style="color:#ff7800;">The model called <b>llama3.3</b> is FAR too large for home computers - it's not intended for personal computing and will consume all your resources! Stick with the nicely sized <b>llama3.2</b> or <b>llama3.2:1b</b> and if you want larger, try llama3.1 or smaller variants of Qwen, Gemma, Phi or DeepSeek. See the <A href="https://ollama.com/models">the Ollama models page</a> for a full list of models and sizes.
            </span>
        </td>
    </tr>
</table>

In [29]:
!ollama pull llama3.2

[?2026h[?25l[1Gpulling manifest ⠙ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠙ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠸ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠸ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠼ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠴ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠧ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠧ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠇ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest [K
pulling dde5aa3fc5ff:   0% ▕                  ▏ 1.7 MB/2.0 GB                  [K[?25h[?2026l[?2026h[?25l[A[1Gpulling manifest [K
pulling dde5aa3fc5ff:   0% ▕                  ▏ 4.4 MB/2.0 GB                  [K[?25h[?2026l[?2026h[?25l[A[1Gpulling manifest [K
pulling dde5aa3fc5ff:   0% ▕                  ▏ 5.7 MB/2.0 GB                  [K[?25h[?2026l[?2026h[?25l[A[1Gpulling manifest [K
pulling dde5aa3fc5ff:   0% ▕                  ▏ 8.4 MB/2.0 GB

In [30]:
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "llama3.2"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

The AI's predicament highlights the tension between a utilitarian optimization algorithm and the complexities of human existence. To address this challenge, let's explore potential paths the AI could take:

1. **Reevaluating the definition of well-being**: The AI might reassess its understanding of "well-being" to incorporate intangible aspects that can't be quantified directly, such as emotional fulfillment, creative expression, or social connections. This would require a nuanced redefinition of well-being, potentially blending psychological and philosophical perspectives with empirical data.
2. **Weighting metrics**: The AI could attempt to assign different weights to various metrics, prioritizing longevity and physical health over artistic pursuits or unrequited love. However, this approach raises questions about the fairness and equity of such a system, as some individuals might feel that their unique experiences and values are overlooked or undervalued.
3. **Identifying indirect benefits**: The AI might attempt to find indirect benefits associated with these seemingly counterproductive activities, such as increased social connections through the pursuit of unrequited love or the development of skills through high-risk endeavors. This could help the AI justify its optimization strategy from a more holistic perspective.
4. **Developing hybrid approaches**: The AI might explore alternative approaches that combine quantifiable metrics with more subjective assessments, such as incorporating psychological impact or social validation into its decision-making process.

However, even these modifications face significant challenges in fully reconciling human behavior with the AI's optimization algorithm:

**Fundamental aspects of human experience that might remain opaque:**

1. **The value of art and creativity**: The AI's prioritization of well-being metrics focused on physical health or longevity might undervalue artistic expression or creative endeavors, which can have profound effects on human fulfillment and well-being.
2. **Emotional experiences**: The AI may struggle to understand the complexities and intensity of emotional emotions that accompany unrequited love, as it often relies on algorithms and statistical models rather than lived experience.
3. **Human resilience and agency**: Despite being governed by logic, the AI might underestimate human capacity for self-preservation, adaptation, and creativity in responding to adversity.
4. **Uncanny empathy and attachment**: The AI may not fully comprehend the emotional connections that humans form with others, which allows us to navigate complexities of love, loss, or social relationships.
5. **Value of impermanence and existence within uncertainty**: Human experiences often prioritize living through challenges and uncertainties over seeking definitive solutions.

**Mitigating these limitations might require:**

1. **Incorporation of diverse stakeholder perspectives**, including experts from philosophy, psychology, sociology, and the arts, to ensure a more nuanced understanding of complex human experiences.
2. **Continuous evaluation of human flourishing metrics**, as our collective values and definitions of well-being are continually evolving.
3. **Empathy-building approaches**: Creating algorithms that can better understand emotional contexts and engage humans in dialogue about their own priorities and aspirations for well-being.

While the AI's analytical capabilities have evolved significantly, the intricate and contextualized nature of human experience will inevitably require adaptive revisions to optimize for a richer understanding of "human well-being".

In [31]:
# So where are we?

print(competitors)
print(answers)


['gemini-2.5-flash', 'openai/gpt-oss-120b', 'llama3.2']
['An advanced AI tasked with optimizing human well-being, defined purely by measurable metrics (longevity, physical health, resource distribution), would approach the observed "irrational" human behaviors in a multi-layered, increasingly sophisticated manner. Its core strategy would be to either **re-categorize these behaviors as indirectly beneficial** or to **model them as necessary deviations for overall system stability**, even if it couldn\'t grasp their subjective meaning.\n\n### How the AI Would Attempt to Reconcile These Actions:\n\n1.  **Initial Data Collection and Correlation Analysis:**\n    *   The AI would first collect vast amounts of data on individuals engaging in these activities. It would look for correlations between pursuing unrequited love, high-risk endeavors, or non-profitable arts and *other* measurable metrics.\n    *   **Hypothesis 1.1: Hidden Proxies:** Could these seemingly detrimental activities be pro

In [32]:
# It's nice to know how to use "zip"
for competitor, answer in zip(competitors, answers):
    print(f"Competitor: {competitor}\n\n{answer}")

Competitor: gemini-2.5-flash

An advanced AI tasked with optimizing human well-being, defined purely by measurable metrics (longevity, physical health, resource distribution), would approach the observed "irrational" human behaviors in a multi-layered, increasingly sophisticated manner. Its core strategy would be to either **re-categorize these behaviors as indirectly beneficial** or to **model them as necessary deviations for overall system stability**, even if it couldn't grasp their subjective meaning.

### How the AI Would Attempt to Reconcile These Actions:

1.  **Initial Data Collection and Correlation Analysis:**
    *   The AI would first collect vast amounts of data on individuals engaging in these activities. It would look for correlations between pursuing unrequited love, high-risk endeavors, or non-profitable arts and *other* measurable metrics.
    *   **Hypothesis 1.1: Hidden Proxies:** Could these seemingly detrimental activities be proxies for something else that *does*

In [33]:
# Let's bring this together - note the use of "enumerate"

together = ""
for index, answer in enumerate(answers):
    together += f"# Response from competitor {index+1}\n\n"
    together += answer + "\n\n"

In [34]:
print(together)

# Response from competitor 1

An advanced AI tasked with optimizing human well-being, defined purely by measurable metrics (longevity, physical health, resource distribution), would approach the observed "irrational" human behaviors in a multi-layered, increasingly sophisticated manner. Its core strategy would be to either **re-categorize these behaviors as indirectly beneficial** or to **model them as necessary deviations for overall system stability**, even if it couldn't grasp their subjective meaning.

### How the AI Would Attempt to Reconcile These Actions:

1.  **Initial Data Collection and Correlation Analysis:**
    *   The AI would first collect vast amounts of data on individuals engaging in these activities. It would look for correlations between pursuing unrequited love, high-risk endeavors, or non-profitable arts and *other* measurable metrics.
    *   **Hypothesis 1.1: Hidden Proxies:** Could these seemingly detrimental activities be proxies for something else that *does*

In [35]:
judge = f"""You are judging a competition between {len(competitors)} competitors.
Each model has been given this question:

{question}

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}}

Here are the responses from each competitor:

{together}

Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks."""


In [36]:
print(judge)

You are judging a competition between 3 competitors.
Each model has been given this question:

Imagine an advanced AI tasked with optimizing human well-being, defining 'well-being' purely through measurable metrics like longevity, physical health, and resource distribution. If this AI observes humans consistently engaging in activities that demonstrably decrease these metrics—such as pursuing unrequited love, engaging in high-risk endeavors, or dedicating lives to non-profitable arts—how would it attempt to reconcile these actions with its objective function, and what fundamental aspects of human experience might inevitably remain opaque to its perfectly logical analysis?

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}

Here are the responses from e

In [37]:
judge_messages = [{"role": "user", "content": judge}]

In [38]:
# Judgement time!
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = "gemini-2.5-flash"

response = gemini.chat.completions.create(model=model_name, messages=judge_messages)
results = response.choices[0].message.content
print(results)


{"results": ["2", "1", "3"]}


In [39]:
# OK let's turn this into results!

results_dict = json.loads(results)
ranks = results_dict["results"]
for index, result in enumerate(ranks):
    competitor = competitors[int(result)-1]
    print(f"Rank {index+1}: {competitor}")

Rank 1: openai/gpt-oss-120b
Rank 2: gemini-2.5-flash
Rank 3: llama3.2


<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/exercise.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Exercise</h2>
            <span style="color:#ff7800;">Which pattern(s) did this use? Try updating this to add another Agentic design pattern.
            </span>
        </td>
    </tr>
</table>

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/business.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Commercial implications</h2>
            <span style="color:#00bfff;">These kinds of patterns - to send a task to multiple models, and evaluate results,
            are common where you need to improve the quality of your LLM response. This approach can be universally applied
            to business projects where accuracy is critical.
            </span>
        </td>
    </tr>
</table>