## Welcome to the Second Lab - Week 1, Day 3

Today we will work with lots of models! This is a way to get comfortable with APIs.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Important point - please read</h2>
            <span style="color:#ff7800;">The way I collaborate with you may be different to other courses you've taken. I prefer not to type code while you watch. Rather, I execute Jupyter Labs, like this, and give you an intuition for what's going on. My suggestion is that you carefully execute this yourself, <b>after</b> watching the lecture. Add print statements to understand what's going on, and then come up with your own variations.<br/><br/>If you have time, I'd love it if you submit a PR for changes in the community_contributions folder - instructions in the resources. Also, if you have a Github account, use this to showcase your variations. Not only is this essential practice, but it demonstrates your skills to others, including perhaps future clients or employers...
            </span>
        </td>
    </tr>
</table>

In [1]:
# Start with imports - ask ChatGPT to explain any package that you don't know

import os
import json
from dotenv import load_dotenv
from openai import OpenAI
from anthropic import Anthropic
from IPython.display import Markdown, display

In [2]:
# Always remember to do this!
load_dotenv(override=True)

python-dotenv could not parse statement starting at line 6
python-dotenv could not parse statement starting at line 7


True

In [None]:
# Print the key prefixes to help with any debugging

openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')
deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')
groq_api_key = os.getenv('GROQ_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set (and this is optional)")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:2]}")
else:
    print("Google API Key not set (and this is optional)")

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set (and this is optional)")

if groq_api_key:
    print(f"Groq API Key exists and begins {groq_api_key[:4]}")
else:
    print("Groq API Key not set (and this is optional)")

In [3]:
endpoint = os.getenv('AZURE_OPENAI_ENDPOINT')
api_key = os.getenv('AZURE_OPENAI_API_KEY')

client = OpenAI(
    base_url=endpoint,
    api_key=api_key
)

In [4]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]

In [5]:
messages

[{'role': 'user',
  'content': 'Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. Answer only with the question, no explanation.'}]

In [6]:

response = client.chat.completions.create(
    model="gpt-oss-120b",
    messages=messages
)

question = response.choices[0].message.content
print(question)

Consider a hypothetical society in which every person can only communicate using 8‑bit binary strings, and the semantic meaning of each string is not fixed but changes daily according to a hidden deterministic rule that depends solely on the calendar date. On March 14, 2025 the binary string “10101010” conveys the meaning “hello,” while on March 15, 2025 the same string conveys “goodbye.”  

**Question:** Propose a plausible, mathematically well‑defined algorithm that, given any calendar date, maps each of the 256 possible 8‑bit strings to a distinct lexical item (e.g., a word or concept) in a way that is reversible and allows every member of the society to infer the correct mapping for any given day without external reference. Explain how such a system could sustain coherent communication despite the daily semantic shift, addressing issues of learning, ambiguity, and error correction.


In [7]:

response = client.chat.completions.create(
    model="gpt-oss-120b",
    messages=messages,
)
question = response.choices[0].message.content
print(question)


A seasoned detective is investigating a series of three seemingly unrelated crimes: (1) a rare book is stolen from a locked library vault without any signs of forced entry; (2) a priceless painting is found hidden inside a hollowed-out sculpture in a museum that was under constant surveillance; (3) a high‑security data server is compromised, yet the logs show no unauthorized access attempts. Each crime scene contains a single clue: a red feather, a torn piece of a blue envelope, and a half‑eaten almond. Assuming the perpetrator is the same individual in all three cases and that each clue is intentionally placed to mislead investigators, determine the most plausible motive behind the crimes, the likely profession or expertise of the perpetrator, and the hidden connection among the three stolen items. Explain your reasoning step by step.


In [8]:
competitors = []
answers = []
messages = [{"role": "user", "content": question}]

## Note - update since the videos

I've updated the model names to use the latest models below, like GPT 5 and Claude Sonnet 4.5. It's worth noting that these models can be quite slow - like 1-2 minutes - but they do a great job! Feel free to switch them for faster models if you'd prefer, like the ones I use in the video.

In [9]:
# The API we know well
# I've updated this with the latest model, but it can take some time because it likes to think!
# Replace the model with gpt-4.1-mini if you'd prefer not to wait 1-2 mins


response = client.chat.completions.create(model="gpt-oss-120b", messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append("gpt-oss-120b")
answers.append(answer)

**The short answer**

* **Motive:**  To assemble the three complementary pieces of a single, carefully‑protected secret – a set of coordinates / a decryption key that will unlock a priceless hidden treasure (or a piece of extremely valuable intelligence).  
* **Who the criminal is:**  A former intelligence / cryptanalysis specialist who has turned his knowledge of both physical security (art‑theft, vault‑breaking) and digital security (hacking, steganography) into a professional “high‑value‑information thief.”  
* **How the three stolen objects are linked:**  Each of the objects – the rare book, the painting concealed in a sculpture, and the data server – contains one‑third of the same encoded message.  When the three halves are brought together the message can be decoded and the location of the hidden prize is revealed.

Below is a step‑by‑step reconstruction of how that conclusion is reached from the facts that were presented.

---

## 1.  What the three clues really say  

| Clue found on the scene | Literal reading | What the clue *actually* hints at |
|--------------------------|-----------------|------------------------------------|
| **Red feather**          | A bird feather, “red.” | **Red herring** – a deliberate false lead. |
| **Torn piece of a blue envelope** | A fragment of paper, blue. | **Paper‑trail / “mail”** – something that is meant to be *sent* or *delivered* elsewhere. |
| **Half‑eaten almond**    | A nut that has been cracked. | **A nut to be cracked** – a puzzle, a code, something that must be broken. |

The detective quickly recognises that each object is a classic *misdirection* (red‑herring) that also points to the *real* nature of the crime: **a hidden puzzle that must be cracked and then delivered**.  The colour “blue” is not random – in cryptographic jargon “blue‑paper” is often used to describe a *one‑time‑pad* or a *paper‑based key*.

Thus the three clues together form a *meta‑message*:  

> **“Red herring – a paper key – a nut to crack.”**  

In other words: *Ignore the surface, look for a paper‑based cipher that needs to be cracked*.

---

## 2.  Why the three thefts fit that meta‑message  

### a) The rare book stolen from a locked vault  

* A “rare book” is the classic carrier of a **book‑cipher**.  
* The vault was opened without a forced entry – the only plausible way is that the thief already possessed the *combination* (i.e., the key).  
* In a book‑cipher the *key* is the book itself; stealing it gives the thief the only thing that can decode the cipher text hidden elsewhere.

### b) The priceless painting hidden inside a hollowed‑out sculpture  

* The painting was concealed inside a sculpture that was under constant visual surveillance – the only way to get it out without being seen is to *replace* it with something that looks identical.  
* The painting itself is a perfect vehicle for **steganography** (hidden images, invisible inks, micro‑dots).  
* By taking the painting the thief also removes the *steganographic carrier* that holds another third of the secret.

### c) The high‑security data server that shows no unauthorized logins  

* Modern servers can hide data in *file‑system metadata*, *unused disk sectors* or in *encrypted containers* that only reveal themselves when the correct *key* (the other two pieces) is supplied.  
* The absence of any login attempts tells us the breach was *pass‑through* – the attacker simply “plugged in” a device that already possessed the decryption key (the stolen book and painting).  

All three crime scenes therefore contain **one component of a three‑part encoded message**:

| Component | How it stores the secret |
|-----------|--------------------------|
| Rare book | Textual book‑cipher key (the “paper” part). |
| Painting  | Visual steganographic layer (the “image” part). |
| Server    | Digital ciphertext (the “data” part). |

When the three are combined, the ciphertext on the server can be decrypted **using the book as the one‑time‑pad** and **the painting’s hidden image as the final checksum**.  The result is a set of GPS coordinates (or a pass‑phrase) that points to a fourth, hidden cache – the real prize.

---

## 3.  Who could pull off all three crimes?  

| Needed skill | Typical profession / expertise |
|--------------|--------------------------------|
| **Vault‑breaking / art‑theft** – knowledge of physical security, ability to move priceless objects without triggering alarms. | Professional art thief, former museum security consultant, or “cat‑ burglar.” |
| **Steganography & cryptanalysis** – extracting hidden data from paintings, creating and reading book‑ciphers. | Cryptographer / intelligence analyst, especially one with a background in *signals intelligence* (SIGINT) or *historical code‑breaking*. |
| **Network intrusion without logs** – using custom firmware, hardware implants, or “air‑gap” attacks that bypass normal authentication. | Cyber‑security specialist / hacker, preferably one who has worked in a government or corporate “red‑team” capacity. |

The only realistic profile that unites *all* three abilities is **a former intelligence officer (or a “black‑ops” operative) who later turned freelance**.  Such a person would have:

* **Training in covert entry** (special forces, CIA “operations” units).  
* **Deep knowledge of classical cipher systems** (used in WWII‑era espionage) as well as modern steganography.  
* **Hands‑on experience with air‑gap attacks and hardware implants** (the kind of skill set taught in elite cyber‑warfare schools).

Thus the perpetrator is most plausibly a **cryptologic‑intelligence specialist turned high‑value information thief**.

---

## 4.  The hidden connection among the stolen items  

1. **The rare book** – contains a *book‑cipher key* (a list of page‑line‑word numbers).  
2. **The painting** – hides a *visual map* (micro‑dots forming a faint diagram) that tells the thief which pages/lines of the book to use.  
3. **The server** – stores the *cipher‑text* that, once the correct page/line selections are applied, decrypts to a *set of coordinates* (or a password) that unlocks a fourth, undisclosed cache (often imagined as a “lost treasure” or a highly classified dossier).  

In short: **the three objects are the three legs of a single, sophisticated information‑hiding system**.  The thief’s only goal is to bring the three legs together, solve the puzzle, and claim whatever lies at the end of the decoded message.

---

## 5.  Putting it all together – the story in one paragraph  

A former intelligence‑officer‑turned‑professional thief discovered that a 17th‑century codex, a masterpiece concealed inside a museum sculpture, and a high‑security data server each hold a third of a single, highly‑encrypted message.  The book is the one‑time‑pad, the painting carries a hidden visual map that points to the exact passages in the book, and the server houses the ciphertext that can only be unlocked with those passages.  By stealing the three pieces the perpetrator can finally crack the “nut” – a set of coordinates that lead to a hidden cache of priceless antiquities (or state secrets).  The red feather, blue‑envelope scrap, and half‑eaten almond left at each scene are intentional “red herrings” that merely point the investigator toward the true nature of the crime: **a sophisticated, three‑part cryptographic heist**.

In [10]:
model_name = "Mistral-Large-3"

response = client.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Alright, let's tackle this intriguing mystery step by step. We have three seemingly unrelated crimes, each with a peculiar clue left behind. Our goal is to determine:

1. The most plausible motive behind the crimes.
2. The likely profession or expertise of the perpetrator.
3. The hidden connection among the three stolen items.

### Step 1: Analyzing the Crimes and Clues

**Crime 1: Rare book stolen from a locked library vault without forced entry.**
- *Clue:* A red feather.
- *Observations:*
  - No signs of forced entry suggest the thief had access or knowledge to bypass security (e.g., keys, codes, or lock-picking skills).
  - A red feather could symbolize something related to birds, writing (quill pens), or even a signature or mark.

**Crime 2: Priceless painting hidden inside a hollowed-out sculpture in a museum under constant surveillance.**
- *Clue:* A torn piece of a blue envelope.
- *Observations:*
  - The painting wasn't taken out of the museum but hidden within it, suggesting the thief might return or that the theft is part of a larger plan.
  - Constant surveillance implies the thief is adept at avoiding or manipulating surveillance (e.g., timing, distractions, or technical skills).
  - A blue envelope might hint at mail, correspondence, or something sent/received (possibly related to communication or secrets).

**Crime 3: High-security data server compromised with no unauthorized access in logs.**
- *Clue:* A half-eaten almond.
- *Observations:*
  - No unauthorized access in logs suggests the thief either had legitimate access or used sophisticated methods to cover their tracks (e.g., insider threat, zero-day exploits, or log manipulation).
  - A half-eaten almond could symbolize something incomplete, a snack (perhaps the thief was there for a while), or something related to health/nuts (e.g., allergies, but that seems less likely).

### Step 2: Considering the Clues as Misleading

The problem states that each clue is intentionally placed to mislead investigators. This means the actual significance of the clues might be opposite or unrelated to their apparent meanings. Let's brainstorm possible misleading aspects:

- **Red feather:**
  - Misleading: Could suggest a bird, freedom, or something light and airy.
  - Actual: Maybe it's about something heavy, grounded, or the opposite of flight (e.g., a weight, a burden).

- **Torn blue envelope:**
  - Misleading: Could imply mail, communication, or something sent.
  - Actual: Maybe it's about something not sent, silence, or a lack of communication.

- **Half-eaten almond:**
  - Misleading: Could suggest incompleteness, a snack, or something left behind.
  - Actual: Maybe it's about something complete, a full almond, or something not edible (e.g., a code word).

### Step 3: Looking for Patterns or Connections

Let's list the stolen items:
1. Rare book.
2. Priceless painting.
3. Data from a high-security server.

At first glance, these seem like items of value, but their connection isn't obvious. Maybe they're not just valuable in monetary terms but in information or symbolic value.

Possible connections:
- **Information/Knowledge:** All three items can contain or represent information (book = written knowledge, painting = visual information, data server = digital information).
- **Historical or Cultural Significance:** Maybe they're all related to a specific historical event, person, or secret.
- **Ownership:** Perhaps they're all owned by the same entity or related entities.

### Step 4: Considering the Perpetrator's Profession or Expertise

Given the nature of the crimes:
1. **Library vault:** Knowledge of locks, security systems, or insider access.
2. **Museum surveillance:** Ability to avoid or manipulate surveillance, possibly with technical skills or insider knowledge.
3. **Data server:** High-level hacking skills or insider access to bypass security and alter logs.

Possible professions/expertise:
- **Security Expert or Hacker:** Someone who knows how to bypass security systems.
- **Insider (Librarian, Museum Staff, IT Personnel):** Someone with legitimate access who can exploit it.
- **Art Thief or Spy:** Someone skilled in stealing valuable items and covering tracks.
- **Historian or Archivist:** Someone interested in rare items with historical significance.

Given the sophistication, a combination of insider knowledge and technical expertise seems likely.

### Step 5: Hypothesizing the Motive

Possible motives:
- **Financial Gain:** Selling the items or data.
- **Espionage:** Stealing information for a third party.
- **Personal Obsession:** Collecting rare items for personal satisfaction.
- **Revenge or Blackmail:** Targeting specific entities for a hidden reason.
- **Uncovering a Secret:** The stolen items might reveal a hidden truth.

Given that the items are hidden or their theft is concealed, the motive might not be immediate financial gain but something more long-term or personal.

### Step 6: Re-evaluating the Clues with Possible Themes

Let's think about the clues in the context of misleading:

- **Red feather:**
  - Misleading: Bird, flight, freedom.
  - Actual: Maybe "quill" (writing), "pen" (as in a spy's tool), or "weight" (something heavy).

- **Blue envelope:**
  - Misleading: Mail, communication.
  - Actual: Maybe "sealed" (something hidden), "empty" (lack of communication), or "blue" as in "blueprint" or "blue chip."

- **Half-eaten almond:**
  - Misleading: Incomplete, snack.
  - Actual: Maybe "whole almond" (completeness), "bitter almond" (cyanide, poison), or "almond" as in "almond-shaped" (something specific).

### Step 7: Connecting the Stolen Items

Let's think about what a rare book, a painting, and data might have in common beyond value:

- **Historical Documents:** Maybe they all relate to a specific historical event or secret.
- **Cryptography:** They might contain codes or encrypted information.
- **Ownership:** Perhaps they're all tied to a secret society or organization.
- **Artifacts of a Person:** Maybe they're all related to a specific individual (e.g., a famous author, artist, or scientist).

### Step 8: Formulating a Theory

Given the above, here's a plausible theory:

**Perpetrator:**
- A former or current employee with access to all three locations (library, museum, data center), possibly with a background in security, IT, or archival work.
- Alternatively, a highly skilled independent thief with expertise in bypassing security systems, possibly with insider help.

**Motive:**
- The stolen items are not for immediate sale but are pieces of a larger puzzle. The thief is collecting them to uncover or protect a hidden secret.
- The secret could be related to a historical event, a conspiracy, or personal history (e.g., proving a family lineage, uncovering a betrayal).

**Hidden Connection:**
- The rare book, painting, and data all contain parts of a hidden message, code, or evidence related to the same secret.
  - For example:
    - The book might contain a cipher or reference.
    - The painting might have a hidden image or symbol.
    - The data might contain encrypted files or correspondence.

**Clues as Misleading:**
- The red feather could symbolize a "quill" or "pen," hinting at writing or authorship (misleading us to think of birds).
- The blue envelope might hint at "sealed" or "confidential" (misleading us to think of mail).
- The half-eaten almond could hint at "bitter" or "poison" (misleading us to think of food).

### Step 9: Refining the Theory

Let's think more about the hidden connection:

- **Rare Book:** Could be a first edition with annotations or a specific passage that's key.
- **Painting:** Might have a hidden layer or underpainting revealing something.
- **Data:** Could contain encrypted files that, when combined with the other items, reveal the secret.

Perhaps the thief is trying to reconstruct a lost or hidden truth that's been fragmented across these items.

**Profession:**
- Given the need for access and technical skills, the thief could be a **security consultant, archivist, or historian with hacking skills**.

**Motive:**
- **Uncovering a Historical Secret:** Maybe the items prove a historical event or figure in a new light, or they're part of a treasure map.
- **Personal Vendetta:** The secret could implicate someone the thief wants to expose or take revenge upon.

### Step 10: Finalizing the Answer

After considering all angles, here's the most plausible conclusion:

**Most Plausible Motive:**
The perpetrator is collecting these items to uncover or reconstruct a hidden secret that is fragmented across them. The motive is likely **intellectual or personal**—perhaps to reveal a historical truth, solve a mystery, or expose a conspiracy—rather than immediate financial gain.

**Likely Profession or Expertise:**
The perpetrator is likely a **security expert, historian, or archivist with advanced technical skills**, possibly a former insider with access to all three locations. They have expertise in **bypassing physical and digital security systems**, suggesting a background in **security consulting, IT, or museum/library sciences**.

**Hidden Connection Among the Stolen Items:**
The rare book, painting, and data server all contain **pieces of a larger puzzle or secret**. They might:
- Contain parts of a code or cipher that, when combined, reveal a hidden message.
- Be artifacts related to a specific historical event or individual, with their theft aimed at piecing together a lost narrative.
- Hold evidence of a conspiracy or hidden truth that the thief is trying to expose or protect.

**Explanation of Clues as Misleading:**
- **Red feather:** Likely misleads to think of birds or flight; actually hints at "quill" or "pen," symbolizing writing or authorship related to the book.
- **Torn blue envelope:** Misleads to think of mail or communication; actually hints at "sealed" or "confidential," pointing to hidden or secret information.
- **Half-eaten almond:** Misleads to think of food or incompleteness; actually hints at "bitter almond" (cyanide), possibly symbolizing poisoned information or a dangerous secret.

### Final Answer:

**Motive:** The perpetrator is collecting the stolen items to uncover or reconstruct a hidden secret fragmented across them, likely driven by a desire to reveal a historical truth, solve a personal mystery, or expose a conspiracy.

**Profession/Expertise:** The perpetrator is most likely a **security expert, historian, or archivist with advanced technical skills**, capable of bypassing both physical and digital security measures. Their expertise suggests a background in security consulting, IT, or museum/library sciences, possibly with insider access to the targeted locations.

**Hidden Connection:** The rare book, priceless painting, and compromised data server all contain **pieces of a larger puzzle or secret**. They may hold parts of a code, evidence related to a specific historical event or individual, or information that, when combined, reveals a hidden truth or conspiracy. The thefts are not for immediate financial gain but are steps toward assembling this concealed knowledge.

In [11]:

model_name = "grok-4-fast-reasoning"

response = client.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

### Step-by-Step Reasoning

1. **Analyze the Nature of the Crimes and Methods Used**:
   - **Crime 1 (Rare Book Theft)**: The book was taken from a locked vault with no signs of forced entry. This implies the perpetrator had legitimate access (e.g., a key, code, or insider privileges) rather than breaking in. No alarm triggers or physical damage suggest careful, authorized entry.
   - **Crime 2 (Painting "Theft")**: The painting was removed from display and hidden inside a hollowed-out sculpture in a surveilled museum. Constant surveillance rules out obvious outsiders; the hollowing likely occurred prior to the theft (e.g., during routine maintenance), allowing the painting to be stashed without immediate detection on cameras. The fact that it was hidden (rather than removed entirely) suggests intent to retrieve it later, not immediate sale or destruction.
   - **Crime 3 (Data Server Compromise)**: The server was breached with no logged unauthorized attempts, indicating the access was legitimate (e.g., via valid credentials) but misused. This points to an insider downloading, altering, or copying data without raising flags, rather than a brute-force hack.
   - **Common Thread in Methods**: All crimes involve no "breaking and entering" or detectable external intrusion, pointing to an insider with multi-departmental access. The placement of misleading clues (red feather, torn blue envelope piece, half-eaten almond) suggests deliberate misdirection—e.g., the feather might imply a costumed intruder, the envelope a postal thief, and the almond a careless snacker—to create false suspect profiles and divert attention from an internal actor.

2. **Establish the Single Perpetrator and Required Expertise**:
   - Since the same individual committed all three, they must have overlapping access and skills across a library vault (rare books), a museum (art and sculptures), and a high-security data server (likely a digital archive or catalog system).
   - This rules out narrow specialists (e.g., a pure hacker lacks physical art access; a librarian lacks sculpture manipulation skills). The ideal profile is someone in a large cultural institution (e.g., a national library-museum complex like the Getty or British Museum, which house rare books, art collections, and digital databases for provenance and inventory).
   - **Likely Profession/Expertise**: An art/book conservator or restorer. These professionals handle rare books (restoring bindings/pages), paintings and sculptures (cleaning, repairing, or modifying for display), and often have access to digital systems for cataloging high-res scans, authentication records, and provenance data. They work "behind the scenes" with minimal suspicion, can enter vaults or disable temporary surveillance for "maintenance," and use authorized logins for servers. Hollowing a sculpture fits restoration work; altering server data fits updating digital records.

3. **Interpret the Misleading Clues**:
   - The clues are explicitly "intentionally placed to mislead," so they aren't direct evidence of the perpetrator's identity or method but rather fabricated distractions. For example:
     - Red feather: Could suggest a birdwatcher, costumed thief, or exotic collector, leading investigators down irrelevant paths.
     - Torn blue envelope piece: Might imply involvement in mail fraud, extortion, or a disgruntled correspondent.
     - Half-eaten almond: Could hint at a snack left by a hurried outsider, suggesting casual opportunism.
   - Collectively, they create a "serial eccentric" narrative (e.g., a quirky kleptomaniac), obscuring the methodical insider. This aligns with a conservator's expertise in staging scenes subtly, as they deal with delicate artifacts and evidence preservation.

4. **Identify the Hidden Connection Among the Stolen Items**:
   - Surface level: The items seem disparate (physical book, physical painting, intangible data), but all are high-value cultural assets in the same institutional ecosystem.
   - Deeper level: Rare books and paintings often have digital counterparts—high-resolution scans, authentication certificates, and provenance histories stored on secure servers to verify authenticity and ownership. The book and painting are physical originals; the server holds their "digital souls" (e.g., metadata, images, and legal records proving genuineness).
   - **Hidden Connection**: These items form an interdependent triad for authenticating irreplaceable cultural treasures. Stealing the book and painting provides the physical originals to study/replicate; compromising the server allows alteration of digital proofs (e.g., editing scan metadata or ownership logs to validate fakes). Without all three, forgeries would be easily detected via mismatched records. This isn't random theft but a targeted acquisition of components for replication.

5. **Determine the Most Plausible Motive**:
   - Random theft for resale doesn't fit: The painting was hidden (not fenced immediately), the book left no escape trace, and data alone has limited street value without context.
   - Vandalism or ideological theft (e.g., exposing security flaws) ignores the careful hiding and lack of destruction.
   - **Plausible Motive**: Financial gain via an elaborate forgery and insurance fraud scheme. The conservator steals the originals to create high-fidelity forgeries (using insider knowledge of materials/techniques), alters server data to "authenticate" the fakes (e.g., updating provenance to erase the originals' traces), and hides the painting for later retrieval/sale on the black market. The real book could be forged and returned (or sold abroad), triggering an insurance payout for the "stolen" painting while the fake remains on display. This exploits institutional trust, with forgeries potentially worth millions (rare books and old masters sell for tens of millions). The clues mislead to buy time for the scheme, as investigations chase phantoms while fakes integrate seamlessly.
   - Why plausible? Art forgery is a real, lucrative crime (e.g., cases like the Getty kouros or Han van Meegeren's Vermeers), often involving insiders with access to originals and records. It explains all elements without contradictions: no forced entry (authorized), surveillance evasion (employee), clean logs (legit access), and the triad's synergy.

### Summary of Conclusions
- **Most Plausible Motive**: Financial gain through forging the stolen items and manipulating records to sell replicas or claim insurance, leveraging insider access for a long-term black-market profit.
- **Likely Profession/Expertise**: Art and book conservator/restorer in a major cultural institution, providing hands-on access to physical artifacts and digital catalogs.
- **Hidden Connection Among the Stolen Items**: They are linked components of authentication for high-value cultural assets—the physical book and painting as originals, the server data as their verifiable digital records—enabling the creation and undetected sale of forgeries.

In [12]:

model_name = "DeepSeek-R1-0528"

response = client.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)


### Step-by-Step Reasoning:

To solve this series of crimes, I analyzed the details of each incident, the intentionally misleading clues, and the common elements pointing to the perpetrator's identity, motive, and the hidden connection among the items involved. Here's the step-by-step reasoning:

1. **Analyze the Crimes and Methods:**
   - **Crime 1 (Stolen Rare Book):** The book was taken from a locked vault with no signs of forced entry. This suggests the perpetrator had legitimate access (e.g., a key, code, or authority to enter) or sophisticated lock-picking skills. The lack of forced entry implies insider knowledge or expertise in bypassing physical security.
   - **Crime 2 (Hidden Painting):** A priceless painting was found concealed inside a hollowed-out sculpture in a museum under constant surveillance. This indicates the perpetrator had:
     - Access to the sculpture (likely requiring advance preparation to hollow it out).
     - Knowledge of surveillance blind spots or the ability to disable/evade cameras.
     - Time to move and hide the painting without detection, pointing to someone with recurring access, such as staff or a contractor.
   - **Crime 3 (Compromised Data Server):** The server was breached with no unauthorized access logs, suggesting:
     - Use of authorized credentials (e.g., stolen or insider login).
     - Advanced hacking techniques to avoid detection (e.g., log deletion, exploiting zero-day vulnerabilities).
     - Expertise in cybersecurity.
   - **Common Thread in Methods:** All crimes involved bypassing high-security measures without leaving traces, indicating the perpetrator has:
     - **Diverse Expertise:** Skills in physical security (lock-picking, surveillance evasion), art handling (for hollowing the sculpture and moving the painting), and cybersecurity (server intrusion).
     - **Insider Access or Social Engineering:** Likely legitimate access to all locations, allowing them to operate without raising suspicion. This could come from a profession involving multiple high-security environments.

2. **Evaluate the Clues as Red Herrings:**
   - Each clue is explicitly stated to be intentionally misleading, so they should not be taken at face value. Instead, they are designed to divert attention from the perpetrator's true identity or motive.
   - **Red Feather (Library Crime):** Might suggest an avian-themed distraction (e.g., a bird enthusiast, a costume, or a false alibi involving wildlife). But since it's a red herring, it could point to an unrelated suspect, like someone with a pet bird or a hobbyist.
   - **Torn Piece of Blue Envelope (Museum Crime):** Could imply correspondence (e.g., a threatening letter, a note from a disgruntled employee, or mail-related crime). The "blue" color might evoke associations with sadness or official documents, but it's likely a plant to frame someone with access to mail or office supplies.
   - **Half-Eaten Almond (Server Crime):** Might indicate carelessness (e.g., a perpetrator snacking, suggesting an amateur) or a health-related trait (e.g., nut allergies). As a red herring, it could mislead investigators toward a suspect with dietary habits or a non-professional profile.
   - **Why Misleading?** These clues are too obvious and incongruent with the sophisticated crimes. A seasoned perpetrator would not leave such evidence unless to create false leads (e.g., implicating an innocent person with a red-feather hobby, a mailroom worker, or a health-conscious individual). The real evidence lies in the crimes' execution, not the clues.

3. **Determine the Perpetrator's Profession or Expertise:**
   - The crimes require a rare combination of skills:
     - **Physical Security Bypass:** For the vault and museum, expertise in locks, surveillance systems, and art handling (e.g., hollowing sculptures) is needed. This suggests experience in facilities management, art installation, or security auditing.
     - **Cybersecurity:** The server breach with no logs indicates advanced IT skills, such as penetration testing or network exploitation.
     - **Insider Access:** The ability to move undetected in a library, museum, and data center implies a role with legitimate access across these environments. Common professions include:
       - **Security Consultant/Penetration Tester:** Often hired to assess vulnerabilities in diverse settings (cultural, digital, physical). They have the knowledge to exploit weaknesses without leaving traces and might turn rogue for personal gain.
       - **Facilities Manager or Contractor:** Could have cross-institutional access for maintenance, allowing them to prepare the sculpture, enter the vault, and access server rooms.
     - **Art and Technical Knowledge:** Hollowing a sculpture requires art-handling skills, while the perpetrator also handled digital intrusion, pointing to a versatile, tech-savvy individual.
   - **Most Plausible Profession:** A **security consultant or penetration tester** with experience in both physical and digital security systems. This role provides the necessary access, skills, and knowledge of vulnerabilities across different environments. They could be a disgruntled employee or someone operating independently to prove a point.

4. **Identify the Motive:**
   - **Financial Gain vs. Demonstration of Skill:** The book was stolen (suggesting profit), but the painting was only hidden (not stolen), and the data was compromised (not necessarily stolen). This inconsistency rules out pure theft for profit. Instead:
     - The hidden painting and server compromise serve as "calling cards" to demonstrate security flaws, while the stolen book could be for personal value or sale.
     - **Motive:** To **expose systemic security failures** across different institutions, potentially for notoriety, revenge (e.g., against a former employer), or to undermine trust in the security measures. This fits a rogue security expert who wants to prove their expertise or highlight incompetence.
   - **Why the Clues?** The misleading clues reinforce this by creating chaos and diverting resources, amplifying the embarrassment for the investigators and institutions.

5. **Uncover the Hidden Connection Among the Stolen/Compromised Items:**
   - The items involved are:
     - Stolen rare book (physical, informational asset).
     - Hidden painting (physical, cultural asset; not stolen but compromised).
     - Compromised data (digital asset).
   - **Surface Differences:** They seem unrelated—book (literary), painting (art), data (digital). But the crimes' execution suggests a deeper link.
   - **Hidden Connection:** All three items are **part of a single digital preservation or cultural heritage project**. For example:
     - The rare book could contain historical data or codes.
     - The painting might be linked to digital records (e.g., provenance or restoration data stored on the server).
     - The server could hold digitized versions of the book and painting, or sensitive project files.
   - **Reasoning:**
     - Libraries, museums, and data centers often collaborate on digitization initiatives (e.g., archiving rare artifacts). Compromising all three could disrupt or expose vulnerabilities in such a project.
     - The perpetrator's actions (stealing the book, hiding the painting to delay detection, and accessing the server) suggest an intent to target the project's integrity, not just individual items.
     - This connection explains why the crimes seem unrelated on the surface—they span physical and digital realms—but are linked by a shared purpose (e.g., a specific cultural database or research project).

### Conclusion:
- **Most Plausible Motive:** To expose and exploit systemic security vulnerabilities in high-value cultural and digital assets, likely driven by a desire for notoriety, revenge, or to demonstrate personal expertise. The stolen book may provide direct financial gain, while the other acts amplify the message.
- **Likely Profession/Expertise:** A **security consultant or penetration tester** with cross-disciplinary skills in physical security, art handling, and cybersecurity. This individual has insider-level access to multiple facilities, allowing them to execute the crimes without detection.
- **Hidden Connection Among the Items:** The rare book, painting, and compromised data are all **components of a unified digital preservation project** (e.g., a cultural heritage initiative where the book and painting are being digitized, with the server storing related data). The perpetrator targeted this project to highlight its security flaws or sabotage it.

This solution accounts for the misleading clues as distractions, the perpetrator's diverse expertise, and the underlying link between the items, providing a cohesive explanation for the crimes.

In [13]:


model_name = "Llama-3.3-70B-Instruct"
response = client.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)


To solve this puzzle, let's analyze the clues and the crimes step by step, considering the nature of each crime and the clues left behind.

### Step 1: Analyzing the Crimes
1. **Theft of a Rare Book**: The lack of signs of forced entry suggests the thief had access or a key, implying someone with authorized access or an expert in lock manipulation.
2. **Discovery of a Hidden Painting**: The use of a hollowed-out sculpture to hide a painting indicates a level of creativity and knowledge about art and possibly the museum's collection and layout.
3. **Compromised Data Server**: The absence of unauthorized access attempts in the logs suggests the perpetrator had legitimate access or found a way to bypass security measures without triggering alarms, indicating high-level IT expertise.

### Step 2: Examining the Clues
- **Red Feather**: Could symbolize danger, warning, or even a reference to a specific mythological or cultural symbol.
- **Torn Piece of a Blue Envelope**: Might suggest a connection to postal services or communication, but the torn aspect could imply a deliberate attempt to mislead.
- **Half-Eaten Almond**: Could indicate a snack preference, but its presence at a crime scene is unusual, suggesting it might be a red herring or a clue to the perpetrator's identity or habits.

### Step 3: Considering the Motive
Given the diverse nature of the crimes (theft of a book, hiding a painting, and compromising a data server), the motive might not be purely financial. The crimes seem to be more about the challenge, the art of execution, or possibly sending a message. The perpetrator might be seeking to prove a point about security vulnerabilities or to leave a signature that is not immediately apparent.

### Step 4: Profiling the Perpetrator
The diversity of skills required to execute these crimes (expertise in security, art, and IT) suggests the perpetrator is highly intelligent, versatile, and possibly has a background in multiple fields. They might be someone who values knowledge, art, and the challenge of outsmarting security systems. Given the lack of immediate financial gain and the complexity of the crimes, the perpetrator could be a former security expert, artist, or someone with a unique combination of skills and interests.

### Step 5: Uncovering the Hidden Connection
The connection among the stolen items might not be immediately apparent, but considering the crimes:
- The **rare book** could contain valuable information or be a collector's item.
- The **priceless painting** is a work of art with potentially significant cultural or historical value.
- The **high-security data server** could contain sensitive information, intellectual property, or data with significant monetary or strategic value.

The connection might be that all these items represent **knowledge** or **information** in different forms (literary, artistic, digital). The perpetrator might be collecting or manipulating these forms of knowledge for a larger purpose, such as creating a unique repository of valuable information, proving a point about the vulnerability of secure systems, or even as part of a larger artistic or intellectual project.

### Conclusion
The most plausible motive behind the crimes is the desire to collect and manipulate unique forms of knowledge or to challenge the concept of security and access. The perpetrator likely has a profession or expertise that combines elements of security, art, and IT, possibly someone like a cybersecurity expert with a passion for art and literature, or a former museum curator with a background in IT. The hidden connection among the three stolen items is their representation of valuable knowledge or information in different forms, which the perpetrator seeks to acquire, manipulate, or highlight for reasons that could range from personal satisfaction to a larger, more complex goal.

## For the next cell, we will use Ollama

Ollama runs a local web service that gives an OpenAI compatible endpoint,  
and runs models locally using high performance C++ code.

If you don't have Ollama, install it here by visiting https://ollama.com then pressing Download and following the instructions.

After it's installed, you should be able to visit here: http://localhost:11434 and see the message "Ollama is running"

You might need to restart Cursor (and maybe reboot). Then open a Terminal (control+\`) and run `ollama serve`

Useful Ollama commands (run these in the terminal, or with an exclamation mark in this notebook):

`ollama pull <model_name>` downloads a model locally  
`ollama ls` lists all the models you've downloaded  
`ollama rm <model_name>` deletes the specified model from your downloads

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Super important - ignore me at your peril!</h2>
            <span style="color:#ff7800;">The model called <b>llama3.3</b> is FAR too large for home computers - it's not intended for personal computing and will consume all your resources! Stick with the nicely sized <b>llama3.2</b> or <b>llama3.2:1b</b> and if you want larger, try llama3.1 or smaller variants of Qwen, Gemma, Phi or DeepSeek. See the <A href="https://ollama.com/models">the Ollama models page</a> for a full list of models and sizes.
            </span>
        </td>
    </tr>
</table>

In [None]:
!ollama pull llama3.2

In [None]:
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "llama3.2"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [14]:
# So where are we?

print(competitors)
print(answers)


['gpt-oss-120b', 'Mistral-Large-3', 'grok-4-fast-reasoning', 'DeepSeek-R1-0528', 'Llama-3.3-70B-Instruct']


In [15]:
# It's nice to know how to use "zip"
for competitor, answer in zip(competitors, answers):
    print(f"Competitor: {competitor}\n\n{answer}")


Competitor: gpt-oss-120b

**The short answer**

* **Motive:**  To assemble the three complementary pieces of a single, carefully‑protected secret – a set of coordinates / a decryption key that will unlock a priceless hidden treasure (or a piece of extremely valuable intelligence).  
* **Who the criminal is:**  A former intelligence / cryptanalysis specialist who has turned his knowledge of both physical security (art‑theft, vault‑breaking) and digital security (hacking, steganography) into a professional “high‑value‑information thief.”  
* **How the three stolen objects are linked:**  Each of the objects – the rare book, the painting concealed in a sculpture, and the data server – contains one‑third of the same encoded message.  When the three halves are brought together the message can be decoded and the location of the hidden prize is revealed.

Below is a step‑by‑step reconstruction of how that conclusion is reached from the facts that were presented.

---

## 1.  What the three clu

In [16]:
# Let's bring this together - note the use of "enumerate"

together = ""
for index, answer in enumerate(answers):
    together += f"# Response from competitor {index+1}\n\n"
    together += answer + "\n\n"

In [17]:
print(together)

# Response from competitor 1

**The short answer**

* **Motive:**  To assemble the three complementary pieces of a single, carefully‑protected secret – a set of coordinates / a decryption key that will unlock a priceless hidden treasure (or a piece of extremely valuable intelligence).  
* **Who the criminal is:**  A former intelligence / cryptanalysis specialist who has turned his knowledge of both physical security (art‑theft, vault‑breaking) and digital security (hacking, steganography) into a professional “high‑value‑information thief.”  
* **How the three stolen objects are linked:**  Each of the objects – the rare book, the painting concealed in a sculpture, and the data server – contains one‑third of the same encoded message.  When the three halves are brought together the message can be decoded and the location of the hidden prize is revealed.

Below is a step‑by‑step reconstruction of how that conclusion is reached from the facts that were presented.

---

## 1.  What the three

In [18]:
judge = f"""You are judging a competition between {len(competitors)} competitors.
Each model has been given this question:

{question}

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}}

Here are the responses from each competitor:

{together}

Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks."""


In [19]:
print(judge)

You are judging a competition between 5 competitors.
Each model has been given this question:

A seasoned detective is investigating a series of three seemingly unrelated crimes: (1) a rare book is stolen from a locked library vault without any signs of forced entry; (2) a priceless painting is found hidden inside a hollowed-out sculpture in a museum that was under constant surveillance; (3) a high‑security data server is compromised, yet the logs show no unauthorized access attempts. Each crime scene contains a single clue: a red feather, a torn piece of a blue envelope, and a half‑eaten almond. Assuming the perpetrator is the same individual in all three cases and that each clue is intentionally placed to mislead investigators, determine the most plausible motive behind the crimes, the likely profession or expertise of the perpetrator, and the hidden connection among the three stolen items. Explain your reasoning step by step.

Your job is to evaluate each response for clarity and st

In [20]:
judge_messages = [{"role": "user", "content": judge}]

In [24]:
# Judgement time!

response = client.chat.completions.create(
    model="gpt-5.1-chat",
    messages=judge_messages,
)
results = response.choices[0].message.content
print(results)


{"results": ["1", "3", "4", "2", "5"]}


In [25]:
# OK let's turn this into results!

results_dict = json.loads(results)
ranks = results_dict["results"]
for index, result in enumerate(ranks):
    competitor = competitors[int(result)-1]
    print(f"Rank {index+1}: {competitor}")

Rank 1: gpt-oss-120b
Rank 2: grok-4-fast-reasoning
Rank 3: DeepSeek-R1-0528
Rank 4: Mistral-Large-3
Rank 5: Llama-3.3-70B-Instruct


<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/exercise.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Exercise</h2>
            <span style="color:#ff7800;">Which pattern(s) did this use? Try updating this to add another Agentic design pattern.
            </span>
        </td>
    </tr>
</table>

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/business.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Commercial implications</h2>
            <span style="color:#00bfff;">These kinds of patterns - to send a task to multiple models, and evaluate results,
            are common where you need to improve the quality of your LLM response. This approach can be universally applied
            to business projects where accuracy is critical.
            </span>
        </td>
    </tr>
</table>