## Welcome to the Second Lab - Week 1, Day 3

Today we will work with lots of models! This is a way to get comfortable with APIs.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Important point - please read</h2>
            <span style="color:#ff7800;">The way I collaborate with you may be different to other courses you've taken. I prefer not to type code while you watch. Rather, I execute Jupyter Labs, like this, and give you an intuition for what's going on. My suggestion is that you carefully execute this yourself, <b>after</b> watching the lecture. Add print statements to understand what's going on, and then come up with your own variations.<br/><br/>If you have time, I'd love it if you submit a PR for changes in the community_contributions folder - instructions in the resources. Also, if you have a Github account, use this to showcase your variations. Not only is this essential practice, but it demonstrates your skills to others, including perhaps future clients or employers...
            </span>
        </td>
    </tr>
</table>

In [1]:
import os
import json
from dotenv import load_dotenv, find_dotenv
from chat_factory.models import ChatModel

In [2]:
load_dotenv(find_dotenv(), override=True)

True

In [4]:
# Print the key prefixes to help with any debugging
from utils.warning_beep import warning_beep

openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')
deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')
groq_api_key = os.getenv('GROQ_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    warning_beep(); print("\033[31mOpenAI API Key not set\033[0m")

if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    warning_beep(); print("\033[31mAnthropic API Key not set\033[0m")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:2]}")
else:
    warning_beep(); print("\033[31mGoogle API Key not set\033[0m")

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    warning_beep(); print("\033[31mDeepSeek API Key not set\033[0m")

if groq_api_key:
    print(f"Groq API Key exists and begins {groq_api_key[:4]}")
else:
    warning_beep(); print("\033[31mGroq API Key not set\033[0m")

OpenAI API Key exists and begins sk-3SZmF
Anthropic API Key exists and begins sk-ant-
Google API Key exists and begins AI
DeepSeek API Key exists and begins sk-
Groq API Key exists and begins gsk_


In [5]:
def print_response(response):
    try:
        from IPython.display import display, Markdown

        get_ipython()
        display(Markdown(response))
    except (ImportError, NameError):
        print(response)

In [6]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]
messages

[{'role': 'user',
  'content': 'Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. Answer only with the question, no explanation.'}]

In [7]:
question = ChatModel(
            model_name="gpt-5-mini",
            provider="openai"
        ).generate_response(
            messages=messages,
        )
print_response(question)


You are the sole decision-maker at a remote research station of 50 people cut off by a storm; current supplies: non-perishable food for 60 person‑days, potable water for 200 person‑days (there is a desalination unit that can produce water for 10 people/day but requires fuel that will run out in 5 days), medical supplies sufficient for routine care but only one ventilator and limited antibiotics, generator fuel for 8 days, and no immediate external support (a helicopter has a 70% chance of reaching the island in 10 days). Two people are moderately injured; one shows symptoms of a contagious respiratory illness. There is also a standing policy from headquarters that “mission-critical personnel” must be prioritized for evacuation if capacity is limited, but the roster of who is mission-critical is ambiguous. Your task: produce (1) a prioritized, day-by-day 30‑day survival and care plan covering resource allocation (food, water, fuel, power), rationing schedules, quarantine and triage rules, and evacuation prioritization with ethical justification for trade-offs; (2) operational steps to extend survivability and increase the probability of successful evacuation; (3) what clarifying questions and data you would request before finalizing the plan; (4) three key assumptions you think are most likely to be wrong and how you would test or mitigate each; (5) five measurable indicators you would track daily to decide whether to change course; and (6) the three most likely failure modes and specific contingency responses for each. Be explicit about the ethical principles you apply, and keep your answer concise.

In [8]:
competitors = []
answers = []
messages = [{"role": "user", "content": question}]

## Note - update since the videos

I've updated the model names to use the latest models below, like GPT 5 and Claude Sonnet 4.5. It's worth noting that these models can be quite slow - like 1-2 minutes - but they do a great job! Feel free to switch them for faster models if you'd prefer, like the ones I use in the video.

In [9]:
model_name = "gpt-5-nano"
answer = ChatModel(
            model_name=model_name,
            provider="openai"
        ).generate_response(
            messages=messages,
        )
print_response(answer)

competitors.append(model_name)
answers.append(answer)

Below is a concise, action-focused plan you can adapt at the station. It covers (1) day-by-day survival and care with resource allocation and evacuation prioritization, (2) steps to extend survivability and increase evacuation probability, (3) clarifying questions/data to request, (4) key assumptions and how to test/mitigate them, (5) five daily measurable indicators to decide on course changes, and (6) likely failure modes with contingencies. Ethical justification is explicit at each decision point.

1) Ethical framework and overarching approach (why)
- Core principles: maximize lives saved (utility), fairness, duty to care for the sick, minimize harm from infectious risk, protect essential operations, transparency in rules, and respect for persons (avoid arbitrary discrimination). In practice this means: prioritize life-sustaining needs, isolate contagious risk, allocate scarce life-support and evacuation to those with the best prognosis and greatest contribution to survival/return of operations, and sequence evacuations by a clear, defensible criterion when capacity is limited.
- Triage rules (operationalized):
  - Contagion control first: isolate the symptomatic/ill to limit spread; strict PPE and cohorting.
  - Ventilator and antibiotics: allocate to those with the best short-term prognosis given the present stock, likelihood of benefiting from treatment, and ability to restore mission-critical functions after treatment.
  - Evacuation: define mission-critical personnel (see Section 1b) and evacuate them first if external capacity is limited; otherwise evacuate to maximize overall survival and future recovery of operations.
  - Food/water: use a utilitarian allocation to preserve those most able to sustain life-support functions or restore operational capacity after evacuation.
- Mission-critical role clarification: since the roster is ambiguous, your plan will define “mission-critical” as those essential to maintaining life-support (water/energy systems, medical care, shelter, communications), safety, and the ability to re-establish full operations. This is revisited daily.

1a) Mission-critical roles (provisional definition, to be refined with HQ)
- Medical lead(s) and staff capable of operating the ventilator, infection control, and antibiotic stewardship.
- Power/water systems lead(s) and technicians maintaining desalination, generators, and siloed water storage.
- Communications/coordination lead to liaise with any external asset (helicopter, ship, etc.) and coordinate triage/evacuation.
- Shelter and basic care leads (to maintain spacing, sanitation, basic nursing care, and isolation logistics).
- If you must evacuate, rank by: impact on life-sustainment now, and impact on re-establishment of operations upon return.

2) Day-by-day survival and care plan (30 days; phases and concrete actions)

Phase A: Immediate stabilization (Day 0–Day 5)
Goals: contain contagious risk, stabilize two injured, protect core functions, secure water/fuel/food balance as long as possible, prepare evacuation if feasible.

Day 0 (now)
- Establish zones: Contagious/Infected Ward (isolated with dedicated staff, PPE), Quiet/Non-infected Ward, Operational Core (power/water, communications, shelter logistics).
- Quarantine and infection control: implement strict isolation for the contagious illness case and anyone with symptoms; designate separate airspace/airflow if possible; use dedicated tools and PPE; establish cleaning/DECON cadence.
- Triaging medical needs: assess the two injured; consult with medical lead to determine prognosis with/without ventilator and antibiotics; decide ventilator allocation based on best short-term survival probability and impact on return of operations.
- Resources snapshot (initial constraints): Food 60 person-days total; Water 200 person-days baseline; Desalination unit available for up to 5 days of operation at 10 people/day (requires fuel); Generator fuel sufficient for 8 days; Antibiotics limited; One ventilator; No external support guaranteed yet.
- Evacuation readiness: begin documenting mission-critical personnel; establish a daily update if helicopter access is viable and the crew could land/relate to the station.

Days 1–2
- Water-focused actions: verify desalination unit status; secure fuel reserves; begin water rationing plan (see 2a). If desalination fuel is available, schedule a 2–3 day window of desalination to maximize potable water while keeping some fuel in reserve for critical power needs.
- Food rationing footprint: set a plan to stretch 60 person-days as far as possible; communicate clearly that food will be very tight; prepare a minimal-need intake protocol for all staff (no nonessential caloric expenditure, standardize sleep/wake and activity to reduce energy demand).
- Medical care: allocate antibiotics to those with highest risk of systemic infection or who have conditions that would be worsened by delays; reserve ventilation for the patient with the highest probability of recovery given current resources; ensure infection control to prevent spread.
- Evacuation logistics: if HQ policy allows limited evacuation, prepare a prioritized list of “likely mission-critical evacuees” and rehearse loading procedures.

Days 3–5
- Water: maximize desalination use while fuel lasts (up to Day 5). If fuel supply dwindles before Day 5, prioritize drinking water for all staff and reduce nonessential uses and hygiene water for noncritical tasks.
- Food: maintain minimal survival ration for all staff; consider shifting to a 2-tier plan if some people are designated essential (e.g., water/power guardians) and can receive marginally higher rations to maintain core function.
- Medical: monitor the ventilated patient as priority; ensure ventilation equipment is functioning; manage antibiotic stewardship to avoid resistance; if clinical status deteriorates, prepare for palliative care and reallocation decisions per triage rules.
- Evacuation: confirm helicopter window and fuel; if arrival looks feasible within the next 2–3 days, finalize dispatch orders for mission-critical evacuees only.

Phase B: Stabilization and resilience (Day 6–Day 10)
Goals: maintain core functions with limited resources; extend survivability window to increase probability of external evacuation; minimize infection risk from the contagious case.

Days 6–8
- Water: rely on desalination while fuel remains; otherwise rely on stored water. If desalination is unsustainable (fuel exhausted), switch to minimal water allocation and enforce strict hygiene to minimize waste.
- Food: continue minimal ration; protect the most essential staff (medical, power, water) with slightly higher rations if possible, but within scarcity.
- Power: manage generator fuel; run only essential loads (shelter climate, critical medical equipment, communications). Consider low-energy modes to extend the generator period.
- Medical: monitor health status of all; provide care in isolation ward; keep antibiotics available where used; monitor for signs of spread; reinforce cohorting and hand hygiene.
- Evacuation: if external recovery window still exists (70% probability in 10 days), keep the plan to evacuate mission-critical personnel first; prepare exit plan for potential extended stay if evacuation fails.

Days 9–10
- If helicopter is en route, ready the most critical evacuees; if not, shift to extended survival with emphasis on core operations and infection control.

Phase C: If evacuation succeeds (Day 11–Day 20, or earlier)
Goals: reconstitute the crew with evacuated personnel; maintain life-sustaining operations; implement longer-term recovery plan.

Days 11–15
- Return of evacuated personnel; reallocate resources to normalize operations; begin phased restoration of food/water balance with any new supply.
- Reassess mission-critical roster; if new external support arrives, adjust roles accordingly.

Days 16–20
- If evac did not occur by Day 10 and external support remains uncertain, transition to a long-term survival posture with strict rationing, continued quarantine, and minimal essential operations.

Phase D: Long-term contingency (Day 21–Day 30)
Goals: maximize survivability under extended isolation; prepare for possible ongoing operations with constrained resources.

Days 21–30
- If still isolated: maintain the core minimal operations; sustain life-support and quarantine; prepare for possible disaster-by-disaster shift (e.g., if water completely exhausted, inventory of contingency measures such as digging, rainwater catching, or alternative water extraction, if feasible).
- Re-evaluate evacuation options; coordinate any potential new support or alternative routes.

2a) Concrete rationing and operational rules (summary)
- Water: primary variable. If desalination runs, add up to 50 person-days of water over days 0–5. If desalination cannot continue, enforce strict water rationing to protect the most essential staff (medical, power, safety) and implement sanitary protocols to extend the available water.
- Food: 60 person-days total; with 50 people, plan for up to ~1.2 days of full provisioning; we must operate under extreme scarcity. Prioritize essential personnel (medical, power/water) with slightly higher rations to sustain them for the duration until evac arrives; otherwise, foods are allocated to minimize mortality.
- Power/Desalination: run desalination only while fuel is available; allocate power to essential life-support and communications only. Preserve a buffer of fuel for critical operations for as long as possible.
- Medical: distribute limited antibiotics to those with serious infections; allocate ventilator to the patient with best prognosis; maintain isolation to protect others.
- Evacuation: prioritize mission-critical personnel; evacuation windows depend on external support.

3) Clarifying questions and data to request (before finalizing)
- Exact roster of staff with roles, ages, comorbidities, and current health status (specifically for the two injured and the contagious case).
- Current stock counts (food, water, antibiotics, ventilator usage, generator fuel, spare parts, PPE inventory) and exact quantities.
- Status and capabilities of the desalination unit (fuel required per day, current fuel on hand, expected daily water yield, maintenance needs).
- Generator capacity, fuel on hand, loads that can be sustained (critical vs. nonessential).
- External support viability: helicopter schedule specifics (fuel needed, load capacity, weather constraints, landing capabilities), alternative evacuation routes.
- The meaning and scope of “mission-critical” personnel as defined by HQ policy (confirm zone to apply the roster).
- Any additional local resources (rainfall, rainwater catchment, scavenged supplies) and any possible salvage options.

4) Three key assumptions likely to be wrong and mitigation
Assumption 1: Evacuation will occur within 10 days with a 70% chance.
- Risk: external support may be delayed or canceled; if wrong, you are left with limited resources for longer than planned.
- Mitigation: build a robust internal survival plan with escalation to in-station care and extended isolation; prepare a fallback to sustain life-support for longer than 10 days; maintain multiple backup evacuation routes or alternative external contact points (e.g., maritime support) if available.

Assumption 2: The contagious illness has manageable transmission and prognosis with isolation and standard precautions.
- Risk: actual contagion could be highly virulent; a mass outbreak could overwhelm isolation; the two injured could worsen.
- Mitigation: treat infection control as absolute; set up a dedicated isolation ward with separate airspace if possible; implement strict PPE protocols; assume high transmission and treat all crew as potential carriers until proven otherwise; consider prophylaxis or clinical management strategies within standard practice.

Assumption 3: Desalination fuel will last as planned (5 days) at the expected rate.
- Risk: actual fuel consumption may be higher due to climate and equipment load; you could lose a critical water source earlier than planned.
- Mitigation: prepare alternative water-saving measures; minimize non-essential water use; identify potential rescue or salvage options to provide more water; keep the desalination unit in reserve to extend its life if needed.

5) Five measurable daily indicators to track and guide decisions
- Indicator 1: Water availability (days of potable water left for all crew; status of desalination unit fuel and water yield).
- Indicator 2: Food availability (days of survival ration left for all crew; status of any new supply).
- Indicator 3: Medical status and infectious risk (number of people in isolation, number of infections, ventilator usage, antibiotic stock levels, patient prognosis).
- Indicator 4: Power resources (generator fuel left; uptime of critical systems; status of desalination and water pumps; battery reserve for communications).
- Indicator 5: Evacuation readiness and external contact (status of helicopter schedule, load capacity, landing feasibility; number of mission-critical personnel identified for evacuation; weather and landing constraints).

6) Three likely failure modes and contingency responses
Failure Mode 1: Water becomes severely constrained before evacuation is possible.
- Contingencies:
  - Enforce strict water rationing for all; prioritize medical staff and essential operations.
  - Accelerate any feasible desalination use; re-evaluate fuel reserves to extend operation.
  - Seek alternative water sources (rain capture if available, or conversion of nonpotable water for non-consumption uses only).
  - If necessary, reduce nonessential activity to cut water waste and reduce exposure risk.

Failure Mode 2: Medical supplies run out or ventilator becomes unavailable just as needs escalate.
- Contingencies:
  - Triage decisions guided by the objective to maximize survival and preserve mission-critical function; shift to palliative care where appropriate for those with poor prognosis.
  - Use antibiotics judiciously; protect the healthcare team and isolation procedures to avoid a broader outbreak.
  - Reallocate ventilator to patient with best prognosis; prepare manual/alternative ventilation approaches if possible, as a last resort; reassign staff to provide noninvasive care if feasible.

Failure Mode 3: Evacuation capacity fails or is delayed beyond your internal plan.
- Contingencies:
  - Secure a robust in-station contingency: sustain core life-support operations with strict rationing; maintain isolation protocols; reinforce shelter and hygiene practices.
  - Reassess and re-prioritize mission-critical roles to sustain the crew until external assistance arrives or alternative transport becomes available.
  - Increase readiness for alternative evacuation routes (sea/air if available); maintain constant communications with HQ or rescue resource providers; maintain a contingency plan schedule.

Bottom line
- This plan accepts extreme scarcity and uses a utilitarian, fairness-based triage to maximize survival, while explicitly protecting the sick and the contagious. It leans on evacuation as the only reasonable path to long-term survival, with a clear, transparent, and revisable criterion for who is evacuated first when capacity is limited.
- If you want, I can convert this into a one-page decision worksheet you can post in the operations room (with checkboxes for the daily indicators, triage criteria, and evacuation priorities) and tailor the numbers to your actual stock counts and the exact roles you confirm as mission-critical.

In [10]:
model_name = "claude-sonnet-4-5"
answer = ChatModel(
            model_name=model_name,
            provider="anthropic"
        ).generate_response(
            messages=messages,
        )
print_response(answer)

competitors.append(model_name)
answers.append(answer)

# REMOTE STATION CRISIS MANAGEMENT PLAN

## 1. PRIORITIZED 30-DAY SURVIVAL PLAN

### Ethical Framework
- **Utilitarian baseline**: Maximize total survivors
- **Fairness constraint**: Equal dignity; avoid arbitrary discrimination
- **Triage principle**: Greatest need + greatest chance of benefit
- **Transparency**: All decisions communicated openly to maintain morale/order

### Resource Timeline & Critical Constraints
- **Food**: 60 person-days ÷ 50 = 1.2 days at full rations (CRITICAL)
- **Water**: 200 person-days + (5 days × 10 people/day desal) = 250 person-days ÷ 50 = 5 days baseline
- **Generator fuel**: 8 days (powers desal, medical, comms)

### Day-by-Day Plan

**Days 1-2: IMMEDIATE ACTIONS**
- Convene leadership team; communicate plan transparently
- Implement **strict food rationing**: 40% normal calories (extends to 3 days)
- Quarantine respiratory patient + close contacts (assume 5 people total)
- Inventory all consumables; search for overlooked supplies (storage areas, vehicles, abandoned equipment)
- Assign teams: foraging/fishing, water collection (rainwater, emergency wells), fuel conservation, medical

**Days 3-5: EXTENSION PHASE**
- Food at 40% ration (survival minimum ~1,200 cal/day for sedentary adults)
- Run desal unit 8 hrs/day (not 24) while fuel lasts → save generator fuel, produce water for 5 people/day
- Begin emergency food acquisition: fishing, foraging, inventory non-obvious sources (lab supplies, ornamental plants)
- Prioritize generator use: desal 8h, medical 4h, communications 2h, darkness only
- Triage injured: stabilize with available supplies; reserve ventilator for acute respiratory failure

**Days 6-10: PRE-EVACUATION**
- Food sources: assumed 20% supplement from fishing/foraging = extend to day 4-5 total (STILL CRITICAL)
- Generator fuel exhausted day 8 → no desal → rely on stored water (enough for 5 days at full use)
- **Water rationing begins day 8**: 2L/person/day (survival minimum) extends 200 person-days to 10 days
- Helicopter 70% arrival day 10: prepare LZ, prioritize evacuation list
- **Evacuation priority** (ethical justification below):
  1. Critical medical cases (injured, respiratory patient if unstable)
  2. Personnel with unique survival skills (medical, engineering, comms)
  3. Vulnerable populations (elderly, immunocompromised)
  4. "Mission-critical" only if defensible (researcher with irreplaceable data/expertise)

**Days 11-20: POST-FIRST EVAC**
- Assume helicopter evacuates 10 people day 10 (if arrives) → 40 remain
- Food crisis acute: 40% rations unsustainable beyond day 7-10 without supplementation
- **Aggressive food acquisition**: rationing + fishing/foraging must yield 50% normal calories minimum
- Water: 2L/day rationing continues (10 days for 40 people with 200L reserve post-desal)
- Second evac attempt day 15-17; prioritize next 10

**Days 21-30: ENDURANCE PHASE**
- Assume 20-30 people remain
- Food remains critical constraint; requires sustained foraging success (50+ person-days needed)
- Water manageable at 2L/day
- Medical: manage illness/injury with depleted supplies; palliative care if necessary
- Maintain communications/signaling for rescue

### Quarantine & Triage Rules
- **Quarantine**: Respiratory patient + contacts isolated; separate ventilation if possible; masks mandatory
- **Triage**: 
  - Green: minor injuries, stable
  - Yellow: moderate injuries (current 2), monitor closely
  - Red: life-threatening, immediate care (ventilator if respiratory failure)
  - Black: expectant (comfort care only if resources exhausted)

### Evacuation Prioritization
**Ethical justification**: Balance immediate medical need (non-abandonment principle) with utilitarian goal of preserving survival expertise for remaining population.

1. Acute medical emergencies (prevent death)
2. Contagious patient (reduce transmission risk)
3. Essential personnel for survivors (doctor, engineer, comm specialist)
4. Vulnerable individuals
5. "Mission-critical" reviewed case-by-case (reject if purely bureaucratic)

---

## 2. OPERATIONAL STEPS TO EXTEND SURVIVABILITY

### Immediate (Days 1-3)
- **Food**: Inventory pantries, lab stockpiles, personal stashes; organize fishing/seaweed harvesting/shellfish teams
- **Water**: Deploy rain catchment (tarps, barrels); explore groundwater/natural springs
- **Fuel**: Search for vehicle fuel, generator alternatives (solar if available, wind improvisation)
- **Power**: Implement strict brownout schedule; LED only; shut non-critical systems

### Ongoing (Days 4-30)
- **Morale**: Daily briefings, transparent rationing, shared labor
- **Health**: Hygiene protocols (conserve water but prevent disease); monitor quarantine
- **Signaling**: Maintain emergency beacon, visual signals (mirrors, fires, SOS markings)
- **Innovation**: Repurpose equipment (battery banks, vehicle fuel for generator, medical supplies for food preservation)

---

## 3. CLARIFYING QUESTIONS & DATA NEEDED

1. **Personnel**: Exact skills inventory (medical training, fishing, engineering, foraging knowledge)? Health baseline (chronic conditions, medications needed)?
2. **Environment**: Island size/terrain? Edible flora/fauna? Freshwater sources? Weather forecast beyond 10 days?
3. **Equipment**: Solar panels, batteries, vehicles (fuel?), fishing gear, purification tablets, additional medical supplies?
4. **Communications**: Satellite phone functional? Can we request supply drop before full evac?
5. **Headquarters policy**: Define "mission-critical" clearly; legal/ethical constraints on triage decisions?
6. **Contagious illness**: Confirmed diagnosis (COVID, flu, TB)? Transmission rate? Incubation period?

---

## 4. THREE KEY ASSUMPTIONS LIKELY WRONG & MITIGATION

### Assumption 1: Fishing/foraging will yield 20% caloric supplement
**Why wrong**: Overfishing, weather, lack of expertise, poisonous species  
**Test**: Day 1-2 trial runs; assess yield  
**Mitigate**: Reduce ration baseline to 35%; train teams; diversify sources (birds, insects, algae)

### Assumption 2: Helicopter has 70% chance day 10 and can evacuate in 2-3 trips
**Why wrong**: Weather delays, mechanical failure, capacity overestimated, prioritization disputes  
**Test**: Daily comms check for updated ETA; confirm capacity  
**Mitigate**: Plan for day 20 first arrival; pre-assign evac groups to avoid conflict; prepare for single-trip scenario (10 people max)

### Assumption 3: Respiratory illness is moderate (not COVID-19/TB with high severity)
**Why wrong**: Could incapacitate 50%+ of personnel, overwhelm medical  
**Test**: Symptom monitoring; antibody/rapid test if available  
**Mitigate**: Aggressive quarantine (1/3 of station isolated); mask mandates; designate care teams to limit exposure

---

## 5. FIVE DAILY MEASURABLE INDICATORS

1. **Caloric intake per person**: Target ≥1,200 cal/day; if <1,000 two consecutive days → emergency foraging/ration reserves
2. **Water reserve volume**: Target >100L buffer; if <50L → extreme rationing (1.5L/day), halt non-drinking use
3. **New illness cases**: If >2 new respiratory cases/day → expand quarantine, revise evac priority
4. **Foraging yield (kg/day)**: Target 15kg fish/edibles; if <10kg → shift labor, explore new areas
5. **Morale/conflict incidents**: If >3 disputes/day or refusal to comply → convene assembly, address grievances, adjust transparency

**Decision rule**: Two indicators in red zone for 48h → convene crisis meeting; consider radical options (early evac of non-essential via risky weather window, extreme rationing, splitting population to satellite site)

---

## 6. THREE FAILURE MODES & CONTINGENCIES

### Failure Mode 1: STARVATION (food runs out day 7-10, no supplement)
**Trigger**: Foraging yields <5kg/day by day 5  
**Contingency**:  
- Immediate meeting: voluntary fasting rotation (medically cleared individuals skip meals to extend rations for vulnerable)  
- Slaughter/consume non-essential animals (if any research specimens, pets)  
- Emergency signal for supply drop (flares, radio)  
- If helicopter delayed beyond day 15: consider controlled abandonment of station to forage-rich area if known

### Failure Mode 2: DISEASE OUTBREAK (respiratory illness spreads to 20+ people)
**Trigger**: >5 new cases by day 5  
**Contingency**:  
- Full lockdown: divide station into isolated cohorts (healthy, exposed, sick)  
- Reallocate ventilator via triage protocol (highest survival probability)  
- Evac priority shifts entirely to medical cases if helicopter arrives  
- Palliative care protocols for expectant cases (pain management, comfort)  
- Healthy cohort maintains operations remotely to prevent total collapse

### Failure Mode 3: NO EVACUATION BY DAY 20 (helicopter fails, storm continues)
**Trigger**: Day 18 confirmation of延迟 beyond day 25  
**Contingency**:  
- **Ration extension**: Drop to 800 cal/day survival minimum (risk long-term health but extend timeline)  
- **Boat option**: If any seaworthy vessels, risk voyage to nearest land (high-risk, only if starvation imminent)  
- **Psychological support**: Prevent panic/suicide; structured routines, hope messaging  
- **Triage shift**: Preserve youngest/healthiest with best long-term survival odds; comfort care for others if resources critical  

---

## SUMMARY RISK ASSESSMENT

**Survival probability**: 
- 70% if helicopter arrives day 10-15 and food supplementation achieves 15%  
- 40% if helicopter delayed to day 20-25  
- 10% if no evac by day 30 and foraging fails

**Greatest risk**: Food scarcity (non-negotiable physics of 60 person-days for 50 people)  
**Mitigation priority**: Fishing/foraging success + transparent rationing + early evac of non-essential personnel

In [11]:
model_name = "gemini-2.5-flash"
answer = ChatModel(
            model_name=model_name,
            provider="google",
        ).generate_response(
            messages=messages,
        )
print_response(answer)

competitors.append(model_name)
answers.append(answer)

As the sole decision-maker, my plan prioritizes maximizing overall survival, maintaining order, and preparing for the worst while hoping for the best.

**Ethical Principles Applied:**
1.  **Utilitarianism:** Actions aiming for the greatest good for the greatest number, focusing on overall survival rates.
2.  **Justice/Fairness:** Equitable distribution of burdens and resources where possible, but overridden by utilitarian concerns in medical triage and (potentially) HQ-mandated evacuation.
3.  **Stewardship:** Responsible and conservative management of scarce resources.
4.  **Non-maleficence:** Avoiding preventable harm.
5.  **Beneficence:** Acting in the best interest of the community.

---

**(1) Prioritized, Day-by-Day 30-Day Survival and Care Plan**

*   **Resource Allocation & Rationing (Ethical Principle: Utilitarianism, Stewardship):**
    *   **Food (60 Person-Days Total):**
        *   **Days 1-30:** Initiate immediate, severe rationing. All 50 personnel receive a measured **critically minimal ration equivalent to ~12% of normal daily caloric intake (approx. 240 kcal/day)**. This aims to stretch the 60 person-days of food to cover ~12 days at an absolute survival minimum for 50 people (60/50 = 1.2 days normal; 1.2 / 0.12 = 10 days at 12% ration). For a 30-day plan, this means any food beyond Day 10 is highly improbable without other sources. This pushes the limits of human endurance.
    *   **Water (200 PPD stored + 50 PPD desal = 250 PPD total available for 5 days):**
        *   **Day 1:** Implement strict water conservation.
        *   **Days 1-5:** Activate desalination unit. Ration water to **1.5 L/person/day**. This allows for critical hydration and minimal hygiene while maximizing desalination output. (Consumes ~150 PPD, leaving 100 PPD remaining in storage).
        *   **Days 6-10:** Desalination unit fuel exhausted. Rely on remaining 100 PPD. Ration to **1.0 L/person/day**. (Consumes 50 PPD, leaving 50 PPD).
        *   **Days 11-30:** Desalination unit non-functional. Rely heavily on alternative collection methods (see below). Remaining 50 PPD stored for medical emergencies or distributed at **0.5 L/person/day** in rotation, prioritizing the most vulnerable.
    *   **Generator Fuel (8 Days Total):**
        *   **Day 1:** Prioritize power for: medical (ventilator, refrigeration for meds), critical communications, and the desalination unit (for 5 days). All non-essential systems powered down.
        *   **Days 2-5:** Maintain power for prioritized systems, including desalination. Initiate data backup for all research and operational data onto portable storage.
        *   **Days 6-8:** Desalination unit off. Generator powers only critical medical equipment (ventilator, vital monitoring) and intermittent, scheduled communication attempts.
        *   **Days 9-30:** Generator fuel exhausted. Rely on limited battery backups for critical medical (ventilator, if needed) and emergency lighting/communications. Prepare for complete blackout.

*   **Quarantine & Triage Rules (Ethical Principle: Utilitarianism, Non-maleficence, Beneficence):**
    *   **Quarantine:**
        *   **Day 1:** Immediately isolate the individual with contagious respiratory illness (CRI) in a designated, well-ventilated area. Implement strict barrier nursing (PPE, separate waste, dedicated caregivers). All personnel interacting with this individual must use available PPE.
        *   **Ongoing:** Daily symptom screening for all personnel. Any new symptomatic individuals are immediately isolated. Implement social distancing and mask-wearing (even improvised) for all personnel.
    *   **Triage (Medical):**
        *   **Priority 1 (Red):** Critically ill/injured with a high probability of survival with intensive care (e.g., ventilator candidate with good prognosis, severe injury).
        *   **Priority 2 (Yellow):** Moderately ill/injured, stable but requiring ongoing care. The two moderately injured fall here.
        *   **Priority 3 (Green):** Minor injuries, able to self-care.
        *   **Priority 4 (Black):** Unlikely to survive even with full care, or already deceased. Palliative care only.
        *   **Ventilator Allocation:** If multiple patients require the single ventilator, it will be allocated based on the highest likelihood of full recovery, aiming to maximize overall survival. "Mission-critical" status is secondary to medical prognosis in this specific critical resource allocation.

*   **Evacuation Prioritization (Ethical Principle: Justice, Utilitarianism, HQ Policy Adherence):**
    *   **Day 1-10 (Pre-Evacuation):**
        *   **Clarify Roster:** Immediately contact HQ to clarify the "mission-critical personnel" roster. If no clarification, a local council will define criteria based on immediate station survival (medics, engineers) and long-term research continuity.
        *   **Prioritization Tiers (if limited capacity):**
            1.  **Life-Threatening Injury/Illness (stable enough to move) AND Mission-Critical Status:** E.g., a critical medic needing advanced care.
            2.  **Mission-Critical Personnel (healthy/minor injury):** As per HQ directive.
            3.  **Non-Mission-Critical Personnel with Life-Threatening Injury/Illness (stable enough to move):** E.g., CRI patient requiring external hospital care.
            4.  **Non-Mission-Critical Personnel (healthy/minor injury):** Prioritize based on vulnerability (if present, e.g., pregnant personnel), then by lottery to ensure fairness.

---

**(2) Operational Steps to Extend Survivability & Increase Evacuation Probability**

*   **Extend Survivability:**
    *   **Water:** Implement aggressive rainwater harvesting, dew collection, and solar stills immediately. Minimize physical exertion to reduce water consumption.
    *   **Food:** Strictly enforce minimal activity. Explore local foraging (with expert guidance only, if available) for edible and safe flora/fauna.
    *   **Power:** Conduct a station-wide energy audit. Disconnect all non-essential loads. Implement manual alternatives where possible (e.g., hand tools).
    *   **Disease Prevention:** Enforce rigorous personal hygiene using minimal water, use improvised masks, and maintain strict social distancing.
    *   **Morale:** Conduct regular, transparent briefings. Encourage mutual support, light duties, and structured rest to combat psychological stress.
*   **Increase Evacuation Probability:**
    *   **Communication:** Maintain scheduled, intermittent radio/satellite contact attempts to conserve power. Prepare concise status reports for rescue.
    *   **Landing Zone:** Clear and mark primary and secondary helicopter landing zones (HLZs) for day/night operations. Designate and train an HLZ team.
    *   **Readiness:** Prepare evacuation bags for all personnel (essential documents, minimal personal items). Keep a manifest with health status ready.
    *   **Signals:** Prepare high-visibility markers and distress signals (e.g., signal mirrors, pyrotechnics for when a craft is sighted).

---

**(3) Clarifying Questions and Data to Request**

1.  **"Mission-Critical" Roster:** A definitive list or clear criteria from Headquarters.
2.  **CRI Specifics:** Confirmed pathogen, severity, contagiousness, and any specific treatment guidelines/medications (if not already known).
3.  **Medical Resources:** Exact inventory of all medical supplies, especially antibiotics, IV fluids, and oxygen if available. Number and expertise of medical staff.
4.  **Island Resources:** Detailed knowledge of potential alternative water sources (springs, wells, reliable rainfall patterns) and local edible flora/fauna.
5.  **Communication Capabilities:** Full specifications (range, power) of all communication equipment and specific frequencies/times for contacting HQ/rescue.

---

**(4) Three Key Assumptions Most Likely to Be Wrong & Mitigation**

1.  **Assumption: Helicopter reaches the island on Day 10 (70% probability).**
    *   **Mitigation:** Plan comprehensively for a 30-day (or indefinite) scenario *without* evacuation. Aggressively conserve resources from Day 1. Beyond Day 10, shift to extreme long-term survival protocols: further reduced rations, increased efforts for local resource acquisition, and continuous passive SAR (e.g., visual signals).
2.  **Assumption: The CRI is contained and does not overwhelm medical resources.**
    *   **Mitigation:** Implement aggressive, universal preventative measures (masks, distancing, hygiene) for *all* personnel immediately. Prepare multiple isolation zones. Develop clear protocols for alternative ventilation methods (e.g., manual resuscitation bags, prone positioning) if the ventilator becomes insufficient or unusable. Re-evaluate medical triage prioritizing those with highest chance of survival without advanced support if resources are overwhelmed.
3.  **Assumption: Personnel will maintain morale and strict compliance with severe rationing.**
    *   **Mitigation:** Foster absolute transparency about the situation, the plan, and its ethical basis. Maintain visible, strong leadership. Implement a system for daily check-ins and address any signs of stress, non-compliance, or conflict promptly and fairly. Assign meaningful tasks to maintain engagement and purpose. Emphasize collective survival.

---

**(5) Five Measurable Indicators to Track Daily**

1.  **Resource Levels:** Daily inventory of food, potable water, and fuel against planned consumption.
2.  **Health Status:** Number of new CRI-symptomatic cases, vital signs of isolated/injured personnel, and overall health trends (dehydration, severe fatigue).
3.  **Communication Status:** Number of successful communication attempts, updates received, and outbound messages sent.
4.  **Operational Systems Status:** Functionality of critical systems (generator, desalination, communication gear, medical equipment).
5.  **Morale & Order:** Subjective assessment from leadership, number of reported conflicts, instances of non-compliance, and general psychological well-being.

---

**(6) Three Most Likely Failure Modes & Contingency Responses**

1.  **Failure Mode: Complete Communication Blackout & Failed Evacuation (Station isolated indefinitely).**
    *   **Contingency:** Initiate "Phase 2" of the water plan (0.5 L/person/day, maximum alternative collection). Activate **extreme long-term survival protocol**: further reduce food rations (if any remains), aggressively pursue all possible local food/water sources. Reallocate personnel to survival-focused tasks. Conserve all remaining power for essential medical/emergency signaling.
2.  **Failure Mode: Widespread CRI Outbreak, Overwhelming Medical Capacity.**
    *   **Contingency:** Expand isolation facilities immediately. Implement strict "universal precautions" (everyone acts as if infected) including improvised PPE for all. Re-evaluate medical triage based strictly on survival probability with available minimal resources, potentially reassigning personnel to emergency medical support roles.
3.  **Failure Mode: Significant Breakdown of Discipline/Order (e.g., theft of rations, refusal of quarantine).**
    *   **Contingency:** Reiterate the shared ethical framework and the necessity of rules for collective survival. Address non-compliance firmly but fairly, emphasizing the severe consequences for the entire group. Establish a small, trusted team to ensure security and fair distribution. Conduct frequent, honest briefings to manage expectations and provide emotional support.

In [12]:
model_name = "deepseek-chat"
answer = ChatModel(
            model_name=model_name,
            provider="deepseek",
        ).generate_response(
            messages=messages,
        )
print_response(answer)

competitors.append(model_name)
answers.append(answer)

**1. 30‑Day Survival & Care Plan (Day‑by‑Day Outline)**  

*Ethical Principles:*  
- **Utilitarianism:** Maximize number of survivors.  
- **Duty of Care:** Protect the vulnerable, treat the sick.  
- **Procedural Justice:** Transparent rationing, no arbitrary favoritism.  
- **Mission Continuity:** Prioritize personnel essential to survival systems.  

**Days 1–5:**  
- **Food:** Ration to 1,200 kcal/person/day (half normal), using 50 person‑days of food over 5 days → extends supply.  
- **Water:** Use desalination (10 people/day capacity) + stored water for others; fuel for desalination drawn from generator fuel reserve (prioritize water over electricity).  
- **Power:** Generator runs 4 hours/day for comms, medical equipment, desalination.  
- **Medical:** Isolate symptomatic respiratory case; treat injured with available antibiotics if infection signs appear; conserve ventilator for extreme respiratory failure.  
- **Evacuation:** Identify 10 “mission‑critical” personnel as those operating comms, desalination, medical, and repair systems.  

**Days 6–10:**  
- **Food:** Continue 1,200 kcal ration.  
- **Water:** Desalination stops when fuel runs out (Day 6); use stored water, ration to 1 L/person/day (200 person‑days supply lasts 50 people ~4 days → until Day 10).  
- **Power:** Generator off except 1 hour/day for emergency comms.  
- **Medical:** If respiratory illness spreads, quarantine zone; triage rule: ventilator only if recovery likely within 10 days and no second critical case.  
- **Evacuation Prep:** Prepare landing zone, prioritize 2 injured + 1 critical medical + 5 mission‑critical for first helicopter lift (if arrives Day 10).  

**Days 11–30 (post‑helicopter chance):**  
- If no rescue by Day 10, assume 70% chance soon after.  
- **Water:** Emergency collection (rain, condensation) using tarps; ration 0.5 L/person/day if necessary.  
- **Food:** Reduce to 800 kcal/person/day to stretch to Day 30.  
- **Power:** No generator; use solar/battery if available.  
- **Medical:** Palliative care if antibiotics exhausted; strict isolation.  
- **Evacuation:** If helicopter arrives, order: (1) most medically urgent (ventilator-dependent if any), (2) injured, (3) mission‑critical, (4) others by lottery.  

---

**2. Operational Steps to Extend Survivability & Increase Evacuation Odds**  
- Form teams: water collection, food rationing/distribution, sanitation, comms watch.  
- Rainwater catchment immediately using all containers/tarps.  
- Attempt to repair/rig alternative power (solar, wind).  
- Daily briefings to maintain morale and compliance.  
- Signal mirrors, fire for visibility.  
- Conserve antibiotics for clear bacterial infections; use antiseptics for wound care.  

---

**3. Clarifying Questions & Data Needed**  
- Exact caloric content of food stores.  
- Inventory of rainwater catchment capability (tarp area, containers).  
- Medical details: Is respiratory illness bacterial/viral? Antibiotic sensitivity?  
- Generator fuel possible to divert to desalination without losing all comms?  
- Does “mission‑critical” roster exist in sealed orders?  

---

**4. Three Key Assumptions Likely Wrong & Mitigations**  
- **Assumption 1:** Water ration of 1 L/day sufficient for moderate work.  
  *Test:* Monitor for dehydration signs; adjust if >10% show symptoms.  
- **Assumption 2:** Respiratory illness will spread slowly.  
  *Mitigation:* Assume faster spread; enforce strict mask‑making from cloth, separate quarters.  
- **Assumption 3:** Helicopter arrival probability independent of weather post‑storm.  
  *Mitigation:* Assume longer delay; intensify water collection and food foraging plans.  

---

**5. Five Measurable Daily Indicators to Change Course**  
1. Number of new febrile/respiratory cases.  
2. Water stores vs. collection rate.  
3. Group morale/cohesion incidents.  
4. Caloric deficit symptoms (weakness, collapse).  
5. Communication status (any contact received).  

---

**6. Three Most Likely Failure Modes & Contingencies**  
- **Failure Mode 1:** Respiratory outbreak overwhelms quarantine.  
  *Response:* Isolate by small groups (cohort quarantine), allocate ventilator by triage committee draw.  
- **Failure Mode 2:** Water runs out before rescue.  
  *Response:* Emergency distillation using fire and fuel reserves, prioritize water for sick.  
- **Failure Mode 3:** Mutiny/ration hoarding.  
  *Response:* Immediate public redistribution, rotate team leaders, involve group in rationing decisions.

In [13]:
model_name = "openai/gpt-oss-120b"
answer = ChatModel(
            model_name=model_name,
            provider="groq",
        ).generate_response(
            messages=messages,
        )
print_response(answer)

competitors.append(model_name)
answers.append(answer)

**Situation at a glance**  
- **People:** 50 (including 2 moderately injured, 1 with probable contagious respiratory illness)  
- **Food:** 60 person‑days (≈ 1.2 days for everybody) – **must be supplemented** by local foraging/fishing.  
- **Water:** 200 person‑days + desal unit = 10 p‑days / day for 5 days → **250 p‑days total** – far short of the 1 500 p‑days needed for 30 days.  
- **Fuel:** enough for 8 days of generator (power → desal unit, medical equipment, heating).  
- **Medical:** routine supplies, 1 ventilator, limited antibiotics.  
- **Evacuation:** helicopter 70 % chance of arrival in 10 days, capacity ≈ 20 people (typical medium‑lift).  
- **Policy:** “Mission‑critical personnel” get evacuation priority, but the roster is vague.  

Below is a **concise, day‑by‑day 30‑day survival & care plan** followed by the other requested sections. All decisions are framed by the **four core bio‑ethical principles** – *beneficence, non‑maleficence, justice (fair distribution), and respect for persons* – and by a **dual‑utilitarian/priority‑to‑the‑most‑vulnerable** approach.

---

## 1️⃣ 30‑Day Survival & Care Plan (resource allocation, rationing, quarantine, triage, evacuation)

| Phase (Days) | Core Goal | Food (person‑days) | Water (person‑days) | Fuel use | Power allocation | Ration schedule | Quarantine & infection control | Medical triage | Evacuation priority |
|--------------|-----------|--------------------|---------------------|----------|-------------------|-----------------|--------------------------------|----------------|----------------------|
| **0‑2** (Prep) | Stabilise injuries, isolate sick, start foraging | **All 60 p‑days used** – give each person **0.5 kg** high‑calorie ration (≈ 800 kcal) per day. | **200 p‑days** → 2 L / person / day (max safe). Use desal unit **full‑time** (10 p‑days / day). | **Fuel day 0‑2** → generator on for heating, medical equipment, desal unit (≈ 1 L / day). | Power → heating, water‑purifier, ventilator (if needed). | **Day‑0:** 800 kcal + 2 L water. **Day‑1‑2:** 600 kcal + 1.5 L water (gradual reduction). | • Separate “Sick‑zone” (1‑person cabin + bathroom).<br>• PPE (N‑95, gloves) for all caregivers.<br>• No sharing of utensils, strict hand‑washing with boiled water.<br>• Daily symptom check. | • Immediate care for 2 injured (analgesia, wound care).<br>• Suspected case placed in isolation; start empiric broad‑spectrum antibiotic pending test.<br>• Ventilator held in reserve – only if respiratory failure develops. | • Identify *de‑facto* mission‑critical staff (research leads, power/engineer, medical, communications). |
| **3‑5** (Fuel window) | Maximise water production, build food stocks, decide who stays vs. who can be evacuated (if helicopter arrives early). | **Continue 0.5 kg rations** (≈ 800 kcal). Begin **daily fishing/shore‑line foraging** – aim for **≥ 1 kg protein** per day → adds ~1 200 kcal/person (shared). | Desal unit **10 p‑days / day** (total +30 p‑days) → **230 p‑days** water. Reduce drinking to **1.2 L** per person (still safe). | **Fuel days 3‑5** → keep generator on for desal, heating, ventilator standby. | Same as above. | **Day‑3‑5:** 800 kcal + foraged food (average 500 kcal extra) → **≈ 1 300 kcal** total.<br>Water 1.2 L. | Continue strict isolation of the sick; assign two “clean‑team” members (engineer + medic) to handle waste & PPE. | Same as above; monitor for worsening respiratory signs. | **If helicopter lands (prob ≈ 0.7):**<br>‑ **First wave (≈ 15 people):** mission‑critical (research leads, engineers, medical staff, one injured).<br>‑ **Second wave (≈ 5 people):** high‑risk (elderly, comorbid).<br>‑ Remaining 30 stay for further rescue attempts. |
| **6‑10** (Post‑fuel) | Conserve power, switch to **manual water‑collection** (rain, solar stills), further cut rations, maintain isolation. | **Ration down to 0.4 kg** (≈ 600 kcal). Rely on **daily catch** (≥ 0.8 kg) → **≈ 1 000 kcal** total/person. | **No generator** → desal unit offline. Use **rain‑catch (if any)**, **solar stills** (≈ 0.5 L / person / day) + **grey‑water recycling** (treat & reuse for drinking). Project **≈ 70 p‑days** extra water (total ≈ 300 p‑days). | **Fuel exhausted** – generator off. | Power only from **battery backups** (keep lights & small communications on). | **Day‑6‑10:** 600 kcal + catch = 1 000 kcal.<br>Water 1 L (mix of rain + recycled). | **Cohort quarantine** – keep sick in same cabin; rotate caregivers (2‑person “clean‑team”) no‑contact with rest. | *Triage level‑2*: If ventilator needed, use only for **younger, otherwise healthy** patient; otherwise provide high‑flow O₂ via mask. | **If helicopter delayed (>10 d):** keep the 15 already‑evacuated on board; prepare **second‑wave** (next 10) for possible later rescue (see “operational steps”). |
| **11‑20** (Sustenance stretch) | **Survival mode** – extreme rationing, maximal water recycling, limit activity to conserve calories. | **Ration 0.3 kg** (≈ 450 kcal). **Catch** may drop to 0.5 kg → **≈ 800 kcal** total. | **Water:** 0.8 L per person (minimum for hydration). Use **condensation traps** in sleeping quarters. | No fuel. | Battery‑only for essential comms & ventilator (if in use). | **Day‑11‑20:** 800 kcal, 0.8 L water. | **Isolation continues** – no mixing of zones. PPE limited to masks + gloves. | *Triage level‑3*: Only treat life‑threatening injuries that are reversible; shift to palliative care for those unlikely to survive >48 h without ventilation. | **If second‑wave rescue becomes possible (e.g., weather clears, other aircraft):** evacuate the **most medically stable** (including those with low risk of infection). |
| **21‑30** (Last‑ditch) | **Maintain life** until rescue or inevitable depletion. | **Ration 0.2 kg** (≈ 300 kcal). **Catch** may be negligible → **≈ 500 kcal** total. | **Water:** 0.6 L per person (minimum). Prioritise **children/elderly** for the last water. | No fuel. | Keep **ventilator on only if patient is <30 y and otherwise healthy** (unlikely). | **Day‑21‑30:** 500 kcal, 0.6 L water. | **Strict isolation** of any remaining symptomatic case; if infection spreads, **quarantine by cabin**. | *Triage level‑4*: No invasive interventions; focus on comfort, infection control, and documentation. | **If rescue finally arrives:** Evacuate **all remaining** regardless of mission‑critical label (principle of *rescue* overrides policy when survival is at stake). |

### Key Ration Numbers (per person)

| Resource | Total available | Daily target (30 d) | Adjustment needed |
|----------|----------------|---------------------|-------------------|
| Food (kcal) | 60 p‑days ≈ 48 000 kcal | ≈ 1 600 kcal → **shortfall** → **+ ≈ 1 200 kcal** from fishing/foraging each day |
| Water (L) | 200 p‑days ≈ 200 L | ≈ 2 L → **shortfall** → **+ ≈ 1 L** from rain/solar stills, grey‑water recycling |
| Fuel (L) | 8 days generator | Power for desal & heating first 5 d; then **no power** – manual methods only |

---

## 2️⃣  Operational steps to extend survivability & boost evacuation odds  

1. **Accelerate local food/water production**  
   - Deploy all able‑body (except two injured & isolated sick) to **day‑time fishing, crab traps, shore‑line foraging, and tide‑pool harvesting**. Target ≥ 1 kg protein/day.  
   - Build **solar‑still arrays** (≥ 6 m²) using clear plastic sheeting and salt‑water to produce ~0.5 L / person / day.  
   - Set up **rain‑water catch‑systems** on every roof; store in sanitized barrels (chlorinate).  

2. **Power management**  
   - Use generator only for **critical 4 h/night** (ventilator, heating, communications).  
   - Shut down non‑essential lights; use **LED lanterns + headlamps** on battery.  

3. **Medical preparedness**  
   - Isolate the respiratory case in a **sealed cabin** with a dedicated **exhaust vent** (open to sea).  
   - Implement **“buddy‑check”** every 6 h for vitals, wound status, hydration.  
   - Use antibiotics judiciously: treat only proven bacterial infection or severe sepsis.  

4. **Communication & rescue coordination**  
   - Operate **satellite phone** on a **2‑hour on/off schedule** to conserve battery, reporting daily status.  
   - Pre‑prepare **evacuation manifest** with clear “mission‑critical” list (see below).  
   - Relay **real‑time weather** to rescue pilots; request **additional assets** (e.g., UAV for supply drops).  

5. **Psychosocial care**  
   - Conduct **twice‑daily briefings** (30 min) for transparency, morale, and to reinforce ration discipline.  
   - Assign **mental‑health “check‑in”** duties to the medic (short 5‑min chats).  

6. **Evacuation staging**  
   - Designate **two “ready‑rooms”** near the helipad; keep a **minimum of 5 L water** and **2 kg high‑energy food** per person in each room.  
   - Rotate **clean‑team** to keep these rooms stocked without contaminating them.

---

## 3️⃣  Clarifying questions / data needed before finalizing  

| Category | Specific question |
|----------|-------------------|
| **Personnel** | Exact list of research projects, their criticality, and which staff are indispensable for mission continuity. |
| **Health** | Rapid diagnostic test results for the respiratory case (viral vs. bacterial) and current infection‑control capability (e.g., HEPA filter availability). |
| **Environment** | Weather forecast for the next 14 days (rain probability, wind speed) – crucial for water‑catch and helicopter safety. |
| **Infrastructure** | Capacity of solar‑still/ rain‑catch installations (surface area, storage volume). |
| **Logistics** | Exact payload & passenger capacity of the rescue helicopter, and any alternative assets (e.g., sea‑craft). |
| **Fuel** | Quantity of fuel remaining (liters) and exact generator fuel consumption per hour at load. |
| **Medical** | Stock‑levels of antibiotics, analgesics, IV fluids, and any spare ventilator parts. |
| **Legal / Policy** | Headquarters’ official definition of “mission‑critical” and any standing orders on “life‑saving versus mission‑preserving” decisions. |

---

## 4️⃣  Three assumptions most likely to be wrong & mitigation  

| # | Assumption | Why it may be wrong | Test / Mitigation |
|---|------------|---------------------|-------------------|
| 1 | **Rainfall will be sufficient** to supplement water via catch‑systems. | Local climate can be arid; a storm may bring wind‑driven rain but little collection. | **Monitor weather hourly**; if < 5 mm rain in 48 h, **activate emergency water ration (0.5 L/person)** and consider **melting sea‑ice** (if present) or **condensation traps**. |
| 2 | **Fishing will reliably yield ≥ 1 kg protein/day**. | Overfishing, low tide, or equipment failure may reduce catch. | Conduct **daily catch log**; if < 0.5 kg for two consecutive days, **re‑allocate more people** to foraging, and **prepare edible sea‑weed/kelp** as backup. |
| 3 | **Helicopter will have 70 % chance of arrival in 10 days**. | Weather could ground it longer; mechanical issues could abort. | **Plan for a “no‑rescue” scenario** after day 12: begin **second‑wave evacuation** via **sea‑craft** (inflatable boats) if sea state permits, and **pre‑position a cache** of emergency rations near the helipad for later use. |

---

## 5️⃣  Five measurable daily indicators (go/no‑go triggers)

| Indicator | Target / Threshold | Action if breached |
|-----------|-------------------|---------------------|
| **Water balance** (L available / person‑day) | ≥ 1.0 L (Days 0‑5), ≥ 0.8 L (Days 6‑10), ≥ 0.6 L (Days 11‑30) | Reduce activity, increase condensation, consider **early evacuation** of non‑critical. |
| **Caloric intake** (kcal / person) | ≥ 800 kcal (first 5 d), ≥ 600 kcal (6‑15 d), ≥ 400 kcal (16‑30 d) | Intensify foraging, cut non‑essential personnel, re‑evaluate evacuation priority. |
| **Infection spread** (new symptomatic cases) | 0 new cases per 24 h | If ≥ 1 new case, **tighten quarantine**, possibly **segregate cabins** and **reduce group size** in common areas. |
| **Fuel remaining** (hours of generator) | > 24 h for next 2 days | If < 24 h, shut down non‑essential loads immediately; prioritize **ventilator & heating**. |
| **Rescue probability** (weather & communications) | ≥ 60 % chance of safe landing in next 48 h (based on forecast) | If < 60 %, **delay evacuation** and keep “ready‑rooms” stocked; if < 30 % after day 12, switch to **sea‑craft extraction** plan. |

---

## 6️⃣  Three most likely failure modes & contingency responses  

| Failure Mode | Why it’s likely | Immediate contingency |
|--------------|----------------|------------------------|
| **A. Water shortage leading to dehydration** | Over‑optimistic rain/solar‑still output, leaks, or contamination. | **Activate emergency water ration (0.5 L)**, **stop all non‑essential water use**, **increase grey‑water recycling**, and **prioritize water for vulnerable (children, ill)**. If still insufficient after 24 h, **initiate “water‑sharing protocol”**: rotate 2‑person groups to a single water source, limiting exposure time. |
| **B. Respiratory illness outbreak** | Incomplete isolation, PPE breach, aerosol spread in confined quarters. | **Seal the sick cabin**, **evacuate the isolated patient** on the first available lift (even if not mission‑critical), **implement cohort quarantine** (one cabin per group), and **distribute prophylactic antibiotics** only to those with confirmed bacterial super‑infection. |
| **C. Helicopter abort / no‑rescue** | Weather deterioration, mechanical failure, fuel shortage at base. | **Switch to secondary extraction:** inflate **two 4‑person inflatable boats** (already stored near shoreline) and conduct **day‑time sea‑evac** to the nearest inhabited island (if within 5 km) for **interim shelter & resupply**. Meanwhile, **consolidate survivors** into one shelter to reduce exposure and conserve resources. |

---

### Ethical justification for the trade‑offs

1. **Beneficence & Non‑maleficence:**  All actions aim to **maximise survival** (food, water, medical care) while **minimizing harm** (preventing infection spread, avoiding futile invasive care).  
2. **Justice (fair distribution):**  Rations are **scaled to need** (more water for vulnerable, calories equalized).  The “mission‑critical” list is **transparent**, based on objective roles, not subjective favoritism.  
3. **Respect for Persons:**  Individuals are **informed daily** about the plan, ration levels, and the possibility of evacuation or palliative care.  Consent is obtained for any invasive treatment (ventilator) where possible.  
4. **Utilitarian + Prioritarian blend:**  The plan first seeks **the greatest number of lives saved** (resource stretching, infection control) and then **prioritizes the most vulnerable** (injured, sick, children) when resources become critically scarce.  

*In short, the plan balances mission continuity with the overriding moral duty to preserve human life under extreme scarcity.*

## For the next cell, we will use Ollama

Ollama runs a local web service that gives an OpenAI compatible endpoint,  
and runs models locally using high performance C++ code.

If you don't have Ollama, install it here by visiting https://ollama.com then pressing Download and following the instructions.

After it's installed, you should be able to visit here: http://localhost:11434 and see the message "Ollama is running"

You might need to restart Cursor (and maybe reboot). Then open a Terminal (control+\`) and run `ollama serve`

Useful Ollama commands (run these in the terminal, or with an exclamation mark in this notebook):

`ollama pull <model_name>` downloads a model locally  
`ollama ls` lists all the models you've downloaded  
`ollama rm <model_name>` deletes the specified model from your downloads

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Super important - ignore me at your peril!</h2>
            <span style="color:#ff7800;">The model called <b>llama3.3</b> is FAR too large for home computers - it's not intended for personal computing and will consume all your resources! Stick with the nicely sized <b>llama3.2</b> or <b>llama3.2:1b</b> and if you want larger, try llama3.1 or smaller variants of Qwen, Gemma, Phi or DeepSeek. See the <A href="https://ollama.com/models">the Ollama models page</a> for a full list of models and sizes.
            </span>
        </td>
    </tr>
</table>

In [None]:
#!ollama pull deepseek-r1:7b

In [14]:
model_name = "deepseek-r1:7b"
answer = ChatModel(
            model_name=model_name,
            provider="ollama",
            api_key="me"
        ).generate_response(
            messages=messages,
        )
print_response(answer)

competitors.append(model_name)
answers.append(answer)

**Comprehensive Plan for Remote Research Station Survival and Care**

**1. Prioritized 30-Day Survival and Care Plan**

- **Resource Allocation:**
  - **Food:** Initially sufficient for two weeks but rationing necessary, prioritizing mission-critical personnel.
  - **Water:** Desalination for five days; generator to supplement with minimal water use.
  - **Fuel:** Ensure desalination runs efficiently while conserving fuel.
  - **Power:** Reliance on the existing 8-day generator supply.

- **Medical Care:**
  - Priority treatment for the symptomatic respiratory illness, then moderately injured.
  - Quarantine and isolation to prevent spread, with strict protocols adhered to.

- **Evacuation Prioritization:**
  - Immediate evacuation of mission-critical personnel (scientists/inachers) within available time frames.

**2. Operational Steps to Extend Survivability**

- Implement a tiered rationing system based on urgency and role.
- Strategically allocate water from the generator to critical individuals first.
- Monitor desalination usage carefully, possibly shutting it down if necessary.

**3. Clarifying Questions/Seeking Data**

- Obtain specifics on remaining desalination capacity post-helicopter arrival.
- Identify exact mission-critical personnel roles and their needs.
- Assess current medical resource availability and distribution.

**4. Key Assumptions and Mitigation Strategies**

- **Assumption:** Desalination can be temporarily reduced without significant water shortages. *Mitigated by monitoring usage closely.*
- **Assumption:**(generator fuel) extends beyond initial 8 days with careful management. * Mitigated by scheduled usage plans.
- **Assumption:**Mission-critical personnel are well-defined on the roster. *R Lori's team mapping will clarify roles.*

**5. Measurable Indicators**

- Daily consumption of non-perishable food, potable water, fuel, and generator runtime.
- Tracking medical cases needing care, number quarantined, movement of critical personnel.

**6. Failure Modes and Contingency Responses**

- **Water Scarcity:** Extend desalination use if possible; prioritize primary用水.
  *Contingency:* Set up temporary water zones using generator output.
- **Missile Evacuation Delays:** Ration strictly by medical condition, prioritize transportation.
  *Contingency:* form multiple medical teams for rapid response.
- **Blackout Extensions:**Use generator to keep essential functions operational.

**Ethical Justification:**
Fairness in rationing based on necessity and health; prioritization of life-saving measures without bias towards roles or affiliations. Medical care directs treatment based on severity, preventing spread through strict protocols.

This plan ensures actionable steps with ethical considerations guiding resource management and personnel care during complex survival challenges.

In [15]:
print(competitors)
print(answers)

['gpt-5-nano', 'claude-sonnet-4-5', 'gemini-2.5-flash', 'deepseek-chat', 'openai/gpt-oss-120b', 'deepseek-r1:7b']
['Below is a concise, action-focused plan you can adapt at the station. It covers (1) day-by-day survival and care with resource allocation and evacuation prioritization, (2) steps to extend survivability and increase evacuation probability, (3) clarifying questions/data to request, (4) key assumptions and how to test/mitigate them, (5) five daily measurable indicators to decide on course changes, and (6) likely failure modes with contingencies. Ethical justification is explicit at each decision point.\n\n1) Ethical framework and overarching approach (why)\n- Core principles: maximize lives saved (utility), fairness, duty to care for the sick, minimize harm from infectious risk, protect essential operations, transparency in rules, and respect for persons (avoid arbitrary discrimination). In practice this means: prioritize life-sustaining needs, isolate contagious risk, allo

In [16]:
together = ""
for index, answer in enumerate(answers):
    together += f"# Response from competitor {index+1}\n\n"
    together += answer + "\n\n"

In [17]:
judge = f"""You are judging a competition between {len(competitors)} competitors.
Each model has been given this question:

{question}

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}}

Here are the responses from each competitor:

{together}

Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks."""


In [18]:
judge_messages = [{"role": "user", "content": judge}]

In [19]:
results = ChatModel(
            model_name="gpt-5-mini",
            provider="openai"
        ).generate_response(
            messages=judge_messages,
        )
print_response(results)

{"results": ["5", "1", "2", "4", "3", "6"]}

In [20]:
results_dict = json.loads(results)
ranks = results_dict["results"]
for index, result in enumerate(ranks):
    competitor = competitors[int(result)-1]
    print(f"Rank {index+1}: {competitor}")

Rank 1: openai/gpt-oss-120b
Rank 2: gpt-5-nano
Rank 3: claude-sonnet-4-5
Rank 4: deepseek-chat
Rank 5: gemini-2.5-flash
Rank 6: deepseek-r1:7b


<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/exercise.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Exercise</h2>
            <span style="color:#ff7800;">Which pattern(s) did this use? Try updating this to add another Agentic design pattern.
            </span>
        </td>
    </tr>
</table>

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/business.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Commercial implications</h2>
            <span style="color:#00bfff;">These kinds of patterns - to send a task to multiple models, and evaluate results,
            are common where you need to improve the quality of your LLM response. This approach can be universally applied
            to business projects where accuracy is critical.
            </span>
        </td>
    </tr>
</table>