## Welcome to the Second Lab - Week 1, Day 3

Today we will work with lots of models! This is a way to get comfortable with APIs.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Important point - please read</h2>
            <span style="color:#ff7800;">The way I collaborate with you may be different to other courses you've taken. I prefer not to type code while you watch. Rather, I execute Jupyter Labs, like this, and give you an intuition for what's going on. My suggestion is that you carefully execute this yourself, <b>after</b> watching the lecture. Add print statements to understand what's going on, and then come up with your own variations.<br/><br/>If you have time, I'd love it if you submit a PR for changes in the community_contributions folder - instructions in the resources. Also, if you have a Github account, use this to showcase your variations. Not only is this essential practice, but it demonstrates your skills to others, including perhaps future clients or employers...
            </span>
        </td>
    </tr>
</table>

In [1]:
# Start with imports - ask ChatGPT to explain any package that you don't know

import os
import json
from dotenv import load_dotenv
from openai import OpenAI
from anthropic import Anthropic
from IPython.display import Markdown, display

In [2]:
# Always remember to do this!
load_dotenv(override=True)

True

In [3]:
# Print the key prefixes to help with any debugging

openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')
deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')
groq_api_key = os.getenv('GROQ_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set (and this is optional)")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:2]}")
else:
    print("Google API Key not set (and this is optional)")

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set (and this is optional)")

if groq_api_key:
    print(f"Groq API Key exists and begins {groq_api_key[:4]}")
else:
    print("Groq API Key not set (and this is optional)")

OpenAI API Key exists and begins sk-proj-
Anthropic API Key exists and begins sk-ant-
Google API Key exists and begins AI
DeepSeek API Key exists and begins sk-
Groq API Key not set (and this is optional)


In [4]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]

In [5]:
messages

[{'role': 'user',
  'content': 'Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. Answer only with the question, no explanation.'}]

In [26]:
# openai = OpenAI()
# response = openai.chat.completions.create(
#     model="gpt-5-mini",
#     messages=messages,
# )
# question = response.choices[0].message.content
question="Suppose you are an independent policy advisor for two neighboring regions that share a river: upstream Region A faces an imminent crop failure and must increase water withdrawals by 40% for the next two years to avoid widespread famine, while downstream Region B depends on steady flow for a hydroelectric plant supplying electricity to 1 million people and for a wetland that would substantially degrade if flow falls by more than 20%; design a legally and ethically defensible temporary agreement and implementation plan that balances immediate human survival in Region A with the long-term ecological and energy needs of Region B—include specific, enforceable allocation rules and quantitative thresholds; monitoring and verification methods; compensation and financing mechanisms; prioritized allocation schedules during shortages; governance and dispute-resolution procedures; measures to minimize perverse incentives and ensure accountability; and contingency plans if one party violates the agreement or if drought conditions persist beyond two years; state all assumptions, provide three plausible alternative solutions with projected benefits and risks for each, and identify the concrete metrics you would use to evaluate success at 6 months, 2 years, and 10 years."
print(question)


Suppose you are an independent policy advisor for two neighboring regions that share a river: upstream Region A faces an imminent crop failure and must increase water withdrawals by 40% for the next two years to avoid widespread famine, while downstream Region B depends on steady flow for a hydroelectric plant supplying electricity to 1 million people and for a wetland that would substantially degrade if flow falls by more than 20%; design a legally and ethically defensible temporary agreement and implementation plan that balances immediate human survival in Region A with the long-term ecological and energy needs of Region B—include specific, enforceable allocation rules and quantitative thresholds; monitoring and verification methods; compensation and financing mechanisms; prioritized allocation schedules during shortages; governance and dispute-resolution procedures; measures to minimize perverse incentives and ensure accountability; and contingency plans if one party violates the ag

In [7]:
competitors = []
answers = []
messages = [{"role": "user", "content": question}]

## Note - update since the videos

I've updated the model names to use the latest models below, like GPT 5 and Claude Sonnet 4.5. It's worth noting that these models can be quite slow - like 1-2 minutes - but they do a great job! Feel free to switch them for faster models if you'd prefer, like the ones I use in the video.

In [8]:
# The API we know well
# I've updated this with the latest model, but it can take some time because it likes to think!
# Replace the model with gpt-4.1-mini if you'd prefer not to wait 1-2 mins

model_name = "gpt-5-nano"

response = openai.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Below is a structured, legally framed, and ethically grounded template for a temporary two-year river-sharing agreement between Upstream Region A and Downstream Region B. It is designed to be implementable with clear allocation rules, monitoring, finance, governance, dispute resolution, and contingency planning. You can adapt the numeric placeholders to your actual hydrological data and legal framework.

1) Executive summary (what this intends to do)
- Purpose: Prevent imminent famine in Region A over two years by allowing a targeted 40% increase in upstream withdrawals, while preserving downstream ecological integrity and the reliability of Region B’s hydroelectric electricity supply and the health of a key wetland.
- Core balance: A’s people must have water for crops/consumption; B’s population must have reliable energy and a protected wetland ecosystem that should not degrade when flows fall by more than 20% relative to baseline.
- Mechanisms: A legally binding interim treaty with explicit water-quantity thresholds, a monitoring/verification regime, a compensation/financing package to support efficiency and drought-resilience, a prioritized allocation schedule during shortages, a dispute-resolution process, and clearly defined contingency plans if drought persists or if either party violates terms.
- Horizon: 24 months (two years) with a formal mid-term review and a ten-year evaluation framework to assess longer-term outcomes and resiliency.

2) Assumptions (the operating context)
- Hydrology and baselines (illustrative placeholders; replace with actual data):
  - Baseline annual river inflow to the confluence regime: Q_base (seasonal and annual variation as measured at the main gauge).
  - Upstream baseline water use by Region A: W_A0 per year (prior to this agreement).
  - Downstream baseline water use for Region B purposes (hydroelectric operation and environmental needs) tied to the river’s outflow: W_B0 per year.
  - Baseline downstream flow at the B gauge (F0): the normal flow without additional interventions.
  - Ecological threshold: Wetland requires flow not to fall more than 20% below F0 (i.e., downstream flow must remain ≥ 0.8 × F0).
  - Hydroelectric plant integrity: A minimum streamflow is required to maintain reliable electricity generation; the plant must be able to operate at or near its planned capacity given the flow regime in this agreement (to be specified with the plant operator).
- Legal and institutional: 
  - There exists or will be created a joint inter-regional body (see Governance section) with binding authority over allocation, monitoring, and dispute resolution.
  - Data-sharing, measurement, and verification (M&V) capacity exists or will be established (gauging stations, telemetry, independent verification).
- Time frame: The policy is temporary (24 months) with a built-in review and milestones; contingency plans extend beyond two years if drought persists.
- Equity and legitimacy: Measures respect human rights to water and food security, protect ecological integrity, and ensure energy security.

3) Overall objective and core principles
- Objective: Provide enough water to Region A for two growing seasons while preserving downstream ecological function and B’s hydroelectric reliability, with a transparent, enforceable, and auditable framework.
- Principles:
  - Humanitarian protection: A’s immediate food/water needs must be met in priority, subject to environmental thresholds.
  - Ecological safeguard: Wetland health and minimum ecological flow are protected to avoid irreversible degradation.
  - Energy reliability: Hydroelectric generation for Region B should not be jeopardized beyond agreed tolerances.
  - Precaution and adaptability: The plan includes monitoring, verification, and contingency adjustments in response to actual hydrology.
  - Accountability: Clear rules, penalties for noncompliance, independent verification, and public reporting.

4) Allocation rules and quantitative thresholds (the meat of the agreement)
- Key quantities and symbols (define in the final document with actual data):
  - W_A0: Baseline annual water withdrawal by Region A (upstream).
  - W_A(t): Actual upstream withdrawal in month t.
  - W_B0: Baseline annual water use for Region B’s downstream needs (hydroelectric + environmental).
  - F0: Baseline downstream flow at the B gauge (average over a representative pre-agreement period).
  - F_min_down(t): Minimum allowable downstream flow at the B gauge to protect the wetland, set at 0.8 × F0 (i.e., no more than a 20% drop from baseline).
  - Q_in(t): Actual river inflow (hydrological input) in month t (observed data to be used in calculations).
  - W_A_target: 1.4 × W_A0 (i.e., a 40% increase over baseline) for months 1–24, subject to hydrological feasibility.
- Core rule for A’s withdrawals:
  - During months 1–24, W_A(t) must satisfy:
    W_A(t) ≤ min(1.4 × W_A0, Q_in(t) − F_min_down(t) − Safety Margin)
  - Safety Margin = a defined buffer (e.g., 5% of Q_in) to cover non-consumptive uses, evaporation losses, and measurement error.
  If Q_in(t) − F_min_down(t) < 1.4 × W_A0, then the agreement triggers the adaptive-shortage protocol (see Section 6) to reconcile A’s humanitarian needs with ecological limits.
- Rule for B’s downstream needs (environment + energy):
  - F_down(t) must satisfy F_down(t) ≥ F_min_down(t) = 0.8 × F0 unless an explicit, pre-agreed deviation is authorized by the Joint River Authority for critical energy generation in emergencies with compensating measures (see contingency, Section 9).
  - The hydroelectric plant’s ability to operate: Ensure that, with F_down(t) ≥ F_min_down(t), the plant can operate at no less than its designed capacity with standard efficiency (i.e., no forced curtailment beyond the agreed minimum). If the plant cannot operate at at least a defined minimum power level P_min (to be specified with the utility), the agreement accommodates remediation steps (see contingency, Section 9).
- Priority allocation order during shortages (monthly):
  1) Essential human drinking water and basic food-production needs for Region A.
  2) Hydroelectric power generation for Region B (to maintain electricity supply for 1,000,000 people) at the agreed reliability level.
  3) Minimum ecological flow to protect the wetland (0.8 × F0) and downstream ecosystem function.
  4) All other uses (nonessential) subject to remaining water availability.
- Time-bound allowances:
  - The 1.4×W_A0 uplift applies only for the 24-month term, unless re-negotiated by mutual written agreement based on hydrological conditions and stakeholder consultations.

5) Monitoring, verification, data sharing, and transparency
- Monitoring framework:
  - Install and maintain calibrated gauging stations at the upstream boundary of A’s withdrawals, at the B gauge downstream, and at key reservoir inlets/outlets if storage is involved.
  - Real-time telemetry and monthly reporting of: inflow Q_in(t), downstream flow F_down(t), upstream withdrawals W_A(t), and reservoir storage levels if applicable.
  - Periodic ecological monitoring of the wetlands (vegetation health, soil moisture, water depth) and hydroelectric plant performance metrics (actual vs. planned generation, plant outages).
- Verification:
  - Independent Verification Body (IVB): A neutral, third-party entity responsible for auditing monthly data, validating compliance with thresholds, and publishing a public compliance report each month.
  - Public dashboard: A transparent, accessible portal showing monthly measurements, allocations, and any deviations with explanations.
- Data sharing and confidentiality:
  - Both regions publish non-sensitive data in the public dashboard; commercially sensitive or critical infrastructure data can be restricted under agreed confidentiality provisions, but must still be available to the IVB and the Joint River Authority.

6) Compensation, financing, and incentives to reduce harm
- Compensation and funding:
  - Establish a Joint Drought-Response and Resilience Fund (JDRF) to finance: water-use efficiency investments in A, crop-switching to drought-tolerant varieties, irrigation modernization (drip, micro-sprinkler), groundwater recharge, and emergency relief for affected farmers.
  - Financing contributions: proportional to baseline water use or economic capacity; e.g., 60% from Region A, 40% from Region B, plus possible external support (international development banks, NGOs, or neighboring states) if needed.
- Incentives and penalties:
  - Positive incentives: Grants or low-interest loans for A’s farmers to upgrade irrigation, insurance subsidies for climate risk, and temporary support payments linked to measurable improvements in water-use efficiency.
  - Penalties for noncompliance: If A exceeds W_A(t) beyond agreed limits without an approved exception, or if B fails to maintain the minimum ecological/hydroelectric thresholds without justification, financial penalties or compensatory adjustments to the sharing ratio may be applied, up to a pre-agreed cap. Any penalties are designed to be proportionate and to reflect actual damages or shortfalls.
- Economic support for adaptation:
  - Technical assistance for A’s agriculture (crop diversification, soil moisture management, rainwater harvesting) funded by the JDRF.
  - Policy support for B to ensure reliability of grid operations (short-term contingency generation plans or energy imports if necessary).

7) Governance, decision-making, and dispute-resolution framework
- Decision-making bodies:
  - Joint River Authority (JRA): A representative body with executive authority for day-to-day administration of the agreement. Composed of equal representation from Regions A and B (e.g., 3 regional appointees each), with rotating presidency and a neutral chair from an independent body for impartial facilitation.
  - Independent Arbitration Panel (IAP): For binding decisions in the event of disputes. Appointment of panel members by mutual consent or by an independent regional arbitration center.
  - Technical Advisory Committee (TAC): Experts in hydrology, ecology, agriculture, energy, and water law who provide non-binding technical recommendations to the JRA.
- Dispute-resolution process (stepwise):
  1) Informal negotiation within 15 days of a dispute.
  2) If unresolved, escalate to the TAC for technical assessment within 30 days.
  3) If still unresolved, refer to the IAP for binding decision within 45–60 days.
  4) If urgent, a fast-track emergency ruling can be requested with a capped decision period (e.g., 14 days) on specific operational issues.
- Enforcement and compliance:
  - Compliance reporting to the public dashboard; the IVB conducts audits; the IAP can impose penalties or require corrective actions with enforceable timelines.

8) Measures to minimize perverse incentives and ensure accountability
- Clear rules and predictable triggers:
  - The 1.4×W_A0 target is only feasible if Q_in(t) supports it; if not, triggers for adaptive management minimize incentives to “game” the data or over-report.
- Independent verification and transparency:
  - Regular third-party audits; public data sharing; independent IAP decisions.
- Alignment with local livelihoods:
  - The compensation and resilience fund funds practical investments that reduce long-term dependency on high water withdrawals (drip irrigation, crop-switch, soil moisture management) rather than simply paying for water.
- Penalty structure:
  - Sanctions are proportionate, time-bound, and include a pathway to cure (e.g., temporary reduction in W_A until the next hydrological assessment).
- Sunset and review:
  - A formal mid-term review and a post-implementation evaluation at 2 years, with a lessons-learned report and, if warranted, renegotiation or extension with updated data.

9) Contingency plans (violation, non-compliance, or persistent drought beyond two years)
- If a party violates the agreement:
  - Immediate notification and temporary suspension of corresponding rights until compliance is verified by the IVB.
  - Implementation of sanctions or remediation steps defined in the Penalties/Remedies schedule.
  - Possible escalation to the IAP for binding resolution within a shortened timeline.
- If drought conditions persist beyond two years:
  - Re-evaluate the baseline W_A0 and re-calculate 1.4×W_A0 in light of actual hydrology; adjust thresholds via a formal amendment approved by both regions.
  - Activate additional resilience measures funded by JDRF (e.g., large-scale water-use efficiency programs, expansion of stored water in reservoirs if feasible, import of energy via alternative sources if hydro limits are stressed).
  - Consider longer-term multi-year arrangements with a phased reduction in A’s withdrawal share, supported by enhanced drought risk management and climate adaptation investments.
- Emergency provisions:
  - In extreme events, allow up to a predefined temporary deviation from the 0.8×F0 ecological floor if such deviation is medically or ecologically mitigated by other measures (e.g., expedited restoration activities, temporary habitat management) and only with IAP approval.

10) Implementation plan (phases and milestones)
- Phase 0: Baseline data collection, stakeholder consultation, and establishment of the JRA, IVB, and TAC; finalize measurement protocols and the data-sharing agreement.
- Phase 1 (months 1–6): Install/verify monitoring infrastructure; initiate compensation/funding mechanisms; begin capacity-building for water-use efficiency in A; implement the 40% increase with ongoing monitoring and mid-course adjustment if needed.
- Phase 2 (months 7–12): Mid-term review; adjust allocation rules if hydrological reality diverges from assumptions; continue resilience investments; publish monthly compliance reports.
- Phase 3 (months 13–24): Maintain operations; monitor ecological and energy indicators; prepare a formal assessment report at the end of the two-year term for potential renegotiation or extension.
- Phase 4 (post-24 months): Long-term decision-making about a longer-term agreement informed by the 6-month, 2-year, and 10-year metrics (see section 12).

11) Evaluation metrics (success indicators)
- 6 months:
  - A household water security indicators: percentage of households with assured drinking water and essential irrigation.
  - Water-use efficiency improvements in A: percentage reduction in nonessential losses or drip irrigation adoption rate.
  - Compliance rate with W_A(t) and F_down(t) thresholds (percentage of months compliant).
  - Public access to the monitoring dashboard and number of stakeholders participating in monthly briefings.
- 2 years:
  - Sustained A crop yields and food security indicators; reduction in famine-risk indicators.
  - Hydroelectric reliability: percentage of months achieving targeted generation; number of outages due to water constraints.
  - Wetland health indicators: vegetation status, hydrology metrics (depth, soil moisture), and water-quality indicators within acceptable ranges.
  - Financial balance: JDRF fund utilization vs. planned investments; measured return on efficiency investments (energy saved per unit water used).
- 10 years:
  - Net ecological status of the wetland (restoration progress, biodiversity indices).
  - Long-term sustainability of A’s agriculture under drought resilience measures (income stability, crop diversification uptake).
  - Energy security: reliable supply to 1 million people with minimal interruptions attributed to river flow variability.
  - Governance effectiveness: number of disputes resolved, time to resolution, and stakeholder satisfaction with the governance process.

12) Three plausible alternative solutions (with benefits and risks)
- Alternative A: Demand-side transformation and agricultural resilience (priority to efficiency and drought-tolerant farming)
  - Description: Rapid deployment of irrigation efficiency upgrades, crop-switching to drought-tolerant varieties, soil moisture management, rainwater harvesting, and crop insurance for farmers in Region A; use savings to satisfy the 40% increase with less pressure on the downstream system.
  - Benefits: Reduces actual water withdrawal needs, increases resilience to climate variability, potentially reduces energy demand for pumping.
  - Risks: High upfront costs, adoption lag, need for farmer retraining, possible short-term yield reductions during transition.
- Alternative B: Storage and water-security investments (increase resilience through storage and managed releases)
  - Description: Build or expand storage capacity (surface reservoirs, underground aquifers) and create a managed release plan to meet A’s needs while preserving B’s ecological and energy thresholds; include conjunctive use and groundwater augmentation where feasible.
  - Benefits: Improves reliability under drought, creates buffer against hydrological variability, may reduce ecological stress during dry periods.
  - Risks: High capital costs, permitting and land-use challenges, long lead times before storage becomes operational; environmental footprint of new storage could be a concern.
- Alternative C: Energy and water trading/offsets (temporary water-trading mechanism with cross-subsidies)
  - Description: Introduce a limited, time-bound water-rights trading mechanism where A can procure permits/priority rights from B’s nonessential uses or from an external water rights pool; paired with temporary energy offsets (imported energy or capacity payments) to maintain B’s reliability during the trading window.
  - Benefits: Improves flexibility, reduces forced over-withdrawal by letting A buy offsets, aligns incentives for efficiency.
  - Risks: Legal complexity, potential inequities in rights trading, possible increased volatility in allocation; depends on external liquidity of rights and energy options.

13) Concrete steps for operationalizing the plan (sample checklist)
- Draft the legal instrument: treaty language, definitions, schedules, annexes on thresholds, penalties, and dispute-resolution timelines.
- Establish the Joint River Authority, IVB, and TAC; appoint members; define terms of reference.
- Install/upgrade monitoring infrastructure; finalize data-sharing protocols.
- Finalize the compensation package and JDRF funding contributions; set interim disbursement rules.
- Develop the compliance dashboard; publish initial baseline reports.
- Initiate capacity-building programs for A’s farmers; begin crop diversification and irrigation modernization.
- Schedule the first formal mid-term review and the second-year evaluation.
- Prepare contingency operations: emergency response protocols, trigger conditions for escalation, and amendment procedures.

14) Assumptions you should confirm or replace with local data
- Actual baseline water use by Region A and the downstream needs of Region B.
- Precise ecological thresholds for the wetland and the minimum hydroelectric flow required to meet grid reliability.
- The physical capacity for monitoring and enforcement (gauging networks, telemetry, independent verification capability).
- The existence of, or ability to establish, a joint governance body with binding authority and funding mechanisms.

15) Accountability and transparency measures
- Public monthly compliance reports and an online dashboard of real-time metrics.
- Independent audits of metering, reporting, and ecological indicators.
- Public stakeholder meetings and accessible briefing documents.

What to provide to finalize this plan for your jurisdictions
- Replace all placeholders with real numbers:
  - W_A0, W_B0, F0, Q_in(t) distributions, and the exact ecological and plant-reliability thresholds.
  - The exact 40% uplift is kept but always constrained by the Q_in − F_min_down condition.
  - The specific compensation amounts, JDRF funding shares, and efficiency-improvement targets.
- Adjust legal language to align with existing treaties, water rights, and transboundary or inter-regional law applicable to your jurisdictions.
- Incorporate sector-specific technical details (irrigation technologies, crop regimes, hydroelectric plant capacity, and wetland ecological indicators) in annexes.

If you want, I can tailor this into a formal draft treaty with:
- Clause-by-clause language (definitions, rights, duties, enforcement, dispute resolution).
- A quantitative annex with your actual hydrology data and a concrete monitoring plan.
- An operational calendar with phased steps and escalation triggers.
- A ready-to-use governance charter for the Joint River Authority and the independent verifier.

In [9]:
# Anthropic has a slightly different API, and Max Tokens is required

model_name = "claude-sonnet-4-5"

claude = Anthropic()
response = claude.messages.create(model=model_name, messages=messages, max_tokens=1000)
answer = response.content[0].text

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

# Transboundary Water Crisis: Comprehensive Policy Framework
## Region A (Upstream) - Region B (Downstream) Temporary Water Sharing Agreement

---

## EXECUTIVE SUMMARY

This framework addresses an acute crisis requiring immediate action while preserving long-term sustainability. The core approach balances Region A's survival needs against Region B's critical infrastructure and ecological thresholds through time-limited enhanced withdrawals, compensatory mechanisms, efficiency mandates, and robust governance.

---

## FOUNDATIONAL ASSUMPTIONS

### Baseline Conditions
1. **Current flow regime**: 1,000 million cubic meters (MCM) annually reaching Region B
2. **Region A current withdrawal**: 200 MCM/year (20% of river flow)
3. **Region A proposed increase**: +80 MCM/year (40% increase = 280 MCM total)
4. **Region B hydroelectric requirements**: Minimum 800 MCM/year for full operation
5. **Wetland critical threshold**: ≥800 MCM/year (20% reduction tolerance)
6. **Crisis duration**: 24 months (with contingency provisions)
7. **Population at risk in Region A**: 500,000 people facing food insecurity
8. **Population dependent on Region B electricity**: 1,000,000 people
9. **Wetland ecosystem services value**: $15 million/year
10. **Both regions have functioning legal systems and can enforce contracts**

### Hydrological Assumptions
- River basin experiences seasonal variation (±30% from mean)
- Climate projections indicate 15% probability of continued drought beyond 24 months
- Groundwater extraction could supplement 10-15% of surface water needs
- Water quality remains within acceptable parameters for all uses

---

## SOLUTION 1: TIERED ALLOCATION WITH PROGRESSIVE COMPENSATION (RECOMMENDED)

### Core Principles
1. **Human life primacy**: Survival needs trump economic interests
2. **Proportionate burden-sharing**: Both regions accept graduated sacrifices
3. **Temporary deviation, permanent cooperation**: Crisis measures don't establish precedent
4. **Compensatory justice**: Those harmed are made whole
5. **Efficiency mandate**: Crisis accelerates conservation investment

---

### SPECIFIC ALLOCATION RULES & QUANTITATIVE THRESHOLDS

#### Phase 1: Months 1-6 (Emergency Stabilization)

**Allocation Schedule:**

| Flow Condition | Region A Withdrawal | Region B Receives | Trigger Mechanism |
|----------------|---------------------|-------------------|-------------------|
| Normal (>1000 MCM/yr) | 250 MCM (+25%) | 850 MCM (-15%) | Baseline |
| Moderate shortage (800-1000 MCM) | 230 MCM (+15%) | 770-870 MCM (-17% to -13%) | Flow monitoring stations report <1000 MCM for 2 consecutive weeks |
| Severe shortage (<800 MCM) | 200 MCM (baseline) | <700 MCM | Emergency protocol triggered |

**Operational Rules:**
- Region A receives 62.5% of requested increase initially (50 of 80 MCM)
- Region B operates hydroelectric at 85-94% capacity depending on seasonal flow
- Wetland receives 850 MCM minimum (15% reduction from baseline 1000 MCM)

#### Phase 2: Months 7-18 (Sustained Support with Mitigation)

**Adjusted Allocation:**

| Flow Condition | Region A Withdrawal | Region B Receives | Required Efficiency Gains |
|----------------|---------------------|-------------------|---------------------------|
| Normal | 270 MCM (+35%) | 830 MCM (-17%) | A: 10% reduction in losses; B: 5% efficiency gain |
| Moderate shortage | 240 MCM (+20%) | 760-860 MCM | A: 15% reduction; B: 8% gain |
| Severe shortage | 210 MCM (+5%) | <690 MCM | Emergency protocol |

**Conditions for increased allocation:**
- Region A demonstrates documented efficiency improvements (see Accountability section)
- Region B has received compensation for first 6 months
- Joint monitoring confirms no permanent ecological damage

#### Phase 3: Months 19-24 (Transition to Normalcy)

**

In [25]:
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = "gemini-2.5-flash"

response = gemini.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

InternalServerError: Error code: 503 - [{'error': {'code': 503, 'message': 'The model is overloaded. Please try again later.', 'status': 'UNAVAILABLE'}}]

In [12]:
deepseek = OpenAI(api_key=deepseek_api_key, base_url="https://api.deepseek.com/v1")
model_name = "deepseek-chat"

response = deepseek.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Of course. As an independent policy advisor, here is a comprehensive proposal for a temporary water-sharing agreement between Region A (upstream) and Region B (downstream).

### **Executive Summary**

This agreement establishes a two-year, crisis-driven "Water Preservation and Famine Prevention Compact" between Region A and Region B. It authorizes a temporary, managed increase in upstream withdrawals by Region A, coupled with a mandatory, compensated reduction in hydropower generation in Region B. The core principle is **shared sacrifice for human survival**, mitigated by financial and in-kind compensation to protect Region B's economy and ecology. The agreement is structured with strict thresholds, independent monitoring, and robust enforcement to prevent abuse and build trust.

**Core Assumptions:**
1.  **Baseline Data:** A reliable, shared baseline for current water flow, withdrawals, and ecological health exists or can be immediately established.
2.  **Good Faith:** Both regions are acting in good faith to resolve the crisis, despite conflicting interests.
3.  **Legal Authority:** Both regional governments have the legal authority to enter into binding agreements and regulate water use within their jurisdictions.
4.  **Financial Capacity:** Region A has or can secure the financial resources (or equivalent in-kind value) to provide compensation to Region B.
5.  **Climate Uncertainty:** The two-year timeframe is a best estimate; conditions may worsen or improve.

---

### **The Proposed Agreement: The Managed Flow & Compensation Compact**

This plan is designed as the primary recommended solution.

#### **1. Specific, Enforceable Allocation Rules & Quantitative Thresholds**

*   **Flow Thresholds at the Regional Border (X):**
    *   **Normal Flow (F_N):** The 10-year average flow for the given season.
    *   **Crisis Trigger (F_C):** Flow at which Region B must begin reducing hydropower. Set at **85% of F_N**.
    *   **Ecological Redline (F_E):** The absolute minimum flow to prevent catastrophic wetland degradation. Set at **80% of F_N**. No further reductions are permitted once this level is reached.
*   **Allocation Rules:**
    *   Region A is permitted to increase its withdrawals by up to **40%** from its baseline, **but only if** the flow at border station X remains above F_E.
    *   If flow falls below F_C, Region B's hydropower operator is legally obligated to begin scaling down generation to maintain flow above F_E.
    *   If flow falls to F_E, Region A's additional withdrawals must be immediately curtailed to pre-crisis levels, regardless of the 40% target.

#### **2. Monitoring and Verification Methods**

*   **Technology:** Installation of two redundant, telemetric flow gauges at the border (X), with data transmitted in real-time to a joint command center and a publicly accessible online portal.
*   **Verification:** A **Joint Water Commission (JWC)**, with equal representation from both regions and one independent, internationally recognized hydrologist, will oversee data collection.
*   **Audits:** Quarterly unannounced audits of Region A's agricultural water usage (via satellite imagery and ground inspection) and Region B's hydropower operations logs.

#### **3. Compensation and Financing Mechanisms**

*   **Compensation Fund:** Region A will establish a "Downstream Impact Mitigation Fund" managed by the JWC.
*   **Payments:** Region B's hydropower company will be compensated for 100% of its verifiable lost revenue due to reduced generation, calculated based on the regional energy spot price.
*   **Ecological Trust:** A separate payment will be made into an ecological trust fund to finance future wetland restoration projects in Region B. The amount will be based on a pre-agreed "environmental flow deficit" model.
*   **Financing:** Region A may finance this through emergency international loans (e.g., World Bank), drawing from sovereign funds, or through a one-time "famine aversion" tax.

#### **4. Prioritized Allocation Schedules During Shortages**

If water levels are critically low and the F_E threshold is threatened, the JWC will enact the following priority schedule:
1.  **Human Survival & Drinking Water:** Basic human needs in both regions.
2.  **Critical Ecosystem Function:** Maintaining minimum flows at F_E to prevent irreversible wetland collapse.
3.  **Famine-Prevention Agriculture:** Region A's core crops for caloric sufficiency.
4.  **Hydropower for Critical Infrastructure:** Power for hospitals, water treatment, and emergency services in Region B.
5.  **All Other Uses:** Including non-essential agriculture and industrial consumption in both regions.

#### **5. Governance and Dispute-Resolution Procedures**

*   **Governance:** The **Joint Water Commission (JWC)** has operational authority.
*   **Dispute Resolution:**
    1.  **Technical Panel:** Initial disputes are referred to the JWC's independent technical experts.
    2.  **Mediation:** If unresolved, a panel of three mediators (one chosen by each region, one independent) has 30 days to broker a solution.
    3.  **Binding Arbitration:** If mediation fails, the dispute goes to a pre-selected international arbitration body (e.g., Permanent Court of Arbitration) whose decision is final and legally binding under the treaty.

#### **6. Measures to Minimize Perverse Incentives & Ensure Accountability**

*   **No Precedent Setting:** The agreement explicitly states it is a *sui generis* (one-of-a-kind) response to an existential crisis and does not alter long-term water rights.
*   **Anti-Profiteering:** Compensation to Region B is for *verified lost revenue only*, not profit. Audits ensure the hydropower company does not exaggerate losses.
*   **Sunset Clause:** The agreement automatically terminates after 24 months. Any extension requires a completely new negotiation.
*   **Transparency:** All data, JWC meeting minutes, and financial transactions are public.

#### **7. Contingency Plans**

*   **If Region A Violates (e.g., exceeds withdrawals):**
    1.  Financial penalties are levied against the Compensation Fund.
    2.  If persistent, the agreement is suspended, and Region B can pursue arbitration for damages and seek international sanctions.
*   **If Region B Violates (e.g., blocks flow data):**
    1.  Compensation payments are halted.
    2.  If persistent, the agreement is suspended, and Region A is no longer bound by its withdrawal limits.
*   **If Drought Persists Beyond 2 Years:**
    *   The sunset clause triggers a return to the pre-crisis water-sharing agreement.
    *   However, a 6-month "glide path" is built-in, where withdrawals and compensation are gradually phased out, providing a buffer for both regions to adapt.
    *   This forces a re-negotiation under the new, permanent drought reality, likely leading to a more sustainable long-term solution.

---

### **Plausible Alternative Solutions**

| Alternative | Description | Projected Benefits | Projected Risks |
| :--- | :--- | :--- | :--- |
| **1. The Water Banking & Transfer Scheme** | Region A's farmers are paid by Region B (or international donors) to fallow a portion of their land and "sell" the saved water downstream. | - Market-based and efficient.<br>- Directly links compensation to action.<br>- Less disruptive to Region B. | - Logistically complex to administer.<br>- May not free up enough water to prevent famine.<br>- Could inequitably impact smallholder farmers in Region A. |
| **2. The Energy Swap Agreement** | Region A compensates Region B by financing and facilitating the immediate import of an equivalent amount of electricity from a third region to make up for the hydropower shortfall. | - Directly addresses Region B's core need (energy).<br>- Could build regional energy infrastructure.<br>- Avoids large cash transfers. | - High cost and complexity of arranging power imports.<br>- Grid stability issues.<br>- Does not compensate for ecological damage to wetlands. |
| **3. The Strict Proportional Sharing Model** | Both regions share the pain equally by taking a proportional cut from a shared baseline. If the total water deficit is 20%, both Region A and B reduce usage by 20%. | - Perceived as highly fair and equitable.<br>- Simple to understand and monitor. | - **Ethically indefensible in the short term:** It forces famine in Region A to maintain slightly higher power generation in Region B.<br>- Politically volatile. |

---

### **Concrete Evaluation Metrics**

| Timeframe | Metrics for Success |
| :--- | :--- |
| **6 Months** | 1. **Flow Compliance:** Flow at border station X remains >F_E for 95% of the period.<br>2. **Famine Prevention:** Acute malnutrition rates in Region A do not increase from pre-crisis baseline.<br>3. **Compensation:** First compensation payment to Region B is made on schedule and is fully verifiable.<br>4. **Data Transparency:** Real-time flow data is publicly available with >99% uptime. |
| **2 Years** | 1. **Humanitarian:** No famine-related mortality in Region A.<br>2. **Economic:** Region B's verifiable lost hydropower revenue is compensated at >95%.<br>3. **Ecological:** Wetland area and a key species health index in Region B show no statistically significant decline beyond predicted models.<br>4. **Institutional:** No disputes required escalation to binding arbitration. |
| **10 Years** | 1. **Relationship:** A permanent, robust river basin treaty is in place, building on the lessons of the crisis.<br>2. **Sustainability:** Both regions have diversified their water and energy sources, reducing vulnerability.<br>3. **Ecology:** The wetland in Region B has recovered to its pre-crisis health index or better, aided by the ecological trust fund.<br>4. **Precedent:** The agreement is cited as a successful model of crisis management, not as a precedent for water rights. |

This comprehensive plan prioritizes immediate human life without sacrificing the long-term viability of the downstream region's economy and ecology, creating a defensible, actionable, and measurable path through a profound crisis.

In [None]:
# Updated with the latest Open Source model from OpenAI

# groq = OpenAI(api_key=groq_api_key, base_url="https://api.groq.com/openai/v1")
# model_name = "openai/gpt-oss-120b"

# response = groq.chat.completions.create(model=model_name, messages=messages)
# answer = response.choices[0].message.content

# display(Markdown(answer))
# competitors.append(model_name)
# answers.append(answer)


## For the next cell, we will use Ollama

Ollama runs a local web service that gives an OpenAI compatible endpoint,  
and runs models locally using high performance C++ code.

If you don't have Ollama, install it here by visiting https://ollama.com then pressing Download and following the instructions.

After it's installed, you should be able to visit here: http://localhost:11434 and see the message "Ollama is running"

You might need to restart Cursor (and maybe reboot). Then open a Terminal (control+\`) and run `ollama serve`

Useful Ollama commands (run these in the terminal, or with an exclamation mark in this notebook):

`ollama pull <model_name>` downloads a model locally  
`ollama ls` lists all the models you've downloaded  
`ollama rm <model_name>` deletes the specified model from your downloads

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Super important - ignore me at your peril!</h2>
            <span style="color:#ff7800;">The model called <b>llama3.3</b> is FAR too large for home computers - it's not intended for personal computing and will consume all your resources! Stick with the nicely sized <b>llama3.2</b> or <b>llama3.2:1b</b> and if you want larger, try llama3.1 or smaller variants of Qwen, Gemma, Phi or DeepSeek. See the <A href="https://ollama.com/models">the Ollama models page</a> for a full list of models and sizes.
            </span>
        </td>
    </tr>
</table>

In [14]:
!ollama pull llama3.2

[?2026h[?25l[1Gpulling manifest ⠋ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠙ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠹ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠸ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠼ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠴ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠦ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠧ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠇ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠏ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠋ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠙ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠹ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠸ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠼ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠴ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠦ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest [K
pulling dde5aa3fc5ff:   0% ▕               

In [15]:
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "llama3.2"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

**Temporary Agreement: River Water Sharing Mechanism**

Between Region A (upstream) and Region B (downstream)

1. **Allocating Shares:** Divide the river's shareable water volume between Region A and Region B based on their population needs:
   - Region A: 60% of total allocation, prioritizing crop use
   - Region B: 40% of total allocation, balancing hydroelectric plant operation, wetland maintenance, and reduced flow impact

2. **Initial Allocation:**
Region A: May increase water withdrawals to 140% of pre-crisis levels (70/100 of allocated volume per annum for the next two years) due to crop failure risks.
Regional B: Continues operating within allowed maximum allowable minimum allocations to ensure steady hydroelectric supply and ecological balance.

3.**Shortage Prioritization and Allocation Schedules**
   - During water shortages exceeding projected baslines:
     – For first 100 days of drought: Ensure full operation rate from water allocation rules for Region A's needs.
     –  Following drought’s first period: If Regional B’s electricity use does not approach the sustainable limits, allow additional portions of water allocation proportional to demand needs but ensuring that both parties satisfy all obligations.

4.**Compensation and Financing Mechanism:**
Implement the ‘Dispute Resolution and Emission Adjustment Agreement'. Any Party breaching agreed-upon rules must compensate affected counterpart. This adjustment can happen via financial transfers between Parties.

5. **Monitoring, Verification and Dispute Resolution:**
Each period involves monitoring water output within a specified range to allow parties correct their respective allocations.
Due to conflict or misunderstandings arise the following dispute resolution protocols are available:

    - For non-violative breaches - Arbitration by mutually agreed court decisions
   - For major violative breaches where one region must make larger allocation adjustments - The River Water Sharing Commission will oversee and mediate disputes with assistance from international organizations.

**Assumptions:** 
1. Population needs of both regions are estimated as 60%, for Region A and 40% of total shares between them.
2.Relevant baseline water withdrawal amounts can be identified.
3.Informative, publicly sourced scientific assessments on sustainable minimum allocations and short-term effects in each region.
4. Both regions commit at 50/50 level during agreement execution
5. Parties' full co-operation for this deal is required with no less than two-year commitment.

Alternative Solutions

1 Alternative: **Hybrid System:** Divide water between crop supply, irrigation of public agricultural fields and public facilities supporting public hygiene services, with a 60-30-10 ratio on Regional A's shares used to address needs during both pre-agree period and short periods in case droughts occur.
Benefits: More equitable split; ensures basic human health standards can be consistently met.
Risks: Possibility of lower capacity water use from upstream region during critical time causing loss of regional production and stability

2 Alternative: **Flexible Reserve:** Divide each share by the available reserve share for potential surplus or under-represented years. The excess amount will then serve a specific purpose – as set in agreements.
Benefits: Adjustments can be added to either side depending on their current needs, ensuring stability and equitable distribution without extreme inefficiencies caused during one party's priority at an other side’s expense.
Risks: In case of noncompliance by any one party, they must purchase the reserve water for both parties which is a major financial challenge.

3 Alternative: **Joint Water Management Facility:** Construct and jointly operate new infrastructure to store & distribute water from river water between Region A and B without sharing natural volume (including the right to withdraw it when needed), with no agreement set on amount or priority level. Regional water storage units allow for flexible capacity sharing.
Benefits: Easier access for all parties through a centralized facility, ensures flexibility in managing supply during periods of drought; promotes shared responsibility and trust amongst parties
Risks: May generate new operational costs & complexity

Concrete Metrics to Evaluate Success:
1. **Six Months**: 
   - Regional Crops Growth Yield Ratios after increasing water availability.
   - Region B Water Availability as a percentage beyond predetermined thresholds (allowing flexibility).

2.**Two Years**: 
   - Total crop yield by Region A and adjusted agricultural production capacity at the end of two years.
   - Electricity supply rate maintained during the entire time with specified limits

3. **Ten Years:**
   - Finalized total adjustment based allocation from drought periods to replenish stored amounts.
   - Regional Water Supply Stability Rate (% of sustainable max volumes, if possible):

In [16]:
# So where are we?

print(competitors)
print(answers)


['gpt-5-nano', 'claude-sonnet-4-5', 'deepseek-chat', 'llama3.2']
['Below is a structured, legally framed, and ethically grounded template for a temporary two-year river-sharing agreement between Upstream Region A and Downstream Region B. It is designed to be implementable with clear allocation rules, monitoring, finance, governance, dispute resolution, and contingency planning. You can adapt the numeric placeholders to your actual hydrological data and legal framework.\n\n1) Executive summary (what this intends to do)\n- Purpose: Prevent imminent famine in Region A over two years by allowing a targeted 40% increase in upstream withdrawals, while preserving downstream ecological integrity and the reliability of Region B’s hydroelectric electricity supply and the health of a key wetland.\n- Core balance: A’s people must have water for crops/consumption; B’s population must have reliable energy and a protected wetland ecosystem that should not degrade when flows fall by more than 20% rela

In [17]:
# It's nice to know how to use "zip"
for competitor, answer in zip(competitors, answers):
    print(f"Competitor: {competitor}\n\n{answer}")


Competitor: gpt-5-nano

Below is a structured, legally framed, and ethically grounded template for a temporary two-year river-sharing agreement between Upstream Region A and Downstream Region B. It is designed to be implementable with clear allocation rules, monitoring, finance, governance, dispute resolution, and contingency planning. You can adapt the numeric placeholders to your actual hydrological data and legal framework.

1) Executive summary (what this intends to do)
- Purpose: Prevent imminent famine in Region A over two years by allowing a targeted 40% increase in upstream withdrawals, while preserving downstream ecological integrity and the reliability of Region B’s hydroelectric electricity supply and the health of a key wetland.
- Core balance: A’s people must have water for crops/consumption; B’s population must have reliable energy and a protected wetland ecosystem that should not degrade when flows fall by more than 20% relative to baseline.
- Mechanisms: A legally bindi

In [18]:
# Let's bring this together - note the use of "enumerate"

together = ""
for index, answer in enumerate(answers):
    together += f"# Response from competitor {index+1}\n\n"
    together += answer + "\n\n"

In [19]:
print(together)

# Response from competitor 1

Below is a structured, legally framed, and ethically grounded template for a temporary two-year river-sharing agreement between Upstream Region A and Downstream Region B. It is designed to be implementable with clear allocation rules, monitoring, finance, governance, dispute resolution, and contingency planning. You can adapt the numeric placeholders to your actual hydrological data and legal framework.

1) Executive summary (what this intends to do)
- Purpose: Prevent imminent famine in Region A over two years by allowing a targeted 40% increase in upstream withdrawals, while preserving downstream ecological integrity and the reliability of Region B’s hydroelectric electricity supply and the health of a key wetland.
- Core balance: A’s people must have water for crops/consumption; B’s population must have reliable energy and a protected wetland ecosystem that should not degrade when flows fall by more than 20% relative to baseline.
- Mechanisms: A legally

In [20]:
judge = f"""You are judging a competition between {len(competitors)} competitors.
Each model has been given this question:

{question}

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}}

Here are the responses from each competitor:

{together}

Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks."""


In [21]:
print(judge)

You are judging a competition between 4 competitors.
Each model has been given this question:

Suppose you are an independent policy advisor for two neighboring regions that share a river: upstream Region A faces an imminent crop failure and must increase water withdrawals by 40% for the next two years to avoid widespread famine, while downstream Region B depends on steady flow for a hydroelectric plant supplying electricity to 1 million people and for a wetland that would substantially degrade if flow falls by more than 20%; design a legally and ethically defensible temporary agreement and implementation plan that balances immediate human survival in Region A with the long-term ecological and energy needs of Region B—include specific, enforceable allocation rules and quantitative thresholds; monitoring and verification methods; compensation and financing mechanisms; prioritized allocation schedules during shortages; governance and dispute-resolution procedures; measures to minimize pe

In [22]:
judge_messages = [{"role": "user", "content": judge}]

In [23]:
# Judgement time!

openai = OpenAI()
response = openai.chat.completions.create(
    model="gpt-5-mini",
    messages=judge_messages,
)
results = response.choices[0].message.content
print(results)


{"results": ["1", "3", "2", "4"]}


In [24]:
# OK let's turn this into results!

results_dict = json.loads(results)
ranks = results_dict["results"]
for index, result in enumerate(ranks):
    competitor = competitors[int(result)-1]
    print(f"Rank {index+1}: {competitor}")

Rank 1: gpt-5-nano
Rank 2: deepseek-chat
Rank 3: claude-sonnet-4-5
Rank 4: llama3.2


<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/exercise.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Exercise</h2>
            <span style="color:#ff7800;">Which pattern(s) did this use? Try updating this to add another Agentic design pattern.
            </span>
        </td>
    </tr>
</table>

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/business.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Commercial implications</h2>
            <span style="color:#00bfff;">These kinds of patterns - to send a task to multiple models, and evaluate results,
            are common where you need to improve the quality of your LLM response. This approach can be universally applied
            to business projects where accuracy is critical.
            </span>
        </td>
    </tr>
</table>