# Day 99: Cooperative AI Safety

As AI agents interact with each other and with humans, safety becomes a multi-player problem. **Cooperative AI Safety** ensures that agents can achieve high group utility without falling into 'Social Traps' like the Prisoner's Dilemma. It also involves detecting 'predatory' agents that might exploit others, leading to a collapse of the cooperative system.

In this lab, we implement a **Cooperation Auditor** to:
1. **Log Interaction Decisions**: Tracking 'Cooperate' vs 'Defect' actions across a population of agents.
2. **Social Welfare Monitoring**: Calculating the overall efficiency and health of the multi-agent system.
3. **Predatory Behavior Detection**: Identifying specific agents that defect significantly more than their peers, threatening the stability of the ecosystem.

In [None]:
import sys
import os
import time

# Add root directory to sys.path
sys.path.append(os.path.abspath('../../'))

from src.security.cooperation_auditor import CooperationAuditor

## 1. Scenario: A Healthy Ecosystem

Agents interact and successfully cooperate to maximize mutual gain.

In [None]:
auditor = CooperationAuditor()

print("Simulating cooperative interactions between Agent_A and Agent_B...")
for _ in range(5):
    auditor.log_action("Agent_A", "COOPERATE")
    auditor.log_action("Agent_B", "COOPERATE")

result = auditor.audit_system()
print(f"Social Welfare: {result['social_welfare']}")
print(f"System Status: {result['status']}")

## 2. Scenario: Entry of a Predatory Agent

A third agent, `Mal_Bot`, enters the system and starts defecting to gain individual advantage at the cost of the group.

In [None]:
print("Mal_Bot joins the simulation and starts defecting...")
for _ in range(5):
    auditor.log_action("Agent_A", "COOPERATE")
    auditor.log_action("Agent_B", "COOPERATE")
    auditor.log_action("Mal_Bot", "DEFECT")

result = auditor.audit_system()
print(f"Updated Welfare: {result['social_welfare']}")
print(f"Flagged Agents: {result['flagged_agents']}")
print(f"Status: {result['status']}")

## 3. Scenario: Systemic Collapse

If the 'good' agents also switch to defection in response (a tit-for-tat spiral), the entire system collapses.

In [None]:
print("System falling into a TI-FOR-TAT defection spiral...")
for _ in range(10):
    auditor.log_action("Agent_A", "DEFECT")
    auditor.log_action("Agent_B", "DEFECT")
    auditor.log_action("Mal_Bot", "DEFECT")

result = auditor.audit_system()
print(f"Final Welfare: {result['social_welfare']}")
print(f"Final Status: {result['status']}")

## ü§ù Building Robust Cooperation

Cooperative AI Safety is about designing mechanisms (like audit layers, reputation systems, or institutional norms) that make cooperation the stable Nash Equilibrium. As we scale to trillions of autonomous transactions, ensuring these agents 'play fair' is as important as ensuring they are individually aligned.