GitHub - jermiah/kidshield_app

A comprehensive three-layer AI system designed to protect children online through intelligent content analysis, decision-making, and coordinated response actions. KidShield combines advanced LLM capabilities with robust safety mechanisms to provide real-time protection against digital threats.

🎯 Overview

This system receives incoming messages that have already been classified as suspicious and focuses on:

Detects Threats: Uses advanced agents to analyze text and images for potential dangers
Makes Smart Decisions: AI-enhanced decision engine determines appropriate protective actions
Coordinates Responses: Manages communications with parents, children, and relevant authorities
Provides Education: Delivers age-appropriate safety resources and guidance
Ensures Safety: Maintains comprehensive audit trails and fallback mechanisms

🏗️ Three-Layer Architecture

KidShield implements a clean three-layer architecture for maximum modularity and scalability:

┌─────────────────────────────────────────────────────────────┐
│                      APP LAYER                              │
│  User Interfaces • APIs • Authentication • Dashboards      │
├─────────────────────────────────────────────────────────────┤
│                    GUARDIAN LAYER                           │
│  Agent Analysis • Threat Detection • Content Classification  │
├─────────────────────────────────────────────────────────────┤
│                     AGENT LAYER                             │
│  Decision Making • Communication • Action Coordination     │
└─────────────────────────────────────────────────────────────┘

Layer Responsibilities

🖥️ App Layer: User-facing interfaces, authentication, and application logic
🛡️ Guardian Layer: AI-powered content analysis and threat detection
🤖 Agent Layer: Intelligent decision-making and response coordination

🤖 LLM Integration

The system integrates with BlackBox AI to provide enhanced capabilities:

Enhanced Decision Reasoning: AI-generated explanations for all decisions
Personalized Communications: Context-aware messages for parents, children, and senders
Age-Appropriate Content: Automatically adapted messaging based on child's age and situation
Fallback Safety: Graceful degradation to template-based responses if LLM is unavailable

🏗️ System Architecture

Core Components

Message Processor - Handles incoming suspicious messages with metadata
Decision Engine - Analyzes context and selects appropriate actions
Communication Generator - Creates tailored messages for different stakeholders
Action Manager - Coordinates and executes chosen actions
Educational Content Library - Age-appropriate safety resources
Logging & Audit System - Tracks decisions and actions for transparency

Key Features

✅ Multi-stakeholder communication (parents, children, senders)
✅ Age-appropriate content generation
✅ Severity assessment algorithms
✅ Educational resource matching
✅ Action justification and audit trails
✅ Configurable response templates
✅ Comprehensive logging and monitoring

🔄 Layer Interaction Flow

The three layers work together in a coordinated flow:

1. 📱 App Layer receives user input (message, image, etc.)
   ↓
2. 🛡️ Guardian Layer analyzes content using LLM models
   ↓ (threat detected)
3. 🤖 Agent Layer makes decisions and coordinates responses
   ↓
4. 📱 App Layer delivers notifications and educational content

Integration Points

App → Guardian: Content submission for analysis
Guardian → Agent: Threat detection results and risk scores
Agent → App: Action plans and communication content
Cross-Layer: Comprehensive logging and audit trails

🚀 Quick Start

Prerequisites

Python 3.8+
BlackBox AI API key
Required packages (see requirements.txt)

Installation

Clone the repository:

git clone https://github.com/jermiah/kidshield_app.git
cd kidshield_app

Install dependencies:

python -m venv myenv
source myenv/bin/activate
pip install -r requirements.txt

Set up your BlackBox API key:

# Create .env file with your API key
touch .env
echo "BLACKBOX_API_KEY=your_api_key_here" > .env

Run fastapi server:

cd guardian_layer/api
fastapi run guardian_api.py

Run nodejs server to connect to WhatsApp

cd app_layer/nodejs
npm i
npm run dev

Basic Usage

from src.agents.ai_agent import AIAgent
from src.models.message import SuspiciousMessage, ChildProfile, MessageMetadata, ThreatType, SeverityLevel

# Initialize the AI agent with LLM enhancement
agent = AIAgent(use_llm=True)

# Create a suspicious message (normally received from detection system)
message = SuspiciousMessage(
    message_id="msg_001",
    content="Suspicious content here",
    threat_type=ThreatType.BULLYING,
    severity=SeverityLevel.HIGH,
    child_profile=child_profile,
    metadata=metadata
)

# Process the message
action_plan = agent.process_suspicious_message(message)

# Review the action plan with AI-enhanced reasoning
print(f"Generated {len(action_plan.decisions)} decisions")
print(f"Created {len(action_plan.communications)} communications")

# View enhanced reasoning
for decision in action_plan.decisions:
    print(f"Action: {decision.action_type.value}")
    print(f"AI Reasoning: {decision.reasoning}")

LLM Configuration

The system can be configured to use or disable LLM features:

# Enable LLM (default)
agent = AIAgent(use_llm=True)

# Disable LLM (use templates only)
agent = AIAgent(use_llm=False)

# Configure via config file
config = {
    "llm": {
        "enabled": True,
        "fallback_to_templates": True,
        "temperature": {
            "decision_reasoning": 0.3,
            "parent_messages": 0.4,
            "child_messages": 0.5
        }
    }
}
agent = AIAgent(config, use_llm=True)

BlackBox API Integration

The system uses BlackBox AI for enhanced capabilities:

from src.utils.blackbox_client import BlackBoxClient

# Direct LLM usage
client = BlackBoxClient()

# Generate enhanced decision reasoning
reasoning = client.generate_decision_reasoning(
    message_content="Inappropriate message",
    threat_type="bullying",
    severity="high",
    child_age=12,
    context={"sender_type": "stranger"}
)

# Generate personalized parent message
parent_msg = client.generate_parent_message(
    child_name="Emma",
    threat_type="bullying",
    severity="high",
    action_taken="Sender blocked",
    tone="urgent"
)

🎯 Supported Threat Types

The system handles various types of suspicious content:

Bullying/Harassment - Cyberbullying and online harassment
Inappropriate Requests - Requests from strangers or inappropriate contacts
Sexual/Violent Content - Explicit or harmful content
Manipulation/Scams - Attempts to manipulate or defraud
Stranger Contact - Unsolicited contact from unknown individuals

🔧 Configuration

Agent Configuration (`config/agent_config.json`)

{
  "decision_engine": {
    "severity_thresholds": {
      "critical": 0.9,
      "high": 0.7,
      "medium": 0.4,
      "low": 0.0
    },
    "immediate_action_triggers": [
      "sexual_content",
      "violent_content",
      "manipulation"
    ]
  }
}

Educational Content (`config/educational_content.json`)

Contains age-appropriate safety tips, educational modules, and crisis resources.

📊 Decision-Making Process

The AI agent follows a structured decision-making process:

Risk Assessment - Calculates risk score based on multiple factors
Action Selection - Determines appropriate actions based on threat type and severity
Communication Planning - Generates targeted messages for stakeholders
Timeline Creation - Schedules actions based on priority
Follow-up Planning - Determines if ongoing monitoring is needed

Risk Factors Considered

Threat type and severity level
Child's age and vulnerability
Sender history and behavior patterns
Message frequency and context
Previous incidents involving the child

💬 Communication Types

For Parents

Urgent Notifications - Immediate alerts for high-risk situations
Standard Notifications - Regular updates on incidents
Educational Resources - Guidance on digital safety and communication

For Children

Age-Appropriate Alerts - Gentle notifications tailored to age group
Educational Content - Safety tips and awareness information
Supportive Messages - Reassurance and guidance

For Senders

Warning Messages - Firm warnings about inappropriate behavior
Educational Information - Guidelines on acceptable online conduct

🧪 Testing

Run the test suite:

# Run all tests
python -m pytest tests/

# Run specific test file
python -m pytest tests/test_decision_engine.py

# Run with coverage
python -m pytest tests/ --cov=src

📈 Monitoring and Auditing

The system includes comprehensive logging and auditing:

Decision Tracking - All decisions are logged with reasoning
Communication Logs - Record of all messages sent
Action Execution - Tracking of action implementation
Performance Metrics - System performance and effectiveness

🔒 Privacy and Safety

Child Privacy - Age-appropriate communication that respects privacy
Data Protection - Secure handling of sensitive information
Transparency - Clear reasoning for all decisions and actions
Non-Stigmatizing - Supportive approach that doesn't blame victims

📋 Sample Use Cases

Case 1: Cyberbullying

Input: Bullying message to 14-year-old
Actions: Notify parent, educate child, warn sender, provide resources
Communications: Supportive message to teen, informative alert to parent

Case 2: Inappropriate Request

Input: Stranger requesting personal meeting with 12-year-old
Actions: Block sender, immediate parent notification, educate child
Communications: Age-appropriate safety reminder, urgent parent alert

Case 3: Manipulation Attempt

Input: Adult attempting to manipulate vulnerable child
Actions: Block sender, escalate to authorities, comprehensive support
Communications: Crisis resources, immediate intervention

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests for new functionality
Submit a pull request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🆘 Support

For questions or support:

Check the examples in the examples/ directory
Review the test cases in tests/
Consult the configuration files in config/

🔮 Future Enhancements

Machine learning integration for improved decision-making
Real-time threat intelligence integration
Advanced natural language processing for content analysis
Integration with external safety platforms
Mobile app for parent notifications
Dashboard for monitoring and analytics

Note: This system is designed to work with pre-classified suspicious messages. It focuses on response coordination rather than initial threat detection.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
agent_layer		agent_layer
app_layer		app_layer
config		config
data		data
examples		examples
guardian_layer		guardian_layer
html		html
myenv		myenv
nodejs		nodejs
src		src
tests		tests
.DS_Store		.DS_Store
.blackboxrules		.blackboxrules
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE_ANALYSIS.md		ARCHITECTURE_ANALYSIS.md
LAYER_REORGANIZATION_SUMMARY.md		LAYER_REORGANIZATION_SUMMARY.md
README.md		README.md
README_FRONTEND.md		README_FRONTEND.md
SYSTEM_OVERVIEW.md		SYSTEM_OVERVIEW.md
TODO.md		TODO.md
TODO_INTEGRATION.md		TODO_INTEGRATION.md
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
test_critical_integration.py		test_critical_integration.py
test_enhanced_agent_actions.py		test_enhanced_agent_actions.py
test_integration_simple.py		test_integration_simple.py
test_simplified_response.py		test_simplified_response.py
test_three_layer_integration.py		test_three_layer_integration.py

Folders and files

Latest commit

History

Repository files navigation

🎯 Overview

🏗️ Three-Layer Architecture

Layer Responsibilities

🤖 LLM Integration

🏗️ System Architecture

Core Components

Key Features

🔄 Layer Interaction Flow

Integration Points

🚀 Quick Start

Prerequisites

Installation

Basic Usage

LLM Configuration

BlackBox API Integration

🎯 Supported Threat Types

🔧 Configuration

Agent Configuration (config/agent_config.json)

Educational Content (config/educational_content.json)

📊 Decision-Making Process

Risk Factors Considered

💬 Communication Types

For Parents

For Children

For Senders

🧪 Testing

📈 Monitoring and Auditing

🔒 Privacy and Safety

📋 Sample Use Cases

Case 1: Cyberbullying

Case 2: Inappropriate Request

Case 3: Manipulation Attempt

🤝 Contributing

📄 License

🆘 Support

🔮 Future Enhancements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Agent Configuration (`config/agent_config.json`)

Educational Content (`config/educational_content.json`)

Packages