Every engineer has shipped a breaking change. Tests passed. Linter was clean. And then production went down.
PRism exists because "tests pass" β "safe to ship."
π¬ Watch the 2-minute Demo on YouTube
| ποΈ Architecture | architecture.mermaid |
| π Setup Platform | prism-dev-platform.orangemushroom-cc646ad1.eastus2.azurecontainerapps.io |
| π VS Code Extension | marketplace.visualstudio.com |
| π¬ Demo Video | youtube.com/watch?v=3jAxC7I3zYk |
Current CI/CD pipelines are binary and stateless. They ask one question: "Did the tests pass?"
They never ask:
- Has this file caused production incidents before?
- Did test coverage actually drop for these new code paths?
- Are we deploying at 4:58 PM on a Friday before a long weekend?
- Was retry logic silently removed from a payment-critical path?
This makes deployment decisions feel like guesswork β and sometimes, they are. The result is incidents that were entirely preventable, post-mortems written at 2 AM, and engineers afraid to merge.
PRism changes that. Instead of a binary pass/fail gate, PRism gives every PR a Deployment Confidence Score (0β100) β a multi-agent risk assessment that considers code quality, historical incidents, test coverage, and operational timing, all in real time.
PRism triggers automatically when a PR is opened or updated. Four specialized AI agents analyze the change in parallel, each returning a structured JSON payload via a shared Data Contract. The Verdict Agent ingests all four payloads and converges on a single governed decision.
Developer opens PR on GitHub
β
βΌ
prism-gate.yml (GitHub Actions)
auto-installed by Setup Wizard
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββ
β ORCHESTRATOR AGENT β
β (Microsoft Agent Framework) β
β Governed by Microsoft Foundry β
ββββ¬βββββββββββ¬βββββββββββ¬βββββββββββ¬ββββββββββ
β β β β
βΌ βΌ βΌ βΌ
[Diff [History [Coverage [Timing β Parallel execution
Analyst] Agent] Agent] Agent]
β β β β
ββββββββββββ΄βββββββββββ΄βββββββββββ
β
π Data Contract
(unified JSON schema)
β
βΌ
VERDICT AGENT
Deployment Confidence Score
+ Risk Brief
+ Rollback Playbook
β
ββββββββββββ΄βββββββββββ
βΌ βΌ
Score β₯ 70 Score < 70
β
Greenlight β Block Deploy
+ Auto-generate
missing tests
via Copilot
+ Rollback plan
See architecture.mermaid for the detailed system diagram (view on GitHub).
| Agent | Signal | Hero Tech |
|---|---|---|
| Diff Analyst | Scans PR diff for dangerous patterns β removed retry logic, missing error handlers, schema changes, hardcoded secrets | GitHub MCP Server + Azure OpenAI |
| History Agent | Correlates changed files with past production incidents via semantic search β "payment_service.py involved in 4 of last 8 incidents" | Azure MCP Server + Azure AI Search |
| Coverage Agent | Detects test coverage regression; triggers GitHub Copilot Coding Agent to auto-write missing tests and open a new PR | GitHub Copilot Coding Agent |
| Timing Agent | Flags high-risk deployment windows β Friday deploys, after-hours merges, pre-release proximity, US federal holidays | Microsoft Agent Framework |
| Verdict Agent | Ingests all four JSON payloads, computes Deployment Confidence Score (0β100), generates risk brief and rollback playbook via GPT-4o-mini with Azure Content Safety guardrails | Microsoft Foundry |
Agent Weights: Diff Analyst 30% Β· History Agent 25% Β· Coverage Agent 25% Β· Timing Agent 20%
Every specialist agent returns a standardized JSON payload. This enables true parallel execution β the Orchestrator dispatches all four agents simultaneously and the Verdict Agent aggregates without sequential handoffs.
{
"agent_name": "string",
"risk_score_modifier": 25,
"status": "warning",
"findings": [
"Specific finding 1",
"Specific finding 2"
],
"recommended_action": "Plain-English recommendation for the Verdict Agent."
}| Field | Type | Description |
|---|---|---|
agent_name |
string | Identifier for the agent |
risk_score_modifier |
integer 0β100 | 0 = perfectly safe, 100 = critical failure |
status |
enum | "pass" Β· "warning" Β· "critical" |
findings |
string[] | Specific, actionable findings |
recommended_action |
string | Plain-English recommendation for aggregation |
A PR comment posted automatically by PRism:
π¬ PRism Deployment Risk Assessment
Confidence Score: 21 / 100 β HIGH RISK β Deploy Blocked
Risk Brief:
β’ payment_service.py linked to 4 of the last 8 production incidents
β’ Test coverage dropped 9% (3 new functions have no tests)
β’ Deployment window: Friday 4:47 PM β historically high incident rate
β’ retry logic removed from a payment-critical path
Action Taken:
β
Missing tests auto-generated by Copilot β PR #47 opened for review
π Rollback playbook generated β attached to this PR comment
To override, a maintainer with write access must manually approve.
| Category | Technology |
|---|---|
| Agent Orchestration | Microsoft Agent Framework (Semantic Kernel) |
| AI Models | Azure OpenAI GPT-4o-mini (Sweden Central) via Microsoft Foundry |
| Platform & Governance | Microsoft Azure AI Foundry (azure-ai-projects) |
| Observability | OpenTelemetry + Azure Monitor / Application Insights |
| Content Safety | Azure Content Safety |
| Tool Connectivity | Azure MCP Server Β· GitHub MCP Server |
| Semantic Incident Search | Azure AI Search (BM25 + semantic ranking) |
| Incident Ingestion | Azure Functions (Timer + Event Grid + HTTP triggers) |
| Code Generation | GitHub Copilot Coding Agent (auto-generates missing tests) |
| Cloud Infrastructure | Azure Container Apps Β· Azure PostgreSQL Β· Azure Container Registry |
| Infrastructure as Code | Bicep + PowerShell deploy scripts |
| Backend | Python 3.12 / FastAPI / Uvicorn |
| Database | SQLite (dev) Β· PostgreSQL via asyncpg (production) |
| Authentication | GitHub OAuth2 Β· JWT (PyJWT) Β· Fernet AES encryption |
| IDE Integration | VS Code Extension (TypeScript) |
| CI/CD | GitHub Actions (self-dogfooding with prism-gate.yml) |
PRism/
βββ .github/
β βββ workflows/
β β βββ ci.yml # PRism dogfeeds its own CI gate
β β βββ prism-gate.yml # Workflow auto-installed in customer repos
β β βββ deploy-azure.yml # Deploy orchestrator to Azure Container Apps
β β βββ deploy-platform.yml # Deploy platform to Azure Container Apps
β βββ CODEOWNERS
βββ agents/
β βββ orchestrator/ # Parallel dispatch, FastAPI webhook server (:8000)
β βββ diff_analyst/ # Dangerous pattern scanner (heuristics + LLM)
β βββ history_agent/ # Azure AI Search incident correlator
β βββ coverage_agent/ # Test regression detector + Copilot trigger
β βββ timing_agent/ # Deploy window risk (pure deterministic)
β βββ verdict_agent/ # Score aggregator + Foundry governance
β βββ shared/ # AgentResult + VerdictReport data contracts
βββ platform/
β βββ server/ # FastAPI onboarding backend (:8080)
β β βββ routers/ # auth, github_setup, azure_setup, registrations
β β βββ services/ # auth_service, github_service, azure_service, db
β βββ frontend/ # Setup wizard (vanilla HTML/CSS/JS)
βββ mcp_servers/
β βββ azure_mcp_server/ # Azure AI Search wrapper + incident ingestion
β βββ github_connector/ # GitHub MCP server configuration
βββ foundry/
β βββ deployment_config/ # Bicep IaC templates + deploy/cleanup scripts
βββ function_deploy/ # Azure Function app (incident ingestion triggers)
βββ vscode_extension/ # PRism Confidence Sidebar (TypeScript)
βββ tests/ # Unit + integration test suite
βββ architecture.mermaid # System architecture diagram
βββ requirements.txt
βββ .env.example
AI Dev Days Hackathon β Microsoft, February 10 β March 15, 2026
Challenge: Automate and Optimize Software Delivery β Agentic DevOps
Target prizes: Grand Prize (Agentic DevOps) Β· Best Multi-Agent System Β· Best Enterprise Solution Β· Best Azure Integration
π¬ Watch our 2-minute Demo Video on YouTube
PRism directly addresses the challenge criteria: intelligent CI/CD pipelines with agent orchestration, automated incident response, and real-time reliability monitoring β with a pre-deployment risk gate that tests against real-world production state, not just isolated code.
For Judges: Download our VS Code extension and experience PRism from your own workspace. We cover up to 500 analysis runs using PRism's own Azure OpenAI model deployed on Microsoft Foundry β no Azure subscription required on your end. You can also try the live Setup Platform to onboard a repo in under 3 minutes.
Built by The Good Data Lab for the Microsoft AI Dev Days Hackathon 2026.
| Member | Owns |
|---|---|
| Ishan Soni | Orchestrator Β· Verdict Agent Β· Timing Agent Β· Foundry Governance Β· VS Code Extension Β· GitHub Actions & CI/CD Β· PR Comment Posting |
| Simarpreet Purba | History Agent Β· Azure MCP Server Β· Landing/Setup Platform Β· Incident Ingestion Pipeline Β· OAuth Flows Β· Bicep IaC |
| Gurinayat Mangat | Diff Analyst Agent Β· LLM Analysis Β· GitHub MCP Client Β· Heuristic Pattern Detection Β· PR Comment CI Risk Brief Β· Demo Video |
| Favour Ejike | Coverage Agent Β· Copilot Integration Β· Test Coverage Detection Β· Demo Video |
MIT β see LICENSE