BLUEPRINT — AI Property Due Diligence

A 7-agent adversarial AI pipeline that turns scattered public records into a sourced Buyer Risk Score — complete with a live OptimistAgent vs PessimistAgent debate before every verdict.

BLUEPRINT answers the question every homebuyer should ask but rarely gets a straight answer to: "What am I actually buying?"

Type any US address. BLUEPRINT's pipeline retrieves deed history, building permits, flood zone data, earthquake exposure, environmental hazards, and neighbourhood intelligence — cross-references everything via Elasticsearch hybrid search and ES|QL — then runs an adversarial AI debate to deliver a confidence-adjusted BUY / NEGOTIATE / AVOID verdict and a ranked Escape Plan for risk reduction.

What it does

A buyer types any US residential address. BLUEPRINT's 7-agent Google ADK pipeline runs in sequence, streaming live progress to the browser via Server-Sent Events:

#	Agent	What it does
1	GeocoderAgent	Normalises the address, geocodes to lat/lng, identifies county and state, creates the Elasticsearch case file
2	DeedAgent	Fetches deed and sale history from public county APIs. Flags price drops >30%, rapid flips, and quitclaim deeds
3	PermitAgent	Queries city building permit databases across 65+ US cities. Flags every open/unresolved permit — buyers inherit the liability
4	ClimateAgent	Checks FEMA National Flood Hazard Layer (flood zones AE/X) and USGS Earthquake Catalog within 75 km
5	NeighborhoodAgent	Queries EPA EJSCREEN (PM2.5, Superfund proximity, traffic pollution) and OSM Overpass (schools, parks, transit within 500m)
6	SynthesisAgent	Hybrid ELSER semantic + BM25 search over all Elasticsearch events + five ES
7	DebateAgent	OptimistAgent argues the score is too high. PessimistAgent argues it's too low. VerdictAgent adjudicates → confidence-adjusted BUY / NEGOTIATE / AVOID

Features

Live 7-agent pipeline with real-time SSE streaming — watch each agent's progress as it runs
Adversarial AI debate — OptimistAgent vs PessimistAgent → confidence-adjusted final verdict
Buyer Risk Score (0–100) — composite score from 7 data sources, stress-tested before delivery
Animated SVG gauge — score animates from 0 to final value with an easeOutCubic curve; re-animates when the debate updates it
Escape Plan — ranked, actionable steps to lower your risk score before or after purchase
Interactive property map — Leaflet.js map with risk-colored pin, 500m analysis radius, and FEMA flood zone overlay
Neighbourhood Intelligence — EPA air quality index, Superfund proximity score, school/park/transit access
Property Timeline — chronological deed/permit/climate event history with source citations, filterable by type
Flip fraud detection — ES|QL detects ≥3 deed transfers on the same property (rapid flip pattern)
ES|QL semantic reranking — top-risk events surfaced via .rerank-v1-elasticsearch before Gemini synthesis
Auto-watch — properties scoring ≥75 are automatically added to the watchlist for 24h monitoring
Cross-property intelligence — /api/similar/{hash} queries the Elasticsearch memory layer for other analysed properties with the same risk profile
Property comparison — run two full 7-agent pipelines in parallel, get a head-to-head "which should I buy?" verdict
Share links — generate a shareable report URL (90-day expiry, Elasticsearch-backed)
Watchlist — manually or automatically monitor properties; re-analysed every 24 hours
Q&A chat — ask Gemini 3 follow-up questions about any open report
HTML export — professional standalone buyer brief with gauge, timeline, debate results, and escape plan
Slack alerts — automatic webhook notification when risk score ≥ configurable threshold
Dark/light theme — persisted in localStorage

Tech stack

Layer	Technology
Agent framework	Google Cloud ADK 2.0 — `SequentialAgent` + `LlmAgent` + `FunctionTool` + `Runner`
Primary model	Gemini 3 Flash Preview (`gemini-3-flash-preview`) via AI Studio API
Fallback model	Gemini 2.5 Flash via Vertex AI — automatic if primary is unavailable
Search & memory	Elastic Cloud Serverless — ELSER hybrid retrieval, Agent Builder MCP, ES
Backend	FastAPI + Uvicorn — async Python, SSE streaming, 18 REST endpoints
Frontend	Vanilla JS + Leaflet.js — single-page app, all data from `/api/*` endpoints
Geocoding	OpenStreetMap Nominatim — no API key required
Permit data	65+ US cities via Socrata open data portals
Climate data	FEMA NFHL, USGS Earthquake Catalog, EPA EJSCREEN — all 50 states
Neighbourhood	OSM Overpass API — schools, parks, transit, amenities within 500m
Alerts	Slack Incoming Webhooks
Hosting	Google Cloud Run — Docker, 2 vCPU / 2 GiB, scales to zero

Elasticsearch integration

BLUEPRINT uses five Elasticsearch indices as a persistent intelligence layer:

Index	Purpose
`blueprint_cases`	Geocoded property case files — one document per address analysed
`blueprint_events`	Property events — permits, deeds, climate, neighbourhood findings (`semantic_text` field for ELSER)
`blueprint_reports`	Synthesised reports — risk scores, escape plans, debate verdicts (permanently searchable)
`blueprint_shared`	Share links — public report access with expiry timestamps
`blueprint_watched`	Watchlist — monitored properties re-analysed every 24 hours

Retrieval capabilities used:

ELSER hybrid search — semantic_text + BM25 over heterogeneous property records via Agent Builder MCP
text_similarity_reranker — .rerank-v1-elasticsearch inference endpoint to rerank BM25 results by semantic relevance; graceful BM25 fallback if the endpoint is unavailable
Agent Builder MCP — Streamable HTTP at {KIBANA_URL}/api/agent_builder/mcp; platform.core.esql tool used for ES|QL execution
ES|QL queries — five cross-reference queries per analysis:
1. Event type distribution with value aggregates
2. Permit-sale timing cross-reference (undisclosed construction detection)
3. High-confidence events filter (confidence ≥ 0.9)
4. Semantic RERANK — top 5 risk events via .rerank-v1-elasticsearch
5. Flip fraud detection — rapid deed transfer pattern (≥2 sales, flagged at ≥3)
Memory layer write-back — every agent writes findings to Elasticsearch before the next agent reads them; synthesised reports accumulate permanently and power cross-property intelligence

Permit coverage

Building permit data is pulled from Socrata open data portals across 65+ US cities:

Northeast: New York City, Philadelphia, Baltimore, Washington DC, Boston, Newark, Hartford, Providence, Pittsburgh
Southeast: Atlanta, Miami, Tampa, Orlando, Jacksonville, Charlotte, Raleigh, Richmond, Virginia Beach, New Orleans, Memphis, Nashville
Midwest: Chicago, Columbus, Cincinnati, Cleveland, Detroit, Indianapolis, Milwaukee, Minneapolis, Kansas City, St. Louis, Omaha, Wichita, Des Moines
South: Houston, Dallas, San Antonio, Austin, Fort Worth, El Paso, Lubbock, Oklahoma City
West: Los Angeles, San Diego, San Francisco, San Jose, Sacramento, Oakland, Fresno, Long Beach, Phoenix, Tucson, Mesa, Denver, Colorado Springs, Las Vegas, Portland, Seattle, Spokane, Albuquerque, Louisville, Honolulu, Anchorage
Other: Columbia SC, Manchester NH

All other US addresses still receive full climate, flood, earthquake, and environmental intelligence via FEMA + USGS + EPA + OSM.

Local setup

Prerequisites

Python 3.11+
Google Cloud project with Vertex AI API enabled
Elastic Cloud Serverless account (free trial available)
Gemini API key (paid tier recommended — free tier: 15 req/min)

1. Elastic Cloud setup

Go to cloud.elastic.co → Start free trial
Create a Serverless Elasticsearch project → choose Google Cloud as cloud provider
Open Kibana → navigate to Agent Builder (Search or Management section)
Enable Agent Builder — the built-in MCP server starts automatically
Go to Agent Builder → Tools → MCP → copy the MCP endpoint URL
Go to Stack Management → API keys → create a key with read + write + manage index privileges on blueprint_*, plus monitor_inference cluster privilege
Copy your Elasticsearch URL from the Connection details page

2. Configure environment

cd blueprint
cp .env.example .env

Edit .env:

# Google Cloud
GOOGLE_CLOUD_PROJECT=your-gcp-project-id
GOOGLE_CLOUD_REGION=us-central1
GEMINI_API_KEY=your-ai-studio-api-key
GEMINI_MODEL=gemini-3-flash-preview
VERTEX_MODEL=gemini-2.5-flash

# Elastic
ELASTIC_URL=https://your-deployment.es.us-central1.gcp.cloud.es.io
ELASTIC_API_KEY=your_api_key_here
ELASTIC_MCP_URL=https://your-deployment.kb.us-central1.gcp.cloud.es.io/api/agent_builder/mcp

# Slack alerts (optional — leave blank to disable)
SLACK_WEBHOOK_URL=https://hooks.slack.com/services/T.../B.../...
SLACK_ALERT_THRESHOLD=60

# App
APP_URL=http://localhost:8080
PORT=8080

3. Install and run

pip install -r requirements.txt
uvicorn backend.main:app --reload --port 8080

Open http://localhost:8080

Try the demo addresses:

363 Van Brunt St, Brooklyn, NY — Flood Zone AE (Sandy damage history), open DOB permits
2121 Airline Dr, Houston, TX — Superfund proximity, hurricane zone, high PM2.5
2000 E Olympic Blvd, Los Angeles, CA — Traffic pollution, earthquake zone, unpermitted additions

4. Verify

curl http://localhost:8080/api/health

Expected response includes "elasticsearch": "connected", "agents": 7, "gemini_model": "gemini-3-flash-preview".

elastic_mcp may show "unavailable (direct SDK fallback)" if your API key lacks Kibana privileges — the full pipeline still works using the Elasticsearch Python client directly.

Slack alerts

Go to api.slack.com/apps → Create New App → From Scratch
Name it anything (e.g. BLUEPRINT Alerts) → select your workspace → Create App
Left sidebar: Incoming Webhooks → toggle ON → Add New Webhook to Workspace
Pick a channel (e.g. #property-alerts) → Allow
Copy the webhook URL (https://hooks.slack.com/services/...)
Set SLACK_WEBHOOK_URL and SLACK_ALERT_THRESHOLD in .env
Restart the server — every analysis where the score meets or exceeds the threshold posts a formatted alert

Deploy to Google Cloud Run

# One-time setup
gcloud auth login
gcloud auth application-default login
gcloud config set project YOUR_PROJECT_ID

# Store secrets in Secret Manager
echo -n "your-api-key" | gcloud secrets create GEMINI_API_KEY --data-file=-
echo -n "https://..."  | gcloud secrets create ELASTIC_URL --data-file=-
echo -n "your-key"     | gcloud secrets create ELASTIC_API_KEY --data-file=-
echo -n "https://..."  | gcloud secrets create ELASTIC_MCP_URL --data-file=-

# Deploy
chmod +x deploy.sh
./deploy.sh

The script builds via Cloud Build, deploys to Cloud Run (2 vCPU / 2 GiB, scales to zero), and prints the live URL. Set that URL as APP_URL in .env for correct Slack alert links.

API reference

Method	Path	Description
`GET`	`/api/analyze/stream`	SSE real-time streaming analysis
`POST`	`/api/analyze`	One-shot JSON analysis (full 7-agent pipeline)
`POST`	`/api/compare`	Compare two properties in parallel
`POST`	`/api/ask`	Q&A chat about a stored report
`GET`	`/api/report/{hash}`	Retrieve stored report from Elasticsearch
`GET`	`/api/export/{hash}`	Download standalone HTML buyer brief
`POST`	`/api/share/{hash}`	Create shareable link (90-day expiry)
`GET`	`/api/share/{share_id}`	Retrieve report via share link
`POST`	`/api/watch`	Add property to watchlist
`GET`	`/api/watch`	List watched properties
`DELETE`	`/api/watch/{hash}`	Remove from watchlist
`GET`	`/api/similar/{hash}`	Similar-risk properties from Elasticsearch memory layer
`GET`	`/api/elastic/status`	Live Elastic index counts, MCP tools, and capability flags
`GET`	`/api/coverage`	All cities with permit data + nationwide climate sources
`GET`	`/api/health`	Service health and configuration
`GET`	`/api/about`	Educational content — methodology, glossary, agent descriptions
`GET`	`/api/stats`	Live platform statistics
`GET`	`/api/recent`	Recent analyses

Interactive API docs at /docs (Swagger) and /redoc.

Data sources

Source	Data	Coverage
OpenStreetMap Nominatim	Address geocoding	Worldwide
OSM Overpass API	Schools, parks, transit, amenities	Worldwide
NYC Open Data — DOB Permits & Sales	Building permits + rolling property sales	New York City (5 boroughs)
65+ Socrata city portals	Building permits	65+ US cities
FEMA National Flood Hazard Layer	Flood zone classification (AE, X, VE, AO, etc.)	All 50 states + territories
USGS Earthquake Catalog	Seismic events within 75 km (M2.5+)	Worldwide
EPA EJSCREEN	PM2.5, Superfund proximity, traffic pollution, diesel PM	All 50 states (block-group level)

All sources are public domain or openly licensed. No personally identifiable information about property occupants is collected or stored.

Project structure

blueprint/
├── backend/
│   ├── config.py                # All settings loaded from environment variables
│   ├── main.py                  # FastAPI app, lifespan, health/about/stats/similar/elastic-status
│   ├── routes/
│   │   ├── analyze.py           # /api/analyze, /api/analyze/stream (SSE), /api/ask, /api/recent
│   │   ├── compare.py           # /api/compare — parallel dual-pipeline comparison
│   │   ├── export.py            # /api/export — Gemini-generated standalone HTML report
│   │   ├── share.py             # /api/share — shareable links with 90-day expiry
│   │   └── watch.py             # /api/watch — watchlist CRUD + 24h background re-analysis
│   └── services/
│       ├── adk_runner.py        # 7-agent ADK SequentialAgent pipeline + SSE queue
│       ├── elastic_client.py    # Elasticsearch SDK + Agent Builder MCP, ELSER, ES|QL, reranking
│       ├── gemini.py            # Gemini 3 Flash + Vertex AI fallback
│       ├── geocoder.py          # Nominatim geocoding
│       ├── data_fetchers.py     # NYC, 65+ Socrata cities, FEMA, USGS, EPA, OSM Overpass
│       └── slack.py             # Slack Incoming Webhook alerts
├── frontend/
│   ├── index.html               # Single-page app — zero hardcoded data, everything from /api/*
│   ├── style.css                # Dark/light theme, mobile-responsive, Leaflet overrides
│   └── app.js                   # SSE client, animated gauge, Leaflet map, all report rendering
├── tests/
│   ├── conftest.py              # Shared fixtures
│   ├── test_health.py           # /api/health, /api/about, /api/stats
│   ├── test_analyze.py          # Full pipeline, SSE streaming, share, export, Q&A
│   ├── test_compare.py          # Parallel comparison
│   ├── test_watch.py            # Watchlist CRUD
│   └── test_validate.py         # Input validation (422/400/404)
├── Dockerfile                   # Python 3.11-slim, Cloud Run ready
├── deploy.sh                    # Cloud Build + Cloud Run deploy script
├── requirements.txt
├── .env.example                 # Environment variable template
└── LICENSE                      # Apache 2.0

Caveats

Permit coverage — NYC (5 boroughs) and Austin have the most detailed permit history; other cities use the Socrata generic schema. Addresses outside the 65 covered cities still receive full climate and environmental analysis.
Gemini free tier — 15 requests/minute; the 7-agent pipeline makes several model calls per analysis. A paid AI Studio key is recommended for sustained use.
Elastic MCP — requires a Kibana API key with feature_agentBuilder.read privilege. If unavailable, the pipeline falls back to the Elasticsearch Python client with identical functionality.
Not professional advice — BLUEPRINT provides property intelligence for informational purposes. Always consult licensed professionals before making purchasing decisions.

License

Apache 2.0 — see LICENSE

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BLUEPRINT — AI Property Due Diligence

What it does

Features

Tech stack

Elasticsearch integration

Permit coverage

Local setup

Prerequisites

1. Elastic Cloud setup

2. Configure environment

3. Install and run

4. Verify

Slack alerts

Deploy to Google Cloud Run

API reference

Data sources

Project structure

Caveats

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
backend		backend
frontend		frontend
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
deploy.sh		deploy.sh
pytest.ini		pytest.ini
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

BLUEPRINT — AI Property Due Diligence

What it does

Features

Tech stack

Elasticsearch integration

Permit coverage

Local setup

Prerequisites

1. Elastic Cloud setup

2. Configure environment

3. Install and run

4. Verify

Slack alerts

Deploy to Google Cloud Run

API reference

Data sources

Project structure

Caveats

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages