GeoSentinel — Live OSINT Intelligence Dashboard

◈ CATEGORY: LIVE AGENTS 🗣️
(Gemini Live Agent Challenge UX/UI Submission)

GeoSentinel is a next-generation, God's-Eye multimodal AI agent. It fuses real-time global telemetry (flights, maritime, satellite tracking) with Open-Source Intelligence (OSINT) to create a glowing, interactive cyber-physical digital twin of Earth.

By integrating Google's Gemini 2.0 Flash (Multimodal Live API), users can verbally interrogate the globe, ask for live sit-reps, and instantly visualize geopolitical hotspots using 3D thermographic hexbins and financial prediction layers.

🏆 Hackathon Requirements Fulfilled

Live Agents (Audio/Vision): Voice-activated OSINT dashboard that handles natural language interruptions and real-time data streaming.
Gemini Live API: Powered by the gemini-2.0-flash-exp model via WebSocket, hosted on Google Cloud.
Google Cloud Services: Backend (FastAPI) and Frontend (Vite/React) deployed dynamically via Google Cloud Run.
Bonus Points (IaC): Full automated containerization and deployment via the included deploy.sh script.

🏗 Architecture Diagram

(See architecture.md for the Mermaid source code)

🧠 Findings & Learnings

Development of GeoSentinel during the contest led to several key insights regarding the Gemini Live API and Multimodal Agent architecture:

Multimodal Grounding is Critical: One of our key findings was that relying solely on Gemini's internal knowledge for geography led to occasional coordinate hallucinations. By implementing a Nominatim Geocoding Tool, we successfully "grounded" the agent's navigation. The agent now validates every location name through OpenStreetMap before executing a FLY_TO command.
Asynchronous UI/Agent Synchronization: Handling interruptions naturally (barge-in) requires tight state synchronization between the React frontend and the FastAPI backend. We found that a single WebSocket tunnel for both audio and control directives provided the lowest latency and most immersive experience.
OSINT Fusion at Scale: We learned that visual density (inspired by Palantir's aesthetic) actually improves agentic debugging. By having 8+ data layers visible on the 3D globe, it became easier to verify if the Gemini agent's spatial understanding matched the rendered reality.
GCP Cloud Run for WebSockets: Initially, we faced timeout issues with standard load balancers. We learned that properly configuring the PORT and timeout settings in Cloud Run is essential for long-running multimodal WebSocket streams.

🚀 Features

3-Column Intelligence Layout: Inspired by Palantir and Glint.trade, featuring a Live GDELT News Feed (updated every 5 min via backend proxy) and a reactive Planetary Intel panel.
Organic news-driven heatmaps: Real-time "Sentinel Scores" (0-100) are now derived purely from extraction of current news events from GDELT headlines, with Zero Mock Hotspots.
8 Interactive 3D Data Layers: Flights (Arcs), Warships, Recon Satellites, Conflict Zones (Pulsing Rings), GPS Jamming, No-Fly Zones, Extreme Weather, and Prediction Markets.
Agentic Voice Control: Click the Mic button to talk natively to the Gemini Agent, which interprets intents (like FLY_TO) and autonomously orbits the camera to the spoken country.

👨‍💻 Quick Start & Spin-Up Instructions

Prerequisites

Node.js 20+
Python 3.10+
A Gemini API Key from Google AI Studio.

Local Development

Start the Backend (FastAPI/Gemini)

cd backend
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
export GEMINI_API_KEY="your-api-key-here"
uvicorn agent:app --reload --port 8000

Start the Frontend (React/Vite & react-globe.gl)

cd frontend
npm install
npm run dev
# Runs on http://localhost:5173

🧪 Reproducible Testing

Judges can verify the core functionality of GeoSentinel using the following steps:

1. Verification of the Immersive UI

Action: Load the Live Frontend.
Expectation: The 3D globe should initialize in a dark "Command Room" atmosphere. The left panel should populate with live news from the GDELT stream.

2. Multi-Layer Intelligence Toggle

Action: Open the Planetary Intel panel on the right. Toggle the 'Aviation' and 'Conflicts' layers.
Expectation: The globe should render 3D arcs for flights and pulsing red rings for conflict zones. Hovering over an arc should reveal a tactical aircraft tooltip.

3. Gemini Live Agent (The "Beyond Text" Test)

Action: Click the Microphone button at the bottom center.
Voice Command: Say "GeoSentinel, fly to France."
Expectation: The agent should respond with audio via the Gemini Live API, geocode the location, and the camera should autonomously orbit to France. The intelligence panels should refresh to show France-specific data.

4. Zero-Click Command Center

Action: Click the [ 📊 COMMAND CENTER ] button.
Expectation: A high-density tactical overlay (Masonry grid) should appear, displaying widgets like the Pentagon Pizza Index and AI Strategic Posture.

☁️ Google Cloud Deployment (Bonus)

We automate containerizing the entire stack and shipping it to Google Cloud Run.

Ensure you have the gcloud CLI installed and authenticated.
Edit deploy.sh to include your PROJECT_ID and GEMINI_API_KEY.
Run the Infrastructure-as-Code script:
```
chmod +x deploy.sh
./deploy.sh
```

This will build and deploy both containers, outputting the live public URLs for grading.

🏢 Proof of Google Cloud Deployment

GeoSentinel is fully architected for the Google Cloud ecosystem.

Live Backend: https://geosentinel-backend-1044499038422.us-central1.run.app
Live Frontend: https://geosentinel-frontend-1044499038422.us-central1.run.app
Automated IaC: Deployment is handled via deploy.sh, which automates the build and ship process to Cloud Run.
Cloud-Native SDK: The backend leverages the google-generativeai SDK to connect directly to Gemini 2.0 Flash endpoints.

📹 Demonstration Video

Watch the GeoSentinel OSINT Demo on YouTube

🏗 Full Architecture

See architecture.md for the detailed Mermaid system flow.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
backend		backend
frontend		frontend
infra		infra
recorder		recorder
.gitignore		.gitignore
README.md		README.md
architecture.md		architecture.md
deploy.sh		deploy.sh
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GeoSentinel — Live OSINT Intelligence Dashboard

🏆 Hackathon Requirements Fulfilled

🏗 Architecture Diagram

🧠 Findings & Learnings

🚀 Features

👨‍💻 Quick Start & Spin-Up Instructions

Prerequisites

Local Development

🧪 Reproducible Testing

1. Verification of the Immersive UI

2. Multi-Layer Intelligence Toggle

3. Gemini Live Agent (The "Beyond Text" Test)

4. Zero-Click Command Center

☁️ Google Cloud Deployment (Bonus)

🏢 Proof of Google Cloud Deployment

📹 Demonstration Video

🏗 Full Architecture

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GeoSentinel — Live OSINT Intelligence Dashboard

🏆 Hackathon Requirements Fulfilled

🏗 Architecture Diagram

🧠 Findings & Learnings

🚀 Features

👨‍💻 Quick Start & Spin-Up Instructions

Prerequisites

Local Development

🧪 Reproducible Testing

1. Verification of the Immersive UI

2. Multi-Layer Intelligence Toggle

3. Gemini Live Agent (The "Beyond Text" Test)

4. Zero-Click Command Center

☁️ Google Cloud Deployment (Bonus)

🏢 Proof of Google Cloud Deployment

📹 Demonstration Video

🏗 Full Architecture

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages