Methodic

Autonomous research operations agent that turns B2B business decisions into governed, evidence-linked data.

Track 1: Build — Net-New Agents | Google for Startups AI Agent Challenge

What Methodic Does

Static B2B surveys produce shallow answers like "price." Methodic replaces them with an autonomous multi-agent workflow:

Plans the study — organizer + methodology agents structure objectives and push back on sampling bias
Conducts adaptive interviews — probes vague answers into specific decision variables in real time
Triangulates against CRM context — MCP tools pull deal and telemetry data mid-interview
Tracks coverage and autonomously re-plans — identifies gaps across 8 canonical research variables
Exports structured data — evidence-linked rows with confidence scores directly to BigQuery

Result: +0.692 composite quality improvement over static surveys (same rubric, same participants; coverage from 16.7% to 100% across 8 research variables).

Live Demo

Cloud Run: https://methodic-2030382823.us-central1.run.app

Demo UI: /static/demo.html — watch the full pipeline run autonomously
Interactive mode: /static/interactive.html — participate as the interview subject
Agent card: /.well-known/agent-card.json

Quick Start (Local)

pip install -r requirements.txt
uvicorn methodic.server:app --port 8080
# Open http://localhost:8080/static/demo.html

Architecture

SequentialAgent (root)
├── plan_phase (SequentialAgent)
│   ├── session_init (BaseAgent)
│   ├── organizer (LlmAgent)
│   ├── methodology (LlmAgent)
│   └── question_design (LlmAgent)
├── fieldwork (LoopAgent, max_iterations=3)
│   └── fieldwork_body (SequentialAgent)
│       ├── interviewer (LlmAgent)
│       ├── participant_sim (LlmAgent, demo mode)
│       ├── extractor_step (BaseAgent)
│       ├── turn_checker (BaseAgent)
│       ├── coverage_assessment (BaseAgent)
│       └── replan_step (BaseAgent)
└── finalize (SequentialAgent)
    ├── quality_reviewer (LlmAgent)
    ├── bigquery_export (BaseAgent)
    └── completion (LlmAgent)

Stack: Google ADK · Gemini 3.1 Pro Preview · MCP (stdio JSON-RPC 2.0) · FastAPI · Cloud Run · BigQuery · Vertex AI

Project Structure

methodic/                  # ADK agent package
  __init__.py              # Model configuration (Gemini 3.1 Pro / Flash)
  agent.py                 # Agent graph (SequentialAgent + LoopAgent)
  server.py                # FastAPI server (SSE streaming, interactive API)
  schemas.py               # Pydantic models (ParticipantResponse, 8 canonical fields)
  agents/                  # Custom BaseAgent steps (extraction, coverage, re-plan, BQ export)
  tools/                   # MCP client, BigQuery export, quality scoring
  static/                  # Demo UI (demo.html) + Interactive UI (interactive.html)
scripts/                   # MCP server, validators, recording, legacy WP scripts
fixtures/                  # Deterministic test data (CRM, telemetry, participants)
tests/                     # 73 unit/integration + 60 Playwright E2E tests
docs/                      # Submission materials, evidence, reviews

Tests

# All tests
python3 -m pytest tests/ -v

# Unit/integration only
python3 -m pytest tests/ --ignore=tests/e2e -v

# E2E (requires running server)
python3 -m pytest tests/e2e/ -v

Evidence

Evidence	Status	Source
Cloud Run health	PASS	`curl /health`
133 tests passing	PASS	`pytest --collect-only`
Live SSE stream (25–34 events)	PASS	live-run-2026-05-14.md
BigQuery live export (3 rows)	PASS	`bq query` on `methodic_demo.win_loss_responses`
Agent card	PASS	`/.well-known/agent-card.json`
Quality delta (+0.692)	PASS	Fixture benchmark

Honest Labels

See docs/limitations.md for what the prototype proves and what it does not.

Name		Name	Last commit message	Last commit date
Latest commit History 155 Commits
.claude		.claude
docs		docs
fixtures		fixtures
methodic		methodic
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
AGENTS-PROTOCOL.md		AGENTS-PROTOCOL.md
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
Dockerfile.cloudrun		Dockerfile.cloudrun
README.md		README.md
additionaldetails.md		additionaldetails.md
challenge.md		challenge.md
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Methodic

What Methodic Does

Live Demo

Quick Start (Local)

Architecture

Project Structure

Tests

Evidence

Honest Labels

Submission Materials

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Methodic

What Methodic Does

Live Demo

Quick Start (Local)

Architecture

Project Structure

Tests

Evidence

Honest Labels

Submission Materials

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages