Intelligent regulatory compliance through agentic AI.
Convert dense regulatory documents into traceable business and technical requirements using multi‑agent retrieval, open‑web enrichment, and structured reporting.
ING-AgenticAI is an open‑source system for regulatory intelligence. It takes portfolios of regulations, directives, and guidance and produces a consolidated requirements bundle with citations and a full decision trail. The focus is practical: convert legal text into implementable obligations while preserving evidence and traceability.
Regulatory compliance teams face a recurring bottleneck:
- Long, complex regulatory documents with interdependent requirements
- Manual interpretation and translation into business and technical actions
- Fragmented sources, slow updates, and limited auditability
This process is slow, expensive, and error‑prone. Missing a single obligation can lead to significant compliance risk.
ING-AgenticAI automates regulatory analysis with a multi‑agent RAG pipeline:
- Agentic reasoning for document understanding and requirement synthesis
- Semantic retrieval over ingested PDFs with Milvus and SentenceTransformers
- Open‑web enrichment behind a controlled MCP boundary
- Full audit logging of every retrieval, decision, and source used
- Structured outputs designed for both machines (JSON) and stakeholders (PDF)
The system is composed of five core parts:
- Ingestion pipeline: parse PDFs, chunk content, embed, and persist vectors.
- MCP tool server: secure boundary for retrieval and web tools.
- LangGraph orchestration: multi‑agent pipeline across discovery, context assessment, and requirements generation.
- Decision logging: JSONL event stream for complete traceability.
- Orchestration runner: portfolio‑level execution and report generation.
The end‑to‑end workflow is purpose‑built for regulatory text:
- Query agent analyzes structure and produces targeted retrieval queries.
- Retrieval node performs deterministic vector search with de‑duplication.
- Context assessor determines gaps and triggers controlled enrichment.
- Requirements agent synthesizes obligations into structured bundles with citations.
The system uses three specialized agents with explicit inputs and outputs. Each agent operates on a structured state object and emits a typed result that is logged for traceability.
- Regulatory discovery agent: consumes grouped document text and extracted headings, classifies the document type, produces a concise summary, and emits a small set of retrieval queries optimized for vector search.
- Context assessment agent: evaluates retrieval results for coverage, identifies missing information, and produces a list of follow‑up queries that may trigger controlled web enrichment.
- Requirements synthesis agent: merges document summaries, retrieval chunks, and approved web context into a structured requirements bundle with business/data requirements, rationale text, assumptions, and citation links to chunk IDs or URLs.
When gaps are detected, the system follows a strict triage:
- Metadata search (DuckDuckGo snippets only).
- Candidate screening with rationale.
- Selective fetching through MCP with boilerplate removal.
- Content vetting to ensure concrete, relevant obligations.
Only approved sources are added to the context, and every decision is logged.
- Machine‑readable JSON requirements bundle with citations
- Business‑ready PDF summary
- Full decision log in JSONL for audit and replay
- Portfolio‑level aggregation across multiple documents
- Deterministic retrieval with chunk‑level citations
- Traceable reasoning with a complete event trail
- Configurable persistence for vector stores and reports
documentation/Architecture.d2->documentation/Architecture.pngdocumentation/BusinessFlow.d2->documentation/BusinessFlow.pngdocumentation/SystemSequenceAgents.d2->documentation/SystemSequenceAgents.pngdocumentation/UseCase.d2->documentation/UseCase.pngdocumentation/Architecture.drawio->documentation/Architecture.drawio.png
-
Configure environment
cp .env.example .env export GEMINI_API_KEY="<your_gemini_key>"
-
Start the stack
docker compose -f docker-compose.local.yml up --build
-
Use the UI + API
- Frontend:
http://localhost:3000 - Backend health:
http://localhost:8000 - Generate requirements:
POST http://localhost:8000/api/pipeline
- Frontend:
-
Install dependencies
pip install -e . -
Configure environment
cp .env.example .env # if you keep one export GEMINI_API_KEY="<your_gemini_key>"
The same key is forwarded to
GOOGLE_API_KEYfor the LangChain Gemini client. Optionally setVECTOR_STORE_DIRif you want a custom persistence folder. -
Run the pipeline
python -m AgenticAI.agentic.pipeline_runner --data-dir data --vector-dir artifacts/vector_store
The runner will:
- rebuild the Milvus collection (unless it already exists and
--rebuild-storeis omitted), - start the MCP regulation server automatically,
- execute the LangGraph pipeline per document,
- persist the JSON requirements bundle to
artifacts/requirements.jsonand a formatted PDF toartifacts/requirements.pdf(configurable via--pdf-output).
- rebuild the Milvus collection (unless it already exists and
-
Inspect results
Open
artifacts/requirements.json(machine readable) orartifacts/requirements.pdf(business friendly) to review requirements with citations to document chunks and web sources.
--rebuild-store– force ingestion even if a vector store already exists.--top-k– number of chunks fetched per query (default 15).--server-script– run a custom MCP server implementation if needed.--output– change the output JSON path.--pdf-output– change where the PDF rendering is stored (defaultartifacts/requirements.pdf).
AgenticAI/pipeline/ingestion.py– ingestion and Milvus persistence helpers.AgenticAI/mcp_servers/regulation_server.py– FastMCP server exposing retrieval, metadata search, and fetch tools.AgenticAI/mcp/client.py– lightweight stdio MCP client for the LangGraph runner.AgenticAI/agentic/*– document grouping utilities, Pydantic schemas, and LangGraph orchestration.AgenticAI/agentic/pipeline_runner.py– main entrypoint that ties everything together.visualizer/index.html– UI for browsing requirement bundles and cited sources.
MIT. See LICENSE.

