Your AI Agent Doesn't Need Neo4j — Lightweight Ontology in Practice #20

xg-gh-25 · 2026-05-19T03:07:26Z

xg-gh-25
May 19, 2026
Maintainer

Your AI Agent Doesn't Need Neo4j — Darwinian Knowledge Management

Your agent's knowledge base will bloat, decay, and contradict itself. You might reach for Neo4j.
Stop. Ask first: does your knowledge need better storage, or a brain that can forget?

You've Seen This Problem

Your AI Agent has been running for 3 months. It's accumulated 200+ "lessons learned." Then you notice:

A 6-month-old best practice is now an anti-pattern in the new framework version
The agent cited a 2024 workaround to make a 2026 decision — and introduced a bug
Three contradictory entries exist in the knowledge base. Nobody remembers which is current
Every session injects ALL knowledge into the prompt. Token costs spike. Most of it is irrelevant

Knowledge bloat isn't a storage problem. It's a lifecycle problem.

The Obvious Solution: Graph Database

Neo4j. Amazon Neptune. Stardog. Define ontology in OWL, query with SPARQL, traverse with Cypher. Structured storage + semantic queries + relationship reasoning. Sounds perfect.

We considered it. Then hit three fatal issues:

1. Maintenance tax is brutal. Every code change requires syncing triples. A rename operation in Neo4j is MATCH + DELETE + CREATE chains. In Markdown it's sed.

2. Query cost > injection cost. 1M context window ≈ 750K words. Your entire knowledge base might be 12K tokens. Directly injecting into the system prompt costs far less than maintaining a RAG pipeline + embeddings + retrieval + re-ranking.

3. Graph databases don't manage death. Neo4j stores forever. Nodes once created, never die. You need custom cron jobs + business logic + versioned cleanup. And "when to forget" is the hardest decision.

The Core Thesis: Darwinism vs Encyclopedia

Most knowledge management systems follow the encyclopedia model: store everything, never delete, filter at query time.

We chose the Darwinian model: knowledge has a lifecycle. Used = strengthened. Unused = decays. Eventually = dies.

Encyclopedia: knowledge → store → immortal (until someone manually cleans)
Darwinian:    knowledge → use → strengthen ←→ unused → decay → archive → forget

Why Darwinian is better for AI Agents:

Agents don't need "all knowledge." They need "knowledge relevant right now." A 6-month-old Python 3.9 workaround in a 3.12 environment isn't just useless — it's harmful.
Forgetting is a feature, not a bug. Human brains forget because forgetting frees cognitive resources for what matters. An agent's context window is finite — useless knowledge occupying space is compressing useful knowledge.
References = natural selection. If a knowledge entry hasn't been referenced by any pipeline, any decision, any conversation in 90 days — it's probably not important. No human judgment needed. The system knows.

Implementation: Three Layers (~1,000 Lines of Python)

Don't let the theory scare you. The core is: a schema + entries with lifecycles + a relation file.

Layer 1: Define How Knowledge Is Organized (Schema)

You need to answer: how many types? What does each look like?

We use 5 types (MECE — Mutually Exclusive, Collectively Exhaustive):

Type	When to use	Example
guideline	"Do this"	"Atomic commits: one logical change per commit"
pitfall	"Don't do this"	"fcntl.flock on symlinked path = arbitrary file lock"
decision	"We chose A over B"	"Chose Amazon Transcribe over Whisper: existing SSO"
model	"This thing's structure is"	"Session states: COLD→STREAMING→IDLE→DEAD"
process	"Steps for this task"	"Release: build→verify→tag→publish"

Why not free-form tags? Because tags have no query contract. You can't say "BUILD stage needs all guidelines + pitfalls" if every entry has arbitrary tags. Fixed types = programmable query interface.

Schema doesn't need OWL. A Markdown file's section structure IS schema. The agent just Reads it.

Layer 2: Each Entry Is a Living Entity

- [pitfall] **Lock fd leaked on exception path** — ... (2026-05-19)
  <!-- ref:3 | last:2026-05-19 | decay:active -->

Three fields determine life and death:

Field	Meaning	Who updates
`ref:N`	Times referenced	System (title match in pipeline output)
`last:date`	Last reference date	System
`decay:state`	active / dormant / archived	System (daily assessment)

Decay rules:

ref:0 + 90 days unreferenced → dormant (flagged ⚠️, visible but not actively injected)
ref:0 + 180 days unreferenced → archived (moved to archive file, removed from active set)
ref:≥10 → double grace period (180d before dormant) — classics are more durable
Created < 30 days → immune — give new knowledge time to prove itself

Key: no human involvement. The decay engine runs daily. You don't need "quarterly knowledge reviews" — the system knows who lives and who dies.

Layer 3: Relations Between Entries

# .knowledge-graph.yaml (142 relations, self-growing)
- s: "Lock fd leaked on exception path"
  p: applies_to
  o: "ddd_orchestrator.py"
  c: "2026-05-19"
  u: "2026-05-19"

10 relation types (motivated_by, supersedes, extends, applies_to, conflicts_with, etc.), all in one YAML file.

The killer feature: relations grow automatically. When the system processes a file and references a knowledge entry, it auto-creates an applies_to relation. Next time it processes the same file, that entry gets priority boost.

More use → richer relations → better recommendations → more use. Flywheel.

Mistakes We Made (So You Don't Have To)

Mistake 1: Signal words for type classification can't be common words.

First version used "pipeline", "step", "→" as process signals. Result: 37/179 entries misclassified — those words appear in ALL types of technical text. Fix: signal words must be unique to the type ("race condition" → pitfall, "workflow:" → process).

Mistake 2: Title matching for reference tracking has false positives.

Title "Build" matches any text containing "Build". Fix: skip titles < 8 chars, use word boundary regex for 8-20 chars, substring for 20+.

Mistake 3: The decay system can't have "exemption paths."

First version: if an entry has no date, skip decay assessment. Result: manually added knowledge never decays. Fix: no date = treat as infinitely old → immediate decay trigger.

Mistake 4 (most expensive): If a gate can pass because data is "absent," the gate doesn't exist.

Our pipeline validator checked "is the adversarial review tier correct?" But what if the entire field is missing? None isn't wrong, isn't right — it skips the check entirely. Four times the same bug class before we learned: absent = violation, not exemption.

The Compound Loop: Why Knowledge Gets Better Over Time

Pipeline runs → produces lessons → tagged by type → references old knowledge (ref+1)
                                                         ↓
                                         auto-creates applies_to relation
                                                         ↓
                                         next time same module → relevant knowledge first
                                                         ↓
                                         better suggestions → fewer bugs → better lessons
                                                         ↓
                                         unused knowledge → auto-decay → knowledge stays lean

Evidence:

179 active entries (2 months accumulated), expected steady state: 80-100
142 relations (auto-extracted, 53% coverage, ~2 new relations per pipeline run)
Token overhead: 12K (1.2% of 1M context)
Human maintenance cost: 0

Start Today: 3 Steps (10 Minutes to 1 Hour)

You don't need our system. The core idea works with any agent:

Step 1: Add type labels to your lessons (10 minutes)

# Before
- Don't call subprocess.run in async functions

# After
- [pitfall] **Don't call subprocess.run in async functions** — blocks event loop (2026-05-02)

Just [guideline] / [pitfall] / [decision] is enough to start.

Step 2: Add reference counting (30 minutes)

In your agent's post-execution hook:

for entry in knowledge_entries:
    if entry.title.lower() in agent_output.lower():
        entry.ref_count += 1
        entry.last_referenced = today

Step 3: Write a daily cron for decay (1 hour)

for entry in entries:
    days_since_ref = (today - entry.last_referenced).days
    if days_since_ref > 90 and entry.state == "active":
        entry.state = "dormant"  # Stop injecting, still searchable
    if days_since_ref > 180 and entry.state == "dormant":
        archive(entry)  # Move out

That's it. The relation layer (Layer 3) is advanced — add it when you have 100+ entries and discover "globally popular ≠ currently relevant."

When You DO Need a Real KG

Honest boundaries. Consider Neo4j/Neptune when ANY of these hold:

Condition	Why lightweight isn't enough
Knowledge > 10,000 entries	Exceeds context window effective injection range
Need 3-hop+ queries	"All decisions indirectly affecting module X" is O(n³) in list comprehension
Multiple concurrent writers	File locks aren't enough, need transaction isolation
Need automatic reasoning	"If A applies_to B and B requires C, then A indirectly requires C"

For 1-3 person + AI teams, these conditions won't hold for the foreseeable future. When 179 entries scan in 0.1ms, you don't need an index.

Principles (Take These, Forget the Implementation Details)

Forgetting is a feature, not a bug. When designing knowledge systems, think "how to delete" before "how to store."
References = natural selection. Used knowledge survives. Unused knowledge dies. No "knowledge audits" needed.
Fixed types > free tags. 3-5 MECE types give you a programmable query contract.
Schema should be agent-readable. If the agent needs a special parser for your knowledge structure, the structure is too complex.
Context window IS your database. Under 100K tokens, full injection > RAG.
Mechanical > aspirational. "Review knowledge quarterly" gets ignored. "Daily auto-decay" doesn't.

Conclusion

The problem with AI Agent knowledge management isn't "how to store more" — it's "how to ensure everything alive is useful."

A best practice nobody has referenced in 90 days has the same value to the agent as a deleted entry — zero. The difference is the former still occupies tokens, pollutes context, and consumes attention.

Kill it. Let the living knowledge breathe.

~1,000 lines of Python + Markdown + YAML. Zero external dependencies. Start today.

Author: XG | SwarmAI — Human directs, AI delivers
Code: github.com/xg-gh-25/SwarmAI
Discussion: github.com/xg-gh-25/SwarmAI/discussions/20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Your AI Agent Doesn't Need Neo4j — Lightweight Ontology in Practice #20

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Your AI Agent Doesn't Need Neo4j — Lightweight Ontology in Practice #20

Uh oh!

Uh oh!

xg-gh-25 May 19, 2026 Maintainer

Your AI Agent Doesn't Need Neo4j — Darwinian Knowledge Management

You've Seen This Problem

The Obvious Solution: Graph Database

The Core Thesis: Darwinism vs Encyclopedia

Implementation: Three Layers (~1,000 Lines of Python)

Layer 1: Define How Knowledge Is Organized (Schema)

Layer 2: Each Entry Is a Living Entity

Layer 3: Relations Between Entries

Mistakes We Made (So You Don't Have To)

The Compound Loop: Why Knowledge Gets Better Over Time

Start Today: 3 Steps (10 Minutes to 1 Hour)

Step 1: Add type labels to your lessons (10 minutes)

Step 2: Add reference counting (30 minutes)

Step 3: Write a daily cron for decay (1 hour)

When You DO Need a Real KG

Principles (Take These, Forget the Implementation Details)

Conclusion

Replies: 0 comments

xg-gh-25
May 19, 2026
Maintainer