🤖 Why AI Agents Fail (And How to Fix Them)

Research-backed solutions to the three critical failure modes that break AI agents in production: hallucinations, timeouts, and memory loss.

⭐ Star this repository

🎯 Learning Path: Understand → Prevent → Scale

This repository demonstrates research-backed techniques for preventing AI agent failures with working code examples.

🚨 Failure Mode	💡 Solution Approach	📊 Projects	⏱️ Total Time
Hallucinations	Detection and mitigation through 4 techniques	4 demos	2 hours
Timeouts	Context management and async patterns	Coming soon	-
Memory Loss	Persistent memory and context retrieval	Coming soon	-

🎭 Stop AI Agent Hallucinations

The Problem: Agents fabricate statistics, choose wrong tools, ignore business rules, and claim success when operations fail.

The Solution: 4 research-backed techniques that detect, contain, and mitigate hallucinations before they cause damage.

📓 Hallucination Prevention Demos

📓 Demo	🎯 Focus & Key Learning	⏱️ Time
01 - Graph-RAG vs Traditional RAG	Structured data retrieval - Compare RAG vs Graph-RAG on 300 hotel FAQs, Neo4j knowledge graph with auto entity extraction, eliminate statistical hallucinations	30 min
02 - Semantic Tool Selection	Intelligent tool filtering - Filter 31 tools to top 3 relevant, reduce errors and token costs, dynamic tool swapping	45 min
03 - Multi-Agent Validation Pattern	Cross-validation workflows - Executor → Validator → Critic pattern catches hallucinations, Strands Swarm orchestration	30 min
04 - Neurosymbolic Guardrails for AI Agents	Symbolic validation - Compare prompt engineering vs symbolic rules, business rule compliance, LLM cannot bypass	20 min

📊 Key Results

🎯 Technique	📈 Improvement	🔍 Metric
Graph-RAG	Accuracy	Precise queries on 300 hotel FAQs via knowledge graph
Semantic Tool Selection	Reduce errors and token costs	Tool selection hallucination detection (research validated), Token cost per query
Neurosymbolic Rules	Compliance	Business rule enforcement - LLM cannot bypass
Multi-Agent Validation	Detects errors	Invalid operation detection before reaching users

→ Explore hallucination prevention demos

Why Your Agent Times Out

(Coming soon)

Your Agent Doesn't Remember You

(Coming soon)

🔧 Technologies Used

Details

🔧 Technology	🎯 Purpose	⚡ Key Capabilities
Strands Agents	AI agent framework	Dynamic tool swapping, multi-agent orchestration, conversation memory, hooks system
Amazon Bedrock	LLM access	Claude 3 Haiku/Sonnet for agent reasoning and tool calling
Neo4j	Graph database	Relationship-aware queries, precise aggregations, multi-hop traversal
FAISS	Vector search	Semantic similarity, tool filtering, efficient nearest neighbor search
SentenceTransformers	Embeddings	Text embeddings for semantic tool selection and memory retrieval

Prerequisites

Before You Begin:

Python 3.9+ installed locally
LLM access: OpenAI (default), AWS Bedrock, Anthropic, or Ollama
OPENAI_API_KEY environment variable (for default setup)
AWS CLI configured if using Bedrock (aws configure)
Basic understanding of AI agents and tool calling

Model Configuration: All demos use OpenAI with GPT-4o-mini by default. You can swap to any provider supported by Strands — see Strands Model Providers for configuration.

AWS Credentials Setup (if using Bedrock): Follow the AWS credentials configuration guide to configure your environment.

🚀 Quick Start Guide

1. Clone Repository

git clone https://github.com/aws-samples/sample-why-agents-fail
cd sample-why-agents-fail

2. Start with Hallucinations

cd stop-ai-agent-hallucinations

3. Explore All Techniques

Each demo folder contains detailed README files and working code examples.

💰 Cost Estimation

💰 Service	💵 Approximate Cost	📊 Usage Pattern	🔗 Pricing Link
OpenAI GPT-4o-mini	~$0.15 per 1M input tokens	Agent reasoning and tool calling	OpenAI Pricing
Amazon Bedrock (Claude)	~$0.25 per 1M input tokens	Alternative LLM provider	Bedrock Pricing
Neo4j (local)	Free	Graph database for demos	Neo4j Community
FAISS (local)	Free	Vector search library	Open source
SentenceTransformers	Free	Local embeddings	Open source

💡 All demos can run locally with minimal costs. OpenAI GPT-4o-mini is the most cost-effective option for testing.

📖 Additional Learning Resources

Strands Agents Documentation - Framework documentation and model providers
AWS Bedrock Documentation - LLM service guide and model access
Search for tools in your AgentCore gateway with a natural language query
Neo4j Graph Database Guide - Graph database setup and Cypher queries

⭐ Star this repository • 📖 Start Learning

🤝 Contributing

Contributions are welcome! See CONTRIBUTING for more information.

📄 License

This library is licensed under the MIT-0 License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
stop-ai-agent-hallucinations		stop-ai-agent-hallucinations
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 Why AI Agents Fail (And How to Fix Them)

🎯 Learning Path: Understand → Prevent → Scale

🎭 Stop AI Agent Hallucinations

📓 Hallucination Prevention Demos

📊 Key Results

Why Your Agent Times Out

Your Agent Doesn't Remember You

🔧 Technologies Used

Prerequisites

🚀 Quick Start Guide

1. Clone Repository

2. Start with Hallucinations

3. Explore All Techniques

💰 Cost Estimation

📖 Additional Learning Resources

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

aws-samples/sample-why-agents-fail

Folders and files

Latest commit

History

Repository files navigation

🤖 Why AI Agents Fail (And How to Fix Them)

🎯 Learning Path: Understand → Prevent → Scale

🎭 Stop AI Agent Hallucinations

📓 Hallucination Prevention Demos

📊 Key Results

Why Your Agent Times Out

Your Agent Doesn't Remember You

🔧 Technologies Used

Prerequisites

🚀 Quick Start Guide

1. Clone Repository

2. Start with Hallucinations

3. Explore All Techniques

💰 Cost Estimation

📖 Additional Learning Resources

🤝 Contributing

📄 License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages