Lightweight behavior control layer for LLM using latent state, reward, and self-evaluation (no training required)
reinforcement-learning autonomous-agents ai-agents self-evaluation llm-agent local-llm hallucination-detection behavior-system
-
Updated
Mar 29, 2026 - Python