Lightweight pairwise evaluator for relational signals in Ouro-2.6B-Thinking loop-state trajectories.
-
Updated
Apr 26, 2026 - Python
Lightweight pairwise evaluator for relational signals in Ouro-2.6B-Thinking loop-state trajectories.
An air-gapped AI contemplation loop. A local model thinks, reflects, and builds a corpus of philosophical thought over time. No internet. No chat interface. Just a mind alone with ideas.
The RCP Experiment is the first completed work in what will become a series of experiments in how LLMs make decisions on morality and values.
Progressive Trust Framework: AI Agent Safety Evaluation Benchmark with 290 scenarios testing Intelligent Disobedience
Add a description, image, and links to the ai-alignment-research topic page so that developers can more easily learn about it.
To associate your repository with the ai-alignment-research topic, visit your repo's landing page and select "manage topics."