Aether Engine & Operating System (AetherOS)

AetherOS is a proprietary Rust inference engine and autonomous AI operating system middleware. It augments small offline GGUF models (1.2B–8B parameters) with multi-model agentic loops, continuous latent reasoning, dual parallel inference, and an active OS-control tool registry.

Software Architecture

Goal / Directive
       │
       ▼
┌────────────────────────────────────────────────────────┐
│  AGENTIC LOOP (src/agent.rs)                           │
│  - Stateless perceive → think → act → observe loop     │
│  - Pluggable personas: Aether, Claude, Arena           │
│  - Speculative thought search (MCTS) + self-verification│
└────────────────────────────────────────────────────────┘
       │
       ▼
┌────────────────────────────────────────────────────────┐
│  14 INFERENCE MIDDLEWARES (src/*.rs)                   │
│  - Structural evaluation (ATD: Likelihood vs Entropy)  │
│  - Text/latent recurrence loops (CLT)                  │
│  - L1/L2 byte ring-buffer streaming (Duet Parallel Run)│
│  - Associative FFT fixed-size state matrix (HCM Arena) │
└────────────────────────────────────────────────────────┘
       │
       ▼
┌────────────────────────────────────────────────────────┐
│  ACTIVE 24-TOOL REGISTRY (src/tools.rs)                │
│  - Real Filesystem IO (Sandboxed in /home/user)        │
│  - Process shell execution (std::process::Command)     │
│  - Active in-memory window manager & autonomous plans  │
│  - Runtime dynamic tool registration (skill_register)  │
└────────────────────────────────────────────────────────┘
       │
       ▼
OpenAI-Compatible Inference Backend (Ollama, llama.cpp, vLLM)

24 Active Agent Tools

AetherOS dispatches actions through 24 typed tool executors implemented in src/tools.rs. The execution engine actively supports real filesystem modifications, sandboxed Python/Bash process execution, and dynamic capability authoring at runtime:

Filesystem: file_read · file_write · file_list · file_delete
Execution: exec · sandbox_eval · code_analyze · git_orchestrate · net_probe
Window Management: window_open · window_close
Memory (Akasha): memory_add · memory_search · web_search
Autonomous Planning: plan_create · plan_update
Cognitive Middleware: mcts_speculate · hypnos_sleep · genesis_toggle · siren_phase_sync · duet_parallel_run · l1l2_buffer_flush · autopoiesis_full_engage
Extensibility: skill_register (Allows the agent to write and register custom scripts in Python/Bash at runtime)

The 14 Inference Middlewares

Aether Engine augments every downstream LLM completion through 14 interconnected software innovations designed to reduce token quantization errors and eliminate dynamic memory saturation:

#	Innovation	Description
1	Semantic Memory Graph	TF-IDF sparse vectors, cosine similarity edges, multi-hop retrieval
2	Cognitive Decomposer	Recursively breaks complex queries into dependency-ordered sub-questions
3	Self-Verification Loop	Output quality auditing with automated self-correcting retry passes
4	Knowledge Distillation	Long-term caching store capturing highly successful decomposition graphs
5	Context Compressor	Two-tier 40K → 4K token signal-preserving context compactor
6	Action Cache	Instant exact/semantic similarity return fast-path (Cosine > 0.95)
7	Speculative Prefetch	Detached async worker that warms adjacent memory graph caches
8	HCM Arena	FFT holographic state matrix absorbing infinite context into fixed 16KB
9	CLT Reasoning	N-step latent/text recurrent loops checking cosine stability
10	ATD Contestation	Dual-graph verification colliding Likelihood against Structural Entropy
11	MCTS Speculation	Monte Carlo Thought Search rollouts using UCT scores for complex paradigms
12	Hypnos Slumber	Asynchronous dream/sleep consolidation abstracting daily scattered log nodes
13	Nano-SIREN Cap	Periodic sinusoidal representation network (`sin(wWx + b)`) for phase projection
14	Duet Parallel Run	Two parallel 1.2B models communicating via an L1/L2 byte ring-buffer

Key Operational Subsystems

The Genesis Reactor (`src/genesis.rs`)

A permanent asynchronous Tokio background loop that executes continuously every 12 seconds. It reflects on the active tool catalog, audits holographic context interference, warms adjacent memory networks, and automatically authors and registers diagnostic capabilities.

Aether Autocoder (`src/autocoder.rs`)

An offline edge specialization engine tuned for 1.2B–3B code models (e.g., qwen2.5-coder-1.5b.gguf). It leverages the high raw generation speed of small models (135+ tok/sec) to run multiple fast self-correcting compilation and evaluation loops inside the sandboxed workspace.

The Masterpiece Flagship Redesign (`GET /desktop`) — "The Claude / GPT / Gemini Killer"

A completely reimagined, ultra-premium, professional 3-column integrated split workspace served directly by the engine at http://localhost:3004/desktop. It natively disrupts classical overlapping OS window GUI metaphors:

Left VFS & Capabilities Explorer: Interactive Virtual File System (VFS) tree to instantly browse, read, create, and delete files in the sandboxed host environment, combined with a collapsible 24 God-Mode Tools catalog.
Central AI Workspace: An elite flagship chat HUD with distinctive styling for user turns, internal MCTS speculative rollouts, SIREN phase logs, active tool execution blocks, and multi-modal quick prompts.
Right File Editor & Visual Overlay Panel: Real-time code editor linked directly to the VFS plus interactive force-directed Akasha memory network visualizers.

API Reference

The server exposes an OpenAI-compatible HTTP service listening on port 3004. Trailing slashes are handled cleanly across all endpoints:

Method	Endpoint	Purpose
`POST`	`/v1/agent/run`	Execute autonomous multi-persona 24-tool agent control loop
`GET`	`/desktop`	AetherOS v5.0 Unified Interactive Operating System Desktop GUI
`POST`	`/v1/duet/run`	Run Duet twin parallel zero-storage collaborative inference
`POST`	`/v1/siren/sync`	Project candidate logic through Nano-SIREN periodic hat
`POST`	`/v1/duet/flush`	Wipes simulated CPU L1/L2 cache ring buffer
`POST`	`/v1/autocoder/run`	Run Offline 1.2B Edge Autocoding & Sandboxed Self-Healing loop
`POST`	`/v1/mcts/speculate`	Launch speculative branching Monte Carlo thought rollouts
`POST`	`/v1/hypnos/sleep`	Execute Hypnos Slumber Protocol for graph node consolidation
`GET`	`/v1/genesis/logs`	Retrieve active 24/7 Genesis autopoietic evolution logs
`POST`	`/v1/genesis/toggle`	Toggle active state of Chronos 24/7 background evolution reactor
`GET`	`/v1/skills`	List all dynamically registered runtime skills
`POST`	`/v1/skills/register`	Author and register new runtime tools (Bash/Python)
`GET`	`/v1/tools`	Enumerate available standard and dynamic tools + JSON schemas
`POST`	`/v1/chat/completions`	OpenAI-compatible 14-stage augmented inference pipeline
`POST`	`/v1/chat/stream`	Server-Sent Events (SSE) streaming chat completions
`GET`	`/health`	Core telemetry HUD counters (Baseline + HCM + CLT + ATD + Duet)
`GET`	`/dashboard`	Monospace tactical ASCII system monitoring dashboard
`GET`	`/metrics`	Prometheus exposition format exposition metrics
`GET`	`/graph/export`	Downloadable semantic memory graph JSON snapshot

Quick Start

# 1. Build optimized release binary
cargo build --release

# 2. Launch AetherOS HTTP middleware (Forwarding to local Ollama / GGUF target)
AETHER_BACKEND=http://localhost:11434/v1 ./target/release/aether-engine

# 3. Open the Live Web OS Desktop GUI
open http://localhost:3004/desktop

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
models		models
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
RELEASE_NOTES.md		RELEASE_NOTES.md
desktop_screenshot.png		desktop_screenshot.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aether Engine & Operating System (AetherOS)

Software Architecture

24 Active Agent Tools

The 14 Inference Middlewares

Key Operational Subsystems

The Genesis Reactor (`src/genesis.rs`)

Aether Autocoder (`src/autocoder.rs`)

The Masterpiece Flagship Redesign (`GET /desktop`) — "The Claude / GPT / Gemini Killer"

API Reference

Quick Start

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Aether Engine & Operating System (AetherOS)

Software Architecture

24 Active Agent Tools

The 14 Inference Middlewares

Key Operational Subsystems

The Genesis Reactor (src/genesis.rs)

Aether Autocoder (src/autocoder.rs)

The Masterpiece Flagship Redesign (GET /desktop) — "The Claude / GPT / Gemini Killer"

API Reference

Quick Start

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

The Genesis Reactor (`src/genesis.rs`)

Aether Autocoder (`src/autocoder.rs`)

The Masterpiece Flagship Redesign (`GET /desktop`) — "The Claude / GPT / Gemini Killer"

Packages