ARAIL — Autoresearch AI Labs

A rail gun for AI.

ARAIL is a blueprint, not a product. Clone it, pick a tier, and you have a local AI research bench with some visibility. RAG'd up chat tab that can connect to your machine or any cloud vendor, an agent-driven knowledge base, and a loop that runs experiments while you sleep.

ARAIL evolves with Autoresearch from Karpathy. Draft, review and approve a plan to begin experimenting and researching

A learn-by-doing AI research lab for friends, family, and the curious. Default name is Autoresearch AI Lab. Rename it to whatever you want in one line of .env — "Sam's AI Lab", "gentoofoo's ai lab", "PeanutLab". It's your lab.

Inference from an AI Engineer built by Nucleus *

New here? The lab, end-to-end is the 12-minute runbook tour — what ARAIL is, every surface, and how to set it up.

Quick start

git clone https://github.com/qukaizen/arail.git
cd arail
./arailctl setup      # pick a tier, install deps, download a starter model
./arailctl start      # open http://127.0.0.1:8080

Three keystrokes: ./arailctl setup, ./arailctl start, and a browser. (If you prefer a shorter alias, ./qkz is symlinked to ./arailctl.)

Pick a tier

Two tiers. Pick one; upgrade later.

Tier	What you get	Good for
`minamalist`	Dashboard · Chat · Autoresearch · Knowledge Base · Agents · LanceDB vectors · AeroLLM 70B (Llama-3.1-70B)	The everyday lab. Real models on small hardware.
`maximum`	+ Admin · Docs · Notebooks · AirLLM 405B (Llama-3.1-405B) · Anthropic SDK · LangChain · full cloud SDKs	Frontier-scale local inference, full bench.

Upgrade any time:

./arailctl upgrade maximum

Knowledge Base and Agents are part of minamalist on purpose — research needs memory to work. Both tiers ship with the embedded LanceDB-backed KB; maximum adds the heavier operator surfaces and orchestration extras.

For the list of models that have been validated against ARAIL's correctness harness — what's Certified, Compatible, Beta, or has a Known Issue — see docs/CERTIFIED_MODELS.md.

External providers (Claude, NVIDIA NIM, OpenRouter, HuggingFace) are reachable in both tiers via plain HTTP — max just adds the official SDKs and LangChain/LangGraph for heavier orchestration. Airgapped mode is the default: agents cannot collect information from the public internet. Local services on this machine and your private network (loopback, RFC1918, link-local) stay reachable so a LAN GPU box keeps working. Flip LAB_MODE=hybrid in .env to allow agent fetches to cloud vendors.

See docs/INSTALL.md for the long walkthrough.

The main surfaces

📊 Dashboard (every tier)

The lab's home screen. Shows the current mission (set from Autoresearch), what the agents are doing, the simulated cloud cost of today's work, and an activity stream so nothing happens out of view. Mission text and breakdown update live when a goal is set or changed elsewhere.

💬 Chat — with a Compute Source pivot (every tier)

The Chat tab is built for crappy machines and beefy ones alike. A Compute Source row at the top lets you choose in one click:

My Machine — runs the model locally (MLX on Apple Silicon, CUDA on NVIDIA GPUs, llama.cpp on CPU-only boxes). No key needed.
Claude — Anthropic's API.
NVIDIA NIM — free credits at build.nvidia.com.
OpenRouter — paid catalogue of hosted open-weight models.
HuggingFace — the HF Inference API.
Custom endpoint — any OpenAI-compatible URL (LM Studio, Ollama, your own vLLM server).

The ⚙ Manage providers button opens a modal where you save each key, test it (the lab pings the vendor's /models endpoint), list the available models, or remove the token. Tokens persist to lab/data/secrets.env with chmod 0600 and are git-ignored. Tokens are never echoed back to the UI after saving and never printed to logs.

Airgapped guard. By default LAB_MODE=airgapped. Agent-originated outbound calls through requests and urllib are denied unless the destination resolves to loopback, RFC1918, or link-local. Denials raise EgressBlocked and append one line to lab/data/egress.jsonl for audit. The Compute Source row shows a banner, cloud radios grey out, and the save/test/models endpoints refuse. Click the Airgapped badge in the nav to see the operational definition and the most recent blocks.

Toggling LAB_MODE from the UI. When the lab is bound to loopback (the default, BIND_ADDR=127.0.0.1), the Network Policy modal shows a toggle button. Click it, read the confirmation copy, wait 3 seconds for the confirm button to enable, then click Confirm. The portal rewrites .env atomically and updates the running process immediately — no restart needed. Each toggle appends one line to lab/data/airgap_audit.jsonl (chmod 0600). The toggle is disabled when the portal is bound to a non-loopback address (BIND_ADDR=0.0.0.0, for example) — in that case the modal shows a static note to edit .env directly, because a UI toggle on a LAN-exposed portal would be a CSRF attack surface.

🔬 Autoresearch (every tier)

This is where you set the goal. Tell the lab a measurable goal — "make our chat responses land under 400 ms on my MacBook" — and it runs an experiment loop: propose a change, measure, compare against the baseline, write up what it learned. The goal is lab-wide: once set here, the Dashboard mirrors it live. You watch from the dashboard and steer from the Autoresearch cockpit when the direction drifts.

📚 Knowledge Base (every tier)

The lab's long-term memory. Agents ingest papers, notes, web pages, and your uploaded PDFs into a LanceDB vector index. Searchable from Chat and Autoresearch — the answers you get start referring back to your own materials. Ingest with ./arailctl pkb ingest <file>.

🤖 Agents (every tier)

Three built-in personalities:

Buddy — your lab partner. Observes, nudges, writes warm two-line summaries.
SRE — the crash watcher. Surfaces recurring errors so you don't miss a broken loop.
Researcher — the engine behind Autoresearch. Reads goals, writes code, runs experiments, commits the winners.

Use the Agent Forge, or add lab/pkb/agents/<id>/AGENT.md plus lab/pkb/agents/<id>/<id>.py; the loader discovers them on start.

🛠 Admin (max)

System health, plugin manager, tier status, diagnostics. It's where the lab admits what's broken.

📖 Docs (max)

Curated operator docs rendered inside the lab. Use it for setup notes, agent architecture, design specs, and the platform-porting manifest without leaving the local UI.

What "local-first" means

By default LAB_MODE=airgapped — agents in the lab cannot collect information from the public internet. Calls to loopback and your private network still work, so a LAN GPU box (Ollama, vLLM, an aerollm node) keeps inferring without changes. Cloud-provider APIs are blocked at the HTTP layer. The dashboard's Airgapped badge is clickable — it shows what is and isn't enforced, the recent blocks, and the known gaps (httpx, raw sockets, subprocess curl) the Python-level guard doesn't cover. The threat model is well-meaning agent code, not an adversary on this host — for that, run a host firewall.

Flip to hybrid and cloud vendors become fallbacks when the local model isn't enough. The Chat tab makes this explicit: you see which provider is active, and you can pivot back with a click.

Make it yours

Edit .env:

LAB_NAME="The AI Research Lab"
LAB_TAGLINE="Our family AI bench"

Restart, and every banner, nav logo, activity line, and wiki landing page now says "Sam's AI Lab". The Python package stays named arail internally so imports don't break — only the display rebrands.

Where to read next

docs/INSTALL.md — tier choice, platform setup, upgrades.
docs/PUBLISH.md — publishing your lab to the public internet (nginx/Caddy, Cloudflare Access, hardening checklist).
docs/MACOS.md — Apple Silicon specifics (MLX).
docs/LINUX.md — Linux + CUDA.
docs/WSL.md — Windows via WSL2 with GPU passthrough.
docs/PRIVACY.md — exactly what data leaves the box.
docs/TROUBLESHOOTING.md — first-run gotchas.
docs/agents-explained.md — the quick tour of Buddy, SRE, Researcher, and custom agents.
docs/agents.md — the agent architecture and loader contract.
AGENTS.md — the platform-porting manifest for coding agents.
BLUEPRINTS.md — how this repo thinks of itself as a blueprint.
CONTRIBUTING.md — contribute back upstream.
SECURITY.md — threat model, how to report issues.

Philosophy

ARAIL exists because AI research shouldn't feel like a closed club. If you have a computer and curiosity, you can run your own experiments, curate your own knowledge, and talk to real frontier models — on terms you understand, with code you can read.

Every tab teaches something. Every loop you run leaves notes behind. That's the lab.

— start with ./arailctl setup.

License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 422 Commits
.github/workflows		.github/workflows
blueprints		blueprints
catalog		catalog
compose		compose
config		config
core/knowledge-canvas		core/knowledge-canvas
docs		docs
examples/peanut_farmer		examples/peanut_farmer
lab		lab
learnings		learnings
models/ai-engineer		models/ai-engineer
research		research
scripts		scripts
sprints		sprints
src/arail		src/arail
tests		tests
.cspell.json		.cspell.json
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
BLUEPRINTS.md		BLUEPRINTS.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
arail		arail
arailctl		arailctl
components.json		components.json
design.md		design.md
pyproject.toml		pyproject.toml
qkz		qkz
start-portal.sh		start-portal.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ARAIL — Autoresearch AI Labs

Quick start

Pick a tier

The main surfaces

📊 Dashboard (every tier)

💬 Chat — with a Compute Source pivot (every tier)

🔬 Autoresearch (every tier)

📚 Knowledge Base (every tier)

🤖 Agents (every tier)

🛠 Admin (max)

📖 Docs (max)

What "local-first" means

Make it yours

Where to read next

Philosophy

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ARAIL — Autoresearch AI Labs

Quick start

Pick a tier

The main surfaces

📊 Dashboard (every tier)

💬 Chat — with a Compute Source pivot (every tier)

🔬 Autoresearch (every tier)

📚 Knowledge Base (every tier)

🤖 Agents (every tier)

🛠 Admin (max)

📖 Docs (max)

What "local-first" means

Make it yours

Where to read next

Philosophy

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages