Skip to content

itigges22/ATLAS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

366 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ATLAS TUI in action
The ATLAS TUI live, 10× sped up, running the V3 pipeline on a file creation.

A.T.L.A.S.

Adaptive Test-time Learning and Autonomous Specialization

Version License Model-agnostic

简体中文 日本語 한국어

🌎 What is ATLAS?

ATLAS is a local coding agent for open models. It runs on your own hardware and works inside real repositories: reading files, editing code, running commands, and checking the result in an isolated sandbox.

Loading a model locally is only half the problem. Getting a compact open model to stay on task across a real code change is harder. ATLAS puts an agent loop around the model that can plan, generate alternatives, enforce tool calls, run tests, and repair failures. Simple edits take the short path; harder tasks get more compute and verification.

atlas init selects a compatible registry model for the machine, or you can bring another GGUF and build the matching Lens and ASA artifacts locally. Model identity and context sizing are runtime configuration, not baked-in family assumptions. There is no hosted API or per-token bill, and your source code and prompts stay on the machine running ATLAS.

The published 74.6% LiveCodeBench result belongs to the frozen 14B reference build. Formal results for the current registry models are still in progress.


🔥 Latest News

Star History Chart

🧱 What ATLAS Does

  1. atlas-tui - native Bubbletea terminal UI; the canonical chat client (PC-062). Type atlas in any project directory to launch it.

  2. atlas-proxy - Go agent loop that orchestrates the system.

    • Tool-call routing - classifies file operations by complexity tier
    • Grammar enforcement - GBNF schemas keep JSON output valid
    • BiasBusters - four composed mitigations (descriptions, grammar bans, system notes, ASA steering) that push the model toward ast_edit for structural code edits
    • Safety limits - turn caps, token budgets, timeouts
  3. V3 Pipeline - multi-phase code generation; turns a single prompt into a verified candidate.

  4. Geometric Lens - energy-based scoring over the model's own embeddings, no external oracle. (What is a "Geometric Lens"?)

  5. Sandbox - isolated execution for build verification.

    • Multi-language execution: Python, Rust, Go, C, Shell, others
    • Compilation and linting before scoring
    • Runs both generated and existing test suites
  6. llama-server - local LLM inference on one consumer GPU.

    • GPU-accelerated quantized inference (Q6_K / Q4_K_M) — NVIDIA CUDA, AMD ROCm, Apple Metal (macOS hybrid), and Vulkan; Intel SYCL on the roadmap
    • Grammar-constrained decoding at the token level
    • Self-embeddings, so the lens doesn't need a second model

Full documentation (setup, architecture, configuration, troubleshooting, benchmark reports, and the research behind each component) lives in the docs/ directory.


🚀 Get Started

One-shot install:

curl -fsSL https://raw.githubusercontent.com/itigges22/ATLAS/main/scripts/atlas-bootstrap.sh | bash

The script detects your distro (Ubuntu, Debian, RHEL, Fedora, Rocky, Alma) and your GPU vendor (NVIDIA → nvidia-container-toolkit; AMD → ROCm device passthrough), installs the appropriate runtime, downloads the model weights, builds the ASA steering vector, and starts the stack. Expect 10-30 minutes; the model download is the bottleneck.

Then in any project directory, run atlas.

Requirements

GPU 16 GB+ VRAM. NVIDIA (CUDA), AMD (ROCm), or Apple Silicon (Metal, macOS hybrid); Vulkan covers most other GPUs. See SETUP.md § Supported GPUs. To size a specific model to your card, see What fits on my GPU?.
Runtime Docker (NVIDIA: + nvidia-container-toolkit; AMD: standalone Docker is enough) or Podman
Python 3.9+
Disk ~20 GB CUDA / ~22 GB ROCm (model weights + container images)

Apple Silicon runs natively through the macOS hybrid Metal path (native llama-server + Docker for the rest — see SETUP_MACOS.md); Intel Arc (SYCL) is on the roadmap. For the manual install path (Docker Compose, bare-metal, K3s) and the full set of bootstrap flags, see SETUP.md.


⚠️ Known Limitations

  • Linux Docker stack, plus a native macOS path. NVIDIA, AMD ROCm, and Vulkan Docker paths ship today; Apple Silicon runs via the native macOS hybrid Metal path (#32). Intel Arc / SYCL is on the roadmap.
  • Current registry models are not formally benchmarked yet. The canonical 74.6% LiveCodeBench score is from the frozen 14B reference build. New model-specific numbers are tracked in #28. The reference methodology and ablations live in docs/reports/V3_ABLATION_STUDY.md; raw traces are on HuggingFace.
  • Complex feature additions can be inconsistent. Compact models sometimes spend agent turns exploring an unfamiliar codebase before writing code. Reliability has improved through the V3.1.2 agent-reliability pass; fresh model-specific numbers are tracked in #28.
  • Grammar-constrained decoding is slow. Around 51 tok/s on llama-server.

🗺️ Roadmap

V3.1.2 "Maia" - Current release. Broader hardware reach, bring-your-own-model training, and an agent-reliability pass on top of the V3.1.0 base (TUI, one-command install, streaming Lens + ASA).

  • Hardware reach: AMD ROCm via llama.cpp incl. RDNA4 / RX 9070 (gfx1200/gfx1201) (#26); Apple Silicon native macOS hybrid Metal path (#32, see SETUP_MACOS.md); Vulkan universal fallback covering AMD / Intel / Snapdragon / Apple-via-MoltenVK / CPU (#114).
  • Bring-your-own-model: local Lens training pipeline (atlas lens build / retrain, #100) and ASA per-model calibration parity (atlas asa check/build/publish, #113) — train Lens + ASA artifacts for additional GGUFs, with per-model operating thresholds that ship with the lens.
  • In-the-loop lens training: rate passes in the TUI (/good · /bad · /review · /deny) → collected, weighted samples → atlas lens retrain on your own workloads.
  • Agent reliability: tool-result visibility fix, read-dedup, traceback → directed-edit, move_file, pip-install / case-mismatch steers, sandbox shell policy + host-sized cgroup limits.
  • Structural call-graph reasoning (#39 / #125, thanks @yogthos); ARCHITECTURE.md translated to zh-CN / ja / ko (#25).

V3.2 - Next milestone: deeper code reasoning and planning.

  • Architecture-first planning phase — RPG-style plan-then-fill: plan at module scope, then implement at function scope (#120, PR #124).
  • Structural code reasoning (tail) — solver-backed reachability + syntax-agnostic wavelet decomposition for multi-resolution "which files matter" retrieval (#39).
  • Reasoning with sampling — efficiency and quality gains (#9).
  • Deferred infra: automated HuggingFace submission pipeline (#102); ROCm on K3s / Kubernetes; formal registry-model benchmarks — LiveCodeBench, GPQA Diamond, SciCode (#28).

Backlog / help wanted

  • Hardware: ARM64 multi-arch builds (#115), multi-GPU for larger models (#34), Intel oneAPI / SYCL (#27).
  • Tooling: VS Code / JetBrains extension (#35).
  • Sandbox languages: Java / Kotlin (#29), Ruby / PHP (#30).
  • Architecture: model-agnostic platform (#66), SQLite over Redis (#57).

💖 Support ATLAS

ATLAS is built by a single college student in his free time on a single consumer GPU. If the project has been useful to you and you want to help keep it sustainable, please consider sponsoring on GitHub.

Sponsorship directly funds:

  • Compute & hardware — more GPUs for faster benchmark iteration, access to architectures the maintainer can't afford (AMD ROCm, higher VRAM cards, cloud rentals for larger-model experiments).
  • Contributor bounties — meaningful compensation for external contributors who put real time into substantive PRs, so ATLAS can grow faster than a single-person pace allows.
  • Research — continued academic engagement around the architecture, from future workshop and conference submissions to paper writing and collaborations that validate and extend the approach.
  • Community — continued support for the community and platforms ATLAS runs on, including documentation, user-facing channels, and educational content that help ATLAS reach more developers and better serve the ones already using it.

Every sponsor is credited in the release notes of the version they helped fund.


🤝 Contributing

We're building ATLAS in the open and we're actively looking for contributors and core maintainers. Whether you're fixing a bug, adding accelerator support, or rethinking a whole subsystem - there's a place for you here. If you believe open models deserve better infrastructure, come build with us.

Found a bug or hit a wall? Open an issue - you don't need to submit a fix. Bug reports and feedback help just as much as code.

See CONTRIBUTING.md for guidelines.


📄 License

Licensed under the GNU Affero General Public License v3.0 (AGPL-3.0).

About

Adaptive Test-time Learning and Autonomous Specialization

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Sponsor this project

 

Packages

 
 
 

Contributors