feat: revise atlas quickstart by jbarnes850 · Pull Request #114 · Arc-Computer/atlas-sdk

jbarnes850 · 2025-11-01T22:19:41Z

This pull request introduces a new atlas quickstart CLI command that replaces the old examples/quickstart.py script, making it easier for users to run a demonstration of Atlas learning capabilities. It also updates documentation and code to support a new offline mode (ATLAS_OFFLINE_MODE) for mock LLM responses, deprecating the previous ATLAS_FAKE_LLM environment variable. The changes improve usability, provide better error handling, and clarify the workflow for running and testing Atlas. Below are the most important changes grouped by theme:

CLI Improvements and Quickstart Command

Added a new atlas quickstart CLI command that runs a demonstration with 3 security review tasks, visualizes metrics, estimates costs, and supports both online and offline modes. This command is now registered in the CLI parser and replaces the old script. [1] [2] [3] [4] [5]
Updated documentation (README.md, docs/sdk/quickstart.mdx, docs/guides/pypi.md, examples/mcp_tool_learning/README.md) to reference atlas quickstart instead of the deprecated script, and provide step-by-step instructions, expected output, troubleshooting, and feature highlights. [1] [2] [3] [4] [5]

Offline Mode and Environment Variable Updates

Introduced ATLAS_OFFLINE_MODE=1 as the preferred way to run Atlas in mock mode for offline testing, with clear warnings for users still using the legacy ATLAS_FAKE_LLM variable. All code and documentation references now use ATLAS_OFFLINE_MODE. [1] [2] [3] [4] [5]

Usability and Error Handling

Improved error handling and messaging for missing API keys, configuration files, and unavailable storage, with troubleshooting steps and documentation updates. [1] [2]

Deprecation Notices

Deprecated the old examples/quickstart.py script and the ATLAS_FAKE_LLM environment variable, with warnings and documentation updates to guide users to the new workflow. [1] [2] [3] [4] [5]

Documentation and Example Updates

Updated all related documentation, guides, and example references to reflect the new CLI command and offline mode, ensuring consistency and clarity for users. [1] [2] [3] [4] [5]

Fixes: #109 #110

- Created new 'atlas quickstart' CLI command with 3 security review tasks - Implemented metrics visualization table showing learning progression - Added offline mode support (ATLAS_OFFLINE_MODE) for smoke testing - Implemented backward compatibility for ATLAS_FAKE_LLM with deprecation warnings (Issue #110) - Added graceful storage fallback when Postgres unavailable - Enhanced quickstart with cost estimation and better error handling - Added comprehensive test coverage (20+ test cases) - Updated documentation: README.md, docs/sdk/quickstart.mdx, pypi.md - Deprecated examples/quickstart.py in favor of CLI command - Updated Docker entrypoint to use new CLI command Features: - 3 progressive security review tasks demonstrating learning - Metrics table with improvement indicators (↑/↓) - Learning insights generation - Configurable task count (1-3 tasks) - Storage integration with graceful fallback - Offline mode for testing without API calls Closes: Replace legacy quickstart with CLI command implementation

The script has been fully replaced by the 'atlas quickstart' CLI command. All functionality has been migrated to the new command with additional features: - 3 progressive tasks instead of 2 passes - Metrics visualization - Better error handling - Offline mode support

- Fix test_quickstart_command_registered to use --offline instead of --help - Fix test_quickstart_missing_config_file to properly handle SystemExit Both tests were failing due to incorrect test setup: - First test used --help which triggers SystemExit - Second test didn't account for sys.exit() raising SystemExit instead of returning

- Increase text response truncation from 500 to 1500 characters - Add smart JSON structure detection and display: * Shows nested structure with key counts * Displays truncated JSON snippet (first 500 chars) * Better visualization of complex JSON responses - Save run artifacts for each task with full answers - Add note pointing to run artifacts for full answers - Display artifact directory path in completion message This provides better developer experience: - More content visible in CLI (1500 chars vs 500) - JSON responses show structure overview + snippet - Full answers always available in .atlas/runs/ artifacts

- Add documentation comment explaining what's saved in artifacts - Verify playbook entries are displayed via _render_learning_summary - Add helpful note about deeper learning analysis when playbook entries exist - Point users to scripts/eval_learning.py and docs/evaluation/learning_eval.md Playbook entries (the key learning signal) are: - Saved in artifacts: metadata.learning_state.metadata.playbook_entries - Displayed in CLI: Active Playbook Entries section with cue hits/adoptions - Full structure includes: cue, action, scope, provenance, impact metrics This ensures users understand where to find learning data and how to analyze it.

jbarnes850

LGTM

Copilot

Pull Request Overview

This PR replaces the standalone examples/quickstart.py script with a new atlas quickstart CLI command and renames the environment variable ATLAS_FAKE_LLM to ATLAS_OFFLINE_MODE while maintaining backward compatibility. The new quickstart command demonstrates Atlas learning capabilities through 3 progressive security review tasks with improved metrics visualization and user experience.

Key changes:

New atlas quickstart CLI command with offline mode, configurable task count, and storage options
Environment variable renamed from ATLAS_FAKE_LLM to ATLAS_OFFLINE_MODE with deprecation warning for the old name
Comprehensive test coverage for offline mode functionality and CLI command features

Reviewed Changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 13 comments.

Show a summary per file

File	Description
`atlas/cli/quickstart.py`	New CLI command implementation with 3 security tasks, metrics table, and learning insights
`atlas/utils/llm_client.py`	Updated to support both `ATLAS_OFFLINE_MODE` (new) and `ATLAS_FAKE_LLM` (deprecated) with warning
`tests/unit/utils/test_llm_client.py`	New test suite for offline mode variable handling and deprecation warnings
`tests/unit/cli/test_quickstart.py`	Comprehensive tests for quickstart command functionality
`atlas/cli/main.py`	Registered new quickstart subcommand
`docs/sdk/quickstart.mdx`	Updated documentation for new CLI command usage
`examples/quickstart.py`	Removed old script (replaced by CLI command)
`docs/guides/pypi.md`	Updated to reference `ATLAS_OFFLINE_MODE` with backward compatibility note
`docker/entrypoint.sh`	Updated to use new CLI command
`README.md`	Updated quickstart instructions
`examples/mcp_tool_learning/README.md`	Updated reference to new CLI command

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

- Extract _has_playbook_entries() helper function to eliminate code duplication - Add test_format_final_answer_json() - Test JSON structure display - Add test_format_final_answer_text_truncation() - Test text truncation at 1500 chars - Add test_artifact_saving() - Test that artifacts are saved per task - Add test_learning_analysis_note_shown() - Test learning note appears when playbook entries exist - Add test_has_playbook_entries() - Test helper function with various metadata structures Addresses feedback from code review: adds comprehensive test coverage for the smart JSON handling and artifact saving features introduced in commits 8d4fe00 and 72c20f8.

- Remove unused imports (os, sys) from test files - Add comment to empty except clause explaining intent - Fix JSON serialization to use _ensure_jsonable helper - Ensure .env file is loaded early for API keys - Fix cost estimate: reduce from $0.05 to $0.01 per task (actual ~$0.001-0.005)

- Convert _check_storage_available to async function - Convert _ensure_storage to async function - Fixes issue where storage was not detected when called from async context

… persistence - Set learning_key_override in session_metadata for all tasks - Ensures learning state persists across all 3 quickstart tasks - Users can now see learning progression and persistence in action

- Standardize learning config across all example and template configs - Add commented common options for discoverability (history_limit, pruning_config, apply_to_prompts) - Remove redundant defaults (enforce_generality, provisional_acceptance, pruning_config) - all have sensible defaults - Improve configuration documentation for ML engineers: - Add Key Concepts section with clear definitions - Add 'How it works' explanation for learning system - Define all terms before use (capability probe, empirical validation, playbook entries) - Add real-world tuning examples with context - Remove redundancy and improve flow - Add 'Why this matters' context - Improve SQL query explanations - Fix terminology consistency (playbook entries, not pamphlets) - Add schema constraints documentation - All configs validated and passing tests

- Fix ExecutionContext metadata timing bug: capture metadata immediately after arun() completes - Fix empty exception handler: add logging for serialization errors - Fix storage check cleanup: add proper finally block and logging - Fix learning key override precedence: override now takes precedence over existing key - Remove cost estimation functionality from quickstart command Addresses critical issues #1-4 from PR review (#114)

- Extract offline mode check utility to atlas/utils/env.py - Remove all code comments from quickstart.py - Extract magic numbers as constants - Add comprehensive error handling in _format_final_answer() - Standardize type hints (Dict -> dict) in llm_client.py - Add explicit recursion depth guard constant

jbarnes850 added 5 commits November 1, 2025 18:01

jbarnes850 self-assigned this Nov 1, 2025

Copilot AI review requested due to automatic review settings November 1, 2025 22:19

jbarnes850 added the bug Something isn't working label Nov 1, 2025

jbarnes850 added this to Arc Project Board & Issue Tracker Nov 1, 2025

jbarnes850 linked an issue Nov 1, 2025 that may be closed by this pull request

Rename ATLAS_FAKE_LLM toggle to customer-friendly offline mode #110

Closed

jbarnes850 commented Nov 1, 2025

View reviewed changes

Copilot AI reviewed Nov 1, 2025

View reviewed changes

jbarnes850 moved this to In review in Arc Project Board & Issue Tracker Nov 1, 2025

jbarnes850 force-pushed the fix/quickstart branch from a57d6f8 to 48a94f8 Compare November 1, 2025 22:45

jbarnes850 force-pushed the fix/quickstart branch from 48a94f8 to 6a1c5c0 Compare November 1, 2025 22:45

jbarnes850 added 6 commits November 1, 2025 18:47

fix: Improve storage availability check to handle event loop edge cases

a9d818a

fix: Make storage check async to work properly in async context

7ad432f

- Convert _check_storage_available to async function - Convert _ensure_storage to async function - Fixes issue where storage was not detected when called from async context

fix: Use consistent learning_key across quickstart tasks for learning…

0ff127f

… persistence - Set learning_key_override in session_metadata for all tasks - Ensures learning state persists across all 3 quickstart tasks - Users can now see learning progression and persistence in action

jbarnes850 merged commit ef1b47e into main Nov 2, 2025
1 check passed

jbarnes850 deleted the fix/quickstart branch November 2, 2025 01:17

github-project-automation Bot moved this from In review to Done in Arc Project Board & Issue Tracker Nov 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: revise atlas quickstart#114

feat: revise atlas quickstart#114
jbarnes850 merged 13 commits intomainfrom
fix/quickstart

jbarnes850 commented Nov 1, 2025

Uh oh!

jbarnes850 left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jbarnes850 commented Nov 1, 2025

Uh oh!

jbarnes850 left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants