deep-gvr is a verification-oriented research system for Hermes Agent, Codex local, packaged Codex plugin workflows, Codex automation templates, Codex subagent workflows, Codex SSH/devbox execution, and direct CLI use. It uses a generator-verifier-reviser loop to answer difficult technical questions with explicit analytical, computational, and formal verification.
It is built for people who want more than a polished answer. deep-gvr is designed to show its work: what it claimed, how it checked the claim, what evidence it produced, and where it could not verify enough to be confident.
Most agent systems optimize for fluency. deep-gvr optimizes for adversarial checking.
Its core idea is simple:
- generate a candidate answer
- try to break it with an isolated verifier
- escalate into analysis or formal proof when the claim requires it
- revise, branch, or stop with an explicit failure mode
That makes it useful for research-style questions where correctness matters more than style.
- A structured generator-verifier-reviser workflow
- Three verification tiers:
- Tier 1 analytical review
- Tier 2 OSS-backed computational analysis
- Tier 3 formal verification
- Explicit artifacts: checkpoints, evidence logs, analysis outputs, and proof transport records
- A domain-agnostic adapter architecture with strong support for math, optimization, dynamics, and open-source quantum tooling
- Supported local operator surfaces through Hermes, Codex local, the packaged Codex plugin bundle, a checked-in Codex automation pack, and
uv run deep-gvr - An explicit orchestrator-backend boundary in the runtime, with both
hermesandcodex_localsupported today - A native Codex backend path that runs Generator, Verifier, and Reviser as separate Codex role calls over the same typed loop
/deep-gvr "Explain why the surface code is understood to have a threshold."
Codex local can drive the same runtime from a local checkout:
codex exec -C /path/to/deep-gvr "Use the deep-gvr skill to answer: Explain why the surface code is understood to have a threshold."A successful run typically:
- grounds itself in known literature and domain context
- produces a candidate explanation
- checks the explanation adversarially at Tier 1
- requests Tier 2 or Tier 3 only if the claim actually needs them
- writes evidence and artifacts under
${DEEP_GVR_HOME:-${HERMES_HOME:-~/.hermes}/deep-gvr}/sessions/<session_id>/
deep-gvr is built for Python 3.12 and uv.
Pick one environment sync path first:
uv syncuv sync --all-extrasuv sync --extra analysis
uv sync --extra quantum_ossuv sync is the minimal local path. uv sync --all-extras is the validated
full-portfolio path for the shipped Tier 2 families plus docs/dev tooling. If
you only want a narrower Tier 2 subset, use the targeted extra sync paths
instead of the full-portfolio sync.
Then install the surface or surfaces you actually want.
Hermes slash-command and direct CLI path:
bash scripts/install.sh
uv run python scripts/release_preflight.py --operator --config ${DEEP_GVR_HOME:-${HERMES_HOME:-~/.hermes}/deep-gvr}/config.yaml
uv run deep-gvr run "Explain why the surface code is understood to have a threshold."Codex-local path, with the Hermes surface kept available on the same machine:
bash scripts/install_codex.sh
uv run python scripts/codex_preflight.py --operatorIf you want the Codex-native backend path without installing the Hermes surface as well:
bash scripts/install_codex.sh --skip-hermes-installIf you want a standalone exported local Codex plugin marketplace root as well:
bash scripts/install_codex.sh --plugin-root /tmp/deep-gvr-codex-pluginIf you want the checked-in Codex automation pack exported for review as well:
bash scripts/install_codex.sh --automation-root /tmp/deep-gvr-codex-automationsIf you want the Codex review and visual-QA prompt pack exported for review as well:
bash scripts/install_codex.sh --review-qa-root /tmp/deep-gvr-codex-review-qaIf you want a repo-owned review-evidence bundle for Codex before live review or browser inspection:
uv run python scripts/codex_review_qa_execute.py pull_request_review --output-root /tmp/deep-gvr-codex-review-qa-evidence/review --force
uv run python scripts/codex_review_qa_execute.py public_docs_visual_qa --output-root /tmp/deep-gvr-codex-review-qa-evidence/docs --forceIf you want the Codex subagent prompt pack exported for review as well:
bash scripts/install_codex.sh --subagents-root /tmp/deep-gvr-codex-subagentsIf you want the Codex ssh/devbox remote-operator prompt pack exported for review as well:
bash scripts/install_codex.sh --ssh-devbox-root /tmp/deep-gvr-codex-ssh-devboxIf you want to run the native Codex backend from a remote Codex SSH/devbox session:
uv run python scripts/codex_remote_bootstrap.py --json
uv run python scripts/codex_preflight.py --ssh-devbox --operator
uv run python scripts/codex_ssh_devbox_run.py run "Explain why the surface code is understood to have a threshold."Once installed into Hermes, the same system is available as:
/deep-gvr <question>
/deep-gvr resume <session_id>
For the full operator path, see Quickstart and Release Workflow.
Start here:
- Docs Home
- Start Here
- Codex Local
- Codex Plugin
- Codex Automations
- Codex Review and Visual QA
- Codex Subagents
- Codex SSH Devbox
- Quickstart
- Concepts
- Domain Portfolio
- Examples
- FAQ
Technical reference:
- System Overview
- Backend Parity Matrix
- Tier 2 and Tier 3 Support Matrix
- Architecture and Design
- Release Workflow
Release history:
deep-gvris a verification-oriented research system, not a general-purpose chatbot.- Tier 2 and Tier 3 are selective. They are used when the claim warrants them, not on every run.
- Local operation is the default path. Some optional backends depend on external tools or remote infrastructure.
- The shipped Tier 3 backends are Aristotle, MathCode, and OpenGauss. Aristotle is the submission/poll/resume path; MathCode and OpenGauss are bounded local CLI paths.
- The current shipped support boundary for Tier 2 families and Tier 3 backends is summarized in Tier 2 and Tier 3 Support Matrix.
- Codex local can now act as a real orchestrator backend when
runtime.orchestrator_backend=codex_localis selected, and that backend now runs Generator, Verifier, and Reviser as separate native Codex role calls over the typed loop. - Hermes remains the default backend in the checked-in config template, and the Hermes
/deep-gvrsurface still requires Hermes to be installed. - The repo ships Codex automation templates and export helpers, not direct registration into Codex's live automation runtime state.
- The repo also ships an exportable Codex review/QA prompt pack plus a repo-owned evidence helper for pull-request review and browser-driven docs QA, including from SSH/devbox sessions when the Codex product supports them.
- The repo also ships an exportable Codex subagent prompt pack for safe multi-agent fanout and parallel surface review over the same runtime and git/worktree discipline; that pack complements the native Codex backend and is not the backend itself.
- The repo also ships an explicit Codex
ssh/devboxbundle, readiness path, and runtime-backed remote execution helper over the nativecodex_localbackend. - The repo also ships a rerunnable
scripts/codex_remote_bootstrap.pyhelper that syncs remote config, installs the Codex surface, ensures the evidence directory exists, and then reports the remaining SSH/devbox readiness deltas. - Some advanced Hermes-native capabilities, especially true per-subagent routing and delegated MCP inheritance, still depend on upstream Hermes support.
- Not a fine-tuned model
- Not a single proprietary stack
- Not limited to quantum computing
- Not a system that always claims success
When deep-gvr cannot verify something to the required standard, it is expected to say so.