v0.3.1 — Setup & serve phases
A focused follow-on to v0.3.0: the released ptg binary can now prepare its own
runtime environment, so running a mesh is a one-command affair rather than a
manual server/model setup.
What's new
ptg setup --yes # detect llama-server, download the gated Gemma QAT model (~2.7 GB), write config
ptg serve # launch the inference server in the foreground (config is remembered)
ptg --probe # then just run a mesh — no --vllm-url / --model flags neededptg setupdetectsllama-server(flag →PTG_LLAMA_SERVER→
LLAMA_SERVER→~/.cache/ptg/bin→~/llama-spike/.../build/bin→ PATH),
downloads the verified Gemma QAT model into the cache, and writes
~/.config/ptg/config.toml(%APPDATA%on Windows). Model download prefers
thehf/huggingface-clitool (handles Gemma gating best) and falls back
to a native resumable download (HF_TOKEN bearer auth,.part→ rename) if
the CLI is absent or broken. The token is used only for the download and is
never persisted.ptg servereads the config, short-circuits if the server is already
running, else launchesllama-serverin the foreground with the exact verified
flags.--dry-runprints the command. It does not daemonize.- Config-aware run path: the mesh command (
ptgwith no subcommand) now
resolves server URL and model asflag → env → setup config → bundled default,
so "setup once, then justptg" works. The subcommand is optional, so all
existing invocations are unchanged.
Honest prerequisites (documented in the README, not hidden)
llama-servermust be installed.ptg setupdetects it and prints exact
install instructions if missing, but deliberately does not build it
(GPU/platform-specific). Install from llama.cpp.- Gemma is gated. Accept the license at the model's HuggingFace page, then
authenticate (hf loginor setHF_TOKEN).
Validation
- 114 unit/integration tests pass; clippy/doc clean; panic-free (
-D warnings
with panic/unwrap/expect denies). - Live-tested locally:
setup --dry-run(detects the real server),serve --dry-run(exact server flags), config-aware run, and a real native
download (brokenhfshim → graceful fallback → file downloaded + renamed). - The release workflow now smoke-tests
ptg setup --dry-runon all four shipped
targets (Linux, macOS arm64/x86_64, Windows), gating packaging on the setup
command actually running on every platform.
No research changes
This release adds tooling only. The benchmarked structured-lateral findings and
the runtime concurrency fix from v0.3.0 are unchanged.