Review SCM files and security updates by hyperpolymath · Pull Request #1 · hyperpolymath/hypatia

hyperpolymath · 2025-12-18T08:44:57Z

No description provided.

…r pinning - Add SSH known_hosts verification for GitLab, Codeberg, and Bitbucket to prevent MITM attacks during git push operations - Pin runner version to ubuntu-22.04 for reproducible builds - Add concurrency controls to prevent race conditions on parallel pushes

- GitLab: Use hardcoded official SSH host keys (ed25519, rsa, ecdsa) - Codeberg: Use ssh-keyscan with fingerprint verification against official SHA256:mIlxA9k46MmM6qdJOdMnAQpzGxF4WIVVL+fj+wZbw0g - Bitbucket: Use hardcoded official SSH host keys from /site/ssh This eliminates the MITM attack window that existed when fetching keys at runtime without verification. Sources: - https://docs.gitlab.com/ee/user/gitlab_com/#ssh-host-keys-fingerprints - https://docs.codeberg.org/security/ssh-fingerprint/ - https://bitbucket.org/site/ssh

DOGFOODING FIX #1: Hypatia fixes itself! BEFORE: - 8 HIGH severity unwrap() calls on Regex::new() - Would panic at runtime if regex compilation failed - No error messages on failure AFTER: - 8 MEDIUM severity expect() calls with descriptive messages - Static Lazy<Regex> compiled at startup - Clear error messages if compilation fails - Proper expect() messages explain WHY failure is a bug CHANGES: - Added once_cell dependency for Lazy statics - Converted Scanner struct fields to static REGEX constants - Replaced unwrap() with expect() + descriptive messages - Removed unused struct fields (Scanner is now zero-sized) SEVERITY REDUCTION: - HIGH → MEDIUM (8 findings downgraded) - Pattern: unwrap_without_check → expect_without_check - This is acceptable: expect() with messages is Rust best practice LEARNING ENGINE UPDATE: - Pattern observed: unwrap_on_regex_compilation - Fix applied: Use Lazy<Regex> with expect() - Outcome: SUCCESS (compiles, tests pass, severity reduced) This proves the dogfooding loop works: 1. ✅ Scanner found real issues in its own code 2. ✅ Applied recommended fix (static Lazy patterns) 3. ✅ Verified fix works (cargo check passes) 4. ✅ Re-scanned to confirm improvement Next: Fix CLI unwraps, then test/adapter code. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

…188) Promotes the benchmark harness at `fixer/benches/hypatia_bench.rs` from "scaffolded" to "partial" per the Six Sigma classification in TESTING-AUDIT.md: * Runs criterion on every PR touching `fixer/**` or the bench infrastructure itself (and on main). * Compares results against a committed baseline in `.machine_readable/benchmarks/baselines.json`. * Fails CI if any benchmark regresses by more than the configured threshold (default 50% per the Acceptable / Unacceptable boundary in TESTING-TAXONOMY.adoc). ## Files * `.github/workflows/bench.yml` — criterion run + gate + artefact upload. Supports `workflow_dispatch` with a `regenerate-baseline` mode that produces an artefact for human review (no auto-commit). * `.machine_readable/benchmarks/baselines.json` — seed file with empty `baselines` map. Until populated the gate runs in "advisory mode" (logs current numbers, passes). * `.machine_readable/benchmarks/README.md` — full convention documentation: schema, regeneration flow, what is and isn't in scope. * `scripts/check-bench-regression.py` — compares current bench output against baselines, writes Markdown summary for `$GITHUB_STEP_SUMMARY`, exits 1 on regression. * `scripts/update-bench-baselines.py` — parses bencher-format output and rewrites baselines.json preserving metadata keys. ## Design choices * **Quick mode:** `--warm-up-time 1 --measurement-time 2 --sample-size 10` caps total runtime at ~2 minutes. Good enough to catch >20% regressions. * **Bencher text output, not JSON:** criterion's JSON lives in `target/criterion/<name>/new/estimates.json` and needs a walker. The bencher line format is stable and Python-friendly. * **Baselines in git, not cache:** cache-based baselines are brittle under branch isolation and eviction. Committed JSON survives reviews and history. * **No auto-commit of regenerated baselines:** baseline updates are meaningful and deserve human review. The regenerate mode emits an artefact; a person downloads and commits. ## Local usage cargo bench --bench hypatia_bench -- \ --warm-up-time 1 --measurement-time 2 --sample-size 10 \ --output-format bencher > /tmp/bench.txt python3 scripts/check-bench-regression.py \ /tmp/bench.txt \ .machine_readable/benchmarks/baselines.json ## What this does NOT do * Does not yet benchmark startup, build time, energy, FFI overhead — those remain in the punch list. * Does not benchmark the Elixir engine. Elixir benching via `Benchee` is a separate follow-up. Scripts validated end-to-end with synthetic bencher output: * advisory mode (empty baselines) → exit 0, clean summary. * 1.5% change → ✅ pass. * 35% change → ⚠️ warning, still pass. * 147% change → ❌ fail, exit 1, `::error::` annotation emitted. Co-authored-by: Claude <noreply@anthropic.com>

…ion (#265) (#271) Refs #260 / #263 / #265 — **do not auto-close**. Follow-up increment to merged #264 (which explicitly deferred this). ### Problem (natsci-studio live calibration) `BranchProtectionID` / `CodeReviewID` were `:open_escalate` — correct (never silently dropped) but suboptimal: they are *repository-configuration-actionable* via the GitHub settings API, not code fixes and not non-actionable. natsci-studio dry-run: alerts #1 BranchProtection / #3 CodeReview should auto-remediate under full-auto. ### Change Third, narrow action class between `:fix` (code) and `:open_escalate`: - `classify/2`: `BranchProtectionID`/`CodeReviewID` → `:fix_settings`. - `reconcile/3`: new `:policy` opt (default `:full_auto`); `do_fix_settings/3` is an idempotent branch-protection PUT requiring ≥1 PR review on the default branch (satisfies both checks). `:conservative` escalates instead. - `mix hypatia.reconcile … --conservative`; summary gains `settings_remediated`/`settings_actionable`. ### Verification `mix compile` clean. Full suite **809/809** (+3 new tests). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>

claude added 2 commits December 17, 2025 23:34

hyperpolymath merged commit c8fc984 into main Dec 18, 2025

hyperpolymath deleted the claude/review-scm-security-t3eCl branch December 18, 2025 08:57

This was referenced May 16, 2026

Reconciler refinement: add :fix_settings action class (BranchProtection/CodeReview) #265

Closed

feat(reconciler): :fix_settings action class — natsci-studio calibration (#265) #271

Merged

hyperpolymath mentioned this pull request May 18, 2026

fix(governance)!: banned_language ban is total — no exceptions #280

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Review SCM files and security updates#1

Review SCM files and security updates#1
hyperpolymath merged 2 commits into
mainfrom
claude/review-scm-security-t3eCl

hyperpolymath commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

hyperpolymath commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants