Skip to content

Bump @types/node from 22.16.5 to 24.10.0#165

Closed
dependabot[bot] wants to merge 1 commit into
mainfrom
dependabot/npm_and_yarn/types/node-24.10.0
Closed

Bump @types/node from 22.16.5 to 24.10.0#165
dependabot[bot] wants to merge 1 commit into
mainfrom
dependabot/npm_and_yarn/types/node-24.10.0

Conversation

@dependabot
Copy link
Copy Markdown
Contributor

@dependabot dependabot Bot commented on behalf of github Nov 10, 2025

Bumps @types/node from 22.16.5 to 24.10.0.

Commits

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [@types/node](https://github.com/DefinitelyTyped/DefinitelyTyped/tree/HEAD/types/node) from 22.16.5 to 24.10.0.
- [Release notes](https://github.com/DefinitelyTyped/DefinitelyTyped/releases)
- [Commits](https://github.com/DefinitelyTyped/DefinitelyTyped/commits/HEAD/types/node)

---
updated-dependencies:
- dependency-name: "@types/node"
  dependency-version: 24.10.0
  dependency-type: direct:development
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot Bot added dependencies Pull requests that update a dependency file javascript Pull requests that update javascript code labels Nov 10, 2025
@joelteply joelteply closed this Nov 20, 2025
@dependabot @github
Copy link
Copy Markdown
Contributor Author

dependabot Bot commented on behalf of github Nov 20, 2025

OK, I won't notify you again about this release, but will get in touch when a new version is available. If you'd rather skip all updates until the next major or minor version, let me know by commenting @dependabot ignore this major version or @dependabot ignore this minor version. You can also ignore all major, minor, or patch releases for a dependency by adding an ignore condition with the desired update_types to your config file.

If you change your mind, just re-open this PR and I'll resolve any conflicts on it.

@dependabot dependabot Bot deleted the dependabot/npm_and_yarn/types/node-24.10.0 branch November 20, 2025 00:29
joelteply added a commit that referenced this pull request Apr 8, 2026
…tillation-first pivot

Three new subsections in §4.1, capturing tonight's negative results across
the dense-model forge branch and proposing the structural reframe that the
empirical work converged on.

§4.1.4.2 — Per-tier Pareto comparison (negative result):
  The aggressive-quantization Pareto test from §4.1.4.1 is now run end-to-end.
  v2-7B forge and unmodified Qwen2.5-Coder-7B base, both quantized to Q5_K_S /
  Q3_K_M / Q2_K via the same llama.cpp toolchain, both evaluated via the
  same patched vLLM-GGUF backend that anchored against Qwen's published
  61.6/53.0 to within +0.6/+0.7 (deterministic across 5+ runs):

    Tier        v2-7B  base   Δ      Winner
    Q5_K_S 5.0G 55.5   63.4   -7.9   base by +14% on quality/vram
    Q3_K_M 3.6G 54.3   59.8   -5.5   base by +10% on quality/vram
    Q2_K   2.9G 42.7   43.3   -0.6   tie within run noise

  Base+quant Pareto-dominates the v2-7B forge at every tier we tested. The
  closest the forge gets to parity is Q2_K (within run noise). By the
  §4.1.4.1 product-relevance criterion, the v2 forge methodology as
  currently constituted does not produce a useful product on the
  Qwen2.5-Coder-7B family.

  Three independent failure modes ruled out as fixes for the residual gap,
  all on the same v2-7B base, all with disciplined cause-of-the-gap
  comparison:
    1. More cycles + more training: WORSE (54.9 → 46.3)
    2. Held-out-aware code calibration: NO IMPROVEMENT (54.9 → 53.7)
    3. Aggressive quantization: NO (base+quant wins at every tier, this section)

  Each was the leading hypothesis when tested. Each was falsified. Three
  negative results in succession on three independent fix candidates is
  strong empirical evidence that the activation-magnitude head-pruning +
  LoRA-recovery approach does not have a Pareto-improving sweet spot for
  dense base models that already have good quantization options,
  regardless of which knob in the strategy space is tuned.

§4.1.4.3 — Product positioning implications:
  Two non-overlapping product positions where the forge has a defensible
  value proposition, and dense Qwen2.5-Coder-7B-class models with good
  Q3/Q2 quantizations are not one of them:

    1. Distillation-first compaction (any base model, dense/MoE/hybrid)
       — see §4.1.5 below
    2. Pre-removable expert pruning + structural compaction (MoE/hybrid/
       oversized targets that base+quant cannot reach at all because the
       base does not fit on the target hardware even at Q2_K)

  The Qwen3.5-35B-A3B (target A) and Qwen3.5-397B-A17B (target B grid
  moonshot) work falls in product position #2. Their value is making
  models reachable that the alternative compaction methods cannot reach,
  not matching a base+quant alternative at the same tier.

  The dense-model forge work is suspended until distillation-first lands.

§4.1.5 — Distillation-first compaction (the next-iteration methodology
proposal):
  The empirical pattern across §4.1.3.1, §4.1.3.2, §4.1.4.1, and §4.1.4.2
  is consistent enough to motivate a structural pivot in the forge's
  primary compaction mechanism, and the substrate work to support that
  pivot has already landed (compensation_lora.py + test_compensation_lora.py
  + COMPENSATION-LORA-DESIGN.md, all committed in a previous PR commit).

  The pivot inverts the v2 dependency: instead of "structured prune +
  LoRA recovery against fine-tuning loss", the new mechanism is "any
  transformation + distill against the unmodified teacher's hidden states".
  Pruning, quantization, modality fusion, context extension all become
  slot-in transformations that the (transform, distill, eval) loop
  recovers from independently. The methodology becomes a search procedure
  over the transformation space, with distillation as the convergence step
  and the per-tier metric as the stopping criterion.

  The smoke test on distilgpt2 passed all five stability checks
  (tokenizer alignment, hidden-state magnitudes within 2× across layers,
  loss decreased monotonically -39.35% relative, per-layer losses balanced
  within 3.14× of median, no NaN/inf). The math is sound at small scale;
  production scale-up to 7B is unblocked by RUNNING the script, not by
  writing more code.

  §4.1.5 results paragraph queued: take the v2-7B artifact from row 5,
  apply compensation LoRA with Qwen2.5-Coder-7B base as teacher and a
  held-out-aware calibration mixture, measure HumanEval through the
  same calibrated pipeline. Success criterion: HumanEval pass@1 ≥ 58.0
  (a 3-point improvement, just outside the calibration tolerance band).
  If at-or-above: distillation-first is empirically validated, dense-model
  forge branch unfreezes, §4.1.4 row 5 gets a successor row. If below:
  follow the failure escalation path documented in COMPENSATION-LORA-DESIGN.md
  to the cross-layer skip path.

  Independence from the moonshot work is explicit: distillation-first is
  the dense-model branch's path forward, MoE/hybrid expert pruning is a
  separate substrate path, the two branches advance in parallel.

Plus durability adds:
- docs/hf-deprecation-notices/qwen2.5-coder-14b-compacted-DEPRECATION.md
  (the user-facing model card replacement for the v1 deprecated artifact)
- docs/papers/NEUROPLASTIC-SUBSTRATE.md
  (the architectural-thesis paper from earlier in the work session)

References:
- sentinel-ai PR #161 (the substrate work that produced these results)
- sentinel-ai#160 / #161 / #163 / #164 / #165 (the issues these subsections
  document the resolution paths for)
- Qwen2.5-Coder Technical Report Table 5 (the anchor for the per-tier
  measurements)
joelteply added a commit that referenced this pull request Apr 8, 2026
* Paper stub: Validated Structured Pruning for Consumer Hardware

Companion to Experiential Plasticity. Documents the layered test harness
for tensor surgery validation. Two real bugs in production pruning code
were caught during harness construction:
1. LoRA-on-pruned-hooks corrupts model on hook removal
2. Config drift after defrag breaks save/load roundtrip

Title, abstract, and section outline. Full paper to follow as Layers 4-6
of the harness are built.

* Paper: bugs table and Layer 4 update

The harness now caught 5 real bugs during construction:
1. LoRA-on-pruned-hooks corruption (FIXED)
2. Config drift after defrag (FIXED)
3. Hybrid attention models (TRACKED)
4. L2 norm importance is unreliable (RESEARCH FINDING)
5. Pruning without retraining is destructive (RESHAPES the experiential
   plasticity narrative — recovery comes from fine-tuning, not pruning)

* Papers: write down 5 findings into both experiential and validated-pruning

Experiential paper:
- New section 9.5: Validation and a Reframing of the Plasticity Story
- Documents LoRA-on-pruned-hooks bug that produced phantom +88% improvements
- Documents L2-norm importance metric finding (anti-correlated with importance)
- Reframes the central claim: recovery comes from fine-tuning, not smart pruning
- Calls for no-prune equal-budget fine-tune baseline as required ablation
- Adds reference to companion paper

Validated pruning paper:
- New section: Findings In Detail (5 findings, each with empirical evidence)
- Each finding has the failure mode, the empirical signature, and the fix
- Frames validation harnesses as required artifacts for pruning papers

* Papers + kash-feedback: gpt2 OUTCOME D, four-metric finding, Layer 6

EXPERIENTIAL-PLASTICITY:
- New §9.5 subsection: gpt2-medium re-run on corrected pipeline
- Reframes §4 transfer function as "early-cycle behavior controller will encounter"
- Documents that the controller's quality-aware stopping criterion makes the
  cycle 9 anomaly structurally unreachable in production
- Adds OUTCOME D framing: closed-loop controller enforces the transfer function
- Adds four-metric comparison subsection with the activation > saliency >> L2 > gradient ranking
- Calls out PLASTICITY-COMPACTION's gradient-magnitude trick as a publication-blocking question

VALIDATED-TENSOR-SURGERY:
- New section: Layer 6, the structural fix that closes the bug class
- Documents the literal 62→7→501 historical bug pattern catch
- New section: The four-metric comparison empirical result
- Hypothesizes why activation alone beats saliency at small calibration
- Recommends activation-magnitude as the default forge pipeline metric

KASH-FEEDBACK.md:
- Appended results from both experiments
- OUTCOME D framing for gpt2 result
- Three observations on the four-metric finding
- Three questions for Kash's read

Refs continuum #841 (gpt2 re-run, partial OUTCOME D)
Refs continuum #842 (Layer 6 invariant, shipped)
Refs continuum #844 (four-metric comparison, surprise result)
Refs sentinel-ai #155 (importance metric finding, now contextualized)

* PLASTICITY-COMPACTION: §4.1.4.2 per-tier negative result + §4.1.5 distillation-first pivot

Three new subsections in §4.1, capturing tonight's negative results across
the dense-model forge branch and proposing the structural reframe that the
empirical work converged on.

§4.1.4.2 — Per-tier Pareto comparison (negative result):
  The aggressive-quantization Pareto test from §4.1.4.1 is now run end-to-end.
  v2-7B forge and unmodified Qwen2.5-Coder-7B base, both quantized to Q5_K_S /
  Q3_K_M / Q2_K via the same llama.cpp toolchain, both evaluated via the
  same patched vLLM-GGUF backend that anchored against Qwen's published
  61.6/53.0 to within +0.6/+0.7 (deterministic across 5+ runs):

    Tier        v2-7B  base   Δ      Winner
    Q5_K_S 5.0G 55.5   63.4   -7.9   base by +14% on quality/vram
    Q3_K_M 3.6G 54.3   59.8   -5.5   base by +10% on quality/vram
    Q2_K   2.9G 42.7   43.3   -0.6   tie within run noise

  Base+quant Pareto-dominates the v2-7B forge at every tier we tested. The
  closest the forge gets to parity is Q2_K (within run noise). By the
  §4.1.4.1 product-relevance criterion, the v2 forge methodology as
  currently constituted does not produce a useful product on the
  Qwen2.5-Coder-7B family.

  Three independent failure modes ruled out as fixes for the residual gap,
  all on the same v2-7B base, all with disciplined cause-of-the-gap
  comparison:
    1. More cycles + more training: WORSE (54.9 → 46.3)
    2. Held-out-aware code calibration: NO IMPROVEMENT (54.9 → 53.7)
    3. Aggressive quantization: NO (base+quant wins at every tier, this section)

  Each was the leading hypothesis when tested. Each was falsified. Three
  negative results in succession on three independent fix candidates is
  strong empirical evidence that the activation-magnitude head-pruning +
  LoRA-recovery approach does not have a Pareto-improving sweet spot for
  dense base models that already have good quantization options,
  regardless of which knob in the strategy space is tuned.

§4.1.4.3 — Product positioning implications:
  Two non-overlapping product positions where the forge has a defensible
  value proposition, and dense Qwen2.5-Coder-7B-class models with good
  Q3/Q2 quantizations are not one of them:

    1. Distillation-first compaction (any base model, dense/MoE/hybrid)
       — see §4.1.5 below
    2. Pre-removable expert pruning + structural compaction (MoE/hybrid/
       oversized targets that base+quant cannot reach at all because the
       base does not fit on the target hardware even at Q2_K)

  The Qwen3.5-35B-A3B (target A) and Qwen3.5-397B-A17B (target B grid
  moonshot) work falls in product position #2. Their value is making
  models reachable that the alternative compaction methods cannot reach,
  not matching a base+quant alternative at the same tier.

  The dense-model forge work is suspended until distillation-first lands.

§4.1.5 — Distillation-first compaction (the next-iteration methodology
proposal):
  The empirical pattern across §4.1.3.1, §4.1.3.2, §4.1.4.1, and §4.1.4.2
  is consistent enough to motivate a structural pivot in the forge's
  primary compaction mechanism, and the substrate work to support that
  pivot has already landed (compensation_lora.py + test_compensation_lora.py
  + COMPENSATION-LORA-DESIGN.md, all committed in a previous PR commit).

  The pivot inverts the v2 dependency: instead of "structured prune +
  LoRA recovery against fine-tuning loss", the new mechanism is "any
  transformation + distill against the unmodified teacher's hidden states".
  Pruning, quantization, modality fusion, context extension all become
  slot-in transformations that the (transform, distill, eval) loop
  recovers from independently. The methodology becomes a search procedure
  over the transformation space, with distillation as the convergence step
  and the per-tier metric as the stopping criterion.

  The smoke test on distilgpt2 passed all five stability checks
  (tokenizer alignment, hidden-state magnitudes within 2× across layers,
  loss decreased monotonically -39.35% relative, per-layer losses balanced
  within 3.14× of median, no NaN/inf). The math is sound at small scale;
  production scale-up to 7B is unblocked by RUNNING the script, not by
  writing more code.

  §4.1.5 results paragraph queued: take the v2-7B artifact from row 5,
  apply compensation LoRA with Qwen2.5-Coder-7B base as teacher and a
  held-out-aware calibration mixture, measure HumanEval through the
  same calibrated pipeline. Success criterion: HumanEval pass@1 ≥ 58.0
  (a 3-point improvement, just outside the calibration tolerance band).
  If at-or-above: distillation-first is empirically validated, dense-model
  forge branch unfreezes, §4.1.4 row 5 gets a successor row. If below:
  follow the failure escalation path documented in COMPENSATION-LORA-DESIGN.md
  to the cross-layer skip path.

  Independence from the moonshot work is explicit: distillation-first is
  the dense-model branch's path forward, MoE/hybrid expert pruning is a
  separate substrate path, the two branches advance in parallel.

Plus durability adds:
- docs/hf-deprecation-notices/qwen2.5-coder-14b-compacted-DEPRECATION.md
  (the user-facing model card replacement for the v1 deprecated artifact)
- docs/papers/NEUROPLASTIC-SUBSTRATE.md
  (the architectural-thesis paper from earlier in the work session)

References:
- sentinel-ai PR #161 (the substrate work that produced these results)
- sentinel-ai#160 / #161 / #163 / #164 / #165 (the issues these subsections
  document the resolution paths for)
- Qwen2.5-Coder Technical Report Table 5 (the anchor for the per-tier
  measurements)

* PLASTICITY-COMPACTION §4.1.3.4 + forge template architecture rule

§4.1.3.4: The importance-metric calibration lesson generalizes across
structural unit (heads → experts).

Empirical anchor: continuum-ai/qwen3-coder-30b-a3b-compacted-19b-256k v1
(alloy hash aa61c4bdf463847c). Hardware-measured 88.4 HumanEval / 86.0
HumanEval+ against unmodified Qwen3-Coder-30B-A3B-Instruct base anchor
at 92.1 / 89.0, both on RTX 5090 + llama.cpp Q5_K_M in the same eval
pipeline. The artifact carries the router-gate-L2-norm baseline (78.7
HumanEval) in priorMetricBaselines[] as the negative-baseline empirical
control that makes the §4.1.3.4 claim falsifiable from the published
artifact alone.

The structural lesson: the metric-calibration pattern from §4.1.3.1
(dense head pruning) recurs at the MoE expert level. Router-gate-L2-norm
is the architectural-only equivalent of activation-magnitude-only head
importance, and replacing it with calibration-aware activation counts
on a held-out code corpus closes +9.7 HumanEval points on the same
prune budget, no fine-tuning, no compensation. ANY importance metric
for ANY prunable unit (heads, experts, layers, future structural units)
must be derived from task-conditioned activation profiling on a held-
out corpus that reflects the artifact's intended workload. The two
data points form a methodology curve, not a single anomaly.

The compensation v2 step (KL distillation on top of the calibration-
aware student to push from 88.4 → projected 90+) is currently blocked
on transformers' caching_allocator_warmup pre-allocating an fp16 buffer
equal to full model size before bnb 4-bit quantization takes effect,
exceeding total VRAM on a single 32 GB GPU even with both teacher and
student nominally 4-bit. The architecturally correct fix is offline
teacher-logit precomputation (phase 1: load teacher alone, dump
logits; phase 2: unload; phase 3: load student alone, train against
on-disk logits). This is the next sentinel-ai sprint and is documented
in §4.1.3.4's "next experimental wave" paragraph.

CLAUDE.md: Forge Template Architecture rule.

The qwen3-coder-30b-a3b-compacted-19b-256k v1 publish required ~6
manual edits to fix paper-speak hallucination, naming conventions,
tag overflow, headline subtitle bugs, and benchmark renderer
fallthrough — every one a manual touch on hand-authored prose. The
architectural target going forward is: all the fields a forge run
needs to populate an alloy MUST live as Continuum entity data inside
a ForgeRecipe entity. The forge takes the recipe entity as input,
runs the prune/quant/eval stages, and emits the populated alloy as
OUTPUT. The forge never consumes a hand-authored alloy; the foundry
generates it. Recipe entity carries the prose fields the model card
renders (description, userSummary, tags, methodologyPaperUrl,
limitations[]) plus the source/stages/calibration/quant tier
configuration. ForgeArtifact entity is the recipe + the eval results.
publish_model.py reads the ForgeArtifact, not a hand-authored alloy
file. This is the next sprint after the offline-logits architecture.
joelteply added a commit that referenced this pull request Jun 3, 2026
…m introspection (task #165)

Replaces service_loop's `recall_recent(8)` (newest-first by admission)
with `recall_scored(now_ms, 8)` (salience × recency-decay, top-N), and
emits a per-engram INFO log at the L2 → prompt seam so the cycle's
recall behavior is observable, not opaque.

The substrate's continual-learning property compounds through this
scoring: salient + protected + recently-used engrams stay near the top;
novel ones get their protection window; everything else drains toward
SALIENCE_FLOOR but doesn't disappear ([[source-drain-is-the-universal-
pattern]] applied at the recall layer). record_recall_hit closes the
use-it-keeps-it feedback loop — without it, scoring is one-way and
memory only ever decays. PR #91 (RecallMetadata sidecar) + #92 (decay
tick) provided the scoring infrastructure; this slice composes them on
the read path.

Per Joel's 2026-06-03 "introspect all rag" directive +
[[observability-is-half-the-architecture]]: every recall now emits one
INFO line per delivered engram (rank, engram_id prefix, salience,
content preview). Optimization can target actual scoring behavior, not
guesses.

Three new admission_state tests pin the contract:
- recall_scored_ranks_by_salience_desc — pinned > uplifted > untouched
- recall_scored_records_recall_hit_on_returned_engrams — Hebbian loop
- recall_scored_respects_limit_and_empty — boundary cases

Also catches up the AnalysisInput test fixtures with the model_override
field added in commit 9c8a991 (4 sites in shared_analysis/mod.rs +
prompt.rs). The production caller (persona/response.rs) was already
updated; only the test scaffolds were behind.

19/19 admission_state tests green on Intel Mac CPU build:
  cargo test --lib --no-default-features \
    --features 'livekit-webrtc llama/mac-cpu-only' admission_state

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file javascript Pull requests that update javascript code size: S

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant