Skip to content

v0.2.2

Choose a tag to compare

@avifenesh avifenesh released this 26 Jun 17:57
1583ce6

Patch release — fixes GLM (and any fallback-wrapped reasoning model) showing no thoughts and no effort control.

  • Fallback wrapper now forwards streaming + capabilities. The daemon wraps the primary provider in a quota-freeze fallback chain; that wrapper previously implemented only Complete, so a wrapped GLM wasn't a Streamer (→ no streamed text/reasoning, no "thoughts") and didn't expose EffortSetter/Searcher/FastModer (→ the GUI hid the effort dropdown, SetEffort no-op). It now implements Stream and forwards effort/search/fast to the active provider.
  • Failover is surfaced, never silent. A primary→fallback switch emits a visible note with the cause (" unavailable (…) — falling back to for this turn"), carried per-turn on the context (thread-safe across concurrent sessions).
  • Fixed a data race on the fallback freeze timestamp.

Combined with 0.2.1 (GLM reasoning_effort high|max + the seq-reassembly streaming fix), thoughts + the off|high|max effort selector should now appear for GLM-5.2.