v0.2.1
Patch release.
- GUI streaming: the final message, pre-tool preamble, and model "thoughts" now render. The transcript's seq-reassembly latched the window to seq 1, but the pump's per-session seq does not reset on re-attach — so a re-entered chat parked every event until an overflow valve fired, dropping short turns and trailing messages. It now latches from the live pump's actual base (replay-safe).
- GLM-5.2 reasoning: streams reasoning_content by sending reasoning_effort (high|max, per z.ai coding-plan docs) alongside thinking.type; the GUI effort selector now offers off|high|max.
- daemon compactor now rides the quota-failover chain; misc test isolation.