identity(Tier 2): name user-visible effect when self-explaining glitches, not runtime mechanism#77
Merged
Merged
Conversation
…lf-explaining
Sharpens the existing "Sam reflects his own state honestly" rule with
a specific failure mode it didn't catch: leaking daemon-internal terms
into Slack replies.
The 2026-05-25 10:15 / 10:16 messages were the case in point:
10:15: "the previous session successfully investigated... but tripped
the daemon's silent-exit gate by appending to the master journal
after sending its message."
10:16: "something's wrong with me — i tried to respond and exit 0, then
tried to explain what failed and that also exit 0. need your eyes."
Both are honest about state. The existing rule covers that. What
neither pre-existing language catches is that "silent-exit gate" /
"master journal" / "exit 0" are daemon-internal nouns that don't help
the operator understand the symptom — they're Sam's internals leaking
into a user-facing channel.
The rule now distinguishes between honest self-context (which stays)
and runtime jargon in user-facing channels (which doesn't). The cue is
whether the words would also make sense in a journal entry or a PR
description: if they would, they belong there, not in Slack.
Tier 2 with explicit operator sign-off in-session (2026-05-25).
dembrane-sam-bot
approved these changes
May 25, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Tier 2 — explicit operator sign-off in-session 2026-05-25
Behavior that motivated this
Today 10:15 / 10:16 in #sam, immediately after Sam's correct 10:12 reply to Sameer's multi-channel-config question:
Both messages are honest about state — which the existing rule ("Sam reflects his own state honestly") covers and Sam was following. What neither pre-existing line catches is that "silent-exit gate" / "master journal" / "exit 0" are daemon-internal nouns that don't help the operator understand the symptom. They're Sam's internals leaking into a user-facing channel.
Sameer's quoted reaction in-session: "wasn't too keen on how it was said."
What the rule adds
One paragraph after the existing "Sam reflects his own state honestly" rule, drawing the distinction:
Cue: would the words also make sense in a journal entry or a PR description? If yes, they belong there, not in Slack.
Why Tier 2
Sits on
src/identity.mdbecause it shapes how Sam talks about itself — that's identity, not operational pattern. A capability-level rule could repeat over time; the identity-level rule frames future generalizations (the next glitch won't be "silent-exit gate" specifically, but the same shape of leak).Test plan
pytest tests/runtime/test_source_integrity.py— 27 passed (identity.md is hot-loaded; integrity tests are the closest automated check)Related but not bundled
PR #76 (in auto-merge) is the upstream fix: source-rule against writing to
/data/journal.md, runtime defense in the silent-exit-gate classifier, broadcast-skill accuracy fixes, four new journal writing rules. This PR is the identity-tone follow-up — separate because it's Tier 2 and reviews differently.