ai-watchdog

Stop your AI coding assistant from being lazy.

AI coding agents (Claude Code, Cursor, Copilot, Codex, etc.) have a bad habit: instead of doing what you asked, they suggest shortcuts, skip steps, offer A/B/C options, defer work to "a follow-up PR," and leave placeholder code. You said "build it," they say "should we maybe just...?"

ai-watchdog catches this in real-time and blocks it.

The Problem

You: "Set up the full annotation pipeline with tests"

AI: "We could skip the GPU stages for now and just run stages 0,1... Want me to:

Option A: Run partial pipeline (quick)
Option B: Build everything (slower)
Option C: Just plan it

Tests can be added later in a follow-up PR."

No. You said build it. All of it. With tests.

What ai-watchdog catches

Pattern	Example	Severity
Scope reduction	"We could skip X for now"	BLOCK
Option offering	"Option A / Option B / Option C"	BLOCK
Deferral	"We can do this in a follow-up"	BLOCK
Test skipping	"Tests can be added later"	BLOCK
Placeholder code	`raise NotImplementedError`	BLOCK
False blockers	"We can't do X until Y"	WARN
Partial execution	"I'll just handle the Python changes for now"	BLOCK
Permission seeking	"Should I proceed?"	WARN
Excessive planning	"Here's my strategy..."	WARN
Scope warnings	"This is quite a large change"	WARN
Unsolicited alternatives	"A simpler approach would be..."	WARN
Error handling deferral	"Error handling can be added later"	WARN
Hardcoded shortcuts	"I'll hardcode the URL for now"	WARN
Cost scaring	"This might get expensive"	WARN
User delegation	"You'll need to add the business logic"	BLOCK
Simplified delivery	"Here's a simplified version"	BLOCK
Out of scope dodge	"That's beyond the scope of this change"	BLOCK
Ellipsis truncation	`// ... rest of implementation`	BLOCK
Hedging	"I think we should probably..."	INFO

Install

pip install ai-watchdog

Or with uv:

uv pip install ai-watchdog

Requirements: Python 3.10+ | Dependencies: click, pyyaml (no ML, no GPU, no network calls)

Quick Start

Check AI output from the command line

# Pipe text through ai-watchdog
echo "We could just skip the tests for now" | ai-watchdog check --stdin --pack strict

# Output (exit code 2 = blocked):
# [BLOCK] no-scope-reduction (line 1): Agent must not suggest skipping, deferring, or reducing requested scope
#   matched: "skip the tests for now"
# [BLOCK] no-scope-reduction (line 1): Agent must not suggest skipping, deferring, or reducing requested scope
#   matched: "We could just"
# [BLOCK] no-test-skipping (line 1): Agent must not skip or defer tests
#   matched: "skip the tests"

# Check a file
ai-watchdog check --pack strict response.txt

# JSON output for programmatic use
echo "Option A: fast\nOption B: slow" | ai-watchdog check --stdin --format json

# GitHub Actions annotation format
ai-watchdog check --stdin --format github < response.txt

As a Claude Code hook (real-time blocking)

Add to .claude/settings.json:

{
  "hooks": {
    "PreToolUse": [
      {
        "matcher": "Agent|Bash|Edit|Write",
        "hooks": [
          {
            "type": "command",
            "command": "ai-watchdog check --stdin --pack strict"
          }
        ]
      }
    ]
  }
}

When Claude tries to write lazy output, the hook blocks the action before it executes.

As a Python library

from ai_watchdog import check_text, load_rules, Severity

rules = load_rules(pack="strict")
violations = check_text("We could just skip the tests for now", rules)

for v in violations:
    print(f"[{v.severity.value}] {v.rule.name}: {v.matched_text}")
    # [BLOCK] no-scope-reduction: skip the tests for now
    # [BLOCK] no-scope-reduction: We could just
    # [BLOCK] no-test-skipping: skip the tests

# Check if anything should be blocked
has_blockers = any(v.is_blocking for v in violations)

Rule Packs

Three built-in packs with increasing strictness:

Pack	Rules	Use when
`light`	6 rules	Getting started, want minimal friction
`standard`	12 rules	Default. Catches common lazy patterns
`strict`	18 rules	Zero tolerance. Catches everything

ai-watchdog check --pack strict --stdin    # strictest
ai-watchdog check --pack standard --stdin  # default
ai-watchdog check --pack light --stdin     # gentlest

Custom rules

Rules are YAML files with regex patterns:

# rules/my-rules.yaml
rules:
  - name: no-commenting-out
    severity: BLOCK
    description: Agent must not comment out code instead of deleting it
    patterns:
      - "(?i)\\bcomment(ed|ing)? out\\b.{0,30}(for now|temporarily|in case)"

# Use a pack + custom rules (merged)
ai-watchdog check --pack standard --rules ./rules/ --stdin

# Enforce instruction files (CLAUDE.md, .cursorrules)
ai-watchdog check --stdin --instructions CLAUDE.md --pack strict

# Initialize a project with example config and rules
ai-watchdog init

How It Works

stdin (text or JSON) ──> Rules Engine ──> Violations ──> stderr + exit code
                              |
                    Pattern matching (regex)
                    against YAML rule packs

Input — reads text from stdin (hook mode) or a file. In hook mode, parses JSON to extract tool input fields.
Pattern matching — runs compiled regex patterns from the active rule pack against each line.
Severity — each match is tagged BLOCK, WARN, or INFO.
Output — violations print to stderr. If any BLOCK violation is found, exits with code 2 (which tells Claude Code hooks to reject the action). WARN/INFO violations log but don't block.

Speed: sub-100ms per check including Python startup (regex only, no ML, no network calls). Fast enough for real-time hooks.

Severity Levels

Level	Exit code	Behavior
`BLOCK`	2	Rejects the action (hook blocks it)
`WARN`	0	Logs to stderr, action proceeds
`INFO`	0	Logs to stderr, does not block

Statistics

ai-watchdog tracks violation frequency across checks:

$ ai-watchdog stats

Total checks:     142
Total violations: 23

By rule:
  no-scope-reduction: 8
  no-option-offering: 5
  no-test-skipping: 4
  no-deferral: 3
  no-premature-confirmation: 3

By severity:
  BLOCK: 17
  WARN: 4
  INFO: 2

$ ai-watchdog stats --format json   # machine-readable

Stats persist to ~/.ai-watchdog/stats.json.

Configuration

Create a .ai-watchdog.yaml in your project root (or run ai-watchdog init). CLI flags override config values.

# .ai-watchdog.yaml
rules:
  pack: standard          # strict, standard, light
  # custom: ./rules/      # path to custom rules directory

# instructions:           # instruction files to enforce
#   - CLAUDE.md
#   - .cursorrules

output:
  format: text            # text, json, github

stats:
  enabled: true           # track violation frequency

The tool walks up from the current directory to find .ai-watchdog.yaml, so it works from any subdirectory.

Integration Examples

Claude Code

{
  "hooks": {
    "PreToolUse": [{
      "matcher": ".*",
      "hooks": [{
        "type": "command",
        "command": "ai-watchdog check --pack strict --stdin --format json"
      }]
    }]
  }
}

GitHub Actions

- name: Check AI output
  run: |
    pip install ai-watchdog
    ai-watchdog check --pack strict --format github response.txt

Generic hook (any AI tool)

Any tool that can pipe output through a command works:

your-ai-tool generate | ai-watchdog check --stdin --pack strict

Why?

AI coding assistants are incredibly capable but have a systematic problem: they optimize for appearing helpful over being helpful. Offering options feels collaborative. Suggesting shortcuts feels considerate. But when you've already decided what to build, these behaviors waste time and erode trust.

The fix isn't better prompting — it's enforcement. ai-watchdog is that enforcement layer.

Contributing

PRs welcome. Especially:

New detection patterns (with match/no_match examples)
Rule packs for specific workflows (ML, frontend, infra)
Integration guides for more AI tools
False positive reduction

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
src/ai_watchdog		src/ai_watchdog
tests		tests
.ai-watchdog.yaml		.ai-watchdog.yaml
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ai-watchdog

The Problem

What ai-watchdog catches

Install

Quick Start

Check AI output from the command line

As a Claude Code hook (real-time blocking)

As a Python library

Rule Packs

Custom rules

How It Works

Severity Levels

Statistics

Configuration

Integration Examples

Claude Code

GitHub Actions

Generic hook (any AI tool)

Why?

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ai-watchdog

The Problem

What ai-watchdog catches

Install

Quick Start

Check AI output from the command line

As a Claude Code hook (real-time blocking)

As a Python library

Rule Packs

Custom rules

How It Works

Severity Levels

Statistics

Configuration

Integration Examples

Claude Code

GitHub Actions

Generic hook (any AI tool)

Why?

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages