cloakFetch — pass Cloudflare-protected URLs through Claude Code 🛡️

English · 中文

External references: CloakBrowser · trafilatura · Claude Code hooks

Two paths for the same idea: when a web fetch is blocked by Cloudflare (or similar bot protection), route the URL through CloakBrowser — a stealth Chromium that passes the JS challenge — and return clean markdown via trafilatura.

Path A: PostToolUse hook — fully automatic on Claude Code. Every blocked WebFetch silently falls back; the agent never sees the failure.
Path B: SKILL.md skill — reactive fallback for any SKILL.md-aware agent (Codex, OpenCode, OpenClaw, SkillsMP). Agent decides to invoke it after seeing a 403/CF pattern.

Why

Claude Code's built-in WebFetch (and curl, and requests, and most HTTP clients) can't pass Cloudflare's JS challenge — so any request to a CF-protected site (science.org, many publishers, lots of news sites) comes back as:

The server returned HTTP 403 Forbidden.

CloakBrowser is a real Chromium with anti-bot patches at the C++ level that does pass those challenges. cloakFetch wires CloakBrowser + trafilatura into Claude Code (and other agents) so the agent never has to tell the user "this page is unfetchable."

Two activation paths

	Path A: Hook	Path B: Skill
Trigger	Automatic — fires on every WebFetch result	Reactive — agent decides after seeing a failed fetch
Agent cognition	Zero — invisible upgrade	Has to notice 403/CF pattern + recall skill
Runtime support	Claude Code only (needs `PostToolUse` hook system)	Any SKILL.md-aware agent: Claude Code, OpenClaw, Codex, OpenCode, SkillsMP
Latency on hit	~25–40 s	~25–40 s
Latency on miss	~milliseconds (regex check, no browser)	None (skill not invoked)
Install	Copy 2 scripts + edit `~/.claude/settings.json`	Drop skill folder into the agent's skills dir
Files	`hooks/cloak_fetch.py` + `hooks/webfetch_cloak_fallback.sh`	`skills/cloak-fetch/SKILL.md` + `cloak_fetch.py` + `cloak_fetch.sh`

Same cloak_fetch.py underneath both — the difference is just how it gets activated.

Repo layout

cloakFetch/
├── hooks/                          # Path A — Claude Code PostToolUse
│   ├── cloak_fetch.py              #   headless CloakBrowser → rendered HTML
│   └── webfetch_cloak_fallback.sh  #   payload matcher + orchestrator
├── skills/cloak-fetch/             # Path B — SKILL.md skill
│   ├── SKILL.md                    #   pushy description + trigger heuristics
│   ├── cloak_fetch.py              #   (same script, env-python shebang)
│   └── cloak_fetch.sh              #   wrapper: locate python, fetch, extract
├── settings.snippet.json           #   PostToolUse JSON block to paste into ~/.claude/settings.json
└── README.md

Path A — Claude Code PostToolUse hook

Architecture

┌───────────────────┐    fails (CF 403)
│  WebFetch (built- │ ─────────────────┐
│  in Claude tool)  │                  │
└───────────────────┘                  ▼
                          ┌──────────────────────────────┐
                          │ webfetch_cloak_fallback.sh   │
                          │ (PostToolUse hook)           │
                          │                              │
                          │ 1. read payload from stdin   │
                          │ 2. regex-match failure       │
                          │ 3. extract tool_input.url    │
                          │ 4. call cloak_fetch.py       │
                          │ 5. emit additionalContext    │
                          └──────────────────────────────┘
                                         │
                                         ▼
                          ┌──────────────────────────────┐
                          │ cloak_fetch.py               │
                          │ (CloakBrowser headless)      │
                          │                              │
                          │ launch → goto → wait for CF  │
                          │ to clear → wait for content  │
                          │ → trafilatura → markdown to  │
                          │ stdout                       │
                          └──────────────────────────────┘

Two independent files so failure-detection regex (bash) and browser logic (Python) evolve separately.

Install (hook)

# 1. Copy the hook scripts into Claude Code's hook directory
mkdir -p ~/.claude/hooks
cp hooks/cloak_fetch.py hooks/webfetch_cloak_fallback.sh ~/.claude/hooks/
chmod +x ~/.claude/hooks/cloak_fetch.py ~/.claude/hooks/webfetch_cloak_fallback.sh

# 2. Tell the hook where your CloakBrowser-enabled Python lives. Add to your
#    shell rc (~/.zshrc, ~/.bashrc, etc.):
#      export CLOAKBROWSER_PYTHON="$HOME/path/to/CloakBrowser/.venv/bin/python"
#    The hook also auto-tries $HOME/github/CloakBrowser/.venv/bin/python and
#    `python3` as fallbacks, so you can skip this if either of those works.

# 3. Register the hook in ~/.claude/settings.json — add the contents of
#    settings.snippet.json as a new entry inside the "PostToolUse" array.
#    Example final shape:

{
  "hooks": {
    "PostToolUse": [
      {
        "matcher": "WebFetch",
        "hooks": [
          {
            "type": "command",
            "command": "$HOME/.claude/hooks/webfetch_cloak_fallback.sh"
          }
        ]
      }
    ]
  }
}

The hook becomes active on the next tool call — no Claude Code restart required.

Tip — symlink instead of copy

If you keep this repo as a git checkout (recommended), symlink the hook files into ~/.claude/hooks/ instead of copying them. Edits in the repo are then live immediately, and git pull updates your hooks:

mkdir -p ~/.claude/hooks
ln -sf "$(pwd)/hooks/cloak_fetch.py"             ~/.claude/hooks/cloak_fetch.py
ln -sf "$(pwd)/hooks/webfetch_cloak_fallback.sh" ~/.claude/hooks/webfetch_cloak_fallback.sh

Tradeoff: if you move or delete the repo checkout, the hook silently stops firing (the script's [ ! -f "$CLOAK_FETCH" ] check trips and exits 0). For a stable checkout under ~/github/, that's a fine bet.

Test (hook)

Simulate the harness by piping a fake failed-WebFetch payload to the hook:

echo '{
  "tool_name": "WebFetch",
  "tool_input": {"url": "https://www.science.org/content/page/information-authors-research-articles", "prompt": "x"},
  "tool_response": "The server returned HTTP 403 Forbidden."
}' | ~/.claude/hooks/webfetch_cloak_fallback.sh

Expected: {"hookSpecificOutput": {"hookEventName": "PostToolUse", "additionalContext": "WebFetch was blocked... <markdown>"}} on stdout, exit code 0.

For a live test, ask Claude inside a Claude Code session to fetch any Cloudflare-protected URL. You should see the WebFetch 403 immediately followed by a PostToolUse:WebFetch hook additional context: ... block in the conversation.

Configuration knobs (hook)

Inside hooks/webfetch_cloak_fallback.sh:

Variable	Default	Purpose
`CLOAK_FETCH`	`$HOME/.claude/hooks/cloak_fetch.py`	Path to the Python fetcher. Override with an env var of the same name.
`CLOAKBROWSER_PYTHON`	`$HOME/github/CloakBrowser/.venv/bin/python` → `python3`	Python interpreter that runs the fetcher. Must have `cloakbrowser` importable.
`FAILURE_REGEX`	`403\|429\|forbidden\|cloudflare\|just a moment\|enable javascript and cookies\|resource was not loaded\|access denied\|blocked\|datadome\|akamai\|please verify you are a human\|incapsula\|pardon our interruption\|kasada\|aws-waf\|sucuri`	Case-insensitive regex against `tool_response`. Covers Cloudflare + the major bot-protection vendors. Widen / narrow to taste.

Path B — SKILL.md skill (for hookless agents)

The hook approach only works on Claude Code. For agents that don't have a PostToolUse system — Codex CLI, OpenCode, OpenClaw, SkillsMP — install cloakFetch as a SKILL.md-format skill. The agent reads the SKILL.md when relevant and invokes the wrapper script after recognising a Cloudflare failure pattern.

Install (skill)

Agent	Install path
Claude Code (global)	`cp -r skills/cloak-fetch ~/.claude/skills/cloak-fetch`
Claude Code (project)	`cp -r skills/cloak-fetch .claude/skills/cloak-fetch`
OpenClaw (global)	`cp -r skills/cloak-fetch ~/.openclaw/skills/cloak-fetch`
OpenClaw (project)	`cp -r skills/cloak-fetch skills/cloak-fetch`
SkillsMP	search for `cloak-fetch` on skillsmp.com

If your CloakBrowser venv isn't at the default path, set the env var (in your shell rc or per-invocation):

export CLOAKBROWSER_PYTHON=/path/to/your/cloakbrowser/.venv/bin/python

Invoke (skill)

The agent runs this single command after a normal fetcher returns a 403/CF pattern:

~/.claude/skills/cloak-fetch/cloak_fetch.sh "<URL>"

The wrapper handles everything: finds a cloakbrowser-importable Python, launches the headless browser, runs trafilatura, prints clean markdown on stdout. Stderr carries progress messages; exit non-zero on any failure.

Test (skill)

~/.claude/skills/cloak-fetch/cloak_fetch.sh "https://www.science.org/content/page/information-authors-research-articles"

Expected: ~20–40 s, then ~25 KB of clean markdown on stdout (page title is "Information for Authors-Research Articles").

For a sanity check on a non-Cloudflare site:

~/.claude/skills/cloak-fetch/cloak_fetch.sh "https://example.com"
# → "This domain is for use in documentation examples..."

Configuration knobs (skill)

Env var	Default	Purpose
`CLOAKBROWSER_PYTHON`	(auto-detect: `~/github/CloakBrowser/.venv/bin/python`, then `python3`)	Python interpreter with `cloakbrowser` importable

Inside skills/cloak-fetch/cloak_fetch.py:

Knob	Default	Purpose
`headless=True`	`True`	Flip to `False` to see the browser window for debugging
Selector wait list	`main, article, .article__body, .core-container, .pb-page-body`	Selectors that signal SPA content has rendered. Extend if a target site needs something more specific.
`time.sleep(2)` settle	2 s	Extra wait for late-loading JS.

Prerequisites (both paths)

CloakBrowser installed with the cloakbrowser Python package importable (a venv with pip install cloakbrowser works)
trafilatura installed into the same Python env (pip install trafilatura) — used for HTML → markdown extraction
Path A only: jq (for parsing the hook payload)

Behaviour & safety

Fail-closed. Both paths leave the original failure intact if something inside cloakFetch breaks (no Python with cloakbrowser, network down, CloakBrowser can't pass the challenge). The agent is never tricked into thinking a fetch succeeded when it didn't.
Silent on the happy path. The hook does nothing when the regex doesn't match; the skill is simply not invoked when there's no failure to recover from.
Cost. A triggered fallback runs a real browser — ~20–40 s wall clock, non-trivial memory. The hook's regex check on the happy path costs ~milliseconds.
Trust boundary. Both paths act only on URLs the agent already chose to send to its fetch tool. They do not introduce a new way for the agent to reach the internet — same URL surface, just a more capable backend.

Limitations

Cloudflare's hardest challenges (interactive Turnstile, etc.) may still defeat headless mode — flip headless=False in cloak_fetch.py if you need full CF coverage.
The hook reads tool_response as a string for regex matching. If a future Claude Code version changes the payload shape, the matcher's jq selector needs updating.
additionalContext size is bounded by Claude Code's hook output handling — very large pages are persisted to disk and only previewed inline (the persisted file path is shown so the agent can Read it).
The skill is reactive: it works only when the agent recognises the failure and recalls the skill. If the agent gives up after the first 403 without trying again, the skill doesn't help. The SKILL.md description is intentionally pushy to combat this — review and tweak if your agent under-triggers.

💬 Community

Discord: https://discord.gg/79JF5Atuk
WeChat: scan the QR code below

❤️ Support

If cloakFetch saves you from one more "HTTP 403 Forbidden", consider supporting the author:

WeChat Pay

Alipay

Buy Me a Coffee

Give a Reward

👤 Author

Agents365-ai

GitHub: https://github.com/Agents365-ai
Bilibili: https://space.bilibili.com/441831884

📄 License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cloakFetch — pass Cloudflare-protected URLs through Claude Code 🛡️

Why

Two activation paths

Repo layout

Path A — Claude Code PostToolUse hook

Architecture

Install (hook)

Tip — symlink instead of copy

Test (hook)

Configuration knobs (hook)

Path B — SKILL.md skill (for hookless agents)

Install (skill)

Invoke (skill)

Test (skill)

Configuration knobs (skill)

Prerequisites (both paths)

Behaviour & safety

Limitations

💬 Community

❤️ Support

👤 Author

📄 License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
hooks		hooks
skills/cloak-fetch		skills/cloak-fetch
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
settings.snippet.json		settings.snippet.json

Folders and files

Latest commit

History

Repository files navigation

cloakFetch — pass Cloudflare-protected URLs through Claude Code 🛡️

Why

Two activation paths

Repo layout

Path A — Claude Code PostToolUse hook

Architecture

Install (hook)

Tip — symlink instead of copy

Test (hook)

Configuration knobs (hook)

Path B — SKILL.md skill (for hookless agents)

Install (skill)

Invoke (skill)

Test (skill)

Configuration knobs (skill)

Prerequisites (both paths)

Behaviour & safety

Limitations

💬 Community

❤️ Support

👤 Author

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages