Strip `data-*` attributes in `stripDangerousAttributes` by Copilot · Pull Request #31714 · github/gh-aw

Copilot · 2026-05-12T14:47:57Z

Bug Fix

What was the bug?

data-* attributes on GFM-allowed tags (e.g. <span data-x="...">) silently passed through stripDangerousAttributes unchanged. They produce zero visible output in rendered Markdown, so adversarial content embedded in them is undetectable by human reviewers but delivered verbatim to the agent — a structurally equivalent hidden channel to HTML comments and zero-width-space splits, both of which are already neutralized.

How did you fix it?

sanitize_content_core.cjs — Extended the stripDangerousAttributes regex to include data-[a-z0-9_-]+ alongside the existing on\w+ and style clauses. The existing /gi flag handles case-insensitivity, making the A-Z range in the character class redundant. Updated JSDoc and call-site comment to document the rationale.

// Before
/[\s/]+(?:on\w+|style)(?:\s*=\s*(?:"[^"]*"|'[^']*'|[^\s>"'`]*))?/gi

// After
/[\s/]+(?:on\w+|style|data-[a-z0-9_-]+)(?:\s*=\s*(?:"[^"]*"|'[^']*'|[^\s>"'`]*))?/gi

Testing

sanitize_content.test.cjs — Added 8 regression tests covering: double-quoted, single-quoted, unquoted, and bare (valueless) data-* attributes; multiple data-* attributes on one tag; data-* combined with on*/style; data-* alongside preserved safe attributes; and case-insensitive matching (DATA-X).

Co-authored-by: szabta89 <1330202+szabta89@users.noreply.github.com>

data-* attributes on GFM-allowed tags (e.g. <span data-x="...">) are invisible in rendered GitHub Markdown but passed through the sanitizer verbatim, making them a hidden injection channel. Add data-[a-z0-9_-]+ (with /gi flag for case-insensitivity) to the stripDangerousAttributes regex alongside the existing on* and style clauses. Also update JSDoc and inline comments to document the rationale, and add 8 regression tests. Co-authored-by: szabta89 <1330202+szabta89@users.noreply.github.com>

Copilot

Pull request overview

Fixes a hidden prompt-injection channel by removing data-* HTML attributes from otherwise-allowed GFM tags during sanitization, and adds regression tests to prevent reintroduction.

Changes:

Extend stripDangerousAttributes to strip data-* attributes in addition to on* and style.
Add regression tests for data-* stripping across quoting forms, valueless attributes, and mixed safe/dangerous attributes.
Update Dependabot workflows to derive the Docker socket path from DOCKER_HOST when building the MCP gateway docker run command.

Show a summary per file

File	Description
actions/setup/js/sanitize_content_core.cjs	Extends dangerous-attribute stripping logic to remove `data-*` attributes and updates inline documentation/comments.
actions/setup/js/sanitize_content.test.cjs	Adds regression tests ensuring `data-*` attributes are removed from allowed tags while preserving safe attributes.
.github/workflows/dependabot-worker.lock.yml	Uses `DOCKER_HOST` (when it’s a unix socket/path) to select the socket file mounted into the MCP gateway container.
.github/workflows/dependabot-campaign.lock.yml	Same `DOCKER_HOST`-aware socket-path logic for the campaign workflow’s MCP gateway container.

Copilot's findings

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comments suppressed due to low confidence (1)

actions/setup/js/sanitize_content_core.cjs:689

This note says the pattern intentionally uses \s+ to require whitespace before attribute names, but the actual regex uses [\s/]+, which can also match a bare / (as described a few lines below). To avoid confusion for future maintainers, consider updating this note to refer to [\s/]+ / “whitespace-or-slash” instead of just \s+.

   * Note: `\s+` (requiring at least one whitespace before the attribute name) is
   * intentional — HTML attributes are always separated from the tag name and from
   * each other by at least one whitespace character. Using `\s*` would risk false
   * matches inside tag names (e.g. matching "ong" inside "strong").

Files reviewed: 4/4 changed files
Comments generated: 2

    // Using [\s/]+ (instead of \s+) also strips dangerous attributes that are immediately
    // preceded by a "/" with no space — e.g. the malformed <img/onerror=alert(1) src=x>.
-    return tagContent.replace(/[\s/]+(?:on\w+|style)(?:\s*=\s*(?:"[^"]*"|'[^']*'|[^\s>"'`]*))?/gi, "");
+    return tagContent.replace(/[\s/]+(?:on\w+|style|data-[a-z0-9_-]+)(?:\s*=\s*(?:"[^"]*"|'[^']*'|[^\s>"'`]*))?/gi, "");


+      const result = sanitizeContent('<span data-x="INJECT">text</span>');
+      expect(result).toBe("<span>text</span>");
+    });
+


Initial plan

8cafd0b

Copilot AI assigned Copilot and szabta89 May 12, 2026

Copilot started work on behalf of szabta89 May 12, 2026 14:51 View session

Copilot AI linked an issue May 12, 2026 that may be closed by this pull request

stripDangerousAttributes does not strip data-* custom attributes — silently-permitted hidden injection channel on GFM-allowe [Content truncated due to length] #31713

Closed

Copilot AI and others added 2 commits May 12, 2026 14:56

Plan: strip data-* attributes in stripDangerousAttributes

4e6e0c0

Co-authored-by: szabta89 <1330202+szabta89@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Fix stripDangerousAttributes to strip data-* custom attributes~~ Strip data-* attributes in stripDangerousAttributes May 12, 2026

Copilot AI requested a review from szabta89 May 12, 2026 15:02

Copilot finished work on behalf of szabta89 May 12, 2026 15:02

pelikhan marked this pull request as ready for review May 12, 2026 15:42

Copilot AI review requested due to automatic review settings May 12, 2026 15:42

Copilot AI reviewed May 12, 2026

View reviewed changes

pelikhan closed this May 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strip `data-*` attributes in `stripDangerousAttributes`#31714

Strip `data-*` attributes in `stripDangerousAttributes`#31714
Copilot wants to merge 3 commits into
mainfrom
copilot/fix-data-attributes-stripping

Copilot AI commented May 12, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Copilot AI commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bug Fix

What was the bug?

How did you fix it?

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Copilot AI commented May 12, 2026 •

edited

Loading