Security: Prevent HTML/CSS injection in markdown rendering by ValwareIRC · Pull Request #87 · obbyworld/obby

ValwareIRC · 2025-10-16T15:57:15Z

This PR implements comprehensive security fixes for markdown rendering to prevent HTML and CSS injection attacks.

Security Fixes

1. Raw HTML Prevention

Strip all HTML tags from input before markdown processing
Block dangerous HTML injection like <div style="background:black">
Only allow markdown-generated HTML through

2. HTML Tag Whitelisting

Restrict allowed HTML tags to markdown-specific elements only
Block potentially harmful tags like div, span, etc.
Allow only: p, br, strong, b, em, i, h1-h6, ul, ol, li, blockquote, code, pre, a, img, hr, table, thead, tbody, tr, th, td, del, ins

3. Style Attribute Sanitization

Remove all style attributes from HTML tags
Prevent CSS injection attacks
Maintain controlled styling only for images (max-height: 150px)

4. Enhanced Sanitization

Remove dangerous tags: script, iframe, object, embed, style, link, meta
Strip event handlers and javascript: URLs
Comprehensive security filtering

Impact

✅ Blocks HTML/CSS injection attacks
✅ Maintains markdown functionality (bold, italic, links, images, etc.)
✅ Preserves image size constraints
✅ All tests passing
✅ No breaking changes

Testing

All 250 tests passing
Linting clean
Security validation complete

Summary by CodeRabbit

Bug Fixes
- Enhanced content filtering in markdown rendering to remove harmful elements, event handlers, and javascript protocols.
- Normalized image display with consistent styling and dimensions.
- Added input validation and length limits to prevent rendering issues.

- Keep forced styling on images with max-height constraint - Remove dangerous HTML tags (style, link, meta) from markdown output - Improve security sanitization for markdown rendering

- Force all img tags to have style='max-height: 150px;' regardless of source - Override any user-provided styles on images - Maintain consistent image sizing across markdown and raw HTML

- Prevent raw HTML injection by removing all HTML tags before parsing - Block dangerous HTML like <div style="..."> from being processed - Ensure only markdown-generated HTML can be rendered

coderabbitai · 2025-10-16T15:57:39Z

Caution

Review failed

The pull request is closed.

Walkthrough

The PR implements a security-focused sanitization pipeline for markdown rendering in ircUtils.tsx. Input text is stripped of HTML tags, markdown is parsed, then output is filtered to allow only whitelisted HTML elements. Dangerous content (scripts, iframes, styles, links) is removed, event handlers are stripped, image styles are normalized, and input validation with length caps is added to prevent XSS and DoS attacks.

Changes

Cohort / File(s)	Summary
Markdown Sanitization Pipeline `src/lib/ircUtils.tsx`	Implements a security-hardened markdown rendering function: strips HTML tags before parsing, enforces an allowlist of safe HTML tags (`allowedTags` whitelist), removes dangerous elements (scripts, iframes, objects, embeds, styles, links, meta tags), strips event handlers and `javascript:` protocols, normalizes image styling with controlled dimensions and fallback for disabled external content, and adds input validation with length cap to prevent DoS attacks.

Sequence Diagram

sequenceDiagram
    participant User
    participant renderMarkdown
    participant Sanitization
    participant MarkdownParser
    participant Validator

    User->>renderMarkdown: Input markdown text
    
    rect rgb(230, 240, 250)
    Note over renderMarkdown: New Security Pipeline
    renderMarkdown->>Validator: Check length & validity
    Validator-->>renderMarkdown: Valid/Invalid
    end
    
    alt Invalid Input
        renderMarkdown-->>User: Return text as-is
    else Valid Input
        renderMarkdown->>Sanitization: Strip HTML tags
        Sanitization->>MarkdownParser: Clean input
        MarkdownParser->>Sanitization: Parse to HTML
        
        rect rgb(240, 230, 250)
        Note over Sanitization: Filter & Normalize
        Sanitization->>Sanitization: Apply allowlist
        Sanitization->>Sanitization: Remove dangerous tags
        Sanitization->>Sanitization: Strip event handlers
        Sanitization->>Sanitization: Normalize img styles
        end
        
        Sanitization-->>renderMarkdown: Sanitized HTML
        renderMarkdown-->>User: Safe rendered content
    end

Estimated Code Review Effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Reasoning: The change introduces security-critical sanitization logic with moderate complexity. While focused to a single file, the logic involves multiple interconnected security concerns (whitelist validation, dangerous content removal, event handler stripping, input validation), each requiring careful scrutiny. The implementation pattern is consistent but security-sensitive, necessitating thorough review of edge cases and bypass prevention.

Possibly Related PRs

Remove style attributes from markdown-rendered HTML and enhance security #86: Modifies markdown-to-HTML sanitization and image styling in src/lib/ircUtils.tsx with similar removal of dangerous tags and controlled image presentation.

Suggested Reviewers

matheusfillipe

Poem

🐰 Hop through the code with a sanitized glow,
Stripping the danger and scripts down below,
Whitelists dance where the safe HTML plays,
Images styled in controlled, trusted ways,
Security bundled in each blessed line! ✨

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix/markdown

📜 Recent review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b33b814 and 33cdf56.

📒 Files selected for processing (1)

src/lib/ircUtils.tsx (1 hunks)

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2025-10-16T15:58:18Z

Preview URL: https://fix-markdown.obsidianirc.pages.dev

Automated deployment preview for the PR in the Cloudflare Pages.

) * Remove style attributes from markdown-rendered HTML and enhance security - Keep forced styling on images with max-height constraint - Remove dangerous HTML tags (style, link, meta) from markdown output - Improve security sanitization for markdown rendering * Ensure all img tags have controlled max-height styling - Force all img tags to have style='max-height: 150px;' regardless of source - Override any user-provided styles on images - Maintain consistent image sizing across markdown and raw HTML * lint * Strip all HTML tags from input before markdown processing - Prevent raw HTML injection by removing all HTML tags before parsing - Block dangerous HTML like <div style="..."> from being processed - Ensure only markdown-generated HTML can be rendered

ValwareIRC added 4 commits October 16, 2025 16:39

Remove style attributes from markdown-rendered HTML and enhance security

05232f2

- Keep forced styling on images with max-height constraint - Remove dangerous HTML tags (style, link, meta) from markdown output - Improve security sanitization for markdown rendering

Ensure all img tags have controlled max-height styling

7c93728

- Force all img tags to have style='max-height: 150px;' regardless of source - Override any user-provided styles on images - Maintain consistent image sizing across markdown and raw HTML

lint

ab62c65

Strip all HTML tags from input before markdown processing

33cdf56

- Prevent raw HTML injection by removing all HTML tags before parsing - Block dangerous HTML like <div style="..."> from being processed - Ensure only markdown-generated HTML can be rendered

ValwareIRC merged commit 7687b19 into main Oct 16, 2025
3 of 4 checks passed

coderabbitai Bot mentioned this pull request Mar 11, 2026

fix/markdown rendering issues and xss #158

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Security: Prevent HTML/CSS injection in markdown rendering#87

Security: Prevent HTML/CSS injection in markdown rendering#87
ValwareIRC merged 4 commits into
mainfrom
fix/markdown

ValwareIRC commented Oct 16, 2025 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Oct 16, 2025 •

edited

Loading

Review failed

Uh oh!

github-actions Bot commented Oct 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ValwareIRC commented Oct 16, 2025 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Security Fixes

1. Raw HTML Prevention

2. HTML Tag Whitelisting

3. Style Attribute Sanitization

4. Enhanced Sanitization

Impact

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Walkthrough

Changes

Sequence Diagram

Estimated Code Review Effort

Possibly Related PRs

Suggested Reviewers

Poem

Uh oh!

github-actions Bot commented Oct 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ValwareIRC commented Oct 16, 2025 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Oct 16, 2025 •

edited

Loading