Extract class candidates from inside parenthesized expressions by mvanhorn · Pull Request #19775 · tailwindlabs/tailwindcss

mvanhorn · 2026-03-10T08:10:40Z

Summary

Fixes #19458

Tailwind's content scanner doesn't extract class names from inside function-call-like syntax in data attributes such as data-loading="addClass(opacity-50)" (used by Symfony Live Components, Alpine.js, HTMX, etc.).

The fix adds ( and ) as boundary characters in the Rust extractor. This allows the scanner to recognize class names inside parenthesized expressions like addClass(opacity-50) since the parentheses now act as valid candidate boundaries.

This may produce a few additional harmless false-positive candidates (function names like addClass itself), but Tailwind ignores tokens that don't match any utility class, so there's no functional impact.

Test plan

Added Rust test: data-loading="addClass(opacity-50)" extracts opacity-50
Added Rust test: data-loading="addClass(opacity-50) removeClass(hidden)" extracts both candidates
Updated existing test expectations to account for new boundary behavior
cargo test passes (all Rust tests)
pnpm build && pnpm test passes (all 4620 Vitest tests)

This contribution was developed with AI assistance (Claude Code).

Add `(` and `)` as Common boundary characters so the content scanner extracts class names from inside function-call-like syntax in data attributes (e.g. `data-loading="addClass(opacity-50)"`). This pattern is used by Symfony Live Components, Alpine.js, and similar frameworks. Previously, parentheses broke candidate extraction because they were not recognized as valid boundaries. Fixes tailwindlabs#19458 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

coderabbitai · 2026-03-10T08:20:39Z

Walkthrough

The change expands the boundary detection logic in the extractor to recognize parentheses '(' and ')' as boundary delimiters alongside null bytes. This enables the tokenizer to properly handle function-call-like syntax within data attributes (e.g., data-loading="addClass(opacity-50)"). The production change is minimal—updating the Common variant of the Class enum in boundary.rs. The remaining changes are test updates across multiple files to reflect the new extraction behavior, including tokens previously ignored in parenthetical contexts and nested function calls. Whitespace adjustments in a HAML test fixture align indentation without functional changes.

🚥 Pre-merge checks | ✅ 4

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately describes the main change: adding parentheses as boundary characters to enable class extraction from parenthesized expressions.
Description check	✅ Passed	The description clearly relates to the changeset by explaining the fix for extracting classes from parenthesized data attributes and detailing test coverage.
Linked Issues check	✅ Passed	The PR successfully addresses issue `#19458` by adding parentheses as boundary characters to extract class names from function-call-like syntax in data attributes.
Out of Scope Changes check	✅ Passed	All changes are scoped to the stated objectives: boundary character additions, test updates, and fixture adjustments for supporting parenthesized expression extraction.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@crates/oxide/src/extractor/boundary.rs`:
- Around line 60-66: Change the paren boundary mapping so '(' is a Before
boundary and ')' is an After boundary instead of both being mapped to Common: in
the enum where Common, Before and After are annotated with #[bytes(...)] replace
#[bytes(b'(', b')')] on Common with #[bytes(b'\0')], add #[bytes(b'(')] to the
Before variant, and add #[bytes(b')')] to the After variant so candidates start
after '(' and end before ')'.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 908fecb2-5648-41d0-a8b3-6eb84a398b80

📥 Commits

Reviewing files that changed from the base of the PR and between c586bd6 and d957ba3.

📒 Files selected for processing (6)

crates/oxide/src/extractor/boundary.rs
crates/oxide/src/extractor/candidate_machine.rs
crates/oxide/src/extractor/mod.rs
crates/oxide/src/extractor/named_utility_machine.rs
crates/oxide/src/extractor/pre_processors/test-fixtures/haml/dst-17051.haml
crates/oxide/src/extractor/utility_machine.rs

coderabbitai · 2026-03-10T08:20:42Z

crates/oxide/src/extractor/boundary.rs

+    // Function-call-like syntax in data attributes, e.g.:
+    //
+    // ```
+    // <div data-loading="addClass(opacity-50)">
+    //                            ^           ^
+    // ```
+    #[bytes(b'(', b')')]


⚠️ Potential issue | 🟠 Major

Split ( and ) into directional boundaries instead of Common.

Line 66 makes ( valid as an after-boundary and ) valid as a before-boundary, which broadens extraction beyond the regression you’re fixing. You can already see that downstream: crates/oxide/src/extractor/mod.rs Line 446 now extracts bg-[red] from bg-[red](--my-color), and the updated tests start accepting wrapper names like addClass, fn, and variant. The issue only needs candidates to start after ( and end before ), so ( should be Before and ) should be After.

♻️ Narrow the paren boundary scope

@@ - #[bytes(b'\0')] - // Function-call-like syntax in data attributes, e.g.: - // - // ``` - // <div data-loading="addClass(opacity-50)"> - // ^ ^ - // ``` - #[bytes(b'(', b')')] + #[bytes(b'\0')] Common, @@ - #[bytes(b'>')] + #[bytes(b'>')] + #[bytes(b'(')] Before, @@ - #[bytes(b'<')] + #[bytes(b'<')] + #[bytes(b')')] After,

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

// Function-call-like syntax in data attributes, e.g.:

//

// ```

// <div data-loading="addClass(opacity-50)">

// ^ ^

// ```

#[bytes(b'(', b')')]

#[bytes(b'\0')]

Common,

#[bytes(b'>')]

#[bytes(b'(')]

Before,

#[bytes(b'<')]

#[bytes(b')')]

After,

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/oxide/src/extractor/boundary.rs` around lines 60 - 66, Change the paren boundary mapping so '(' is a Before boundary and ')' is an After boundary instead of both being mapped to Common: in the enum where Common, Before and After are annotated with #[bytes(...)] replace #[bytes(b'(', b')')] on Common with #[bytes(b'\0')], add #[bytes(b'(')] to the Before variant, and add #[bytes(b')')] to the After variant so candidates start after '(' and end before ')'.

RobinMalfait · 2026-03-10T11:07:18Z

Hey! Thanks for the PR but going to say no to this solution.

This may produce a few additional harmless false-positive candidates

While they are false-positives, they are often not just a few candidates, these can be hundreds of additional candidates. It's true that Tailwind's core will skip them most of the time, but we still have to pass them from Rust to the Node.js world and process them to even know that they are valid classes or not.

I tried it on a few repo's, small and big.

3318 → 3380 +62 candidates (one of our templates)
1732 → 1862 +130 candidates (my personal website)
13532 → 13861 +329 candidates (tailwindcss.com)
10309 → 10711 +402 candidates (tailwindcss.com/plus)

In the grand scheme of things, these are relatively low numbers, but they add up over time especially knowing that these additional candidates are all false positives because this was tested on repo's that don't have this issue as described in the original issue.

mvanhorn requested a review from a team as a code owner March 10, 2026 08:10

mvanhorn mentioned this pull request Mar 10, 2026

Source classes are not detected correctly when specified as data-loading="addClass(opacity-50)" #19458

Open

coderabbitai bot reviewed Mar 10, 2026

View reviewed changes

RobinMalfait closed this Mar 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extract class candidates from inside parenthesized expressions#19775

Extract class candidates from inside parenthesized expressions#19775
mvanhorn wants to merge 1 commit intotailwindlabs:mainfrom
mvanhorn:osc/19458-fix-data-loading-class-extraction

mvanhorn commented Mar 10, 2026

Uh oh!

coderabbitai bot commented Mar 10, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Mar 10, 2026

Uh oh!

RobinMalfait commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

mvanhorn commented Mar 10, 2026

Summary

Test plan

Uh oh!

coderabbitai bot commented Mar 10, 2026

Walkthrough

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

RobinMalfait commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants