Skip to content

[fix](doc) v4.x: remove stray control characters that break HTML minifier#3843

Merged
dataroaring merged 1 commit into
apache:masterfrom
dataroaring:docs/fix-v4x-control-chars
May 29, 2026
Merged

[fix](doc) v4.x: remove stray control characters that break HTML minifier#3843
dataroaring merged 1 commit into
apache:masterfrom
dataroaring:docs/fix-v4x-control-chars

Conversation

@dataroaring
Copy link
Copy Markdown
Contributor

Summary

  • STRUCT.md (semi-structured data types) had a trailing 0x03 byte after DISTRIBUTED BY HASH(id) BUCKETS 1 on line 147.
  • sequence-count.md and sequence-match.md (aggregate functions) had 0x19 standing in for the apostrophe in "isn't" (three occurrences across the two files).
  • The Docusaurus HTML minifier rejects control characters in the input stream, so these pages fail the build for both /cloud/26.x/... and /enterprise/4.x/... URL paths.
  • Applied the same fix in docs/ (next) and versioned_docs/version-4.x/. Chinese (i18n) copies and other version trees were already clean.

Test plan

  • yarn build (or the project's site build) completes without the previous Control character in input stream errors on the three affected pages.
  • Spot-check the rendered pages for STRUCT, sequence-count, sequence-match in both enterprise/4.x and cloud/26.x and confirm "isn't" displays with a normal apostrophe and the DISTRIBUTED BY ... BUCKETS 1 line renders cleanly.

🤖 Generated with Claude Code

…fier

STRUCT.md had a trailing 0x03 after `DISTRIBUTED BY HASH(id) BUCKETS 1`;
sequence-count.md and sequence-match.md had 0x19 in place of the
apostrophe in "isn't". The HTML minifier rejects control characters in
the input stream, failing the build for these pages.

Fixed in both docs/ (next) and versioned_docs/version-4.x/.
Copilot AI review requested due to automatic review settings May 29, 2026 02:59
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Removes stray control characters from a small set of SQL manual Markdown pages that were causing Docusaurus’ HTML minifier to fail during site builds (notably affecting the 4.x versioned docs and “next” docs).

Changes:

  • Replaced the invalid control character used in “isn't” with a normal ASCII apostrophe in sequence-count and sequence-match docs.
  • Removed a trailing 0x03 control byte from a SQL example line in the STRUCT docs.
  • Applied the same cleanup in both docs/ (next) and versioned_docs/version-4.x/ (4.x).

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated no comments.

Show a summary per file
File Description
versioned_docs/version-4.x/sql-manual/sql-functions/aggregate-functions/sequence-match.md Replaces control-character apostrophes to prevent minifier failures in 4.x docs.
versioned_docs/version-4.x/sql-manual/sql-functions/aggregate-functions/sequence-count.md Replaces control-character apostrophes to prevent minifier failures in 4.x docs.
versioned_docs/version-4.x/sql-manual/basic-element/sql-data-types/semi-structured/STRUCT.md Removes trailing control byte from a SQL snippet line in 4.x docs.
docs/sql-manual/sql-functions/aggregate-functions/sequence-match.md Replaces control-character apostrophes to prevent minifier failures in “next” docs.
docs/sql-manual/sql-functions/aggregate-functions/sequence-count.md Replaces control-character apostrophes to prevent minifier failures in “next” docs.
docs/sql-manual/basic-element/sql-data-types/semi-structured/STRUCT.md Removes trailing control byte from a SQL snippet line in “next” docs.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@dataroaring dataroaring merged commit 83b3e13 into apache:master May 29, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants