Skip to content

Fixed Purifier breaks UTF8 chars #77#78

Open
yarkm13 wants to merge 1 commit into
bumpcore:1.xfrom
yarkm13:77-Purifier-breaks-UTF8-chars
Open

Fixed Purifier breaks UTF8 chars #77#78
yarkm13 wants to merge 1 commit into
bumpcore:1.xfrom
yarkm13:77-Purifier-breaks-UTF8-chars

Conversation

@yarkm13
Copy link
Copy Markdown

@yarkm13 yarkm13 commented May 15, 2026

By default DOMDocument::loadHTML() treat strings as single byte ISO-8859-1.

In Purifier::stripAttributes() added <?xml encoding="utf-8" ?> to $domDocument->loadHTML() to treat strings as UTF8 and removed unnecessary re-encoding after saveHTML()

fixes #77

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Purifier breaks UTF8 chars

1 participant