Skip to content

Conversation

@marcphilipp
Copy link
Member

  • Add test task using Woodstox for XML serialization
  • Delete duplicate test relying on implementation details
  • Use Unicode replacement character for illegal characters
  • Add test for whitespace escaping
  • Implement workaround for escaping whitespace chars in attribute values

Rather than double-escaped character references.
While XML attribute values may contain whitespace such as line breaks,
the XML spec [1] dictates that XML processors must replace them with
spaces which causes downstream tools to misrepresent the original value.

[1] w3.org/TR/xml#AVNormalize

Resolves #4174.
@marcphilipp marcphilipp self-assigned this Dec 8, 2024
@marcphilipp marcphilipp merged commit 432d556 into main Dec 8, 2024
15 checks passed
@marcphilipp marcphilipp deleted the marc/4174-xml-attribute-whitespace-workaround branch December 8, 2024 18:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Legacy XML reports should preserve new line characters in message attributes

2 participants