Should explicitly support `xml:space="preserve"` #74

TPS · 2015-12-28T17:36:03Z

Details as to necessity at leethomason/tinyxml2#242, & JayXon/Leanify#3 (comment) & preceding.

zeux · 2016-01-10T02:08:45Z

So it seems that the cost of this feature outweighs the benefits.

The linked issue in Leanify would be resolved by both xml:space=preserve support, or by using parse_ws_pcdata_single (or parse_ws_pcdata and some advanced client code that strips unused whitespace - parse_ws_pcdata_single is inadequate for e.g. XHTML parsing).

Interestingly enough, the issue in tinyxml2 does not have xml:space="preserve" in the document, so it's irrelevant there - this is already served by parse_ws_* parsing flags pugixml provides. In general, for XHTML documents xml:space="preserve" won't help because it's almost never present in the documents directly.

Now, there's still some value in automatic preservation of whitespace if needed. XML recommendation seems to suggest that support for this is preferred. The problem is that implementation is very involved:

Requires tracking this flag during parsing for the entire subtree. Since pugixml uses a stackless parser, it's not obvious how to cleanly implement this without additional storage and without the need to examine the presence of the attribute whenever you close the node
Requires analyzing the attribute names during parsing which adds overhead to one of the hottest paths in the parser for some documents
or requires inspecting the attribute state during PCDATA parsing, which is once again one of the hottest paths in the parser

Combining the implementation challenges with the fact that in most cases the applications don't seem to benefit from this means that supporting this is not worth the trouble.

TPS · 2016-01-10T04:43:29Z

Thanks for investigating this!

TPS mentioned this issue Dec 28, 2015

some spaces in docx is removed JayXon/Leanify#3

Closed

zeux added the enhancement label Dec 29, 2015

zeux closed this as completed Jan 10, 2016

zeux added the wontfix label Jan 10, 2016

TPS mentioned this issue Jan 10, 2016

Pedantic white space preservation not supported. leethomason/tinyxml2#242

Closed

halx99 mentioned this issue Jun 11, 2018

xml:space='preserve' was ignored? #213

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should explicitly support `xml:space="preserve"` #74

Should explicitly support `xml:space="preserve"` #74

TPS commented Dec 28, 2015

zeux commented Jan 10, 2016

TPS commented Jan 10, 2016

Should explicitly support xml:space="preserve" #74

Should explicitly support xml:space="preserve" #74

Comments

TPS commented Dec 28, 2015

zeux commented Jan 10, 2016

TPS commented Jan 10, 2016

Should explicitly support `xml:space="preserve"` #74

Should explicitly support `xml:space="preserve"` #74