Escape characters on xml attributes #217

dt-jean-baptiste-lemee · 2023-08-18T19:26:36Z

Issue: #216

dt-jean-baptiste-lemee · 2023-08-18T19:30:16Z

cjs/shared/mime.js

@@ -7,26 +7,31 @@ const Mime = {
  'text/html': {
    docType: '<!DOCTYPE html>',
    ignoreCase: true,
+    isXml: false,
    voidElements: /^(?:area|base|br|col|embed|hr|img|input|keygen|link|menuitem|meta|param|source|track|wbr)$/i
  },
  'image/svg+xml': {
    docType: '<?xml version="1.0" encoding="utf-8"?>',
    ignoreCase: false,


I think ignoreCase could be remove in favor of isXml

so this MR is just about using isXml instead of ignoreCase ? I am not sure what I am looking at in here, but I am sure the MR could be way simpler without adding oddly-cased fields (XML is XML, not Xml) and slow/XSS-prone transformers as commented

nop, this PR is about fixing this issue : #216 I think the test speaks for itself. I commented this, just to say that we could remove ignoreCase boolean since it has to ignoreCase only on non-xml so it's redondant here

I think I've named it as such due XHTML but again I am not sure why we need to change that ... it could be breaking if anyone out there brand-check that property.

I've renamed isXml to isXML and XmlAttr to XMLAttr

Tell me if there's anything else I could improve on this MR

There is a regression on ". I've push a solution but again I'm chaining escape and a replace. I wondering why don't we escape all html entities (https://www.w3schools.com/html/html_entities.asp) on Attr.toString through the escape function ? This way we do not need XMLAttr nor Mime.isXML

why don't we escape all html entities

I think you are better off with JSDOM there ... it's 100% standard, and 100% slower than LinkeDOM

😆 I'm trying to switch from JSDom to linkedom because it is 100 000% slower (JSDom takes sometime 5minutes to manage xml document when Linkedom takes 1second on the same document !) But yes, still we need a bit more "standardization". I don't think it's a bad idea for linkedom, but your the boss, it's your choice. We can use our fork. Thank for this lib and the work, it's huge.

to be fair, XML is not the main target here and you are suggesting XML related changes and paths that would slow the common use case by far, as example suggesting to escape all html entities ... I understand this is valuable for your business but this project is about perf ... perf for most common use cases, which is not XML, so apologies if my answers are not the most welcoming one, but I am trying to preserve the original idea of this project which is: work for most common cases and as fast as possible ... your quest feels a bit against that initial/original goal, hence my nitpicking in here. Hope you can understand, and hope your fork will work great for your use case too!

esm/interface/xml-attr.js

esm/shared/constants.js

dt-jean-baptiste-lemee commented Aug 18, 2023

View reviewed changes

dt-jean-baptiste-lemee mentioned this pull request Aug 18, 2023

Escape characters on xml attributes DiliTrust/linkedom#1

Merged

dt-jean-baptiste-lemee force-pushed the escape-charaters-on-xml-attributes branch from f47e99e to 3f63e45 Compare August 18, 2023 19:47

dt-jean-baptiste-lemee marked this pull request as ready for review August 18, 2023 19:52

dt-jean-baptiste-lemee force-pushed the escape-charaters-on-xml-attributes branch 2 times, most recently from 3e2579b to 0ce8770 Compare August 21, 2023 20:23

WebReflection reviewed Aug 23, 2023

View reviewed changes

esm/interface/xml-attr.js Outdated Show resolved Hide resolved

WebReflection reviewed Aug 23, 2023

View reviewed changes

esm/interface/xml-attr.js Outdated Show resolved Hide resolved

WebReflection reviewed Aug 23, 2023

View reviewed changes

esm/shared/constants.js Outdated Show resolved Hide resolved

dt-jean-baptiste-lemee force-pushed the escape-charaters-on-xml-attributes branch from 0ce8770 to 30c9bcf Compare August 24, 2023 19:26

dt-jean-baptiste-lemee requested a review from WebReflection August 24, 2023 19:27

dt-jean-baptiste-lemee force-pushed the escape-charaters-on-xml-attributes branch 6 times, most recently from 2a7512b to eb4d4e6 Compare August 28, 2023 17:59

dt-jean-baptiste-lemee marked this pull request as draft August 28, 2023 19:12

dt-jean-baptiste-lemee force-pushed the escape-charaters-on-xml-attributes branch from eb4d4e6 to 695e3e5 Compare August 28, 2023 19:26

dt-jean-baptiste-lemee marked this pull request as ready for review August 28, 2023 19:26

Escape characters on xml attributes

8085ee9

dt-jean-baptiste-lemee force-pushed the escape-charaters-on-xml-attributes branch from 695e3e5 to 8085ee9 Compare August 28, 2023 19:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Escape characters on xml attributes #217

Escape characters on xml attributes #217

dt-jean-baptiste-lemee commented Aug 18, 2023 •

edited

dt-jean-baptiste-lemee Aug 18, 2023

WebReflection Aug 23, 2023

dt-jean-baptiste-lemee Aug 23, 2023 •

edited

WebReflection Aug 28, 2023

dt-jean-baptiste-lemee Aug 28, 2023 •

edited

dt-jean-baptiste-lemee Aug 28, 2023 •

edited

WebReflection Aug 28, 2023

dt-jean-baptiste-lemee Aug 28, 2023

WebReflection Aug 28, 2023 •

edited

Escape characters on xml attributes #217

Are you sure you want to change the base?

Escape characters on xml attributes #217

Conversation

dt-jean-baptiste-lemee commented Aug 18, 2023 • edited

dt-jean-baptiste-lemee Aug 18, 2023

Choose a reason for hiding this comment

WebReflection Aug 23, 2023

Choose a reason for hiding this comment

dt-jean-baptiste-lemee Aug 23, 2023 • edited

Choose a reason for hiding this comment

WebReflection Aug 28, 2023

Choose a reason for hiding this comment

dt-jean-baptiste-lemee Aug 28, 2023 • edited

Choose a reason for hiding this comment

dt-jean-baptiste-lemee Aug 28, 2023 • edited

Choose a reason for hiding this comment

WebReflection Aug 28, 2023

Choose a reason for hiding this comment

dt-jean-baptiste-lemee Aug 28, 2023

Choose a reason for hiding this comment

WebReflection Aug 28, 2023 • edited

Choose a reason for hiding this comment

dt-jean-baptiste-lemee commented Aug 18, 2023 •

edited

dt-jean-baptiste-lemee Aug 23, 2023 •

edited

dt-jean-baptiste-lemee Aug 28, 2023 •

edited

dt-jean-baptiste-lemee Aug 28, 2023 •

edited

WebReflection Aug 28, 2023 •

edited