First comment node is missing after calling `DOMParser.parseFromString()` #9861

psmyrek · 2021-06-14T06:22:33Z

Provide a description of the task

This issue has been found during working on HTML comments.

The DOM string parsed by HtmlDataProcessor#_toDom() does not contain the first comment node, if it is the first node in the DOM string.

This issue is caused because the return value from:

new DOMParser().parseFromString( '<!--COMMENT 1--><p>PARAGRAPH 1</p><!--COMMENT 2-->', 'text/html' ).body.childNodes

is a two-elements list: NodeList [ p,  ] without the first comment node . The first comment is somehow hoisted and inserted before the newly created HTML document:

This issue could be fixed by wrapping the DOM string with a <body> or even better with a <div>, just like it is done for BasicHtmlWriter#getHtml()

The text was updated successfully, but these errors were encountered:

psmyrek · 2021-06-17T07:38:54Z

I think I understood the reason for this behavior, which is described in the rules for parsing tokens in HTML content.

In short: parsing tokens in an HTML string starts with the so-called "initial" insertion mode. When a parser is in this state and encounters a comment node, it inserts this comment node as the last child of the Document object. The parser then proceeds to successive insertion modes by creating and appending subsequent nodes (like <html>, <head>, <body>), which in turn leads to the fact that the first comment becomes the first node in the document object and it is located before the <html> element.

Other (engine): Fixed parsing leading HTML comments by `HtmlDataProcessor.toView()`. Closes #9861.

psmyrek added type:task This issue reports a chore (non-production change) and other types of "todos". squad:compat domain:v4-compatibility This issue reports a CKEditor 4 feature/option that's missing in CKEditor 5. labels Jun 14, 2021

Mgsy added this to the iteration 44 milestone Jun 17, 2021

psmyrek self-assigned this Jun 18, 2021

psmyrek mentioned this issue Jun 21, 2021

Move leading comment nodes into the document fragment #9927

Merged

mlewand added the package:engine label Jun 21, 2021

ma2ciek closed this as completed in #9927 Jun 25, 2021

ma2ciek added a commit that referenced this issue Jun 25, 2021

Merge pull request #9927 from ckeditor/ck/9861

12dc7ba

Other (engine): Fixed parsing leading HTML comments by `HtmlDataProcessor.toView()`. Closes #9861.

This was referenced Jan 14, 2022

<script> or <style> that's used at the beginning of an HTML data string is lost #11109

Closed

<script> or <style> that's used at the beginning of an HTML data string is lost #11110

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First comment node is missing after calling `DOMParser.parseFromString()` #9861

First comment node is missing after calling `DOMParser.parseFromString()` #9861

psmyrek commented Jun 14, 2021 •

edited

Loading

psmyrek commented Jun 17, 2021 •

edited

Loading

First comment node is missing after calling DOMParser.parseFromString() #9861

First comment node is missing after calling DOMParser.parseFromString() #9861

Comments

psmyrek commented Jun 14, 2021 • edited Loading

Provide a description of the task

psmyrek commented Jun 17, 2021 • edited Loading

First comment node is missing after calling `DOMParser.parseFromString()` #9861

First comment node is missing after calling `DOMParser.parseFromString()` #9861

psmyrek commented Jun 14, 2021 •

edited

Loading

psmyrek commented Jun 17, 2021 •

edited

Loading