bug when parsing <script> tag using some template system #29

ghostoy · 2011-09-01T06:35:50Z

var htmlparser = require('htmlparser'),
    util = require('util'),
    handler = new htmlparser.DefaultHandler(function(err, dom){}),
    parser = new htmlparser.Parser(handler),
    rawHtml = '<script type="text/template"><h1>Heading1</h1></script>';

parser.parseComplete(rawHtml);
console.log(util.inspect(handler.dom, false, null));

This piece of code discards "<" of <h1> and outputs:

[ { raw: 'script type="text/template"',
    data: 'script type="text/template"',
    type: 'script',
    name: 'script',
    attribs: { type: 'text/template' },
    children: 
     [ { raw: 'h1>Heading1</h1>',  // discard <
         data: 'h1>Heading1</h1>',
         type: 'text' } ] } ]

fb55 · 2011-10-25T20:49:21Z

The funny thing is that, if you add a space between the script and the h1-tag, it actually works: https://github.com/FB55/node-htmlparser/blob/master/tests/23-template_script_tags.js

elfsternberg · 2011-11-11T22:48:37Z

Nothing funny about it, @fb55. The problem is deep inside parseTags(), where it consumes the first less-than symbol following any tag, including the script tag, but then correctly goes back into text-parsing mode to handle all of the template.

fb55 · 2011-11-12T11:41:29Z

I fixed the bug inside my own fork, the test linked above passes without a problem (the additional space was removed).

tautologistics#29

That fixed tautologistics#29.

That fixed tautologistics/node-htmlparser#29.

were left unmolested. This is to ensure that script tags can be used for some language other than Javascript. The test data has a unit of whitespace at the very front to work around tautologistics/node-htmlparser#29

tautologistics/node-htmlparser#29

That fixed tautologistics/node-htmlparser#29.

tautologistics/node-htmlparser#29

That fixed tautologistics/node-htmlparser#29.

tautologistics/node-htmlparser#29

That fixed tautologistics/node-htmlparser#29.

tautologistics/node-htmlparser#29

That fixed tautologistics/node-htmlparser#29.

kirbysayshi pushed a commit to kirbysayshi/node-htmlparser that referenced this issue Dec 19, 2013

Added a test

dc5fe9c

tautologistics#29

kirbysayshi pushed a commit to kirbysayshi/node-htmlparser that referenced this issue Dec 19, 2013

Replaced _tagStack with _contentFlags, tweaked DefaultHandler

bc12cd8

That fixed tautologistics#29.

fb55 added a commit to fb55/high5 that referenced this issue Apr 8, 2014

Replaced _tagStack with _contentFlags, tweaked DefaultHandler

b0276b3

That fixed tautologistics/node-htmlparser#29.

fb55 added a commit to fb55/htmlparser2 that referenced this issue Oct 21, 2018

Added a test

93d5f91

tautologistics/node-htmlparser#29

fb55 added a commit to fb55/htmlparser2 that referenced this issue Oct 21, 2018

Replaced _tagStack with _contentFlags, tweaked DefaultHandler

50818bc

That fixed tautologistics/node-htmlparser#29.

Vatoth pushed a commit to fasterize/node-htmlparser that referenced this issue Mar 5, 2021

Added a test

83a0ade

tautologistics/node-htmlparser#29

Vatoth pushed a commit to fasterize/node-htmlparser that referenced this issue Mar 5, 2021

Replaced _tagStack with _contentFlags, tweaked DefaultHandler

da9376d

That fixed tautologistics/node-htmlparser#29.

Vatoth pushed a commit to fasterize/node-htmlparser that referenced this issue Mar 5, 2021

Added a test

68eb7bf

tautologistics/node-htmlparser#29

Vatoth pushed a commit to fasterize/node-htmlparser that referenced this issue Mar 5, 2021

Replaced _tagStack with _contentFlags, tweaked DefaultHandler

2336f88

That fixed tautologistics/node-htmlparser#29.

Vatoth pushed a commit to fasterize/node-htmlparser that referenced this issue Mar 5, 2021

Added a test

0a787b3

tautologistics/node-htmlparser#29

Vatoth pushed a commit to fasterize/node-htmlparser that referenced this issue Mar 5, 2021

Replaced _tagStack with _contentFlags, tweaked DefaultHandler

d7cf9a4

That fixed tautologistics/node-htmlparser#29.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug when parsing <script> tag using some template system #29

bug when parsing <script> tag using some template system #29

ghostoy commented Sep 1, 2011

fb55 commented Oct 25, 2011

elfsternberg commented Nov 11, 2011

fb55 commented Nov 12, 2011

bug when parsing <script> tag using some template system #29

bug when parsing <script> tag using some template system #29

Comments

ghostoy commented Sep 1, 2011

fb55 commented Oct 25, 2011

elfsternberg commented Nov 11, 2011

fb55 commented Nov 12, 2011