Simple strong/em case fails to be parsed (foo bar) #63

TheCloudlessSky · 2012-10-05T01:10:53Z

The the sample case of ***foo bar*** fails to parse correctly.

It should be parsed as ["em", ["strong", "foo bar"]] but is instead parsed as [["strong", "*foo bar"], "*"].

However if spaces are added (* **foo** *), it produces the correct HTML.

The text was updated successfully, but these errors were encountered:

lorddev · 2012-10-05T04:55:54Z

I think if you want em + strong you need to use underscores. From what I understand, 3 asterisks is supposed to be used in order to produce actual asterisks, e.g. foo bar

TheCloudlessSky · 2012-10-05T08:01:47Z

@lorddev Having ***foo bar*** produce foo bar is consistent with at least StackOverflow's and GitHub's markdown. To produce asterisks around bolded text would be with escaped asterisks **\*foo bar\***, which works fine and seems most intuitive.

lorddev · 2012-10-05T17:32:14Z

Ok. I must having been thinking of Google+, which implements only the asterisks and underscores subset of markdown.

ashb · 2012-10-06T11:20:44Z

This is quite probably a parsing bug as most other parsers treat it as foo bar as you can see here: http://babelmark.bobtfish.net/?markdown=***foo+bar***

(I know I made some decisions on purpose of what to parse and what to just ignore but I don't think this was one of them)

TheCloudlessSky · 2012-10-06T12:38:24Z

@ashb That's what I think too. I apologize for not submitting a pull request; I'm still learning the code base and how the parsing works. I did, however, add the test case in the inline_strong_em of regressions.t.js and it failed.

ashb · 2012-10-06T13:06:33Z

The parsing of strong and em is a little bit ... interesting (along with most of the rest of the parsing) and has some fruity backtracking like stuff in it. The strong_em helper function is what deals parsing of **

markdown-js/lib/markdown.js

Line 977 in 50f6d69

function strong_em( tag, md ) {

ashb · 2012-10-06T13:20:49Z

Hmmm I've taken a look and the way the strong/em state is currently split out is what's causing the problem I suspect.

The problem is that it doesn't keep the ordering of which of a strong/em was last opened, so it closes the wrong one (the strong) as this is first in the regex pattern and doesn't know that it should check if it should close an em instead of a strong.

I suspect we'll have to rewrite that parser helper func to use a single state variable instead of two split ones.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple strong/em case fails to be parsed (foo bar) #63

Simple strong/em case fails to be parsed (foo bar) #63

TheCloudlessSky commented Oct 5, 2012

lorddev commented Oct 5, 2012

TheCloudlessSky commented Oct 5, 2012

lorddev commented Oct 5, 2012

ashb commented Oct 6, 2012

TheCloudlessSky commented Oct 6, 2012

ashb commented Oct 6, 2012

ashb commented Oct 6, 2012

Simple strong/em case fails to be parsed (***foo bar***) #63

Simple strong/em case fails to be parsed (***foo bar***) #63

Comments

TheCloudlessSky commented Oct 5, 2012

lorddev commented Oct 5, 2012

TheCloudlessSky commented Oct 5, 2012

lorddev commented Oct 5, 2012

ashb commented Oct 6, 2012

TheCloudlessSky commented Oct 6, 2012

ashb commented Oct 6, 2012

ashb commented Oct 6, 2012

Simple strong/em case fails to be parsed (foo bar) #63

Simple strong/em case fails to be parsed (foo bar) #63