Whitespace prevents random component to be interpreted #161

Jonarod · 2018-03-10T09:34:55Z

Say my render logic is like this:

render((
    <Markdown
        children={md}
        options={{
            overrides: {
                MyComponent: {
                    component: MyComponent,
                },
            },
        }} />
), document.body);

MyComponent being as simple as possible like:

const MyComponent = props => <h1>{props.say}</h1>

Now, the problem occurs depending on the markdown input syntax. Observe those two similar inputs:

Input 1 (equal sign with spaces):

Say hello: <MyComponent say = "hello" />

Input 2 (equal sign withOUT spaces):

Say hello: <MyComponent say="hello" />

Input 1 outputs nothing, while Input 2 correctly prints the component's content.
Is there any way to make markdown-to-jsx more resilient to whitespaces so that both syntax works ?
I have been struggling like 3 hours just to understand where was my mistake while it was just a space messing... :/
Anyway, nice library :)

The text was updated successfully, but these errors were encountered:

Jonarod · 2018-03-10T09:43:44Z

After further investigations, it seems a more global problem:

These markdown input render differently:

<div class="green">
   Hello
</div>

<div class ="green">
   Hello
</div>

<div class= "green">
   Hello
</div>

<div class = "green">
   Hello
</div>

if this is parsed using regex, would it be possible to add something like:

/ *= */

if yes, I'd be glad to help

Jonarod · 2018-03-10T10:19:22Z

Got into the code and found these bits:

const ATTR_EXTRACTOR_R = /([-A-Z0-9_:]+)(?:\s*=\s*(?:(?:"((?:\\.|[^"])*)")|(?:'((?:\\.|[^'])*)')|(?:\{((?:\\.|{[^}]*?}|[^}])*)\})))?/gi;

and

function attrStringToMap (str) {
        const attributes = str.match(ATTR_EXTRACTOR_R);

        return attributes ? attributes.reduce(function (map, raw, index) {
            const delimiterIdx = raw.indexOf('=');

            if (delimiterIdx !== -1) {
                const key = normalizeAttributeKey(raw.slice(0, delimiterIdx));
                const value = unquote(raw.slice(delimiterIdx + 1));
...

Should be where the planets align together.
Now, I could suggest another approach to prevent parsing attributes twice. In fact, the regex oddly catches relevant groups into one single part, then it is parsed again using indexOf('='), right ?
Couldn't we leverage regex groups to naturally handle attributes key/value split for us ? something like:

/([-A-Z0-9_:]+)\s*=\s*(?:"([^"]+)"|'([^']+)'|\{([^{]+)\})/gi

where 4 groups can be extracted: attribute key, double quoted value, single quoted value and curly brackets enclosed value. Then unquote would not even be needed.

What do you think ?

Jonarod · 2018-03-10T11:27:27Z

Submitted PR in a very lightweight fashion (by just trimming down spaces before parsing) + corresponding tests :)

quantizor · 2018-03-12T02:33:09Z

Released as 6.5.1, thanks for your contribution!

Jonarod mentioned this issue Mar 10, 2018

Support spaces between equal sign for html attributes #162

Merged

quantizor closed this as completed Mar 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whitespace prevents random component to be interpreted #161

Whitespace prevents random component to be interpreted #161

Jonarod commented Mar 10, 2018

Jonarod commented Mar 10, 2018

Jonarod commented Mar 10, 2018

Jonarod commented Mar 10, 2018

quantizor commented Mar 12, 2018

Whitespace prevents random component to be interpreted #161

Whitespace prevents random component to be interpreted #161

Comments

Jonarod commented Mar 10, 2018

Input 1 (equal sign with spaces):

Input 2 (equal sign withOUT spaces):

Jonarod commented Mar 10, 2018

Jonarod commented Mar 10, 2018

Jonarod commented Mar 10, 2018

quantizor commented Mar 12, 2018