Render unescaped HTML #2

AndreasHeintze · 2015-09-29T10:15:49Z

In the docs it says we can define a raw tag to render unescaped HTML.
Well that works, but it's not very clean.

<raw content="{ json(aListData.data).toString() }"></raw>

I would like Riot to support an alternative way of displaying unescaped HTML, why not like this:

{{ html }}

{{ json(aListData.data).toString() }}

If it's not that hard to solve and if it doesn't increase the code size too much...

The text was updated successfully, but these errors were encountered:

GianlucaGuarini · 2015-09-29T12:05:12Z

I would just use something like:

{! myHtml !}

Where the ! flag enables the unescaped html rendering using any kind of template delimiters

tipiirai · 2015-09-29T13:39:17Z

How about double {{ brackets }} ? Would be analogous to Moustache and Handlebars with triple brackets.

GianlucaGuarini · 2015-09-29T13:46:44Z

@tipiirai it will be a problem for the users using other template delimiters. What should use someone using riot.settings.brackets = '${ }'?

sylvainpolletvillard · 2015-10-01T21:24:38Z

Couldn't we just improve the <raw> tag to do so : <raw>{html}</raw> ?

tipiirai · 2015-10-05T05:28:13Z

Raw tag is actually harder to implement in above way since there is no (easy) way to access the HTML. I prefer a custom delimiters for the job. Not sure why checking ! character is easier than { character when using custom template delimiters.

I think @aMarCruz is best to know.

aMarCruz · 2015-10-10T15:59:46Z

Hi @tipiirai , you are right, and checking for ! is more convenient. The popular bracket { is "in use" by function bodies, literal objects, regexes, es6 classes, methods, and template strings, je... and of course by riot.

In the new version of tmpl (I'm testing with riot right now) there a little hack that allows compile with --type es6 --expr and shorthands: a caret. In expressions such as {^ foo: 1 }, the caret is removed and the expression is not passed to babel.
I think something like that can be used for raw html with quoted attribute values, or .tag files. But again, custom tags embedded in the markup (* .html files) containing characters not allowed by the html specs for text elements can be altered by browsers, and <> are in these set. The only "secure" from, to my knowledge, is to use a CDATA block.

aMarCruz · 2015-11-16T05:28:38Z

Taking up this issue and thinking a bit more...
The change to insert HTML occurs at mount time (we can not use innerHTML, right?) and therefore the compiler needs to know that an expression is raw text, and it then tell to riot.tag2 about this.

For simplicity the compiler needs only one character starting the expression, as in the {^ } hack in my previous post this needs to be an invalid starting JS character, to avoid confusing the parser... e.g. It can not be ! because {! foo } is a valid expression. Goods chars are =, #, $, @, etc. no problem here.

The real question is how to tell to riot.tag2 (and to the mount function) an expression is raw text? preserving this first non-valid char?

This is a problem that I found with the precompiled expressions, in order to support mixed modes. How to know when the expression was already compiled? expressions has no meta-data. Because this, we are using the riot- and __ prefixes and other tricks.

...maybe I'm wrong and this is not a problem. Thoughts?

aMarCruz · 2015-11-16T05:41:37Z

Thinking more...
the compiler needs to take care of the first character to temporarily remove it if a parser runs in the expression, nothing more. In the tag construction, riot removes the character and inserts the text as html.
Seems easy...

tipiirai · 2015-11-18T08:46:12Z

My preference would still be {{ str }}. The next best is {< str }, because it feels like outputting HTML (starts with "<") on UNIX shell (redirection operator).

I like {= str } also, which is pretty standard in templating, but usually it means escaped output.

sylvainpolletvillard · 2015-11-18T14:15:43Z

Double curly braces are used by default by very popular libraries like Angular, Handlebars (Ember), Mustache... Personnally I often put double curly braces by mistake inside my riot tags, because of the mustache background I come from.

I am concerned that double curly braces could now work, but without the security provided by HTML escaping. Many users could use it by mistake and have tags that seem to work perfectly, but are vulnerable to XSS attacks.

Inserting raw HTML content is an uncommon need, we do not need the syntax to be beautiful or handy. It must be explicit and warns the user about the risks. I vote for an explicit tag or for @GianlucaGuarini proposal with exclamation marks {! myHtml !}

aMarCruz · 2015-11-18T19:18:13Z

Why we need new brackets? the brackets is not the problem.
With a single character at the beginning of the expression both the compiler and tmpl know that this is html.
Inject HTML in the DOM can be done easily, too.

The real problem is riot reads expressions from the DOM. This is the cause of many many issues.
Sorry, I have made several posts about this but my english is poor. So again, think about

  <p>{ "hello<br>world" }</p>

the compiler generates code for call riot.tag2 with the html param (a JS string) as '{"Hello world"}'
riot.compile runs the generated code through the browser and call its callback (riot.mount).
riot.mount instantiate a Tag (in mountTo). The Tag ctor calls mkdom.
mkdom create a root html element (mostly a div) and sets the innerHTML to the JS string.
Next, tag.mount is called. Inside this, parseExpressions is called.
Remember div.innerHTML contains '{"Hello world"}' and all is ok, yes? ...err no.

parseExpressions start walking the div through the firstChild/nextSibling html properties ...now I'm sure you see now the problems to come...
fistChild of  is a text node, but there are three:
1. {"Hello (#text)
2.   (HTMLBRElement)
3. world"} (#text)
browser's html parser (correctly) interpreted its content, which breaks the tmpl function.

Another things like { "foo" } does not breaks tmpl, but do not generate the expected result. Raw html can not be inserted by expressions with the current implementation.

We need not depend on the browser for reading the expressions source. I think this is inneficient anyway. In my opinion, we have 2 choises:
a) Go one-way: Evaluate the expression from the vdom, and test vs the real DOM with the results.
b) Use of precompiled expressions. This requires changes in all the code.

With both solutions, we lose the ability to set/modify expressions directly into the DOM.

EDIT: This also explains why I am denying the use of < > in custom brackets.

aMarCruz · 2015-11-18T19:36:57Z

...mmm thinking more...
perhaps another solution is that the compiler replaces the expression with a call to a precompiled special function that bypasses the text throgh the html parser. I will after some testing with this.

GianlucaGuarini · 2015-11-18T20:16:22Z

@aMarCruz I understand the issue but I think we can just use a simpler solution without changing to much the riot source code: we can simply parse the expressions tagging the ones using a special template delimiter {! Hello there !} ( so they should no be escaped from the compiler ). Then in the update method we simply print the unescaped html string in the expression as el.innerHTML. I would like to not overcomplicate the things and the source code.

aMarCruz · 2015-11-18T21:10:45Z

The compiler does not escape anything. Unless I'm missing something, conversion from " " into ~~"\n"~~ the HTMLBRElement happens in the browser after mkdom injects the correct html string. This happens with other html elements, too. I find no way to avoid this (in the browser).

aMarCruz · 2015-11-18T21:17:18Z

...or you mean the compiler must create a special, precompiled function for hidding the desired string from the browser's html parser?

aMarCruz · 2015-11-18T21:43:56Z

e.g. assume {@ Hi }.
the compiler see '@' as first char inside the expression (any brackets it have), and replaces this with a precompiled expression id, to register the function with its id in the tmpl cache:

riot.tag2(`tag`, `<p>{#0001}</p>`, '', '', function (opts) {
}, { "#0001": function(){ return "Hi<br>" } });

...now, riot.tag2 call to an inner tmpl function (e.g. registerFn), to insert these fn into the cache:

function registerFn(obj) {
    extend(cache, obj)
}

mkdom set div.innerHTML to {#0001}. The html parser sees this as text. There is no conversion.

Later, update see the format and knows this is raw html, anyway calls tmpl with "{#0001}". tmpl detect the format and call the #0001 function in the cache. Finally, update injects the html.

I'm ok?

GianlucaGuarini · 2015-11-19T07:35:40Z

@aMarCruz I guess this are the steps to render unescaped strings in a tag element.
Assuming we use the string {! 'Hi there' !}:

the riot-compiler should leave untouched the string (maybe it needs to escape the quotes)
the riot-tmpl should contain a function isRaw to detect expressions starting and ending with !
- we can make the ! an option like riot.settings.rawDelimiter
the riot-tmpl should return only Hi there
in the update method in riot we just check whether isRaw on an expression is true and in that case we will print the string as innerHTML

At moment I would not invest too much time working on this feature (we have too many things to do), we can plan it for riot 2.4.0

GianlucaGuarini · 2015-12-01T07:14:56Z

@aMarCruz can you have a look at this feature please? I would like to include it in the next riot release. Let me know if I should work on it as well

aMarCruz · 2015-12-01T08:03:59Z

@GianlucaGuarini , sure.

aMarCruz · 2015-12-02T01:07:13Z

Ok, This jsbin have 2 samples, one from riot#744, the other from the @GianlucaGuarini sample above, but using ~~expressions~~ properties for  and color.
The trick to avoid the parseExpressions() issues was simple and obvious: encode the <> chars.

I'm using = as flag (e.g. {= '<html>' }) because the '!' is a valid JS starting char (e.g. {!foo}), so I need one additional test for the closing sequence !}.
< is good too, but I don't like {< ' ' } or {< x > 0 ? '<hr>' : '' }.
For simplicity and to not rely on riot.settings, I would prefer a non-customizable flag.

@GianlucaGuarini ,
The compiler temporarily removes the = for any external parsers -- babel, etc. Double quotes do not need to be encoded, in fact, this cause issues to fix in this version.
You can use tmpl.isRaw. The only change in tmpl() is to remove =, from there, the source is treated as a normal expression.

The code in this block (only for test) inject the elements in the parent node 'cause we can't use innerHTML here (all the expressions live in #text nodes) and I don't know if it breaks parent-child relations or generates sync issues. So please write this part and test, I will push the PR for the other modules when you are ready.

GianlucaGuarini · 2015-12-03T07:52:25Z

thanks @aMarCruz I cant wait to test it :)

GianlucaGuarini · 2016-04-09T14:53:01Z

I thought about this feature and I would like that riot-tmpl could be smart enough to handle this feature.
For expressions like:

{! raw } Hello { user }

I would like that riot-tmpl will produce:

// tmpl(string, data, mustEscape)
var str = '{! raw } Hello { user }',
  data = { raw: '<b>Cool</b>', user: '<i>Foo<i>' }
tmpl(str, data, true) // escape only the output of expressions not prefixed with `!`
// => <b>Cool</b> Hello &lt;i&gt;Foo&lt;i&gt;
tmpl(str, data, false || undefined) // behave as the current riot-tmpl
// => <b>Cool</b> Hello <i>Foo<i>

@aMarCruz let me know if you need help on this please. This will be part of a riot major release 3.0.0, so this means we can start tagging it riot-tmpl@3.0.0-alpha

aMarCruz · 2017-06-03T02:35:50Z

This issue will be solved in the near future.

fabien · 2017-06-24T09:46:28Z

@GianlucaGuarini @aMarCruz I'm using riot.util.tmpl throughout my application whenever I need to evaluate an expression, this works nicely. However, by default the results will be unescaped. This is different from its use within Riot itself, which uses the DOM methods to stay safe.

For now, I solved this with a custom method:

riot.evalTmpl = function(expression, data, mustEscape) {
    expression = String(expression || '').replace(/(\{\s*\!?)\s*([^}]*)\s*\}/g, function(m, prefix, expr) {
        if (prefix.indexOf('!') > -1 || !mustEscape) {
            return '{ ' + expr.trim() + ' }';
        } else {
            return '{ _.escape(' + expr.trim() + ') }';
        }
    });
    return riot.util.tmpl(expression, data);
};

What do you think if my approach for the time being? It's similar to your proposal, but of course it's more of a syntactic-sugar preprocessor trick right now.

GianlucaGuarini · 2017-06-24T16:09:18Z

@fabien we will completely rewrite riot-tmpl in riot@4 so at moment I wouldn't like to add new features to the current implementation, thanks for sharing your code

tipiirai mentioned this issue Oct 13, 2015

Render unescaped HTML riot/riot#1237

Closed

GianlucaGuarini mentioned this issue Nov 8, 2015

Riot next - roadmap to riot 2.4.0 riot/riot#1322

Closed

9 tasks

aMarCruz mentioned this issue Nov 18, 2015

Cannot use Riot in Chrome App due to Content Security Policy riot/riot#1076

Closed

aMarCruz mentioned this issue Dec 2, 2015

expressions with html not evaluated riot/riot#744

Closed

GianlucaGuarini mentioned this issue Apr 9, 2016

isRaw is returning false if the expression contains other text content before the template delimiters #13

Closed

aMarCruz self-assigned this May 12, 2016

aMarCruz closed this as completed Jun 3, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Render unescaped HTML #2

Render unescaped HTML #2

AndreasHeintze commented Sep 29, 2015

GianlucaGuarini commented Sep 29, 2015

tipiirai commented Sep 29, 2015

GianlucaGuarini commented Sep 29, 2015

sylvainpolletvillard commented Oct 1, 2015

tipiirai commented Oct 5, 2015

aMarCruz commented Oct 10, 2015

aMarCruz commented Nov 16, 2015

aMarCruz commented Nov 16, 2015

tipiirai commented Nov 18, 2015

sylvainpolletvillard commented Nov 18, 2015

aMarCruz commented Nov 18, 2015

aMarCruz commented Nov 18, 2015

GianlucaGuarini commented Nov 18, 2015

aMarCruz commented Nov 18, 2015

aMarCruz commented Nov 18, 2015

aMarCruz commented Nov 18, 2015

GianlucaGuarini commented Nov 19, 2015

GianlucaGuarini commented Dec 1, 2015

aMarCruz commented Dec 1, 2015

aMarCruz commented Dec 2, 2015

GianlucaGuarini commented Dec 3, 2015

GianlucaGuarini commented Apr 9, 2016

aMarCruz commented Jun 3, 2017

fabien commented Jun 24, 2017

GianlucaGuarini commented Jun 24, 2017

Render unescaped HTML #2

Render unescaped HTML #2

Comments

AndreasHeintze commented Sep 29, 2015

GianlucaGuarini commented Sep 29, 2015

tipiirai commented Sep 29, 2015

GianlucaGuarini commented Sep 29, 2015

sylvainpolletvillard commented Oct 1, 2015

tipiirai commented Oct 5, 2015

aMarCruz commented Oct 10, 2015

aMarCruz commented Nov 16, 2015

aMarCruz commented Nov 16, 2015

tipiirai commented Nov 18, 2015

sylvainpolletvillard commented Nov 18, 2015

aMarCruz commented Nov 18, 2015

aMarCruz commented Nov 18, 2015

GianlucaGuarini commented Nov 18, 2015

aMarCruz commented Nov 18, 2015

aMarCruz commented Nov 18, 2015

aMarCruz commented Nov 18, 2015

GianlucaGuarini commented Nov 19, 2015

GianlucaGuarini commented Dec 1, 2015

aMarCruz commented Dec 1, 2015

aMarCruz commented Dec 2, 2015

GianlucaGuarini commented Dec 3, 2015

GianlucaGuarini commented Apr 9, 2016

aMarCruz commented Jun 3, 2017

fabien commented Jun 24, 2017

GianlucaGuarini commented Jun 24, 2017