Translate markdown with angle brackets to HTML (fixes #476) #497

jcads · 2021-04-02T14:24:03Z

https://deploy-preview-497--glean-dictionary-dev.netlify.app/apps/fenix/metrics/browser_search_with_ads

Fixes #476.

Pull Request checklist

The pull request has a descriptive title (and a reference to an issue it
fixes, if applicable)
All tests and linter checks are passing
The pull request is free of merge conflicts

Iinh

Hi @jcads, thank you for working on this issue. This solution looks like a great start, I have some input below.

I believe the cause of this bug is since < and > are reserved characters in HTML, the browser mistook <provider-name> as an HTML tag. To fix this, we could use character entities to display these special characters instead.

For example, if we replace the special characters with their equivalent HTML entities, such as:

htmlText = text.replace("<", "<");

and then pass it to parse(htmlText), the browser would then be able to read "<".

Wondering what @wlach thinks about this approach. If this seems reasonable, should we also take care of the other edge cases, such as &?

Iinh · 2021-04-02T17:50:24Z

src/components/Markdown.svelte

+{#if text.includes('<')}
+  {#if inline}
+    {text}
+  {:else}
+    {#each lines as text}
+      <p style="margin: 0">{text}</p>
+    {/each}
+  {/if}


We should always use parse/parseInline to render text in this component as the goal is to display markdown content.

Got it @Iinh. Thank you.

wlach

Hi @jcads -- thanks for the PR I think this solution is not quite right. As @Iinh, we want to continue using parse/parseInline to render the text.

What's going on here is the markdown renderer is creating <provider-name> as an HTML element. We can confirm this by looking at the developer console:

I think a shorter/cleaner solution would be to enhance the renderer object to escape these objects.

https://marked.js.org/using_pro#renderer

I think you might need to add a render for tag types, but am not 100% sure. Give it a try and see!

src/components/Markdown.svelte

wlach · 2021-04-02T20:13:16Z

I believe the cause of this bug is since < and > are reserved characters in HTML, the browser mistook <provider-name> as an HTML tag. To fix this, we could use character entities to display these special characters instead.

For example, if we replace the special characters with their equivalent HTML entities, such as:

htmlText = text.replace("<", "<");

and then pass it to parse(htmlText), the browser would then be able to read "<".

Wondering what @wlach thinks about this approach. If this seems reasonable, should we also take care of the other edge cases, such as &?

Thanks @Iinh! This is actually a better approach than what I suggested. I think the markdown parser will automatically handle characters like & ok -- it's just HTML-like tags that confuse it. @jcads do you want to give her approach a try?

wlach

Looks great, thanks for persisting on this @jcads. Let me know if / when you're interested in working on something else.

This fixes pagination when navigating through long lists of metrics (the descriptions weren't being updated after #497)

PR #497 fixed some cases, but not all. Angle brackets in code blocks have their own escaping behaviour, which doesn't work well with the adhoc work we're doing. Strictly a string like this: <provider> is considered HTML in markdown. However this *probably* isn't how we want to interpret it in the Glean Dictionary. We'll address this by modifying the renderer when it encounters an HTML tag, rather than doing a naive regex replace against the whole markdown string.

This is pedantic, but strictly something called <provider-name> is considered an HTML tag unless it's in a code block (backticks). See mozilla/glean-dictionary#549 and mozilla/glean-dictionary#497. I'm going to fix this upstream but figured I might as well file a PR here to fix the underlying issue.

PR #497 fixed some cases, but not all. Angle brackets in code blocks have their own escaping behaviour, which doesn't work well with the adhoc work we're doing. Strictly a string like this: <provider> is considered HTML in markdown. However this *probably* isn't how we want to interpret it in the Glean Dictionary. We'll address this by modifying the renderer when it encounters an HTML tag, rather than doing a naive regex replace against the whole markdown string.

This is pedantic, but strictly something called <provider-name> is considered an HTML tag unless it's in a code block (backticks). See mozilla/glean-dictionary#549 and mozilla/glean-dictionary#497. I'm going to fix this upstream but figured I might as well file a PR here to fix the underlying issue.

…9243) This is pedantic, but strictly something called <provider-name> is considered an HTML tag unless it's in a code block (backticks). See mozilla/glean-dictionary#549 and mozilla/glean-dictionary#497. I'm going to fix this upstream but figured I might as well file a PR here to fix the underlying issue.

fix: Angle brackets not translated to html correctly

c41d41a

Iinh self-requested a review April 2, 2021 17:48

Iinh reviewed Apr 2, 2021

View reviewed changes

Iinh requested a review from wlach April 2, 2021 17:57

wlach suggested changes Apr 2, 2021

View reviewed changes

src/components/Markdown.svelte Show resolved Hide resolved

wlach changed the title ~~fix: Angle brackets not translated to html correctly~~ Translate markdown with angle brackets to HTML (fixes #476) Apr 2, 2021

Add tests

f774332

jcads force-pushed the markdown branch from da1db39 to f774332 Compare April 3, 2021 08:12

Fix tests

103a0ea

jcads requested review from Iinh and wlach April 3, 2021 08:27

wlach approved these changes Apr 3, 2021

View reviewed changes

wlach merged commit 09af671 into mozilla:main Apr 3, 2021

wlach added a commit that referenced this pull request Apr 5, 2021

Make variable declaration in markdown component reactive

c6814fa

This fixes pagination when navigating through long lists of metrics (the descriptions weren't being updated after #497)

wlach mentioned this pull request Apr 5, 2021

Make variable declaration in markdown component reactive #509

Merged

3 tasks

wlach added a commit that referenced this pull request Apr 5, 2021

Make variable declaration in markdown component reactive (#509)

4b2fda7

This fixes pagination when navigating through long lists of metrics (the descriptions weren't being updated after #497)

wlach mentioned this pull request Apr 26, 2021

Fix angle brackets (hopefully for the last time) #549

Merged

3 tasks

wlach mentioned this pull request Apr 26, 2021

Properly quote descriptions in ad metric definitions mozilla-mobile/fenix#19243

Merged

3 tasks

gabrielluong mentioned this pull request Apr 26, 2021

CI for https://github.com/mozilla-mobile/fenix/pull/19243 mozilla-mobile/fenix#19253

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Translate markdown with angle brackets to HTML (fixes #476) #497

Translate markdown with angle brackets to HTML (fixes #476) #497

jcads commented Apr 2, 2021 •

edited

Loading

Iinh left a comment •

edited

Loading

Iinh Apr 2, 2021

jcads Apr 3, 2021

wlach left a comment

wlach commented Apr 2, 2021

wlach left a comment

Translate markdown with angle brackets to HTML (fixes #476) #497

Translate markdown with angle brackets to HTML (fixes #476) #497

Conversation

jcads commented Apr 2, 2021 • edited Loading

Pull Request checklist

Iinh left a comment • edited Loading

Choose a reason for hiding this comment

Iinh Apr 2, 2021

Choose a reason for hiding this comment

jcads Apr 3, 2021

Choose a reason for hiding this comment

wlach left a comment

Choose a reason for hiding this comment

wlach commented Apr 2, 2021

wlach left a comment

Choose a reason for hiding this comment

jcads commented Apr 2, 2021 •

edited

Loading

Iinh left a comment •

edited

Loading