Adopting a text-based diagram syntax in Jupyter Markdown #101

bollwyvl · 2023-03-14T23:43:57Z

Hello folks! This JEP proposes adopting this syntax:

```mermaid
flowchart LR
  chicken --> egg --> chicken
```

Which renders all over as:

flowchart LR
    chicken --> egg --> chicken

It was written entirely within a checkout of this PR, which you can try out directly on .

Thanks!

Resolve #100

Voting from @jupyter/software-steering-council

@echarles
- Yes
- No
- Abstain
@fcollonval
- Yes
- No
- Abstain
@ibdafna
- Yes
- No
- Abstain
@isabela-pf
- Yes
- No
- Abstain
@ivanov
- Yes
- No
- Abstain
@JohanMabille
- Yes
- No
- Abstain
@minrk
- Yes
- No
- Abstain
@rpwagner
- Yes
- No
- Abstain
@SylvainCorlay
- Yes
- No
- Abstain
@willingc
- Yes
- No
- Abstain
@Zsailer
- Yes
- No
- Abstain

bollwyvl · 2023-03-22T16:16:32Z

Hi folks! It's been a week, and we've collected a few emoji ❤️ .

it was my hope to do the work to get this into the JupyterLab 4/Notebook 7 release window, which is rapidly closing.

Any feedback welcome!

manics · 2023-03-22T16:41:02Z

What are the expectations on Jupyter clients that comply with this addition? For example, are they expected to:

use mermaidjs 10 (or version X) for rendering all mermaid diagrams
display all mermaid diagrams "in the same way" as mermaid 10 (or version X) but using their own code if they want?
interpret mermaid diagrams in any way they wish?
display only the non-experimental diagram types, e.g. C4 Diagrams has the warning:

C4 Diagram: This is an experimental diagram for now. The syntax and properties can change in future releases. Proper documentation will be provided when the syntax is stable.

bollwyvl · 2023-03-22T17:07:37Z

interpret mermaid diagrams in any way they wish?

As a "living" dependency with maintenance costs, I think this has to be the assumption: as noted, changes can happen in minor versions, and 9.x is still getting releases, which speak well the the upstream maintainer attention to detail.

That being said, the flowchart diagrams originally authored from very early in mermaid's release history still represent fairly well in 10.x. And everyone changes grammars now and again, or even the meaning of existing keywords, if a better algorithm comes along (e.g. in graphviz prior to 2.28 overlap=false meant voronoi, but now means prism, but would need voronoi to draw the same way).

Perhaps if rendering fails, the right play is to fall back to the the raw text, which could convey some meaning... ideally with grammar-aware syntax highlighting, but then that can "fail".

only the non-experimental diagram types

Perhaps. But I think throwing the kitchen sink, including the new, sharp knives, isn't the worst play. The one that kills me is the ElkJS backend for flowchart: it's super heavy on the wire and behind a feature flag, but super powerful. And mindmap is the sole user of cytoscape.js (I think).

bollwyvl · 2023-03-22T23:01:25Z

@manics thinking more about it: i've also updated the JupyterLab PR with a treatment of how an "unparseable" might be treated... moving it down to the "warning" class of feedback seems reasonable, as again, you can still kinda tell what is going on in the diagram by reading the text.

as-rendered	click on the diagram text/arrow

bollwyvl · 2023-03-22T23:35:43Z

xx-text-based-diagrams-in-markdown/text-based-diagrams.md

+and the full `d3` metapackage. Some advanced features use the `cytoscape` and `elkjs`
+_rendering engines_, but are only loaded when needed.
+
+## Unparseable Diagrams


@manics here's me expounding a bit more

And for reference, here's how GitHub handles unparseables:

flowchart LR a --> b -->

Note that it:

puts the (undismissable) error up front with (unhighlighted) source below

does not provide the (semi-) helpful grammar message

allows for copying (but not viewing) the raw source

bollwyvl · 2023-03-23T13:53:24Z

xx-text-based-diagrams-in-markdown/text-based-diagrams.md

+
+...and they should _mostly_ render the same way everywhere.
+
+_MermaidJS 10_ also includes a number of new diagrams, not available on earlier major


@manics i've split out the diagrams and features not available in <10

psychemedia · 2023-04-25T14:28:36Z

I note the proposed syntax diverges from the syntax used by MyST, which is the flavour of md preferred for writing Jupyter Book documents.

For example, via https://myst-tools.org/docs/mystjs/diagrams :

It is possible to add mermaid diagrams using the {mermaid} directive, for example:

```{mermaid}
flowchart LR
 A[Jupyter Notebook] --> C
 B[MyST Markdown] --> C
 C(mystjs) --> D{AST}
 D <--> E[LaTeX]
 E --> F[PDF]
 D --> G[Word]
 D --> H[React]
 D --> I[HTML]
 D <--> J[JATS]
```

The jupyterlab-myst extension supports stylised rendering of several MyST admonition types, including mermaid:

There are also various flavours of IPython magic that support text2diagram rendering (example).

westurner · 2023-04-25T23:49:36Z

GitHub has just mermaid (instead of {mermaid}) what is the jupyterbook syntax and does JupyterBook also support what just works on GitHub:
https://github.blog/2022-02-14-include-diagrams-markdown-files-mermaid/ :

```mermaid
  graph TD;
      A-->B;
      A-->C;
      B-->D;
      C-->D;
```

psychemedia · 2023-04-26T09:15:50Z

@westurner The MyST parser renders the ```{mermaid} initiated block correctly, but not ```mermaid. I note also that Quarto uses the ```{mermaid} syntax [docs].

In general, I get the feeling {} route has certain advantages, not least because it allows passage of additional contextual information in the top line of the block (for example, ```{admonition} Title, ```{figure} IMAGE-URL etc.

From a "content creator" end user perspective, writing content, the {} approach appears to have more traction. From a developer perspective, the Github flavoured route may be more attractive.

psychemedia · 2023-04-26T10:12:49Z

In MyST, the convention is ```CODE-LANGUAGE for executable code blocks (docs) whereas in Quarto/qmd the curly-bracketed syntax is used for executable content too (```{python}, ```{r}) [docs].

In the .ipynb context, Jupyter markdown is distinguished separately from code within the "markdown" vs. "code" keyed JSON object. But as Jupytext shows, it is quite possibly to use extended markdown or extended Python formats to represent notebook content that excludes cell outputs.

krassowski · 2023-04-26T21:36:34Z

I believe that the discussion on the use of braces or not is orthogonal to this JEP. Jupyter (Notebok, Lab, etc) currently de facto uses GtiHub-flavoured markdown and this JEP follows that; there is a separate JEP about allowing other Markdown flavours: #99 and if both will be accepted everyone will be happy. But should only this one be accepted, I think we should not introduce discrepancies by changing to braced format for only one singular case.

rgbkrk · 2023-05-03T05:21:17Z

👍 to supporting whatever GitHub flavored markdown supports. Too bad they didn't include mermaid in their spec.

westurner · 2023-05-04T02:24:19Z

@westurner The MyST parser renders the ```{mermaid} initiated block correctly, but not ```mermaid. I note also that Quarto uses the ```{mermaid} syntax [docs].

In general, I get the feeling {} route has certain advantages, not least because it allows passage of additional contextual information in the top line of the block (for example, ```{admonition} Title, ```{figure} IMAGE-URL etc.

From a "content creator" end user perspective, writing content, the {} approach appears to have more traction. From a developer perspective, the Github flavoured route may be more attractive.

FWIW YAML (and YAML-LD with an implicit @context) probably has a better-defined one-line grammar than the existing MyST Markdown parser and JupyterBook.

westurner · 2023-05-04T02:26:38Z

Value of YAML-LD in MyST in: code-fence attr syntax:
- We could add schema.org JSON-LD metadata to inlined code blocks, datasets etc

ivanov · 2023-06-26T22:44:38Z

I want to start off by conceding that having a text-to-graphic diagramming standard within the notebook would be useful, and acknowledge the significant amount of effort and thoroughness that's been put into preparing this JEP.

One of the challenges I see with this proposal is that an external tool is being used without reference to the version of the tool that is part of the document containing the diagrams.

To make this discussion more concrete: Mermaid JS adds a new chart types in minor versions, 10.2.0 introduced quadrant chart, for example. A new user could start using these quadrant charts, or other Mermaid JS features which are not in the version of Mermaid JS that shipped with whatever Jupyter tool they are using, and be frustrated when it doesn't work. Similarly, two users sharing a notebook document but using different versions of the same Jupyter tool which happen to bundle different versions of Mermaid could end up with a proper render of a diagram for one of them, and an un-rendered/error diagram in another. This doesn't have to be different users, the same situation can arise from one user using a combination of Jupyter tools (a JupyterLab version that has adopted the latest Mermaid JS version, and an nbconvert version that is using an older version).

I hope you can understand how an unspecified and possibly changing dependency can also lead to a complement problem, where diagrams that used to render stop rendering as features in MermaidJS are deprecated and removed, and the Mermaid JS version changes.

One of the differences I see between this and an analog in code cells caused by syntax changes in programming code languages is that at least the previous rendered results are available when receiving a notebook document, even if the recipient lacks a kernel matching programming language version.

Finally, it seems like this introduces a javascript runtime environment requirement for converting a notebook to a PDF document. Notebooks using mermaid syntax will have to either be rendered via webpdf or would have to pass though a Mermaid filter for pandoc (I found a couple, they use the Mermaid CLI which requires nodejs: https://github.com/raghur/mermaid-filter and https://github.com/timofurrer/pandoc-mermaid-filter). I do not believe we've had such a requirement in the past.

For these reasons, I am voting against this proposal. I would prefer an alternate path where we come up with a way to try out extensions to markdown like this without committing the project to a particular one. That way, even if JupyterLab and Notebook 7+ ship supporting mermaid syntax, others are also possible, including the ones mentioned in the JEP, or still others, like Wavedrom, or ones that haven't been made yet. One way to think about this JEP is that it adds a secondary "kernel" to documents, and further blurs the operational distinction between code and markdown cells, as markdown cells now contain "code" (in the form of diagram syntax) and that new markdown "code" relies on a particular "kernel" (version of mermaidJS). But I can also see how this could be a staring point for further refinement later, the way we used to have words that were specialized for Python (what started as "pyin" and "pyout" instead of the current "execution_count" and "execution_result").

Should this proposal pass as is, I would like to encourage further work to address cross-compatibility, such as capturing the Mermaid JS version used during authoring into the ipynb document, so other tools have a chance of resolving the ambiguity. To me, the alternative of capturing the MermaidJS version within the notebook document format seems like it would lead to too many minor version updates just for this one component.

Another proposal to address some of these concerns would be to add a rendered version of the diagrams (SVGs) as attachments to the markdown cell, with some convention for how to unambiguously use a pre-rendered version in cases where rendering is unavailable (no javascript) or fails (MermaidJS version mismatch).

echarles · 2023-06-27T07:00:35Z

Based on @ivanov comment on #101 (comment) and on question shared yesterday at the SSC meeting by @ibdafna I am converting my Yes to an Abstain.

I am also formalising here some feedbacks/questions I have asked yesterday during yesterday meeting:

This JEP is very detailed and logical which is great, my doubt is rather on the foundation on which it applies. We don't have a well defined Markdown definition across all Jupyter projects as far as I understand, so building on sand will not drive us far.
MyST is an option to have that rock-solid common definition. It sounds that embracing MyST is a much broader scope than this JEP, however I would prefer having that discussion to take informed decision on this JEP. Based on the recent experience on the JEP, I don't except a potential discussion around MyST to happen later that sooner, so please carry-on without that parameter.

echarles · 2023-06-27T14:45:53Z

How this PR relates/articulates/coordinates with #99 and #103?

bollwyvl · 2023-06-27T23:35:22Z

#103

Presumably it is orthogonal, as this proposed change only adopts existing, widely-adopted syntax with relatively sane failure modes. Put differently, if something in 103 breaks due to this PR, it will probably break other de facto GFM+ features employed on many other platforms.

#99

Welp, it would have to pile on a significant amount more metadata to be precise.

To dovetail with one of @ivanov's key points: let's not be overly-harsh on little old mermaid: we have a bunch of non-portable things done in markdown (and other Jupyter-adopted %languages) that result in pixel-level, or worse, differences: MathJax2, MathJax3, MathJax4, KaTeX, different math fonts, localized typefaces, and they are all extendable at pretty much all levels. If we think mermaid is heavy, wait until every tool would have to ship the matrix of all of the above!

the following is getting totally off topic, now, please ignore for the purposes of reviewing this proposal...

Frankly, I'm of a mind that the current model of many best effort tools (e.g. markdown, widgets, etc) really don't provide the level of reproducibility we yearn for. The above-mentioned "typesetting kernel," (also discussed in a number of other places) is indeed likely the high road.

I think the path there would pretty much abolish cell_type, making source a mimebundle-enabled "rich input" so that "A The Markdown" cell would just be "a self-contained cell my fancy whizzywhig client knows how to render into text/html" such that a downstream viewer might only have to view it as a completely standards-compliant, isolated document fragment. Capture the whole, standards-based description of the used (not installed) renderers. But then extend this thinking to the backend: kernels could tell us what syntax deviations they support, and almost certainly could turn text/x-ipython into text/python. And provide a description of their environments, with tools such as SPDX.

I'd go on to abolish cell_id, hoisting that content to the keys of a cells map, which manages ordering in some other way (linked list, "weight", whatever). Then all cells (and by extension all cell components, pretty much down to the line-of-code level) would be concretely addressable with a standards-compliant URL patterns. This would then allow transcluding them, ideally via some built in, accessibility-first technique, such as . And, as a side-effect: real annotation and commenting.

But these are topics for a whole other day, the discussion around which would be... greatly facilitated by having a portable diagram syntax! ;P

rock-solid

Mermaid has been running like a champ for just a hair shy of a decade, and has a (number of) formal, closed-form grammars, but indeed, just a single implementation. I'd just like to be able to use it in Jupyter tools. This proposal tries to make as few uninformed claims about other syntaxes/tools/features... just the diagram ones that have proven stable enough for me to spend my time (trying) to implement against, and my findings in using them.

fcollonval · 2023-07-11T09:00:16Z

Following the discussion at the SSC meeting yesterday, the mermaid JS website lists the available integrations. In addition to Git webservice (such as GitHub), online editing platforms are also integrating it, like Notion, Observable, HackMD.

bollwyvl · 2023-07-11T12:48:32Z

online editing platforms are also integrating it

Yep, this is the called out in the summary as one of the main motivators of why this proposal concentrates on just one diagram format with just one markdown syntax. Even if a platform doesn't offer this syntax natively, open source platforms Jupyter already uses (such as Discourse) can enable it via first-party plugin.

That Discourse plugin offers one "metadata" extension to the syntax (~~~mermaid height=500) which is actually super useful for page layout to avoid heavy visual page reflows, but probably isn't widely-enough implemented to commit to supporting everywhere.

rpwagner · 2023-07-11T17:16:41Z

open source platforms Jupyter already uses (such as Discourse) can enable it via first-party plugin.

I think the distinction between hosted services, such as GitHub or Discourse, and standalone applications that need to support the exchange of documents in an agreed upon format (e.g., Jupyter notebooks) is important. Looking at the list of integrations I see that code editors exclusively use plugins to enable Mermaid.js support, and that nearly all the documentation generations tools do, as well. This suggests that the current practice for standalone applications is to recommend or maintain a well-supported plugin.

Zsailer · 2023-07-31T17:12:32Z

Hey folks, I think we should pause voting here temporarily—

I think the voting pattern on this JEP presents an interesting problem we haven't faced before; we should use this opportunity to learn/define how to handle this type of situation going forward.

While this JEP technically has more "Yes" votes (4) than "No" votes (1) at this point, there is an equal number of abstentions (4). It would make me extremely uncomfortable to pass this JEP without more buy-in from the SSC.

Further, when I read this thread, the temperature in the room that I'm perceiving is that these abstentions as "soft no"s—folks feel hesitant towards the proposal, but don't want to be the blocker to acceptance. If we are seeing a majority of these types of abstentions, (I believe) this raises a flag.

I believe SSC should work together to ensure that every member has enough time/info to formulate a firm opinion on each JEP (also, the SSC should ensure this happens in a timely manner—we'll get better at this over time). Otherwise, abstentions can become a crutch and allow JEPs to pass without more thorough investigation.

I think this JEP needs more discussion before voting, so I propose we pull it out of voting phase temporarily. I'll mention this in our next SSC meeting.

I want to thank @bollwyvl for proactively engaging every question/concern mentioned here and giving thorough responses. Also, thank you to everyone who has given careful comment/feedback on the JEP so far.

I think this exposed a gap in the SSC that we need to address. I've opened an issue on the SSC team-compass to discuss further, jupyter/software-steering-council-team-compass#4. I'll update this thread on our discussion after next week's meeting.

echarles · 2023-07-31T18:05:10Z

This JEP is very detailed and logical which is great, my doubt is rather on the foundation on which it applies. We don't have a well defined Markdown definition across all Jupyter projects as far as I understand, so building on sand will not drive us far.

In this particular JEP, I have abstained not because of the JEP content itself (and I really want to thank @bollwyvl for the quality work he is doing here, and at a lot of other places), but rather because of the lack of definitions of Markdown outside of this JEP. This is not the first time I have similar feeling where a JEP as such sounds good, and it is easier to say yes to it, instead of looking wider, which in general, sounds synonym to "blocking", or "moving much slower". I don't expect anyone to agree that it is preferable to look wider, instead of having tiny steps even on unstable and unclear things, but I felt maybe useful to share the reasons for my abstention.

See jupyter/enhancement-proposals#101

bollwyvl added 2 commits March 14, 2023 18:26

start with template

68ad31c

Adopting a text-based diagram syntax in Jupyter Markdown

0306b10

bollwyvl force-pushed the add-mermaid-to-markdown branch from 6242983 to 0306b10 Compare March 14, 2023 23:44

bollwyvl mentioned this pull request Mar 15, 2023

Markdown table of contents doesn't handle YAML frontmatter jupyterlab/jupyterlab#14200

Closed

fcollonval mentioned this pull request Mar 16, 2023

Weekly Team Meetings: Jan-Jun 2023 jupyterlab/frontends-team-compass#170

Closed

expand on comments by @manics about unparseable diagrams

c39c35d

bollwyvl commented Mar 22, 2023

View reviewed changes

update with nbconvert progress, split out new and experimental features

9350833

bollwyvl commented Mar 23, 2023

View reviewed changes

krassowski mentioned this pull request Apr 15, 2023

Pre-proposal: Adopting a text-based diagram syntax in Jupyter Markdown #100

Open

bollwyvl mentioned this pull request Apr 27, 2023

Pre-proposal: subdomain for published schemas under jupyter.org #107

Closed

tonyfast mentioned this pull request May 2, 2023

Add JEP for adding $schema to notebook format #97

Merged

33 tasks

Zsailer added the under discussion (RFC) label May 2, 2023

bollwyvl mentioned this pull request May 31, 2023

Update to marked 7.0.2, add mermaidjs 10.3.1 jupyterlab/jupyterlab#14102

Merged

15 tasks

fcollonval mentioned this pull request Jun 5, 2023

SSC meeting minutes jupyter/software-steering-council-team-compass#2

Open

JohanMabille approved these changes Jun 6, 2023

View reviewed changes

psychemedia mentioned this pull request Jun 6, 2023

mermaid.js scripts in ipynb markdown mwouts/jupytext#1072

Open

Zsailer approved these changes Jun 12, 2023

View reviewed changes

echarles approved these changes Jun 13, 2023

View reviewed changes

fcollonval added Waiting decision and removed under discussion (RFC) labels Jun 26, 2023

Zsailer mentioned this pull request Jul 31, 2023

What does abstaining from a JEP actually mean? jupyter/software-steering-council-team-compass#4

Open

fcollonval mentioned this pull request Aug 3, 2023

Vote on Mermaid graph renderer for Markdown cells jupyterlab/frontends-team-compass#208

Closed

63 tasks

rowanc1 added a commit to executablebooks/mystmd that referenced this pull request Aug 3, 2023

🧜‍♀️ Support mermaid as a code block

d6ea7c3

See jupyter/enhancement-proposals#101

rowanc1 mentioned this pull request Aug 3, 2023

🧜‍♀️ Support mermaid as a code block executablebooks/mystmd#529

Merged

rowanc1 added a commit to executablebooks/mystmd that referenced this pull request Aug 3, 2023

🧜‍♀️ Support mermaid as a code block (#529)

fee1eea

See jupyter/enhancement-proposals#101

echarles mentioned this pull request Aug 15, 2023

A call to improve SSC participation jupyter/software-steering-council-team-compass#8

Closed

lazlop mentioned this pull request Dec 11, 2023

guides open223/docs.open223.info#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adopting a text-based diagram syntax in Jupyter Markdown #101

Adopting a text-based diagram syntax in Jupyter Markdown #101

bollwyvl commented Mar 14, 2023 •

edited by fcollonval

bollwyvl commented Mar 22, 2023

manics commented Mar 22, 2023 •

edited

bollwyvl commented Mar 22, 2023 •

edited

bollwyvl commented Mar 22, 2023

bollwyvl Mar 22, 2023

bollwyvl Mar 23, 2023

bollwyvl Mar 23, 2023

psychemedia commented Apr 25, 2023

westurner commented Apr 25, 2023

psychemedia commented Apr 26, 2023 •

edited

psychemedia commented Apr 26, 2023 •

edited

krassowski commented Apr 26, 2023

rgbkrk commented May 3, 2023

westurner commented May 4, 2023

westurner commented May 4, 2023

ivanov commented Jun 26, 2023

echarles commented Jun 27, 2023 •

edited

echarles commented Jun 27, 2023

bollwyvl commented Jun 27, 2023

fcollonval commented Jul 11, 2023

bollwyvl commented Jul 11, 2023

rpwagner commented Jul 11, 2023

Zsailer commented Jul 31, 2023

echarles commented Jul 31, 2023


		...and they should _mostly_ render the same way everywhere.

		_MermaidJS 10_ also includes a number of new diagrams, not available on earlier major

Adopting a text-based diagram syntax in Jupyter Markdown #101

Are you sure you want to change the base?

Adopting a text-based diagram syntax in Jupyter Markdown #101

Conversation

bollwyvl commented Mar 14, 2023 • edited by fcollonval

Voting from @jupyter/software-steering-council

bollwyvl commented Mar 22, 2023

manics commented Mar 22, 2023 • edited

bollwyvl commented Mar 22, 2023 • edited

bollwyvl commented Mar 22, 2023

bollwyvl Mar 22, 2023

Choose a reason for hiding this comment

bollwyvl Mar 23, 2023

Choose a reason for hiding this comment

bollwyvl Mar 23, 2023

Choose a reason for hiding this comment

psychemedia commented Apr 25, 2023

westurner commented Apr 25, 2023

psychemedia commented Apr 26, 2023 • edited

psychemedia commented Apr 26, 2023 • edited

krassowski commented Apr 26, 2023

rgbkrk commented May 3, 2023

westurner commented May 4, 2023

westurner commented May 4, 2023

ivanov commented Jun 26, 2023

echarles commented Jun 27, 2023 • edited

echarles commented Jun 27, 2023

bollwyvl commented Jun 27, 2023

fcollonval commented Jul 11, 2023

bollwyvl commented Jul 11, 2023

rpwagner commented Jul 11, 2023

Zsailer commented Jul 31, 2023

echarles commented Jul 31, 2023

bollwyvl commented Mar 14, 2023 •

edited by fcollonval

manics commented Mar 22, 2023 •

edited

bollwyvl commented Mar 22, 2023 •

edited

psychemedia commented Apr 26, 2023 •

edited

psychemedia commented Apr 26, 2023 •

edited

echarles commented Jun 27, 2023 •

edited