Add new parser for text-based notebooks #83

chrisjsewell · 2020-03-17T20:12:11Z

Once mwouts/jupytext#458 is merged, and a new jupytext version is released, we can progress with the folowing:

Also these files won't work directly with the myst-parser extension (because it won't know what to do with code-cell and raw-cell). You need to add a separate parser to myst-nb that calls jupytext.myst.matches_mystnb, to work out if the markdown file should be read as pure markdown or converted to a notebook. (it should just need to be a small subclass of NotebookParser)

@AakashGfude will also need to use it in some manner in #55 to work out which markdown files to convert/execute/cache

Originally posted by @chrisjsewell in #82 (comment)

The text was updated successfully, but these errors were encountered:

choldgraf · 2020-03-17T20:18:46Z

sounds good - that's similar to what jupyter book does currently, if it gets a text file that's not .ipynb, it does a quick check for whether it's a jupytext file and, if so, converts it into a notebook before processing.

One snag here will be same line number / reporting challenge in #71 - in this case, we do have a "natural" line number to report, even though we're working with a notebook, but we'll need to find a way to pass this through the build system to use when needed

chrisjsewell · 2020-03-17T20:53:03Z

One snag here will be same line number / reporting challenge in #71 -

Yeh absolutely, thats the 'major' complication of pre-conversion to a notebook, rather than treating these cells as 'pure' sphinx directives (as done by jupyter-sphinx). I think/hope it is solvable though.

To re-iterate why trying to achieve this by adapting jupyter-sphinx is problematic:

You have to maintain two separate parsing flows for notebooks and text-based documents. As opposed to here, where once you do the initial conversion, both continue down the same process. Two flows can lead to inconsistencies, as we have now for the CSS formatting.
It would be difficult/impossible to integrate jupyter-sphinx into the approach being taken in FEAT: Jupyter-cache integration #55 for execution and caching. This is because you have to make a full (sphinx) parse of the document to identify the code cells, which precludes having the execution before any parsing takes place. In this approach, jupytext effectively acts as a 'fast parse', that doesn't need to use the full sphinx machinery, just identify the boundaries (and metadata) between cells.
It would be difficult to see how jupyter-sphinx could utilise the required front-matter metadata, output by jupytext, to extract the kernel information (currently it uses a seperate directive to specify the kernel, which isn't really an option for a jupytext conversion)
Sphinx directives cannot take arbitrary metadata (as may be output by jupytext), you have to define all options up front. For example, this would mean that users could only add specific metadata to their code cell, otherwise it would break the directive. Or you have to make the jupytext converter output only select metadata, in a specific format, which would 'break' true round-trip conversion.

chrisjsewell · 2020-03-18T00:51:00Z

One snag here will be same line number / reporting challenge in #71 - in this case, we do have a "natural" line number to report, even though we're working with a notebook

Using

nb = jupytext.myst.matches_mystnb(text, store_line_numbers=True)

Now stores the line numbers in cell metadata, under _source_lines.
This can be parsed into: https://github.com/ExecutableBookProject/MyST-NB/blob/ab4ba1d0964a7fe0a6cd516143ccc0a472b63570/myst_nb/parser.py#L71
as

start_line=nb_cell["metadata"].get("_source_lines", [0])[0]

chrisjsewell added the enhancement New feature or request label Mar 17, 2020

chrisjsewell mentioned this issue Mar 18, 2020

Add documentation of myst-nb format mwouts/jupytext#458

Merged

chrisjsewell mentioned this issue Mar 20, 2020

*always* use jupytext to read in markdown files? #88

Closed

choldgraf mentioned this issue Mar 26, 2020

Discussion for beta release executablebooks/meta#48

Closed

chrisjsewell linked a pull request Apr 1, 2020 that will close this issue

Draft of text-based notebooks #116

Merged

choldgraf closed this as completed in #116 Apr 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new parser for text-based notebooks #83

Add new parser for text-based notebooks #83

chrisjsewell commented Mar 17, 2020

choldgraf commented Mar 17, 2020

chrisjsewell commented Mar 17, 2020 •

edited

Loading

chrisjsewell commented Mar 18, 2020

Add new parser for text-based notebooks #83

Add new parser for text-based notebooks #83

Comments

chrisjsewell commented Mar 17, 2020

choldgraf commented Mar 17, 2020

chrisjsewell commented Mar 17, 2020 • edited Loading

chrisjsewell commented Mar 18, 2020

chrisjsewell commented Mar 17, 2020 •

edited

Loading