prevent to_md() from clobbering document (and friends!) #15

zkamvar · 2020-05-28T23:23:26Z

Description

I've done a couple of things here (I apologize for shoving too many things in a single PR, but I figured they were all roughly related):

Fixed a bug I introduced in add sourcepos argument #13/ensure Rmd chunks retain sourcepos attribute #14 where code chunks would have an extra sourcepos option added if read in with sourcepos = TRUE (whoops 😳)
Updated to_xml() to only post-process code blocks that have a language attribute to allow bare or unevaluated code fences.
I noticed that the original xml document was being affected by the round trip (because xml2 objects operate on pass-by-reference, which is summarized SO WELL in this coffee cup gif), so I have updated the processing to copy the document before processing and it no longer goes through a disk write (there was a weird namespace issue with this process that was affecting the block processing, so I use the technique I found in my own package)

Example

library(tinkr)
library(magrittr)
path <- system.file("extdata", "example2.Rmd", package = "tinkr")
rmd <- tinkr::to_xml(path, sourcepos = TRUE)
rmd$body %>%
  xml2::xml_find_first(".//d1:code_block") %>%
  xml2::xml_attrs()
#>  sourcepos      space   language       name    include       eval 
#>  "2:1-4:3" "preserve"        "r"    "setup"    "FALSE"     "TRUE"
tmp <- tempfile()
to_md(rmd, tmp)
# blocks are processed correctly
readLines(tmp) %>%
  head(10) %>%
  cat(sep = "\n")
#> ---
#> title: "Untitled"
#> author: "M. Salmon"
#> date: "September 6, 2018"
#> output: html_document
#> ---
#> 
#> ```{r setup, include=FALSE, eval=TRUE}
#> knitr::opts_chunk$set(echo = TRUE)
#> ```
# unevaluated blocks are processed correctly
readLines(tmp) %>%
  tail(29) %>%
  head(10) %>%
  cat(sep = "\n")
#> 
#> Non-RMarkdown blocks are also considered
#> 
#> ```bash
#> echo "this is an unevaluted bash block"
#> ```
#> 
#> ```
#> This is an ambiguous code block
#> ```
# The attributes have not changed
rmd$body %>%
  xml2::xml_find_first(".//d1:code_block") %>%
  xml2::xml_attrs()
#>  sourcepos      space   language       name    include       eval 
#>  "2:1-4:3" "preserve"        "r"    "setup"    "FALSE"     "TRUE"

^{Created on 2020-05-28 by the reprex package (v0.3.0)}

This serves as a followup to ropensci#14

- to_xml only processes info of blocks that have curly braces - transform_code_blocks only looks for the code blocks with a language attribute (avoids ```{NA} code chunks) - example blocks have been added to Rmarkdown example - tests incorporated

The to_md() function would clobber the original xml document because all of the functions on xml documents are pass by reference instead of the trusty ol' R pass-by-value :/

maelle

Thanks a ton!!

maelle · 2020-05-29T13:08:07Z

R/to_md.R

 #' # file.edit("newmd.md")
+#' file.remove(newmd)


much better! should I submit this to CRAN one day? If I do you'll have to become an author.

It would be nice to have on CRAN at some point, but I might like to give #9 a shot first 😉

maelle · 2020-05-29T13:11:43Z

R/to_md.R

  code_blocks <- xml %>%
-    xml2::xml_find_all(xpath = './/d1:code_block',
+    xml2::xml_find_all(xpath = './/d1:code_block[@language]',


so much more elegant! ✨

maelle · 2020-05-29T13:14:15Z

Would it be fine to add you as an author before merging? If so, please go ahead and add your metadata.

zkamvar added 4 commits May 28, 2020 15:24

add sourcepos to ignored attrs

8be4527

This serves as a followup to ropensci#14

Allow processing of un-evaluated blocks

1e912fc

- to_xml only processes info of blocks that have curly braces - transform_code_blocks only looks for the code blocks with a language attribute (avoids ```{NA} code chunks) - example blocks have been added to Rmarkdown example - tests incorporated

fix check note from example

f0bd4e5

update to_md to not clobber xml body

b6cb878

The to_md() function would clobber the original xml document because all of the functions on xml documents are pass by reference instead of the trusty ol' R pass-by-value :/

zkamvar changed the title ~~Znk code processing~~ prevent to_md() from clobbering document (and friends!) May 28, 2020

maelle approved these changes May 29, 2020

View reviewed changes

add Zhian as author

bc62b03

maelle merged commit 2acfbea into ropensci:master May 29, 2020

zkamvar deleted the znk-code-processing branch September 22, 2020 22:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prevent to_md() from clobbering document (and friends!) #15

prevent to_md() from clobbering document (and friends!) #15

zkamvar commented May 28, 2020

maelle left a comment

maelle May 29, 2020

zkamvar May 29, 2020

maelle May 29, 2020

maelle commented May 29, 2020

prevent to_md() from clobbering document (and friends!) #15

prevent to_md() from clobbering document (and friends!) #15

Conversation

zkamvar commented May 28, 2020

Description

Example

maelle left a comment

Choose a reason for hiding this comment

maelle May 29, 2020

Choose a reason for hiding this comment

zkamvar May 29, 2020

Choose a reason for hiding this comment

maelle May 29, 2020

Choose a reason for hiding this comment

maelle commented May 29, 2020