Store "unnumbered" class in DocBook role attribute #8481

lifeunleaded · 2022-12-11T08:32:12Z

Markdown allows marking a heading as unnumbered, which is stored as a class token internally. This change will recognize this particular class token and append it to the role attribute, or create a role attribute with it if needed. This does not imply any processing in DocBook but is intended to let customized stylesheets identify these sections and act accordingly.

The enrichRole function is intentionally designed to be able to take more class tokens in the future by extending the cand list

fixes #1402

tarleb · 2022-12-11T18:38:47Z

Thank you!

I think the code in enrichRole could possibly be cleaned up a little. Checkout the functions in the Data.List module, esp. lookup and partition, those could help to keep the function succinct.

Edit: also perhaps maybeToList, but that depends on the chosen approach.

jgm · 2022-12-11T18:39:36Z

Is the use of role="unnumbered" standard (i.e., supported in any standard xslt stylesheets)? Or is it that there's no standard way to do this, and you just want to provide a way that it can be done?

lifeunleaded · 2022-12-11T18:45:11Z

@tarleb Thank you. I did look for an import of Data.List and couldn't find one, so I thought it would be too intrusive to add. It can certainly be made more readable with Data.List functions.

@jgm No, there is no standard method (that I know of). In the issue descriptions people mentioned "workarounds" just to be able to find these sections, so that's the intention of this: The XPath selector to identify the sections that are supposed to be unnumbered is deterministic, but the work to then act accordingly with section labelling has to be done in the stylesheets.

jgm · 2022-12-11T19:16:47Z

No problem to use Data.List.

lifeunleaded · 2022-12-11T19:56:58Z

@tarleb @jgm On second thought, this is a trivial filter to create since the class token already comes in from the MD reader, and we can put attributes pairs that create XML attributes. Maybe just write it and add to the filters repo and point to it? That would allow anyone to tailor which attribute and token to use, if they already have a stylesheet. My Lua is not stellar, but something like below would do it? That would remove the need to hardcode it in entirely.

local function contains(table, val)
   for i=1,#table do
      if table[i] == val then 
         return true
      end
   end
   return false
end

function Header (hdr)
    if contains(hdr.classes, 'unnumbered'
    then
        hdr.attributes = {{'role','unnumbered'}}
    end
    return hdr
end

tarleb · 2022-12-11T22:06:40Z

Pandoc already does quite a few slightly opinionated things in various writers, so I think that making the change in the writer would be fine.

Slightly shorter filter:

function Header (hdr)
  if hdr.classes:includes 'unnumbered' then
    hdr.attributes.role = 'unnumbered'
    return hdr
  end
end

jgm · 2022-12-12T04:08:46Z

I agree, I think it's fine to change this in the writer.

lifeunleaded · 2022-12-12T08:11:49Z

Thank you. I will rewrite the enrichRole function and update.

Markdown allows marking a heading as unnumbered, which is stored as a class token internally. This change will recognize this particular class token and append it to the role attribute, or create a role attribute with it if needed. This does not imply any processing in DocBook but is intended to let customized stylesheets identify these sections and act accordingly. fixes jgm#1402

lifeunleaded · 2023-01-02T18:19:29Z

@tarleb It is now more succinct. The Prelude lookup sufficed for this. To be honest I'm not sure it's more readable. I'll gladly take further pointers on changing it.

tarleb · 2023-01-02T18:58:29Z

src/Text/Pandoc/Writers/DocBook.hs

+enrichRole mattrs cls = [("role",rolevals) | rolevals /= ""]<>(filter (\x -> (fst x) /= "role") mattrs)
+  where
+    rolevals = T.unwords((filter (`elem` cand) cls)<>(maybeToList(lookup "role" mattrs)))
+    cand = ["unnumbered"]


Nice, thanks! This is good to merge from my POV.

Your comments on readability made me wonder, and I ended up writing an (untested!!) version trying to strike a balance between readability and and conciseness. Not sure if I succeeded, but here we go:

enrichRole :: [(Text, Text)] -> [Text] -> [(Text, Text)] enrichRole mattrs cls = [("role", T.unword roles) | not (null roles)] <> nonRole where (roleAttr, nonRole) = partition (\(key, _v) -> key == "role") mattrs roles = nub $ ["unnumbered" | "unnumbered" `elem` cls] <> map snd roleAttr

@tarleb Thank you for the suggestion! I'm fine either way, although having cand separate was something I hoped to keep, as I can envision there are more class tokens that would make sense to add as role tokens in DocBook, and adding to that list is (perhaps) less intimidating to future contributors. Let me know if you would prefer it changed, and if not, if there's more to do before merge (rebase on current master?)

Agreed, keeping the list of "role classes" separate seems sensible.

I just noticed that some lines break the 80 chars limit, we generally want to keep lines below that length. Once that's fixed we can squash-merge everything into a single commit; no need for a rebase.

EDIT: We're less strict about this limit in the tests, those are fine.

@tarleb Sorry, didn't get to the 80char fix before it merged. Will keep it in mind in the future.

No problem at all. Thanks again, looking forward to the next one!

Markdown allows marking a heading as unnumbered, which is stored as a class token internally. This change will recognize this particular class token and append it to the role attribute, or create a role attribute with it if needed. This does not imply any processing in DocBook but is intended to let customized stylesheets identify these sections and act accordingly. Closes jgm#1402

lifeunleaded force-pushed the issue1402 branch from 8c9ca0e to 157d556 Compare January 2, 2023 18:15

lifeunleaded force-pushed the issue1402 branch from 157d556 to fec74c6 Compare January 2, 2023 18:17

tarleb reviewed Jan 2, 2023

View reviewed changes

jgm merged commit 4746d0c into jgm:main Jan 13, 2023

lifeunleaded deleted the issue1402 branch January 24, 2023 07:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store "unnumbered" class in DocBook role attribute #8481

Store "unnumbered" class in DocBook role attribute #8481

lifeunleaded commented Dec 11, 2022

tarleb commented Dec 11, 2022 •

edited

Loading

jgm commented Dec 11, 2022

lifeunleaded commented Dec 11, 2022

jgm commented Dec 11, 2022

lifeunleaded commented Dec 11, 2022

tarleb commented Dec 11, 2022 •

edited

Loading

jgm commented Dec 12, 2022

lifeunleaded commented Dec 12, 2022

lifeunleaded commented Jan 2, 2023

tarleb Jan 2, 2023 •

edited

Loading

lifeunleaded Jan 3, 2023

tarleb Jan 7, 2023 •

edited

Loading

lifeunleaded Jan 17, 2023

tarleb Jan 17, 2023

Store "unnumbered" class in DocBook role attribute #8481

Store "unnumbered" class in DocBook role attribute #8481

Conversation

lifeunleaded commented Dec 11, 2022

tarleb commented Dec 11, 2022 • edited Loading

jgm commented Dec 11, 2022

lifeunleaded commented Dec 11, 2022

jgm commented Dec 11, 2022

lifeunleaded commented Dec 11, 2022

tarleb commented Dec 11, 2022 • edited Loading

jgm commented Dec 12, 2022

lifeunleaded commented Dec 12, 2022

lifeunleaded commented Jan 2, 2023

tarleb Jan 2, 2023 • edited Loading

Choose a reason for hiding this comment

lifeunleaded Jan 3, 2023

Choose a reason for hiding this comment

tarleb Jan 7, 2023 • edited Loading

Choose a reason for hiding this comment

lifeunleaded Jan 17, 2023

Choose a reason for hiding this comment

tarleb Jan 17, 2023

Choose a reason for hiding this comment

tarleb commented Dec 11, 2022 •

edited

Loading

tarleb commented Dec 11, 2022 •

edited

Loading

tarleb Jan 2, 2023 •

edited

Loading

tarleb Jan 7, 2023 •

edited

Loading