DocBook Reader doesn't seem to parse elements within "code" xml #1449

bexelbie · 2014-07-23T13:52:28Z

elements such as <screen> can contain other elements such as <command>. These inner elements appear to be ignored.

All of these elements are calling codeWithLang and that doesn't appear to account for nested tags.

What led me to the above conclusion was that code like this

<screen>
]# <command>ls</command>
</screen>

results in (in this case html, but I've tried odt as well with similar results)

<pre><code>~]# 
            </code></pre>

The inner block is gone.

The text was updated successfully, but these errors were encountered:

jgm · 2014-07-23T15:00:26Z

Pandoc's document model doesn't allow structure in code blocks. But we should at least extract the string content of these inner tags, recursively.

bexelbie · 2014-07-24T08:30:03Z

Why does pandoc not allow markup within code blocks? I believe some of the output formats, such as html, support it. In an idea world wouldn't it be the Writer's job to dump the inner tags that it can't support while preserving their content.

(This is similar to the issue with the odt writer dumping markdown that uses the pre html tag without preserving the string content)

Of course, we can't include structure in the code block, but this way we at least preserve the text. Closes jgm#1449.

jgm · 2014-07-28T20:32:36Z

+++ bcexelbi [Jul 24 14 01:30 ]:

Why does pandoc not allow markup within code blocks? I believe some of
the output formats, such as html, support it. In an idea world wouldn't
it be the Writer's job to dump the inner tags that it can't support
while preserving their content.

In an ideal world, perhaps. And adding additional structure to the
CodeBlock type could be considered in the future. But such changes
would require a lot of changes to the code (they would affect all the
writers and readers, and they would make writing filters much less
straightforward). Not many formats allow structure inside code blocks,
and those that do vary widely in the kind of structure they allow.

Pandoc tries to find a happy medium, in its representation of documents,
between simplicity and expressiveness. (Not the "least common
denominator," but not far from that.)

jgm closed this as completed in 9c3f768 Jul 23, 2014

mpickering pushed a commit to mpickering/pandoc that referenced this issue Jul 25, 2014

DocBook reader: Better handle elements inside code environments.

ca19f39

Of course, we can't include structure in the code block, but this way we at least preserve the text. Closes jgm#1449.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DocBook Reader doesn't seem to parse elements within "code" xml #1449

DocBook Reader doesn't seem to parse elements within "code" xml #1449

bexelbie commented Jul 23, 2014

jgm commented Jul 23, 2014

bexelbie commented Jul 24, 2014

jgm commented Jul 28, 2014

DocBook Reader doesn't seem to parse elements within "code" xml #1449

DocBook Reader doesn't seem to parse elements within "code" xml #1449

Comments

bexelbie commented Jul 23, 2014

jgm commented Jul 23, 2014

bexelbie commented Jul 24, 2014

jgm commented Jul 28, 2014