HTML table to contains list #379

qzchenwl · 2012-01-18T03:34:14Z

For file table.html

<table>
  <tr>
    <td>
      <ul>
        <li>item1</li>
        <li>item2</li>
      </ul>
    </td>
  </tr>
</table>

pandoc.old -f html -t html table.html

<ul>
<li>item1</li>
<li>item2</li>
</ul>

pandoc.new -f html -t html table.html

<table>
<tbody>
<tr class="odd">
<td align="left"><ul>
<li>item1</li>
<li>item2</li>
</ul></td>
</tr>
</tbody>
</table>

jgm · 2012-01-28T21:57:05Z

If you apply the patch, then do

pandoc -f html -t markdown | pandoc

on this input, you'll see why I had pPlain instead of block.

The problem is that we can't extract information about the widths of the table columns from the HTML. So we just set them all to 0, which pandoc interprets as meaning "just put the cells on one line and create a simple, not a multiline table".

I suppose that more general tables could be supported in HTML by assuming that all the columns are equal width, but that will often produce funny results.

jgm · 2013-12-09T19:22:32Z

The latest pandoc handles the original input as expected. Closing.

add block support for table cell

18f5ae1

unknown and others added 7 commits February 15, 2012 01:12

reader consume ByteString, previously consume String

4a86590

docx hello world

479f86c

extract files from docx

a94a724

extract text

45005cb

extract plain text from docx

183324c

parse bullet list

214305d

parse rich table

3729b37

jgm closed this Dec 9, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HTML table to contains list #379

HTML table to contains list #379

qzchenwl commented Jan 18, 2012

jgm commented Jan 28, 2012

jgm commented Dec 9, 2013

HTML table to contains list #379

HTML table to contains list #379

Conversation

qzchenwl commented Jan 18, 2012

jgm commented Jan 28, 2012

jgm commented Dec 9, 2013