Skip to content

Using plain text OCR and Markdown for EPUB PDF export

hendrack edited this page Jan 30, 2023 · 20 revisions

I have seen a feature request in the issues page for exporting to epub. This can be done quite easy using the Markdown format, it is very easy to learn and does not involve much coding. Here is a cheat sheet.

I use these workflows in Linux, so Windows users may have to adapt a bit.

Here are some examples:

https://jdhao.github.io/2019/05/30/markdown2pdf_pandoc/
https://learnbyexample.github.io/customizing-pandoc/
https://keleshev.com/my-book-writing-setup/

  • LaTeX or ConTeXT

CTAN/markdown package. With this, you can either embed Markdown syntax in your latex files, or embed .md files themself. I don't recommend LaTeX or ConTeXT for ePub exporting.

This IDE has a nice addon called bookdown and allows you to export Markdown and r-markdown to PDF/ePub.

Pandoc on steroids. Can export to various formats.

  • ePub fine tuning

The ePub format is mostly HTML/CSS code, if you export your OCR Markdown text to Ppub, you can use Sigil to edit your ePub, like cover picture, ToC, CSS etc.