All notable changes to the EPUP Page Approximator will be documented here.
- Added the option to provide the chapter map as key:value pairs instead of a simple list.
- Added the functionality to rebuild a valid page list from non-standard page markers in the ebook by passing a tag selector as the
pages
argument. - Added the
--attributes\-a
argument to select attribute values of detected page markers. - Fixed some bugs involving HTML decoding and zip file handling.
- Guides moved to the new Wiki
- The order in which the HTML documents are parsed is now defined by the order they are listed in the spine instead of their order in the content.opf.
- There are two new options for dealing with special cases pertaining to the book spine.
--nonlinear\-l
for documents with thelinear='no'
setting and--unlisted\-u
for documents not listed in the spine. - More bugfixes with path handling.
- added new
--romanfrontmatter\-f
option to specif if and how many pages of front matter the book contains, which will be paginated in Roman numerals. - The
--tocpages
option now also supports roman numerals for defining the page number of sections before page one. - More bugfixes and performance improvements.
- Added new
--autopage
flag for generating a dynamic number of pages by using the value of thepages
argument as size definition of a single page. - Added new
--suggest
flag for use together with--autopage
. This flag will display the dynamically generated page count without saving it to a file. - Added a new "words" mode for the
--pagingmode
option. - The
pages
argument accepts the string "bookstats", resulting in displaying the character-, line- and word count of the current book.
- Added new flag
--page-map
for generating a page-map.xml file for compatibility with Adobe Digital Editions based readers.
- Introduced the
--tocpages\-t
option for matching page numbers to Table of Content markers. - Implemented logic for starting a book at page 0 if the
--tocpages
list defines a 'first' page. - A more pythonic naming scheme for files
- Source code fully documented
- Enormous performance improvement.
- Added a new
--pagingmode\-p
option to allow finer control over how pages are split up. - Fixed edge cases where page breaks could be generated outside the
body
tag of the document.
- Switched to a proper node manipulation based approach. Much slower compared to the previous regex method, but much more reliable and guarantees that page breaks can't end up in invalid locations.
- Added the
--breakmode\-b
option, to handle page breaks in the middle of words. - Lots of general bugfixes.
- Initial release.