Skip to content

ugent-library/PDFCreator

Repository files navigation

PDFCreator

Creates a PDF/A file from a list of images (JPEG,JPEG2000,TIF) with optional HOCR support.

Usage:

  # Create a PDF out of a list of images
  ./pdfcreator -v -o test.pdf /tmp/*.tif

  # Create a PDF using a batch input
  ./pdfcreator -v -b -i ".*tif" -e ".*pdf" < input.txt

  where input.txt is a space delimited file containing an output file
  and an source image directory for every line.

  E.g.

  my_file.pdf  /data/files/my_images
  my_file2.pdf /data/files/my_images2

Options:

  -v           - verbose
  -a 1A|1B|2A|2B|3A|3B|NONE   - PDF/A compliance
  -b           - batchmode
  -i regex     - include files [batchmode]
  -e regex     - exclude files [batchmode]
  -o file      - output file
  -t title     - title of the document
  -x version   - PDF verson ("1.4" -> "1.7")

Remarks:

  By default PDFCreator will generate a PDF/A-1B file that fills an area specified
by the image resolution tags. 

  By default PDFCreator will read the ICC Color profile from the supplied images.
If no color profile is provided sRGB will be used as default.

  When the HTML files are available that match the name of the image files 
  (e.g. 0001.tif vs 0001.html), then these files will be interpreted as HOCR
  data and included in a PDF file with fulltext search.

Authors:

 * Patrick Hochstenbach < patrick . hochstenbach @ ugent . be >
 * Nicolas Franck < nicolas . franck @ ugent . be >

License and copyright

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
 
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU Affero General Public License for more details.
 
You should have received a copy of the GNU Affero General Public License along with this program.  If not, see <http://www.gnu.org/licenses/>.

About

Create PDF files from images + optional HOCR support

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published