Skip to content
PdfParser, a standalone PHP library, provides various tools to extract data from a PDF file.
Branch: master
Clone or download
smalot Merge pull request #231 from rbairwell/patch-1
Add default timezone (UTC)
Latest commit 0c85b15 Mar 28, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
samples Fix #18, #58 based on MR from @xelan Jan 23, 2019
src/Smalot/PdfParser Update ElementDate.php Mar 28, 2019
.atoum.php
.gitattributes fix EOL chars to unix style Oct 13, 2013
.gitignore remove composer.lock from git Dec 8, 2013
.travis.yml PHP 7.3 is no longer allowed to fail Dec 5, 2018
LICENSE.txt change license to LGPLv3 according to TCPDF license Jan 3, 2017
README.md
composer.json Merge pull request #168 from mplx/fix/mbstring Jun 23, 2018

README.md

PdfParser

Pdf Parser, a standalone PHP library, provides various tools to extract data from a PDF file.

Build Status Current Version composer.lock

Total Downloads Monthly Downloads Daily Downloads

Website : http://www.pdfparser.org

Test the API on our demo page.

This project is supported by Actualys.

Features

Features included :

  • Load/parse objects and headers
  • Extract meta data (author, description, ...)
  • Extract text from ordered pages
  • Support of compressed pdf
  • Support of MAC OS Roman charset encoding
  • Handling of hexa and octal encoding in text sections
  • PSR-0 compliant (autoloader)
  • PSR-1 compliant (code styling)

Currently, secured documents are not supported.

This Library is still under active development. As a result, users must expect BC breaks when using the master version.

Documentation

Read the documentation on website.

Original PDF References files can be downloaded from this url : http://www.adobe.com/devnet/pdf/pdf_reference_archive.html

License

This library is under the LGPLv3 license.

You can’t perform that action at this time.