Read PDF files with PHP 5.6 (based on spatie/pdf-to-text package)
This package is a PHP 5.6+ fork of Spatie PDF To Text package. If you use PHP7, please use the original package.
This package provides a class to extract text from a pdf.
\JBPapp\PdfToText\Pdf::getText('book.pdf'); //returns the text from the pdf
Behind the scenes this package leverages pdftotext. You can verify if the binary installed on your system by issueing this command:
which pdftotext
If it is installed it will return the path to the binary.
To install the binary you can use this command on Ubuntu or Debian:
apt-get install poppler-utils
If you're on RedHat or CentOS use this:
yum install poppler-utils
You can install the package via composer:
$ composer require spatie/pdf-to-text
Extracting text from a pdf is easy.
$text = (new Pdf())
->setPdf('book.pdf')
->text();
Or easier:
\JBPapp\PdfToText\Pdf::getText('book.pdf')
By default the package will assume that the pdftotext
is located at /usr/bin/pdftotext
.
If you're using the a different location pass the path to the binary in constructor
$text = (new Pdf('/custom/path/to/pdftotext'))
->setPdf('book.pdf')
->text();
or as the second parameter to the getText
-function:
\JBPapp\PdfToText\Pdf::getText('book.pdf', '/custom/path/to/pdftotext')
Please see CHANGELOG for more information what has changed recently.
$ composer test
Please see CONTRIBUTING for details.
If you discover any security related issues, please email freek@spatie.be instead of using the issue tracker.
The MIT License (MIT). Please see License File for more information.