php-tesseract

This extension is currently under development.

Installation

Initial Setup

$ git clone https://github.com/insecia/php-tesseract.git
$ cd php-tesseract
$ docker-compose build extension-builder

Compile Extension

$ docker-compose run --rm extension-builder
$ docker-compose build php

Run Tests

$ docker-compose run --rm php make test

Run Script

$ docker-compose run --rm php php script_name.php

Usage

Basic usage

$tesseract = \Tesseract\Tesseract::fromFile('image.jpg');
$textContent = $tesseract->getText();

It's also possible to define a certain rectangular area of the image from which the tesseract lib should extract text.

$tesseract = \Tesseract\Tesseract::fromFile('image.jpg');
$textContent = $tesseract->getRectangle(500, 500, 1000, 1000)->getText();

A tesseract instance can also be created from a string that contains the binary content of an image. This has the advantage of not requiring the creation of a temporary file.

$textContent = \Tesseract\Tesseract::fromString($imageContent)->getText();

One or multiple languages can also be specified. Note that the language file for the specified languages must be installed. Refer to the Dockerfile for usage under Alpine or the tesseract-ocr documentation.

$tesseract = \Tesseract\Tesseract::fromFile('image.jpg', [
    \Tesseract\Language\GERMAN,
    \Tesseract\Language\ENGLISH
]);
$textContent = $tesseract->getText();

It is also possible to choose a different page seg mode.

$tesseract = \Tesseract\Tesseract::fromFile('image.jpg');
echo $tesseract->setPageSegMode(\Tesseract\PageSegMode\SINGLE_WORD)->getText();

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
phpstorm-stubs		phpstorm-stubs
src		src
tests		tests
travis		travis
.travis.yml		.travis.yml
Dockerfile		Dockerfile
Dockerfile.builder		Dockerfile.builder
DockerfileUbuntu.builder		DockerfileUbuntu.builder
README.md		README.md
config.m4		config.m4
docker-compose.yml		docker-compose.yml
php_future.h		php_future.h
php_tess.cpp		php_tess.cpp
php_tess.h		php_tess.h
php_tesseract.h		php_tesseract.h

cnsuhao/php-tesseract

Folders and files

Latest commit

History

Repository files navigation

php-tesseract

Installation

Initial Setup

Compile Extension

Run Tests

Run Script

Usage

Basic usage

About

Resources

Stars

Watchers

Forks

Languages