GitHub - yurimarx/ocr-service: OCR Interoperability Service

InterSystems IRIS Interoperability OCR Service

This is an InterSystems IRIS Interoperability OCR Service to extract text from images and pdfs from a file into a multipart request from form or http request.

What The the service does

This application receive a http multipart request with a file, extract text using OCR from Tesseract and returns the result

Prerequisites

Make sure you have git and Docker desktop installed.

Installation: Docker

Clone/git pull the repo into any local directory

$ git clone https://github.com/yurimarx/ocr-service.git

Open the terminal in this directory and run:

$ docker-compose build

Run the IRIS container with your project:

$ docker-compose up -d

OCR and NLP working together:

How to Run the Ocr Production

Open the production
Set host destination folder to the uploaded files. See:

Start the production.
Now Open Postman or create a multipart request into a form pointing to localhost:9980/ using POST with a form-data file attribute. See sample (use an image or pdf with image inside):

See the text returned - support to english and portuguese languages only, in the first version
Send 2 or 3 files with some text
Go to the NLP Domain Explorer
Analyze the texts and enjoy!

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.settings		.settings
.vscode		.vscode
jgw		jgw
src		src
target		target
tessdata		tessdata
.classpath		.classpath
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.project		.project
Dockerfile		Dockerfile
IRISAPP.session.sql		IRISAPP.session.sql
Installer.cls		Installer.cls
LICENSE		LICENSE
README.md		README.md
debugLog.log		debugLog.log
dev.md		dev.md
docker-compose.yml		docker-compose.yml
iris.script		iris.script
irissession.sh		irissession.sh
module.xml		module.xml
nlp2.gif		nlp2.gif
ocrgif.gif		ocrgif.gif
pom.xml		pom.xml
postman.png		postman.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InterSystems IRIS Interoperability OCR Service

What The the service does

Prerequisites

Installation: Docker

OCR and NLP working together:

How to Run the Ocr Production

About

Releases

Packages

Contributors 4

Languages

License

yurimarx/ocr-service

Folders and files

Latest commit

History

Repository files navigation

InterSystems IRIS Interoperability OCR Service

What The the service does

Prerequisites

Installation: Docker

OCR and NLP working together:

How to Run the Ocr Production

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages