Distributed system based on Apache Kafka to run OCR on images and extract images and texts from PDF files.
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
giles-eco
LICENSE
README.md

README.md

Giles Ecosystem

The Giles Ecosystem is a distributed system to run OCR on images and extract images and texts from PDF files. This repository contains the user-facing component of this system called "Giles". The system requires the following software:

The core components of the Giles Ecosystem are located in the following repositories:

The above applications have dependencies to libraries located in the following repositories:

Additionally, Giles depends on:

There are some additional components of the Giles Ecosystem that can be added if required:

There is a Docker Compose file for testing and evaluation purposes that sets up the Giles Ecosystem in Docker. You can find that file here: https://github.com/diging/giles-eco-docker

You can detailed installation information and the documentation of Giles' API here.