ocrs-models

This project contains tools for training PyTorch models for use with the Ocrs OCR engine.

About the models

The ocrs engine splits text detection and recognition into three phases, each of which corresponds to a different model in this repository:

Text detection: This is a semantic segmentation model which classifies each pixel in a greyscale input image as text/non-text. Consumers then post-process clusters of text pixels to get oriented bounding boxes for words.
Layout analysis (VERY WIP): This is a graph model which takes word bounding boxes as input nodes and classifies each node's relation to nearby nodes (eg. start / middle / end of line)
Text recognition: This is a CRNN model that takes a greyscale image of a text line as input and returns a sequence of characters.

All models can be exported to ONNX for downstream use.

Datasets

The models are trained exclusively on datasets which are a) open and b) have non-restrictive licenses. This currently includes:

HierText (CC-BY-SA 4.0)

Pre-trained models

Pre-trained models are available from Hugging Face as PyTorch checkpoints, ONNX and RTen models.

Training custom models

See the Training guide for a walk-through of the process to train models from scratch or fine-tune existing models.

Name		Name	Last commit message	Last commit date
Latest commit History 203 Commits
.github/workflows		.github/workflows
docs		docs
layout-scraper		layout-scraper
ocrs_models		ocrs_models
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
mypy.ini		mypy.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

docs

docs

layout-scraper

layout-scraper

ocrs_models

ocrs_models

.gitignore

.gitignore

Makefile

Makefile

README.md

README.md

mypy.ini

mypy.ini

poetry.lock

poetry.lock

pyproject.toml

pyproject.toml

Repository files navigation

ocrs-models

About the models

Datasets

Pre-trained models

Training custom models

About

Releases

Packages

Contributors 2

Languages

robertknight/ocrs-models

Folders and files

Latest commit

History

Repository files navigation

ocrs-models

About the models

Datasets

Pre-trained models

Training custom models

About

Topics

Resources

Stars

Watchers

Forks

Languages