OCR Experiments

My experiments with some neural Optical Character Recognition OCR models. At the moment, I'm really just referencing other people's code, although eventually I'll have some more details on my own extensions.

The main novelty at the moment is that I'm trying to get everything running in containers so that other people might be able to use them as well. Currently, my interest is mostly due to needing to process reams of notes written in the course of my research, rather than anything more cutting edge. However, if that sounds like the sort of thing that might be useful to you, I've created a few utilities in my pdf-stuff repo which might be of interest.

Functional Models

I've been looking at projects based on reported metrics, papers, and GitHub activity. The ones I've managed to get running so far are:

Clova AI Research's CRAFT-PyTorch and Deep Text Recognition Benchmark
PaddleOCR v2.0
Handwriting OCR
- Semi-functional, I think the pinned version of OpenCV needs to be updated.

Each of these has corresponding Dockerfiles and a Makefile, which means anyone should be able to get something running fairly easily.

TODO

Sample code and results for each listed repo.
Change entrypoints so they can be used as one-off utilities.
Fix user/groupid for containers.
Ensure that containers leverage GPU/CUDA where possible.
In general, modify Dockerfiles to obey "best practices".
Script or make rule for downloading models from Google Drive.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
clova-ai		clova-ai
handwriting-ocr		handwriting-ocr
paddle-ocr		paddle-ocr
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clova-ai

clova-ai

handwriting-ocr

handwriting-ocr

paddle-ocr

paddle-ocr

.gitignore

.gitignore

README.md

README.md

Repository files navigation

OCR Experiments

Functional Models

TODO

About

Releases

Packages

Languages

rldotai/ocr-experiments

Folders and files

Latest commit

History

Repository files navigation

OCR Experiments

Functional Models

TODO

About

Resources

Stars

Watchers

Forks

Languages