Skip to content
This repository has been archived by the owner on May 29, 2024. It is now read-only.

Sanity checker for wrongly parsed characters #6

Open
anuran-roy opened this issue Jun 17, 2022 · 0 comments
Open

Sanity checker for wrongly parsed characters #6

anuran-roy opened this issue Jun 17, 2022 · 0 comments

Comments

@anuran-roy
Copy link

Some characters might not be detected correctly in the OCR. A hacky but probable method to correct them can be using a dictionary that stores commonly mistaken characters, with the list of characters they are mistaken for, like this:

{
   1: ["l", "I", "i", "L", 7, 4],
   2: ["Z", "z"],
   3: [8, "e",]
...
}
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant