You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, guys!
I am trying to reuse the OCR transformation module in TextFlint, but I somehow find it rather trivial...
I quote the code about the OCR rules in the source code as below:
Here, the rules do not even cover the alphabet... And there are for sure more rules, eg., "w" => "vv". "m" => "rn".
I have found a dataset here (https://github.com/jie-mei/MiBio-OCR-dataset), which contains some OCR errors retrieved from real-world.
Although I find it quite annoying to parse the files in the aforementioned dataset... I believe that it may be benefitial to this work!
The text was updated successfully, but these errors were encountered:
Hi, guys!
I am trying to reuse the OCR transformation module in TextFlint, but I somehow find it rather trivial...
I quote the code about the OCR rules in the source code as below:
Here, the rules do not even cover the alphabet... And there are for sure more rules, eg., "w" => "vv". "m" => "rn".
I have found a dataset here (https://github.com/jie-mei/MiBio-OCR-dataset), which contains some OCR errors retrieved from real-world.
Although I find it quite annoying to parse the files in the aforementioned dataset... I believe that it may be benefitial to this work!
The text was updated successfully, but these errors were encountered: