-
Notifications
You must be signed in to change notification settings - Fork 888
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add vulgar fraction for 1/2 #69
Comments
This is not limited to English, but applies to more Latin based languages. |
|
|
Pango, which is what we use to render the images with text2image, supports MathML. |
Now we only need a Tesseract which can detect formulae in images and generate hOCR with MathML for those formulae. :-) |
https://github.com/tesseract-ocr/tesseract/blob/master/ccmain/equationdetect.h |
tesseract-ocr/tesseract#2274 (comment) It is possible to finetune to recognize fractions. See above comment. |
Also with a tool such as https://www.calligraphr.com/en/ it is possible to create a ttf with the desired form of characters and then use it for generating synthetic data. It will work well for Latin script based languages that do not have many ligatures or combining marks. |
Font which has the fractions with numbers vertically above each other with a horizontal bar in between - |
https://graphicdesign.stackexchange.com/questions/71097/fractions-in-indesign-typing-not-%C2%BD-alt-0189 has a short list:
|
@theraysmith
Please see tesseract-ocr/tesseract#841 (comment)
https://cloud.githubusercontent.com/assets/1194896/25436113/477a23b6-2a60-11e7-967f-c4b97b21e3a9.png
I could not find any font which has 1/2 in this vertical format with straight line between 1 and 2.
The text was updated successfully, but these errors were encountered: