v0.3.2
IMPORTANT:
Due to this bug the run_server.py is not creating the %userprofile%/.ocr_translate folder automatically.
If this is your first install of the server please manually create the %userprofile%/.ocr_translate folder (you can type %userprofile% in file explorer and create a folder named .ocr_translate).
This will be fixed in the next release of the code
What's Changed
- All feature for box/ocr/tsl have been moved to plugins in separate packages (They are still bundled together with the EXE release)
- Improved pre-parsing of OCRed text before translation for languages with latin alphabet
- Introduced a way to remove ghost character generated at the begin/end of every string (e.g. tesseract would produce random character at the begin/end of many string, probably due to speech bubble edges included in the box).
- Introduced Trie capability (only for languages with a list/freq file ... for now only English)
- Can use trie to detect if an incorrect work ("helloworld") should be split into multiple valid words (["hello", "world"])
- Added English word list/freq file.
- From the plugin for easyocr, the boxes now are merged with higher tolerances, to reduce occurrence of multiple boxes in a single speech bubble (It would cause translation to be much worse since boxes are translated 1by1)
NOTE: There is also a plugin to run translation with google translate. It is not included in the EXE release as it does not fit with the main idea of the tool being something that will run entirely on system, but if there is request for it I can include it in successive release bundles (anyway you would need to select it to use it).