Skip to content
This repository has been archived by the owner on Mar 17, 2022. It is now read-only.

Illegible words recognition in Persian lang #259

Closed
ImanX opened this issue Feb 17, 2019 · 2 comments
Closed

Illegible words recognition in Persian lang #259

ImanX opened this issue Feb 17, 2019 · 2 comments

Comments

@ImanX
Copy link

ImanX commented Feb 17, 2019

Summary:
I implemented tess-two and the .traineddata imported in project as Persian language
tess-two work but that return Illegible words like:

   ـاغ {.
    ٥ ج.: { ٠
    ٤ \ ٤2,
    } 13
    ؤ. …
    « چ \ ة 8۱
    :} 3 ١.٠
    ٠ ء,٬, "و ۱١ |
    ), ٠
    } ( \ ق {۰
    | } چ
    د … ة ؛ ٠
    ؛ \ ؤ ٠٠
    دغ٬ ؤ \ 3
    حس {؛ | غ
    3 ق : « }
    دا ) { 3 د.
    » < {:
    ٠ دێ .
    ؛ ,? 33٠ ,
    { -3 ٠_
    {سم

Tess-two version: 5.4.1

Android version: 6.0

Phone/device model: Samsung S6

Phone/device architecture (armeabi, armeabi-v7a, x86, mips, arm64-v8a, x86_64, mips64): ARM64

@Robyer
Copy link
Contributor

Robyer commented Mar 5, 2019

@ImanX tess-two 5.4.1 is more than 3 years old, you should try latest version 9.0.0.

@rmtheis
Copy link
Owner

rmtheis commented Mar 13, 2019

You might try asking on the Tesseract mailing list and including a sample input image so you can get suggestions about what image processing to do in order to get a better result. While your current result is clearly not what you're looking for, it does look like Tesseract is working as intended. Robyer's suggestion of trying a newer version is a good one too.

@rmtheis rmtheis closed this as completed Mar 13, 2019
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants