Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Arabic-Indic numerals to Arabic #71

Closed
Shreeshrii opened this issue May 2, 2017 · 9 comments
Closed

Add Arabic-Indic numerals to Arabic #71

Shreeshrii opened this issue May 2, 2017 · 9 comments

Comments

@Shreeshrii
Copy link
Contributor

Please see tesseract-ocr/tesseract#858

include both 0-9 and ( ٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩) for Arabic.

@Shreeshrii
Copy link
Contributor Author

  • 0x30 to 0x39
  • U+0660 through U+0669

@Testya
Copy link

Testya commented May 2, 2017

Please add the Arabic comma too, (،) U+060C.

@aboelmor
Copy link

Any idea of when the eastern-arabic numerals will be added to the language packs?

@theraysmith
Copy link
Contributor

Added to my local copy for next round of training. Then I will push updated langdata as well.

@Shreeshrii
Copy link
Contributor Author

@theraysmith

I hope you have seen other comments regarding using only persian number range for persian and arabic range for Arabic.

@theraysmith
Copy link
Contributor

theraysmith commented Aug 8, 2017 via email

@AbdelsalamHaa
Copy link

Hi guys , I'm using tesseract 4 I'm using ara.traineddata to extract the text from the image.
it's working well for the letters but numbers is not good at all . From the comment above there should be some other traineddata for only numbers . any body can guide me where to find it .

thank a lot

@amitdo
Copy link

amitdo commented Apr 24, 2020

It seems that Ray didn't push the data to our side (langdata_lstm and best/fast repos).

@amitdo
Copy link

amitdo commented Feb 26, 2021

This issue should be re-opened.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants