Skip to content

Tesseract 4.0.0 training data for Javanese Script (Aksara Jawa)

License

Notifications You must be signed in to change notification settings

Shreeshrii/tessdata_jav_java

Repository files navigation

tessdata_jav_java

Tesseract 4.0.0 training data for Javanese Script (Aksara Jawa)

Built in response to this issue

Traineddata files

To run training for Javanese script

  • Clone this repo
  • To further continue training with existing data, run ./plustrain.sh
  • To customize for your own training, update training text in langdata and fonts lists in makedata.sh
  • Training text needs to be in utf-8 encoding.
  • Unicode fonts supporting the Javanese code-range need to be used.
  • Source code changes will be needed in tesseract, additions will be similar to Khmer/Mynamar or Thai.

Custom bash scripts - run in following order

./makeeval.sh

./makedata.sh

./mergedata.sh

./plustrain.sh

About

Tesseract 4.0.0 training data for Javanese Script (Aksara Jawa)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published