Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Madcat Arabic handwritten text line recognition #2356

Merged
merged 168 commits into from
May 17, 2018

Conversation

aarora8
Copy link
Contributor

@aarora8 aarora8 commented Apr 14, 2018

It's using 750k utterances from madcat Arabic data. It is getting the line image from the MAR (minimum area rectangle). It contains the recent TDNN training recipe for End2end and regular chain recipe. @hhadian

add README(README.txt)
add scripts for data preparation (text, wav.scp and utt2spk file) (local/prepare_data.sh, local/process_data.py, local/create_line_image_from_page_image.py)
add scripts for feature extraction (local/make_features.py)
add scripts for lexicon, language modeling, grammar (local/train_lm.sh, local/prepare_lexicon.py, local/prepare_dict.sh)
add script for GMM-HMM training and using chain model (local/chain/run_cnn_1a.sh, run.sh, run_end2end.sh, local/chain/run_cnn_chainali_1b.sh, local/chain/run_flatstart_cnn1a.sh, local/chain/compare_wer.sh)
Other (cmd.sh, link to image/steps/utils, v1/local/score.sh, path.sh, local/check_tools.sh)

Some of its info and features are as follows:
-It is getting the line image from the MAR (minimum area rectangle).
-It is currently building the language model from training utterances only.
-Its lexicon size is 95k words, OOV rate is around 1.5%.
-For quick debugging and experiments, it can be run with a subset of the dataset based on writing conditions (writing style, speed, carefulness) of the image.
-It contains the recent TDNN training recipe for End2end and regular chain.
-WER 12.97% with line image formed by stitching the word images.
-WER 15.03% with line image formed using MAR.

To do:
replace PIL in create_line_image_from_page_image
replace convex_hull library routine
update configs

@danpovey
Copy link
Contributor

danpovey commented May 4, 2018 via email

@@ -0,0 +1,226 @@
#!/bin/bash

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@aarora8, there should be results at the top of these files, obtained from the compare_wer.sh script.

Copy link
Contributor Author

@aarora8 aarora8 May 15, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

currently, run_cnn_chainali_1b.sh was giving a little bit worse result than run_flatstart_cnn1a.sh. I am currently running it with higher epochs and more tree-leaves. should i update it after current run completion or with recent results.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sorry, I realized I haven't run run_cnn_1a.sh, run_cnn_chainali_1b and run_cnn_end2endali_1a, with the recent code. I have currently ran run_flatstart_cnn1a.sh and run_cnn_end2endali_1b (UC) scripts. I updated the results on those two scripts, I will run and update the results for other scripts aswell.

adding latest results, removing dev from extract features to save time
@danpovey
Copy link
Contributor

danpovey commented May 15, 2018 via email

@aarora8
Copy link
Contributor Author

aarora8 commented May 15, 2018 via email

@danpovey
Copy link
Contributor

danpovey commented May 15, 2018 via email

@danpovey danpovey merged commit 108832d into kaldi-asr:master May 17, 2018
dpriver pushed a commit to dpriver/kaldi that referenced this pull request Sep 13, 2018
Skaiste pushed a commit to Skaiste/idlak that referenced this pull request Sep 26, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants