Add recipe for OCR task on IAM handwriting dataset #4707

kenzheng99 · 2022-10-11T23:05:14Z

Current best performance is about 6.8 CER, which is still a bit off from SOTA results on this dataset. Let me know if anyone has suggestions for further performance tuning of this recipe.

sw005320 · 2022-10-28T11:08:56Z

Please fix https://github.com/espnet/espnet/actions/runs/3323163660/jobs/5524495069#step:8:8669

codecov · 2022-10-28T18:31:49Z

Codecov Report

Merging #4707 (2169367) into master (143c946) will increase coverage by 0.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #4707      +/-   ##
==========================================
+ Coverage   80.31%   80.32%   +0.01%     
==========================================
  Files         527      527              
  Lines       46311    46311              
==========================================
+ Hits        37193    37200       +7     
+ Misses       9118     9111       -7

Flag	Coverage Δ
test_integration_espnet1	`66.37% <ø> (+0.13%)`	⬆️
test_integration_espnet2	`48.96% <ø> (+0.11%)`	⬆️
test_python	`68.56% <ø> (ø)`
test_utils	`23.30% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
espnet/tts/pytorch_backend/tts.py	`78.63% <0.00%> (+0.29%)`	⬆️
...et/nets/pytorch_backend/e2e_asr_mix_transformer.py	`84.97% <0.00%> (+0.46%)`	⬆️
espnet/asr/asr_utils.py	`76.53% <0.00%> (+0.87%)`	⬆️
...pnet/nets/pytorch_backend/transformer/optimizer.py	`88.88% <0.00%> (+2.77%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

ftshijt

Very cool PR! Could you please also update the corresponding entry in egs2/TEMPLATE/README.md

egs2/iam/ocr1/local/data.sh

egs2/iam/ocr1/run.sh

mergify · 2022-10-30T03:31:12Z

This pull request is now in conflict :(

sw005320 · 2022-10-31T11:32:24Z

@kenzheng99, could you reflect @ftshijt's comments?

sw005320 · 2022-11-01T13:58:46Z

egs2/iam/ocr1/README.md

+- pytorch version: `pytorch 1.10.0`
+- Git hash: `5a6319300231b8193f1b6e8465d572be63150119`
+  - Commit date: `Sat Sep 24 12:14:08 2022 -0400`
+


Can you add a pre-trained model?

mergify · 2022-11-02T16:17:23Z

This pull request is now in conflict :(

sw005320 · 2022-11-03T14:32:30Z

Please reflect the comments ($IAM issue and model upload).

kenzheng99 · 2022-11-04T00:39:34Z

Updated the last comment! For model upload, I have just made a HuggingFace account (username is kenzheng99) and requested access to the ESPnet org, once I get that I can upload my model.

ftshijt · 2022-11-06T06:20:32Z

Updated the last comment! For model upload, I have just made a HuggingFace account (username is kenzheng99) and requested access to the ESPnet org, once I get that I can upload my model.

Very cool! You can make a separate PR for the update of the link. Will merge this PR. Many thanks for your contribution!

mergify bot added ESPnet2 README labels Oct 11, 2022

sw005320 added the Recipe label Oct 11, 2022

kenzheng99 force-pushed the iam-ocr-recipe branch from e0f9d83 to f063a09 Compare October 14, 2022 15:31

kenzheng99 added 5 commits October 25, 2022 13:51

add OCR recipe for IAM handwriting dataset

7de4cc8

fix feats_type=extracted to not require cmvn in asr.sh

38ba290

document code and add readme

6855d56

reformat data_prep.py

59f0646

sort imports in data_prep.py

75b0109

kenzheng99 force-pushed the iam-ocr-recipe branch from f063a09 to 75b0109 Compare October 25, 2022 17:56

reduce line lengths to 80

7eaa043

sw005320 requested a review from ftshijt October 28, 2022 17:41

remove trailing whitespace

043affb

ftshijt reviewed Oct 29, 2022

View reviewed changes

egs2/iam/ocr1/local/data.sh Show resolved Hide resolved

egs2/iam/ocr1/local/data.sh Outdated Show resolved Hide resolved

egs2/iam/ocr1/run.sh Outdated Show resolved Hide resolved

egs2/iam/ocr1/run.sh Outdated Show resolved Hide resolved

Merge branch 'master' into iam-ocr-recipe

91d1dd4

mergify bot added the conflicts label Oct 30, 2022

Merge branch 'master' into iam-ocr-recipe

459f5b0

mergify bot removed the conflicts label Oct 30, 2022

kenzheng99 added 2 commits October 31, 2022 11:48

update IAM recipe from PR feedback

b91d7b5

Merge branch 'master' into iam-ocr-recipe

7425c63

sw005320 reviewed Nov 1, 2022

View reviewed changes

mergify bot added the conflicts label Nov 2, 2022

Merge branch 'master' into iam-ocr-recipe

f943afa

mergify bot removed the conflicts label Nov 3, 2022

add check for IAM value

2169367

ftshijt merged commit b221db0 into espnet:master Nov 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add recipe for OCR task on IAM handwriting dataset #4707

Add recipe for OCR task on IAM handwriting dataset #4707

kenzheng99 commented Oct 11, 2022 •

edited

sw005320 commented Oct 28, 2022

codecov bot commented Oct 28, 2022 •

edited

ftshijt left a comment

mergify bot commented Oct 30, 2022

sw005320 commented Oct 31, 2022

sw005320 Nov 1, 2022

mergify bot commented Nov 2, 2022

sw005320 commented Nov 3, 2022

kenzheng99 commented Nov 4, 2022 •

edited

ftshijt commented Nov 6, 2022

Add recipe for OCR task on IAM handwriting dataset #4707

Add recipe for OCR task on IAM handwriting dataset #4707

Conversation

kenzheng99 commented Oct 11, 2022 • edited

sw005320 commented Oct 28, 2022

codecov bot commented Oct 28, 2022 • edited

Codecov Report

ftshijt left a comment

Choose a reason for hiding this comment

mergify bot commented Oct 30, 2022

sw005320 commented Oct 31, 2022

sw005320 Nov 1, 2022

Choose a reason for hiding this comment

mergify bot commented Nov 2, 2022

sw005320 commented Nov 3, 2022

kenzheng99 commented Nov 4, 2022 • edited

ftshijt commented Nov 6, 2022

kenzheng99 commented Oct 11, 2022 •

edited

codecov bot commented Oct 28, 2022 •

edited

kenzheng99 commented Nov 4, 2022 •

edited