[feature] Part 2 from use datasets for recognition #891

felixdittrich92 · 2022-04-13T19:50:21Z

This PR:

enables all datasets to be used for recognition task (without obj_detection one)
add corresponding tests
one special case: SynthText (to big will lead to memory overflow) -> saved in cache for faster reload and avoiding memory explosions 😅
all other works fine without caching

Issues:
Closes #867
(Documentation for this will follow in another PR #855 )

Any feedback is welcome 🤗

codecov · 2022-04-13T20:01:22Z

Codecov Report

Merging #891 (ab2b437) into main (1a4e4df) will increase coverage by 0.00%.
The diff coverage is 98.48%.

@@           Coverage Diff           @@
##             main     #891   +/-   ##
=======================================
  Coverage   94.71%   94.72%           
=======================================
  Files         135      135           
  Lines        5375     5440   +65     
=======================================
+ Hits         5091     5153   +62     
- Misses        284      287    +3

Flag	Coverage Δ
unittests	`94.72% <98.48%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
doctr/datasets/synthtext.py	`94.73% <92.59%> (-2.41%)`	⬇️
doctr/datasets/cord.py	`97.72% <100.00%> (+0.29%)`	⬆️
doctr/datasets/funsd.py	`97.36% <100.00%> (+0.39%)`	⬆️
doctr/datasets/ic03.py	`97.72% <100.00%> (+0.35%)`	⬆️
doctr/datasets/ic13.py	`96.77% <100.00%> (+0.62%)`	⬆️
doctr/datasets/iiit5k.py	`96.96% <100.00%> (+0.19%)`	⬆️
doctr/datasets/imgur5k.py	`93.33% <100.00%> (+0.83%)`	⬆️
doctr/datasets/sroie.py	`97.29% <100.00%> (+0.42%)`	⬆️
doctr/datasets/svhn.py	`95.23% <100.00%> (+0.50%)`	⬆️
doctr/datasets/svt.py	`97.61% <100.00%> (+0.39%)`	⬆️
... and 2 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1a4e4df...ab2b437. Read the comment docs.

felixdittrich92 · 2022-04-20T20:55:46Z

@charlesmindee wdyt?

charlesmindee

Thanks!

felixdittrich92 added 26 commits January 11, 2022 08:34

backup

81c313e

Merge branch 'mindee:main' into main

50574b5

Merge branch 'mindee:main' into main

5a6ed54

Merge branch 'mindee:main' into main

b9958a7

Merge branch 'mindee:main' into main

14c4651

Merge branch 'mindee:main' into main

779731f

Merge branch 'mindee:main' into main

ce2cdda

Merge branch 'mindee:main' into main

d13dc43

Merge branch 'mindee:main' into main

9a07d73

Merge branch 'mindee:main' into main

a002a70

Merge branch 'mindee:main' into main

6ad096e

Merge branch 'mindee:main' into main

1e77fd4

Merge branch 'mindee:main' into main

2be762c

Merge branch 'mindee:main' into main

e2f2055

Merge branch 'mindee:main' into main

bdc4e67

Merge branch 'mindee:main' into main

b525021

Merge branch 'mindee:main' into main

417a27b

Merge branch 'mindee:main' into main

9b3f5a1

Merge branch 'mindee:main' into main

93074a8

Merge branch 'mindee:main' into main

c64e209

Merge branch 'mindee:main' into main

fdc8381

Merge branch 'mindee:main' into main

bd68b07

Merge branch 'mindee:main' into main

7ac6ee2

Merge branch 'mindee:main' into main

1c79f32

Merge branch 'mindee:main' into main

45e43ac

datasets and tests

ab2b437

felixdittrich92 mentioned this pull request Apr 13, 2022

[datasets] Add MJSynth (Synth90K) #827

Merged

felixdittrich92 mentioned this pull request Apr 13, 2022

[datasets][PoC] Enable dataset usage for recognition task #867

Closed

2 tasks

charlesmindee approved these changes Apr 27, 2022

View reviewed changes

charlesmindee merged commit 624ee6c into mindee:main Apr 27, 2022

felixdittrich92 deleted the crop-2 branch April 27, 2022 19:40

frgfm added ext: tests Related to tests folder module: datasets Related to doctr.datasets type: new feature New feature labels May 2, 2022

felixdittrich92 added this to the 0.6.0 milestone Jun 28, 2022

felixdittrich92 mentioned this pull request Jun 29, 2022

Release tracker - v0.6.0 #791

Closed

85 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature] Part 2 from use datasets for recognition #891

[feature] Part 2 from use datasets for recognition #891

felixdittrich92 commented Apr 13, 2022 •

edited

Loading

codecov bot commented Apr 13, 2022

felixdittrich92 commented Apr 20, 2022

charlesmindee left a comment

[feature] Part 2 from use datasets for recognition #891

[feature] Part 2 from use datasets for recognition #891

Conversation

felixdittrich92 commented Apr 13, 2022 • edited Loading

codecov bot commented Apr 13, 2022

Codecov Report

felixdittrich92 commented Apr 20, 2022

charlesmindee left a comment

Choose a reason for hiding this comment

felixdittrich92 commented Apr 13, 2022 •

edited

Loading