feat: Added Vietnamese entry in VOCAB #878

calibretaliation · 2022-03-31T07:14:40Z

I added vietnamese VOCABS for Vietnamese devs if they want to use doctr for vietnamese like me :)

codecov · 2022-03-31T09:57:40Z

Codecov Report

Merging #878 (9c8420a) into main (7f396ca) will decrease coverage by 0.01%.
The diff coverage is 100.00%.

❗ Current head 9c8420a differs from pull request most recent head dbfd583. Consider uploading reports for the commit dbfd583 to get more accurate results

@@            Coverage Diff             @@
##             main     #878      +/-   ##
==========================================
- Coverage   94.84%   94.82%   -0.02%     
==========================================
  Files         133      133              
  Lines        5200     5201       +1     
==========================================
  Hits         4932     4932              
- Misses        268      269       +1

Flag	Coverage Δ
unittests	`94.82% <100.00%> (-0.02%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
doctr/datasets/vocabs.py	`100.00% <100.00%> (ø)`
doctr/transforms/functional/base.py	`95.65% <0.00%> (-1.45%)`	⬇️
doctr/transforms/modules/base.py	`94.59% <0.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7f396ca...dbfd583. Read the comment docs.

charlesmindee

Thanks for the PR! Just a style check not passing, you need to split the line 34 which is too long in 2 :)

calibretaliation · 2022-04-04T06:43:27Z

@charlesmindee Thanks for your comment, I have added the new commit of style fix.
However, can you give me a help on how to use the library on Vietnamese OCR please ? Do I have to change something or just applying the new vocabs ?

doctr/datasets/vocabs.py

charlesmindee · 2022-04-04T09:20:46Z

@charlesmindee Thanks for your comment, I have added the new commit of style fix. However, can you give me a help on how to use the library on Vietnamese OCR please ? Do I have to change something or just applying the new vocabs ?

Hi @calibretaliation, if you want to use the librairy on Vietnamese OCR, you need to apply the you new vocab on a recognition model, you can keep the detection model as it is. However, you need to retrain the recognition model on you vocabulary with a labelled vietnamese dataset of word crops. For your PR, it would indeed be nice to indent so that flake8 is running without raising any error 🙏

charlesmindee

You need to align everything under the parenthesis, otherwise it is OK!

doctr/datasets/vocabs.py

charlesmindee

Thanks!

* feat: Added Vietnamese entry in VOCAB - update style fix 2 * feat: Added Vietnamese entry in VOCAB - update style fix 3

frgfm · 2022-04-27T11:22:14Z

Missing PR labels here as well @charlesmindee :)

Also, perhaps we should add specific contribution guidelines for vocab addition? I remember that for portuguese we had back & forth iterations, so perhaps we could ask to add a reference in the PR or better, as a comment in the code?

felixdittrich92 · 2022-04-27T11:23:40Z

@charlesmindee
@frgfm
There are also missing additions in the documentation for the last added vocabs

calibretaliation mentioned this pull request Mar 31, 2022

Can I add some vocabs of Vietnamese to VOCABS file ? #879

Closed

charlesmindee requested changes Mar 31, 2022

View reviewed changes

felixdittrich92 reviewed Apr 4, 2022

View reviewed changes

doctr/datasets/vocabs.py Outdated Show resolved Hide resolved

feat: Added Vietnamese entry in VOCAB - update style fix 2

9c8420a

charlesmindee reviewed Apr 5, 2022

View reviewed changes

doctr/datasets/vocabs.py Outdated Show resolved Hide resolved

feat: Added Vietnamese entry in VOCAB - update style fix 3

dbfd583

charlesmindee approved these changes Apr 6, 2022

View reviewed changes

charlesmindee merged commit 2c697ff into mindee:main Apr 6, 2022

felixdittrich92 pushed a commit to felixdittrich92/doctr that referenced this pull request Apr 7, 2022

feat: Added Vietnamese entry in VOCAB (mindee#878)

8a72716

* feat: Added Vietnamese entry in VOCAB - update style fix 2 * feat: Added Vietnamese entry in VOCAB - update style fix 3

calibretaliation deleted the vietnamese-vocabs branch April 7, 2022 06:54

frgfm added module: datasets Related to doctr.datasets type: new feature New feature labels May 2, 2022

felixdittrich92 added this to the 0.6.0 milestone Jun 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Added Vietnamese entry in VOCAB #878

feat: Added Vietnamese entry in VOCAB #878

calibretaliation commented Mar 31, 2022

codecov bot commented Mar 31, 2022 •

edited

charlesmindee left a comment

calibretaliation commented Apr 4, 2022

charlesmindee commented Apr 4, 2022

charlesmindee left a comment

charlesmindee left a comment

frgfm commented Apr 27, 2022

felixdittrich92 commented Apr 27, 2022

feat: Added Vietnamese entry in VOCAB #878

feat: Added Vietnamese entry in VOCAB #878

Conversation

calibretaliation commented Mar 31, 2022

codecov bot commented Mar 31, 2022 • edited

Codecov Report

charlesmindee left a comment

Choose a reason for hiding this comment

calibretaliation commented Apr 4, 2022

charlesmindee commented Apr 4, 2022

charlesmindee left a comment

Choose a reason for hiding this comment

charlesmindee left a comment

Choose a reason for hiding this comment

frgfm commented Apr 27, 2022

felixdittrich92 commented Apr 27, 2022

codecov bot commented Mar 31, 2022 •

edited