Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[datasets] Add a synthetic recognition dataset #262

Closed
fg-mindee opened this issue May 17, 2021 · 1 comment · Fixed by #760
Closed

[datasets] Add a synthetic recognition dataset #262

fg-mindee opened this issue May 17, 2021 · 1 comment · Fixed by #760
Assignees
Labels
module: datasets Related to doctr.datasets side-project Long-term issue which is particularly independant of other issues topic: text recognition Related to the task of text recognition
Milestone

Comments

@fg-mindee
Copy link
Contributor

fg-mindee commented May 17, 2021

The library would greatly benefit from synthetic data. It could be very helpful for character classification, and text recognition.

Here are suggestions about the augmentations operating on discrete spectrum:

  • choice of character in the vocab
  • choice of font family
  • choice of font style (bold, italic, etc)
    The last two would not be relevant for handwritten characters

And those operating on a continuous spectrum: all common image classifications augmentations (rotation, color modifications, etc.)

@fg-mindee fg-mindee added help wanted Extra attention is needed module: datasets Related to doctr.datasets labels May 17, 2021
@fg-mindee fg-mindee added this to the 0.3.0 milestone May 17, 2021
@charlesmindee charlesmindee added the side-project Long-term issue which is particularly independant of other issues label Jun 1, 2021
@fg-mindee fg-mindee modified the milestones: 0.3.0, 0.4.0 Jul 1, 2021
@fg-mindee fg-mindee added the topic: text recognition Related to the task of text recognition label Aug 25, 2021
@fg-mindee fg-mindee modified the milestones: 0.4.0, 0.4.1 Sep 20, 2021
@fg-mindee fg-mindee modified the milestones: 0.4.1, 1.0.0, 0.5.0 Oct 30, 2021
@felixdittrich92
Copy link
Contributor

@fg-mindee
@charlesmindee
following #640 what is your idea to implement this or better what you expect to generate ?
My closed PR has implemented a Word-Image generator with the depending word labels where you can specify:

  • language (vocab)
  • image size
  • background (noise /white)
  • font (family / size)
  • was planned also to add: rotation / distorsion / skew / blur

My orientation was a bit on this: TRDG
An example:
0_200
label: "sweat"

wdyt ? :)

@fg-mindee fg-mindee removed the help wanted Extra attention is needed label Dec 26, 2021
@fg-mindee fg-mindee self-assigned this Dec 26, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: datasets Related to doctr.datasets side-project Long-term issue which is particularly independant of other issues topic: text recognition Related to the task of text recognition
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants