[datasets] Add a synthetic recognition dataset #262
Labels
module: datasets
Related to doctr.datasets
side-project
Long-term issue which is particularly independant of other issues
topic: text recognition
Related to the task of text recognition
Milestone
The library would greatly benefit from synthetic data. It could be very helpful for character classification, and text recognition.
Here are suggestions about the augmentations operating on discrete spectrum:
The last two would not be relevant for handwritten characters
And those operating on a continuous spectrum: all common image classifications augmentations (rotation, color modifications, etc.)
The text was updated successfully, but these errors were encountered: