Create "Damage Paper" Augmentation #17

proofconstruction · 2021-07-08T09:42:25Z

Motivation
It's not uncommon to scan documents that have undergone some physical deformation, like tearing, folding, or crinkling. The resulting changes in the surface of the paper generally become more apparent after digitization, causing difficulties for humans and machines reading the text.

See, for example, the image at the top of this post

It would be useful to be able to generate images of "damaged" documents, for training models in settings like healthcare (medical records), law (contracts), finance (invoicing), and so on.

There are several forms of damage that could be applied. Off the top of my head:

crumpling
tearing
burning
dissolving

... to name a few

Implementation
There are several paths forward for something like this. The most naive way would be to take some images of damaged paper and use these as the base image for existing pipelines. One much more sophisticated approach would be generating a 3D model (perhaps using Blender API?) and applying an image of a document as a texture.

kwcckw · 2021-07-08T13:59:38Z

One much more sophisticated approach would be generating a 3D model (perhaps using Blender API?)

Interesting, or we can just use some damaged paper reference in paper phase?

proofconstruction · 2021-07-08T14:14:15Z

Interesting, or we can just use some damaged paper reference in paper phase?

Can you explain more what you mean?

kwcckw · 2021-07-08T14:25:21Z

The most naive way would be to take some images of damaged paper and use these as the base image for existing pipelines

Actually i think this should be good enough, but how a 3D model can further improve the results? Do you have any related link on that?

proofconstruction · 2021-07-08T15:43:46Z

I don’t have a link, and I’m not sure what to look for.

What I mean is:

We use a pipeline to create an image, just like we can now
After the image is generated, we pass it through the CrumplePaperAugmentation to simulate rolling the paper into a ball, tearing it, etc.
The result is very different than using the existing image pipeline to apply text to a picture of damaged paper, because the text does not deform in the same way the paper is.

jboarman · 2021-07-08T22:31:24Z

Before tacking the more complicated concepts in this issue, we might want to decompose the concepts into simpler techniques (like in #21) that we can evolve into more the complicated techniques that we'll need for fuller implementation needed for this issue.

jboarman · 2021-07-11T19:45:23Z

The mechanical deformations proposed here are good ones that should be independently proposed on a per-transform basis along with examples of the expected output. For now, I think we should allow issue #21 (single paper fold) to reach completion so that we can pivot accordingly at that point based on how that issue matures.

jboarman added the enhancement New feature or request label Jul 8, 2021

jboarman mentioned this issue Jul 8, 2021

Add Single Fold to Paper #21

Closed

jboarman added discussion Open dialog about how we can approach and solve various issues and removed enhancement New feature or request labels Jul 11, 2021

jboarman closed this as completed Jul 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create "Damage Paper" Augmentation #17

Create "Damage Paper" Augmentation #17

proofconstruction commented Jul 8, 2021

kwcckw commented Jul 8, 2021

proofconstruction commented Jul 8, 2021

kwcckw commented Jul 8, 2021

proofconstruction commented Jul 8, 2021

jboarman commented Jul 8, 2021

jboarman commented Jul 11, 2021

Create "Damage Paper" Augmentation #17

Create "Damage Paper" Augmentation #17

Comments

proofconstruction commented Jul 8, 2021

kwcckw commented Jul 8, 2021

proofconstruction commented Jul 8, 2021

kwcckw commented Jul 8, 2021

proofconstruction commented Jul 8, 2021

jboarman commented Jul 8, 2021

jboarman commented Jul 11, 2021