Resize and noise policies #2252

ducha-aiki · 2015-04-03T13:47:08Z

Works only for cv::Mat stored databases.
@immars kindly rebased my branch to current master and added gaussian noise.
P.S. Travis is failing because of different OpenCV version, which does not have CLAHE function.
Adds different resize and noise policies to data_transformer like:

transform_param {
    crop_size: 227
    resize_param {
      prob: 0.5
      resize_mode: WARP
      height: 256
      width: 256
      interp_mode: LINEAR
    }
    resize_param {
      prob: 0.25
      resize_mode: BY_LARGE_SIZE
      height: 256
      width: 256
      interp_mode: AREA
    }
      resize_param {
      prob: 0.2
      resize_mode: BY_SMALL_SIZE_AND_PAD
      height: 256
      width: 256
      pad_mode: MIRRORED
      interp_mode: NEAREST
    }
      resize_param {
      prob: 0.05
      resize_mode: BY_SMALL_SIZE_AND_PAD
      height: 256
      width: 256
      pad_mode: CONSTANT
      pad_value: 104
      pad_value: 117
      pad_value: 124
      interp_mode: NEAREST
    }
    noise_param {
    prob: 0.6
    }
    noise_param {
    hist_eq: true
    prob: 0.2
    }  
    noise_param {
    decolorize: true
    gauss_blur: true
    prob: 0.2
    }

Lint almost happy

gaussian noise implementation

lukeyeager · 2015-04-03T17:04:34Z

We've done something similar with DIGITS on the database creation side of things. I would love to see these options in caffe instead - nice work!

A few thoughts from our research:

Padding with REPEAT_NEAREST was terrible. The network learns to recognize those distinctive lines on the edge of the screen and it throws off the training. CONSTANT wasn't much better. We had better luck padding with random noise than anything else. By all means, leave REPEAT_NEAREST in there and let people try it for themselves, but I'd like to see RANDOM added as another option in the future.
Our go-to resize method is halfway in between your FIT_SMALL_SIZE (or "crop") and FIT_LARGE_SIZE_AND_PAD (or "fill") - something we call "half crop, half fill" (great name, I know). You might want to look into it.

Both of those are more suggestions than critiques. This looks like a step in the right direction!

FYI, here is what HALF_CROP with RANDOM noise looks like (resize code here):

ducha-aiki · 2015-04-03T17:17:17Z

Thanks for suggestions. From my experience, best padding is mirroring. Second - warping.
The third best was padding with mean pixel.

I like DIGITS, but resizing when creating dataset is too rigid compared to online data augmentation.

lukeyeager · 2015-04-03T17:21:06Z

resizing when creating dataset is too rigid compared to online data augmentation

Right, I agree. That's why I'm excited to see this going into caffe!

ducha-aiki · 2015-04-03T17:30:13Z

Have you tested mean or mirror padding?

lukeyeager · 2015-04-03T19:56:50Z

Have you tested mean or mirror padding?

I haven't. Mirror sounds promising. I'll check it out when I find some time.

bhack · 2015-04-03T21:13:08Z

/cc @mtamburrano

Resize and noise policies

bhack · 2016-01-01T22:00:35Z

Just another PR is dying in natural drifting from master. Is this project alive?

ducha-aiki · 2016-01-01T23:34:51Z

@bhack more dead than alive. It is merged in my main branch elu https://github.com/ducha-aiki/caffe but separate headers haven`t merged yet there.

bhack · 2016-01-02T00:45:44Z

@lukeyeager is your team still interested?

bhack · 2016-01-02T18:42:07Z

@ducha-aiki It makes sense to rebase?

ducha-aiki · 2016-01-02T18:55:09Z

@bhack it is not question to me :) I can do rebase, but am not as patient as you with @mtamburrano ;)
so will rebase only if it is needed.

ducha-aiki · 2016-01-02T19:00:21Z

@bhack Actually, I agree with https://github.com/zer0n/deepframeworks/blob/master/README.md
caffe is nice as industry standard for deployment and fast training of standard models.

But not as suitable as before for experiments.

bhack · 2016-01-02T19:01:05Z

@ducha-aiki I'm not so patient ;) I've not contributed anymore until the MIA status of BVLC core devs will be clarified. Other frameworks are getting momentum so I think that there is more choice now. I'll try to use your fork directly instead of trying to rebase this.

ducha-aiki · 2016-01-02T19:06:17Z

@bhack if you give me a way to contact you, will notify when merge current master into my fork.
Btw, multiscale training is not always good, at least one I have tested :(
https://github.com/ducha-aiki/caffenet-benchmark/blob/master/Augmentation.md

bhack · 2016-01-02T19:07:32Z

Things like basic augmentation could be part of the industrial standard ;)

bhack · 2016-01-02T20:20:49Z

I think that Imagenet dataset is quite rich to cover scales. But in other cases, with a smaller dataset it could be useful.

bhack · 2016-01-03T22:10:54Z

@ducha-aiki Can be used with lmdb created with convert_imageset utility?

ducha-aiki · 2016-01-03T22:28:44Z

@bhack yes, but compressed (aka jpeg, png, etc ) only.

lukeyeager · 2016-01-04T17:38:11Z

@lukeyeager is your team still interested?

Yes. I think somebody around here implemented their own version of this, but it would be great to have it standard in BVLC/caffe.

ducha-aiki · 2016-01-13T17:11:55Z

@lukeyeager @bhack
rebased to current bvlc-master here https://github.com/ducha-aiki/caffe/tree/augmentations
not linted or wrote tests yet.

ducha-aiki · 2016-01-13T17:14:24Z

@shelhamer @ronghanghu
I have rebased this PR to current master in other branch and can write tests/expand/shrink number of features in augmentation branch, if caffe is interested in data augmentation.
If not, I will abandon this. Any feedback please?

mtamburrano · 2016-01-13T17:21:07Z

really useful PR, we already tested it and would be nice to see it merged on master.
Just my 2 cents

TimZaman · 2016-10-18T10:45:27Z

What a shame.. RIP PR

ducha-aiki and others added 7 commits March 3, 2015 11:28

Added resize and noise policy.

d5bba43

Lint almost happy

Merge branch 'bvlc' into aiki-transform

07f8426

gaussian noise initial commit

dfdd205

fix pixel overflow

ed5dab6

lint

bbbd94d

lint

6aab267

Merge pull request #1 from immars/aiki-transform

82204af

gaussian noise implementation

ducha-aiki mentioned this pull request Apr 3, 2015

PR: add different resize and noise-adding policies to input data #1550

Closed

lukeyeager mentioned this pull request Apr 14, 2015

Data augmentation used by DIGITS NVIDIA/DIGITS#68

Closed

weiliu89 added a commit to weiliu89/caffe that referenced this pull request Apr 14, 2015

Merge pull request BVLC#2252 from ducha-aiki/more-transforms

fc57f8f

Resize and noise policies

futurely mentioned this pull request Apr 30, 2015

data augmentation tool/feature #701

Closed

Commented out CLAHE part

15308e7

bhack mentioned this pull request Jan 15, 2016

rotate an image for data augmentation tensorflow/tensorflow#781

Closed

ducha-aiki mentioned this pull request Jan 19, 2016

What I will test next ducha-aiki/caffenet-benchmark#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resize and noise policies #2252

Resize and noise policies #2252

ducha-aiki commented Apr 3, 2015

lukeyeager commented Apr 3, 2015

ducha-aiki commented Apr 3, 2015

lukeyeager commented Apr 3, 2015

ducha-aiki commented Apr 3, 2015

lukeyeager commented Apr 3, 2015

bhack commented Apr 3, 2015

bhack commented Jan 1, 2016

ducha-aiki commented Jan 1, 2016

bhack commented Jan 2, 2016

bhack commented Jan 2, 2016

ducha-aiki commented Jan 2, 2016

ducha-aiki commented Jan 2, 2016

bhack commented Jan 2, 2016

ducha-aiki commented Jan 2, 2016

bhack commented Jan 2, 2016

bhack commented Jan 2, 2016

bhack commented Jan 3, 2016

ducha-aiki commented Jan 3, 2016

lukeyeager commented Jan 4, 2016

ducha-aiki commented Jan 13, 2016

ducha-aiki commented Jan 13, 2016

mtamburrano commented Jan 13, 2016

TimZaman commented Oct 18, 2016

Resize and noise policies #2252

Are you sure you want to change the base?

Resize and noise policies #2252

Conversation

ducha-aiki commented Apr 3, 2015

lukeyeager commented Apr 3, 2015

ducha-aiki commented Apr 3, 2015

lukeyeager commented Apr 3, 2015

ducha-aiki commented Apr 3, 2015

lukeyeager commented Apr 3, 2015

bhack commented Apr 3, 2015

bhack commented Jan 1, 2016

ducha-aiki commented Jan 1, 2016

bhack commented Jan 2, 2016

bhack commented Jan 2, 2016

ducha-aiki commented Jan 2, 2016

ducha-aiki commented Jan 2, 2016

bhack commented Jan 2, 2016

ducha-aiki commented Jan 2, 2016

bhack commented Jan 2, 2016

bhack commented Jan 2, 2016

bhack commented Jan 3, 2016

ducha-aiki commented Jan 3, 2016

lukeyeager commented Jan 4, 2016

ducha-aiki commented Jan 13, 2016

ducha-aiki commented Jan 13, 2016

mtamburrano commented Jan 13, 2016

TimZaman commented Oct 18, 2016