is it possible to specify different batch sizes for train and validation? #8550

ophiry · 2017-11-21T08:23:33Z

for train data, there are reasons to keep batches relatively small (batch size can effect training results)
however for the validation set, using a single reasonably big batch, allows simple use of metric functions that don't work well when averaging batches (precision \ recall \ f1)

cbensimon · 2017-11-22T13:34:50Z

If you don't specify any batch_size in your model definition (I think that you never have to, but I may be wrong), your model has a dynamic batch_size. You then use the batch_size parameter of the fit and predict methods of your keras models.

ophiry · 2017-11-22T13:47:04Z

I use fit_generator
the batch size of the train data is constant, though not explicitly defined in keras
the validation data is sent as a numpy arrays (not as a generator) which are larger then the batch size of the train data

I see that event though no batch size was defined, keras will break the test data to batches in the size of the train set batch (I added a dummy metric that calculates the length of the batch to confirm this)

cbensimon · 2017-11-22T16:09:42Z

For that scenario, my advice is to do the epochs loop yourself :
for each epoch, you call fit_generator with epochs=1,
and then you call predict on your validation data with the desired batch size.

Another solution is to use a generator for the validation data, so you have control over the batch_size of your validation data

You could also use a custom Keras Callback
(in which you call predict inside on_epoch_end method)

cbensimon · 2017-11-22T16:13:23Z

But you're right, maybe a validation_batch_size parameter is missing for the case where validation_data is not a generator

boulderZ · 2018-06-23T20:47:04Z

I need this feature now. I am implementing a custom metric that calculates correlation. If I use the training batch size of 32 it is too small for my custom metric and I can get runs that are all zeros resulting in nan output for the validation metric. I would think this could be a common issue for any metric that needs longer string of data to process accurately or in a stable way. Seems like this would be an easy feature to add?

MatthewScholefield · 2018-07-12T00:00:59Z

@boulderZ I'm not sure if this would help your issue, but when I ran into an Nan issue, I was able to fix it with value / K.minimum(1.0, other_value). However, this didn't fix my issue of metric being averaged across the batch size. This issue has some relevant posts that details more of what @cbensimon and @ophiry are talking about.

chmn106 · 2019-07-27T00:37:51Z

my advice is to define your validate function，use self define metric function

fchollet closed this as completed Jun 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is it possible to specify different batch sizes for train and validation? #8550

is it possible to specify different batch sizes for train and validation? #8550

ophiry commented Nov 21, 2017 •

edited

Loading

cbensimon commented Nov 22, 2017

ophiry commented Nov 22, 2017

cbensimon commented Nov 22, 2017

cbensimon commented Nov 22, 2017

boulderZ commented Jun 23, 2018 •

edited

Loading

MatthewScholefield commented Jul 12, 2018

chmn106 commented Jul 27, 2019

is it possible to specify different batch sizes for train and validation? #8550

is it possible to specify different batch sizes for train and validation? #8550

Comments

ophiry commented Nov 21, 2017 • edited Loading

cbensimon commented Nov 22, 2017

ophiry commented Nov 22, 2017

cbensimon commented Nov 22, 2017

cbensimon commented Nov 22, 2017

boulderZ commented Jun 23, 2018 • edited Loading

MatthewScholefield commented Jul 12, 2018

chmn106 commented Jul 27, 2019

ophiry commented Nov 21, 2017 •

edited

Loading

boulderZ commented Jun 23, 2018 •

edited

Loading