Enable element-wise weighing of outputs in model.fit(..., sample_weight=weights) #10775

rcasero · 2018-07-25T15:02:04Z

Summary

Add element-wise weighting of the loss function.
Described in Issue #10561

Keras API Design Review google doc with comments enabled here
https://docs.google.com/document/d/19BDXgNmeTgpgb9xYKzNboXyM7XX2PeM3mlvCFCdIQj0/edit?usp=sharing

Related Issues

None, as far as I know.

PR Overview

[y] This PR requires new unit tests [y/n] (make sure tests are included)

As described in the API Design Review, I've had some trouble with this, and could use some guidance.

[y] This PR requires to update the documentation [y/n] (make sure the docs are up-to-date)

Help added in the code, but as noted in the API Design Review doc, I also need a bit of guidance with that.

[?] This PR is backwards compatible [y/n]

I don't know how to test this.

[y] This PR changes the current API [y/n] (all API changes need to be approved by @fchollet )

It adds a new possible value 'element' to the option sample_weight_mode in model.compile().

rcasero · 2018-08-01T11:15:10Z

I have asked for guidance in the thread "How to debug a keras pull request" in the mailing list.

…work.errors_impl.InvalidArgumentError: Incompatible shapes

Some text lines were too long

fchollet · 2018-08-05T23:50:59Z

@pavithrasv what's your take on this feature?

fchollet · 2018-08-05T23:51:26Z

Thank you for the PR @rcasero. Will review soon.

Dapid

Looks great! We need some tests, though.

keras/engine/training_utils.py

Dapid · 2018-08-06T09:08:27Z

I think this feature is very interesting, and I will be using it a lot.

My use case is predictions of some experimental results. When I am predicting 1D features, I can use "temporal" mode and mask out the points where I am missing data, but when I am predicting 2D, I can no longer use that trick. My solution now is to wrap my model and manually force the outputs at the missing pixels to 0; essentially multiplying the outputs by the "present" mask.

This mode would allow me to have simpler models, avoid the extra complication of extracting the inner model when I need to use it, and also use the same structure for 1 and 2D.

rcasero · 2018-08-06T12:47:27Z

I wrote a test script, but I'd need help to turn it into test units. (As I mention in the Keras API Design Review google doc, I've tried following the instructions to test keras, but it fails for me even with the unpatched main keras branch.)

keras/engine/training.py

keras/engine/training_utils.py

pavithrasv · 2018-08-08T04:04:01Z

I can see this feature being useful for use cases like the one @Dapid has mentioned. It could also be used to mask unknown elements in a sample. Did a first pass through the code, will review again after unit tests have been added.

rcasero · 2018-08-22T18:00:20Z

Suggestions by @pavithrasv pushed in commit 157244c

rcasero · 2018-08-23T11:12:06Z

I implemented this feature because I generate training data with unknown elements, as @pavithrasv mentions. Training a network this way is working for me.

As mentioned above, I have a testing script, but I don't know how to generate unit tests in keras.

fchollet · 2018-09-12T18:46:43Z

@pavithrasv Could you please take a look at the recent changes? Thank you.

pavithrasv

Thank you for the changes. Can you add unit tests to test_training.py :https://github.com/keras-team/keras/blob/master/tests/keras/engine/test_training.py

rcasero · 2018-09-18T10:27:33Z

@pavithrasv What I see in "View changes" proposed by you is the patch I've submitted, right? Do I need to do anything about that?

Roger about the unit tests.

pavithrasv · 2018-09-18T15:35:11Z

Yes, the only change I had requested after that commit were the unit tests. Thank you!

rcasero · 2018-09-18T15:58:12Z

@pavithrasv I've found an error if one has two outputs, and one is e.g. (None, 22, 22, 2) instead of (None, 22, 22, 1). Code and error below. Any help appreciated!

import os
os.environ['KERAS_BACKEND'] = 'tensorflow'
import keras
import keras.backend as K
import numpy as np

from keras.models import Model, Sequential
from keras.layers import Activation, Conv2D, Input
from keras.layers.normalization import BatchNormalization

from keras.utils import multi_gpu_model

# remove warning "Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA"
os.environ['TF_CPP_MIN_LOG_LEVEL'] = '2'  # Just disables the warning, doesn't enable AVX/FMA

# image_data_format = 'channels_first'
image_data_format = 'channels_last'

K.set_image_data_format(image_data_format)

# simulate input images
im = np.zeros(shape=(10, 64, 64, 3), dtype='uint8')

# simulate network output
out = 2 * np.ones(shape=(10, 64, 64, 1), dtype='float32')
aux_out = 5 * np.ones(shape=(10, 22, 22, 1), dtype='float32')
# simulate training weights for network output
# weight = np.ones(shape=(10, 64, 64, 1), dtype='float32')
weight = np.ones(shape=(10, 64, 64, 1), dtype='float32')
aux_weight = np.ones(shape=(10, 22, 22, 1), dtype='float32')

# simulate validation data
im_validation = 3 * np.ones(shape=(5, 64, 64, 3), dtype='uint8')
out_validation = 4 * np.ones(shape=(5, 64, 64, 1), dtype='float32')

validation_data = (im_validation, out_validation)

# optimizer
optimizer = keras.optimizers.SGD(lr=0.01, decay=1e-6, momentum=0.9, nesterov=True)

'''Multi-output CNN with outputs of different number of features
'''

# create network model
input = Input(shape=im.shape[1:], dtype='float32')
x = Conv2D(filters=32, kernel_size=(3, 3), strides=1, padding='same')(input)
x = BatchNormalization(axis=3)(x)
x = Activation('relu')(x)

main_output = Conv2D(filters=1, kernel_size=(1, 1), strides=1, padding='same', name='main_output')(x)
aux_output = Conv2D(filters=2, kernel_size=(1, 1), strides=3, padding='same', name='aux_output')(x)

model = Model(inputs=input, outputs=[main_output, aux_output])

'''list format (sample_weight_mode=['element', 'element'])
'''

model.compile(loss='mae', optimizer=optimizer, metrics=['accuracy'],
              sample_weight_mode=['element', 'element'])

model.fit(im, [out, np.repeat(aux_out, repeats=2, axis=3)],
          sample_weight=[weight, np.repeat(aux_weight, repeats=2, axis=3)],
          batch_size=3, epochs=3)

model.fit(im, [out, np.repeat(aux_out, repeats=2, axis=3)],
          sample_weight=[weight, np.repeat(aux_weight, repeats=2, axis=3)],
          batch_size=3, epochs=3)
Epoch 1/3
Traceback (most recent call last):
  File "<input>", line 3, in <module>
  File "/home/rcasero/Software/keras_branch_sample_weight/keras/engine/training.py", line 1070, in fit
    validation_steps=validation_steps)
  File "/home/rcasero/Software/keras_branch_sample_weight/keras/engine/training_arrays.py", line 199, in fit_loop
    outs = f(ins_batch)
  File "/home/rcasero/Software/keras_branch_sample_weight/keras/backend/tensorflow_backend.py", line 2661, in __call__
    return self._call(inputs)
  File "/home/rcasero/Software/keras_branch_sample_weight/keras/backend/tensorflow_backend.py", line 2631, in _call
    fetched = self._callable_fn(*array_vals)
  File "/home/rcasero/.conda/envs/cytometer_tensorflow/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1454, in __call__
    self._session._session, self._handle, args, status, None)
  File "/home/rcasero/.conda/envs/cytometer_tensorflow/lib/python3.6/site-packages/tensorflow/python/framework/errors_impl.py", line 519, in __exit__
    c_api.TF_GetCode(self.status.status))
tensorflow.python.framework.errors_impl.InvalidArgumentError: Input to reshape is a tensor with 2904 values, but the requested shape has 1452
	 [[Node: loss/aux_output_loss/Reshape = Reshape[T=DT_FLOAT, Tshape=DT_INT32, _device="/job:localhost/replica:0/task:0/device:GPU:0"](_arg_aux_output_sample_weights_0_4/_113, loss/aux_output_loss/Shape_1)]]
	 [[Node: loss/add/_151 = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_723_loss/add", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]

gabrieldemarmiesse · 2018-09-18T20:39:45Z

I think something is going wrong around those lines:

# reduce weight array to same ndim as score_array (needed for
# sample_weight_mode='element')
if weight_ndim > K.ndim(score_array):
      weights = K.reshape(weights, K.shape(score_array))

If you do print(K.int_shape(weights)) and print(K.int_shape(score_array)) before the reshape, the error should be obvious. I didn't pull your branch though, so no guarantees.

…_mode='element'

rcasero · 2018-09-19T02:02:11Z

Thanks @gabrieldemarmiesse . You were right, the problem was there. Furthermore, the patch had a conceptual error, because the element-wise weights should have the size of score_array, not the output. Both have been fixed with commit d5701ea.

I still need to do some tests and write the test units.

rcasero · 2018-09-19T13:04:06Z

My last commits broke the 10782.8 test, Python: 2.7, KERAS_BACKEND=cntk in the Travis CI build, in particular tests/test_multiprocessing.py::test_multiprocessing_predict_error.

Can't debug for the moment, because keras docker requires nvidia-docker, which doesn't accept Ubuntu 17.10 (my current distribution).

On a related note, I had to:

Edit keras/docker/Makefile, adding --network host to the build, because otherwise my computer cannot reach ubuntu servers

docker build -t keras --build-arg python_version=$(PYTHON_VERSION) ...

Edit keras/docker/Dockerfile, replacing

git clone git://github.com/

with

git clone http://github.com/

and replacing

pip install git+git://github.com/keras-team/keras.git

with

pip install git+http://github.com/keras-team/keras.git

because otherwise the git and pip commands time out without completing.

gabrieldemarmiesse · 2018-09-19T13:47:57Z

The error is not due to your commit. It is a flaky test. I'll restart the build for you. You can also install keras cpu by folowing the instructions in .travis.yml to debug stuff locally.

rcasero · 2018-09-19T15:17:19Z

Thanks, @gabrieldemarmiesse . I'll look at .travis.yml for the testing.

gabrieldemarmiesse · 2018-11-02T16:13:56Z

I'll try to fix the test. Please use git pull before working again on your branch.

gabrieldemarmiesse · 2018-11-02T16:56:57Z

I can't review this anymore since I added some commits. We need new reviewers. The build is passing, so it's ready for review.

rcasero · 2018-11-02T17:40:27Z

Thanks. Anything else that needs to be done? Do we need that error message if the weights ndim is not 1 less than the output's ndim?

gabrieldemarmiesse · 2018-11-02T18:23:27Z

We need this error message. You already wrote it. It looks like this now:

if sample_weight is not None and sample_weight.shape != score_array_shape:
    raise ValueError('Found a `sample_weight` array with shape ' +
                             str(sample_weight.shape) +
                             ' for output with shape ' +
                             str(y.shape) +
                             '. When sample_weight_mode="element", ' +
                             'weights and score_array must have the same size.'
                             'Your `sample_weight` array should have the '
                             'following shape: ' + str(score_array_shape))

gabrieldemarmiesse · 2018-11-02T18:25:14Z

I think we just need to wait for the build, and if it passes, for a review from another member of the keras team. Thanks for your work @rcasero !

rcasero · 2018-11-02T18:39:52Z

Thanks for your help, @gabrieldemarmiesse

gabrieldemarmiesse · 2018-11-18T11:35:51Z

@pavithrasv, this PR is ready. Could you please take a look?

rcasero · 2018-12-05T17:51:16Z

@pavithrasv Just bumping this up

gabrieldemarmiesse · 2018-12-09T19:31:25Z

@fchollet please take a look at it when you have the time.

The tests have been added.

rcasero · 2019-02-04T14:41:36Z

@fchollet @gabrieldemarmiesse bumping this up, as it seems to have been forgotten

gabrieldemarmiesse · 2019-02-07T07:56:41Z

I haven't forgotten, but each PR changing the API needs @fchollet 's approval. We need to be better organised for this.

rcasero · 2019-02-07T12:24:50Z

@gabrieldemarmiesse @fchollet Not blaming anyone. :) I'd just love this to be merged so that updates to keras don't keep breaking the branch, and we can refer to it in a publication. I've just merged the official keras into my branch, as there were some new conflicts. Now it passes the tests again.

rcasero · 2019-02-07T12:50:03Z

@todiketan Perhaps you could ask that question in the thread you mention (just reply to my message there), as it's off-topic here? (This is an issue about adding an element-wise weighting feature to keras).

fchollet · 2019-09-11T21:17:02Z

Thank you for preparing this PR.

We are no longer adding new features to multi-backend Keras, as we are refocusing development efforts on tf.keras. If you are still interested in submitting this PR, please direct it to tf.keras in the TensorFlow repository instead.

rcasero · 2019-09-12T10:01:01Z

@fchollet Thanks for letting me know. Could you give me a couple of pointers of where the code goes within https://github.com/tensorflow/tensorflow/tree/master/tensorflow/python/keras? The code structure seems to be different to regular keras, although the filenames are the same.

The PR makes the following changes:

keras/engine/training.py
- In compile(), add an 'element' option to sample_weight_modes
- instance a K.placeholder with the size of the loss function for the weights.
keras/engine/training_utils.py: In weighted_masked_objective(),
- check that the size of the weights array coincides with the size of the loss function
- calculate the size of the score_array_shape
tests/keras/engine/test_training.py
- test unit

rcasero added 2 commits July 6, 2018 18:35

Issue keras-team#10561: Enable element-wise sample_weights

0d3bfb5

Issue keras-team#10561: Add doc strings for element-wise sample_weights

f1c5c1d

rcasero mentioned this pull request Jul 25, 2018

[API DESIGN REVIEW] Enable element-wise weighing of outputs in model.fit(..., sample_weight=weights) #10561

Closed

rcasero added 2 commits August 5, 2018 17:40

Fix issue #1: when batch_size>1, weights give tensorflow.python.frame…

6003480

…work.errors_impl.InvalidArgumentError: Incompatible shapes

Fix PEP8-check errors found by Travis build

cbaa2f5

Some text lines were too long

Dapid suggested changes Aug 6, 2018

View reviewed changes

keras/engine/training_utils.py Outdated Show resolved Hide resolved

small refactor suggested by @Dapid

86b241c

pavithrasv reviewed Aug 8, 2018

View reviewed changes

keras/engine/training.py Outdated Show resolved Hide resolved

keras/engine/training_utils.py Outdated Show resolved Hide resolved

keras/engine/training_utils.py Outdated Show resolved Hide resolved

keras/engine/training_utils.py Outdated Show resolved Hide resolved

fchollet assigned pavithrasv Aug 9, 2018

Issue keras-team#10775, code refactorings suggested by @pavithrasv

157244c

pavithrasv previously requested changes Sep 12, 2018

View reviewed changes

rcasero added 2 commits September 19, 2018 02:41

fix bug. Weights must have the shape of score_array for sample_weight…

d5701ea

…_mode='element'

fix PEP8-check errors

8171545

removed the keras_test decorator.

7eb5990

gabrieldemarmiesse added the Reviewers wanted label Nov 2, 2018

gabrieldemarmiesse added 5 commits November 2, 2018 18:43

Simplified the logic in standardize_weights().

5de3414

Simplified the test.

bb4c21a

Inverted the order of optimizer/loss.

7b8b1ed

Added a test and simplified the logic when making the placeholder.

45eef01

No need to break lines.

973d9eb

gabrieldemarmiesse assigned fchollet Dec 9, 2018

Merge remote-tracking branch 'upstream/master'

038e1db

rcasero added 4 commits August 14, 2019 14:55

Merge branch 'master' of https://github.com/keras-team/keras

0681160

minor

381e1e7

minor

f674bc2

Merge branch 'master' of https://github.com/keras-team/keras

2193b0c

fchollet closed this Sep 11, 2019

Enable element-wise weighing of outputs in model.fit(..., sample_weight=weights) #10775

Enable element-wise weighing of outputs in model.fit(..., sample_weight=weights) #10775

Uh oh!

Conversation

rcasero commented Jul 25, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Related Issues

PR Overview

Uh oh!

rcasero commented Aug 1, 2018

Uh oh!

fchollet commented Aug 5, 2018

Uh oh!

fchollet commented Aug 5, 2018

Uh oh!

Dapid left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Dapid commented Aug 6, 2018

Uh oh!

rcasero commented Aug 6, 2018

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pavithrasv commented Aug 8, 2018

Uh oh!

rcasero commented Aug 22, 2018

Uh oh!

rcasero commented Aug 23, 2018

Uh oh!

fchollet commented Sep 12, 2018

Uh oh!

pavithrasv left a comment

Choose a reason for hiding this comment

Uh oh!

rcasero commented Sep 18, 2018

Uh oh!

pavithrasv commented Sep 18, 2018

Uh oh!

rcasero commented Sep 18, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gabrieldemarmiesse commented Sep 18, 2018

Uh oh!

rcasero commented Sep 19, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rcasero commented Sep 19, 2018

Uh oh!

gabrieldemarmiesse commented Sep 19, 2018

Uh oh!

rcasero commented Sep 19, 2018

Uh oh!

gabrieldemarmiesse commented Nov 2, 2018

Uh oh!

gabrieldemarmiesse commented Nov 2, 2018

Uh oh!

rcasero commented Nov 2, 2018

Uh oh!

gabrieldemarmiesse commented Nov 2, 2018

Uh oh!

gabrieldemarmiesse commented Nov 2, 2018

Uh oh!

rcasero commented Nov 2, 2018

Uh oh!

gabrieldemarmiesse commented Nov 18, 2018

Uh oh!

rcasero commented Dec 5, 2018

Uh oh!

gabrieldemarmiesse commented Dec 9, 2018

Uh oh!

rcasero commented Feb 4, 2019

Uh oh!

gabrieldemarmiesse commented Feb 7, 2019

Uh oh!

rcasero commented Feb 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

rcasero commented Jul 25, 2018 •

edited

Loading

rcasero commented Sep 18, 2018 •

edited

Loading

rcasero commented Sep 19, 2018 •

edited

Loading

rcasero commented Feb 7, 2019 •

edited

Loading