How could we use Leaky ReLU and Parametric ReLU as activation function ? #117

gaoyuankidult · 2015-05-11T16:05:52Z

Leaky ReLU and Parametric ReLU are implemented as classes, how could we use them as activation functions ?

Thanks

patyork · 2015-05-11T16:09:58Z

There's a PReLU example in the Kaggle Otto example; it can be used as a template for all of the Advanced Activation:

from keras.layers.advanced_activations import LeakyReLU, PReLU
..
..
model.add(Dense(512, 512, activation='linear')) # Add any layer, with the default of an identity/linear squashing function (no squashing)
model.add(LeakyReLU(alpha=.001))   # add an advanced activation
...
...
model.add(Dense(512, 123, activation='linear')) # Add any layer, with the default of an identity/linear squashing function (no squashing)
model.add(PReLU((123,)))   # add an advanced activation

gaoyuankidult · 2015-05-12T07:14:30Z

Thanks @patyork . But what if I would like to use them as inner_activation ?

patyork · 2015-05-12T17:01:06Z

Hmm, it doesn't appear that that is possible at the moment to use the advanced activation within recurrent layers as the inner activation. @fchollet may be able to say for sure. This is a good point.

fchollet · 2015-05-12T17:57:37Z

Not currently possible with Keras. It would be possible to modify a recurrent layer to add this capability, though.

gaoyuankidult · 2015-05-13T07:54:33Z

@patyork @fchollet Adding more flags might violate the principles of this library. Perhaps we need to come up with a good way of doing this.

wxs · 2015-06-09T20:50:58Z

@fchollet would a reasonable stopgap approach here be to add a "dummy" layer whose get_output() is just the identity, but also exposes the correct PReLU activation as a separate method, say activation()? Then by adding it to your model, nothing changes except that its parameters become part of the gradient update.

Finally: when adding the recurrent layer you can set its activation function to dummy.activation. Does that make sense? Something like this:

dummy = DummyPReLU(512)
model.add(dummy)
model.add(SimpleRNN(512,512, activation=dummy.activation))

stale · 2017-05-24T00:29:00Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 30 days if no further activity occurs, but feel free to re-open a closed issue if needed.

iamtodor · 2018-06-28T22:25:06Z

The problem is still appearing.
My code:

model.add(Conv2D(filters=512, kernel_size=(3, 3), kernel_initializer='he_normal',
                     padding='same'))
model.add(PReLU())
model.add(BatchNormalization())

I got the following warning: /data/Ilya/projects/whale/env/lib/python3.5/site-packages/keras/activations.py:115: UserWarning: Do not pass a layer instance (such as PReLU) as the activation argument of another layer. Instead, advanced activation layers should be used just like any other layer in a model. identifier=identifier.__class__.__name__))
And then such messages were rolled out:

2018-06-28 22:20:22.411542: W tensorflow/core/common_runtime/bfc_allocator.cc:217] Allocator (GPU_0_bfc) ran out of memory trying to allocate 1.55GiB. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory is available.

And eventually OOM.
How to handle PReLU?

kirarpit · 2018-06-29T21:23:50Z

@iamtodor, I think the problem here is that you didn't define any activation function in the first layer.

naarkhoo · 2020-03-13T15:57:33Z

five years later I have the same question. but seems it is rather a warning now. can you elaborate please.

* Implement the LearningRateScheduler callback * Add missing files * Cleanup * Formatting * Remove test util for building model * Initial implementation of ReduceLROnPlateau * Remove unused test variables * Tests for ReduceLROnPlateau * Improve docstrings * Review comments

trungnt13 mentioned this issue May 17, 2015

Add support for custom function for activations, initializations, objectives #135

Closed

PavlosMelissinos mentioned this issue May 23, 2017

Automatically close stale issues #6701

Merged

stale bot added the stale label May 24, 2017

stale bot closed this as completed Jun 23, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How could we use Leaky ReLU and Parametric ReLU as activation function ? #117

How could we use Leaky ReLU and Parametric ReLU as activation function ? #117

gaoyuankidult commented May 11, 2015

patyork commented May 11, 2015

gaoyuankidult commented May 12, 2015

patyork commented May 12, 2015

fchollet commented May 12, 2015

gaoyuankidult commented May 13, 2015

wxs commented Jun 9, 2015

stale bot commented May 24, 2017

iamtodor commented Jun 28, 2018

kirarpit commented Jun 29, 2018

naarkhoo commented Mar 13, 2020

How could we use Leaky ReLU and Parametric ReLU as activation function ? #117

How could we use Leaky ReLU and Parametric ReLU as activation function ? #117

Comments

gaoyuankidult commented May 11, 2015

patyork commented May 11, 2015

gaoyuankidult commented May 12, 2015

patyork commented May 12, 2015

fchollet commented May 12, 2015

gaoyuankidult commented May 13, 2015

wxs commented Jun 9, 2015

stale bot commented May 24, 2017

iamtodor commented Jun 28, 2018

kirarpit commented Jun 29, 2018

naarkhoo commented Mar 13, 2020