Attention Mechanism not working #56

SaharaAli16 · 2021-05-27T20:43:12Z

Hi,
I have added an attention layer (following the example) to my simple LSTM network shown below.

timestep = timesteps
features = 11
model = Sequential()
model.add(LSTM(64, input_shape=(timestep,features), return_sequences=True))
model.add(Dropout(0.2))
model.add(LSTM(32, return_sequences=True))
model.add(LSTM(16, return_sequences=True))
model.add(Attention(32))
model.add(Dense(32))
model.add(Dense(16))
model.add(Dense(1))
print(model.summary())
The code worked fine up till last week and I got a summary of model having attention layer details like this:

However, now running the same code gives me a weird error.
ValueError: tf.function-decorated function tried to create variables on non-first call.

What I noticed is that the model summary has changed too:

I am tight on time due an upcoming deadline. Any assistance would be highly appreciated.
P.S. This was a fully working model that has stopped working all of a sudden for no apparent reason.

The text was updated successfully, but these errors were encountered:

philipperemy · 2021-05-27T23:53:39Z

Try to downgrade your tensorflow version.

SaharaAli16 · 2021-05-28T14:33:14Z

I changed the way I was defining the model, without downgrading Tensorflow and it started working again. New model definition:

timestep = timesteps
features = 11

model_input = Input(shape=(timestep,features))
x = LSTM(64, return_sequences=True)(model_input)
x = Dropout(0.2)(x)
x = LSTM(32, return_sequences=True)(x)
x = LSTM(16, return_sequences=True)(x)
x = Attention(32)(x)
x = Dense(32)(x)
x = Dense(16)(x)
x = Dense(1)(x)
model = Model(model_input, x)
print(model.summary())

philipperemy · 2021-05-29T03:56:16Z

Great!

SaharaAli16 · 2021-05-30T02:11:14Z

Quick follow-up question: Can you tell how to downgrade tensorflow to 2.3? Current version in Colab is 2.5 and I am having the reported issue again, even with the new model definition.
I know %tensorflow_version 2.x cannot downgrade TF to 2.3

philipperemy · 2021-05-30T03:03:36Z

I think this should work:

!pip install tensorflow==2.3

Like that

SaharaAli16 · 2021-05-30T18:10:57Z

Alright, so that worked. Next up, I cannot use multiple Attention layers in one ensembled model. So, I have model1 that has an attention layer and I have model2 that has another attention layer. But when I concatenate these two models, I get this error:
ValueError: The name "last_hidden_state" is used 2 times in the model. All layer names should be unique.
I believe this is because the attention layer itself has multiple inner/nested layers and models cannot have layers with same name. I tried renaming the attention layer but since it is just a wrapper, that renaming didn't help and the error persists.
Any workaround for this?

philipperemy · 2021-05-31T01:05:12Z

@SaharaAli16 yes have to remove the names inside the layers: https://github.com/philipperemy/keras-attention-mechanism/blob/0f8b440e8e74fb25309b2d391f7280bf4f13129a/attention/attention.py#L24. Otherwise Keras will complain that they already exist if you instantiate a second Attention class.

shlomi-schwartz · 2021-11-15T12:17:06Z

The suggested setup:

timestep = timesteps
features = 11

model_input = Input(shape=(timestep,features))
x = LSTM(64, return_sequences=True)(model_input)
x = Dropout(0.2)(x)
x = LSTM(32, return_sequences=True)(x)
x = LSTM(16, return_sequences=True)(x)
x = Attention(32)(x)
x = Dense(32)(x)
x = Dense(16)(x)
x = Dense(1)(x)
model = Model(model_input, x)
print(model.summary())

No longer works for TF 2.7.0

Error :

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-6-88e2c30c5093> in <module>()
      7 x = LSTM(32, return_sequences=True)(x)
      8 x = LSTM(16, return_sequences=True)(x)
----> 9 x = Attention(32)(x)
     10 x = Dense(32)(x)
     11 x = Dense(16)(x)

1 frames
/usr/local/lib/python3.7/dist-packages/keras/engine/base_layer.py in __init__(self, trainable, name, dtype, dynamic, **kwargs)
    339              trainable.dtype is tf.bool)):
    340       raise TypeError(
--> 341           'Expected `trainable` argument to be a boolean, '
    342           f'but got: {trainable}')
    343     self._trainable = trainable

TypeError: Expected `trainable` argument to be a boolean, but got: 32

SaharaAli16 · 2021-11-17T01:37:31Z

I would suggest copying the source code and compile it in your code. That should work.

philipperemy · 2022-01-24T00:54:19Z

Yes so this issue was fixed in the latest release (4.1) of the attention mechanism.

pip install attention --upgrade

Will solve it.

SaharaAli16 closed this as completed May 28, 2021

SaharaAli16 reopened this May 30, 2021

philipperemy closed this as completed Jan 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attention Mechanism not working #56

Attention Mechanism not working #56

SaharaAli16 commented May 27, 2021 •

edited

Loading

philipperemy commented May 27, 2021

SaharaAli16 commented May 28, 2021 •

edited

Loading

philipperemy commented May 29, 2021

SaharaAli16 commented May 30, 2021 •

edited

Loading

philipperemy commented May 30, 2021

SaharaAli16 commented May 30, 2021

philipperemy commented May 31, 2021

shlomi-schwartz commented Nov 15, 2021

SaharaAli16 commented Nov 17, 2021

philipperemy commented Jan 24, 2022

Attention Mechanism not working #56

Attention Mechanism not working #56

Comments

SaharaAli16 commented May 27, 2021 • edited Loading

philipperemy commented May 27, 2021

SaharaAli16 commented May 28, 2021 • edited Loading

philipperemy commented May 29, 2021

SaharaAli16 commented May 30, 2021 • edited Loading

philipperemy commented May 30, 2021

SaharaAli16 commented May 30, 2021

philipperemy commented May 31, 2021

shlomi-schwartz commented Nov 15, 2021

SaharaAli16 commented Nov 17, 2021

philipperemy commented Jan 24, 2022

SaharaAli16 commented May 27, 2021 •

edited

Loading

SaharaAli16 commented May 28, 2021 •

edited

Loading

SaharaAli16 commented May 30, 2021 •

edited

Loading