Output of softmax GRU and LSTM layers does not add up to 1 #6255

louisabraham · 2017-04-14T20:53:32Z

import numpy as np
from keras.models import *
from keras.layers import *
m = Sequential()
m.add(LSTM(3, input_shape=(3,2), activation='softmax'))
print(m.predict(np.random.rand(5,3,2)).sum(axis=-1))

[ 0.56759441  0.59162366  0.57279199  0.52342385  0.54326206]

Works fine with Dense or SimpleRNN.
If this is normal, it should be specified in the doc, because it is NOT logical…

The text was updated successfully, but these errors were encountered:

joelthchao · 2017-04-15T06:45:45Z

Yes, this is somehow misleading. Activation here is directly apply on each hidden unit. However, we won't use LSTM in this way. Usually, we do it by:

m.add(LSTM(hidden_unit, input_shape=(3,2)))
m.add(Dense(3, activation='softmax'))

stale · 2017-07-14T09:30:23Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 30 days if no further activity occurs, but feel free to re-open a closed issue if needed.

stale bot added the stale label Jul 14, 2017

stale bot closed this as completed Aug 13, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output of softmax GRU and LSTM layers does not add up to 1 #6255

Output of softmax GRU and LSTM layers does not add up to 1 #6255

louisabraham commented Apr 14, 2017

joelthchao commented Apr 15, 2017 •

edited

stale bot commented Jul 14, 2017

Output of softmax GRU and LSTM layers does not add up to 1 #6255

Output of softmax GRU and LSTM layers does not add up to 1 #6255

Comments

louisabraham commented Apr 14, 2017

joelthchao commented Apr 15, 2017 • edited

stale bot commented Jul 14, 2017

joelthchao commented Apr 15, 2017 •

edited