Stack LSTM Net for Paddle Book6 #5503

QiJune · 2017-11-08T23:33:41Z

qingqing01 · 2017-11-10T02:02:43Z

python/paddle/v2/framework/layers.py

+            'isReverse': is_reverse,
+            'gateActivation': gate_activation,
+            'cellActivation': cell_activation,
+            'candidateActivation': candidate_activation


these attr names have been changed to snake_case, please update.

qingqing01 · 2017-11-10T02:03:42Z

python/paddle/v2/framework/layers.py

+            'cellActivation': cell_activation,
+            'candidateActivation': candidate_activation
+        })
+    return hidden


The Cell is also the output.

qingqing01 · 2017-11-10T02:09:40Z

python/paddle/v2/framework/tests/test_understand_sentiment_dynamic_lstm.py

+    inputs = [fc1, lstm1]
+
+    for i in range(2, stacked_num + 1):
+        fc = layers.fc(input=inputs, size=hid_dim)


这处和book不一致，book: https://github.com/PaddlePaddle/book/blob/develop/06.understand_sentiment/train.py#L58

这个fc有两个输入，有两组weight，每个weight的初始化，强调下lstm作为输入的weight初始化是0。

fc_para_attr = paddle.attr.Param(learning_rate=1e-3) lstm_para_attr = paddle.attr.Param(initial_std=0., learning_rate=1.)

qingqing01 · 2017-11-10T02:13:54Z

python/paddle/v2/framework/tests/test_understand_sentiment_dynamic_lstm.py

+    for i in range(2, stacked_num + 1):
+        fc = layers.fc(input=inputs, size=hid_dim)
+        lstm = layers.dynamic_lstm(
+            input=fc, size=hid_dim, is_reverse=(i % 2) == 0)


这里有一处和book不同：lstm的candidate_activation在book里(即book里的act)用的是relu，

https://github.com/PaddlePaddle/book/blob/develop/06.understand_sentiment/train.py#L80

qingqing01 · 2017-11-10T02:16:35Z

python/paddle/v2/framework/tests/test_understand_sentiment_dynamic_lstm.py

+    prediction = layers.fc(input=[fc_last, lstm_last],
+                           size=class_dim,
+                           act='softmax')
+    cost = layers.cross_entropy(input=prediction, label=label)


为了数值稳定性，我们有softmax_with_cross_entropy_op，建议demo里 softmax+ cross_entropy换成softmax_with_cross_entropy_op ?

qingqing01 · 2017-11-10T02:17:43Z

python/paddle/v2/framework/tests/test_understand_sentiment_dynamic_lstm.py

+        paddle.reader.shuffle(
+            paddle.dataset.imdb.train(word_dict), buf_size=1000),
+        batch_size=BATCH_SIZE)
+    place = core.CPUPlace()


是否加个GPU的例子？

# place = core.GPUPlace(0)

qingqing01 · 2017-11-10T02:18:34Z

python/paddle/v2/framework/tests/test_understand_sentiment_dynamic_lstm.py

+            outs = exe.run(g_main_program,
+                           feed={"words": tensor_words,
+                                 "label": tensor_label},
+                           fetch_list=[cost, acc])


后续会作为demo吗？如果作为demo，是不是应该测试下test集？(也可以加TODO，作为后续PR。)

qingqing01

I approve this PR, but some mentioned reviews need to be updated later, I create an issue #5591

QiJune added 7 commits November 6, 2017 17:39

add lstm layer

1014ef6

set hidden shape

c778a94

rename input parameter

3ecc946

Merge remote-tracking branch 'baidu/develop' into book6_lstm

318cc12

add dynamic lstm

f69cfb8

Merge remote-tracking branch 'baidu/develop' into book6_lstm

6e1d953

refine dynamic lstm layer

8fd3551

QiJune requested a review from qingqing01 November 8, 2017 23:34

QiJune added 2 commits November 9, 2017 11:57

Merge remote-tracking branch 'baidu/develop' into book6_lstm

692cc5d

change parameter using XavierInitializer by default

104fdd9

qingqing01 reviewed Nov 10, 2017

View reviewed changes

QiJune added 2 commits November 13, 2017 11:03

refine dynamic lstm layer

a5a49c1

Merge remote-tracking branch 'baidu/develop' into book6_lstm

6cff7b7

qingqing01 approved these changes Nov 13, 2017

View reviewed changes

qingqing01 mentioned this pull request Nov 13, 2017

The configuration test_understand_sentiment_dynamic_lstm.py is not consistent with book. #5591

Closed

QiJune merged commit 29f494f into PaddlePaddle:develop Nov 13, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stack LSTM Net for Paddle Book6 #5503

Stack LSTM Net for Paddle Book6 #5503

QiJune commented Nov 8, 2017 •

edited

Loading

qingqing01 Nov 10, 2017

qingqing01 Nov 10, 2017

qingqing01 Nov 10, 2017

qingqing01 Nov 10, 2017

qingqing01 Nov 10, 2017

qingqing01 Nov 10, 2017 •

edited

Loading

qingqing01 Nov 10, 2017

qingqing01 left a comment

Stack LSTM Net for Paddle Book6 #5503

Stack LSTM Net for Paddle Book6 #5503

Conversation

QiJune commented Nov 8, 2017 • edited Loading

qingqing01 Nov 10, 2017

Choose a reason for hiding this comment

qingqing01 Nov 10, 2017

Choose a reason for hiding this comment

qingqing01 Nov 10, 2017

Choose a reason for hiding this comment

qingqing01 Nov 10, 2017

Choose a reason for hiding this comment

qingqing01 Nov 10, 2017

Choose a reason for hiding this comment

qingqing01 Nov 10, 2017 • edited Loading

Choose a reason for hiding this comment

qingqing01 Nov 10, 2017

Choose a reason for hiding this comment

qingqing01 left a comment

Choose a reason for hiding this comment

QiJune commented Nov 8, 2017 •

edited

Loading

qingqing01 Nov 10, 2017 •

edited

Loading