reorganize sequence 2 sequence. #112

lcy-seso · 2017-06-21T07:37:12Z

reorganize codes of attention without attention.

kuke

Almost LGTM

kuke · 2017-06-27T11:08:35Z

nmt_without_attention/README.md

@@ -84,19 +85,17 @@ encoded_vector = paddle.networks.bidirectional_gru(


 ### 无注意力机制的解码器
-PaddleBook中[机器翻译](https://github.com/PaddlePaddle/book/blob/develop/08.machine_translation/README.cn.md)的相关章节中，已介绍了带注意力机制（Attention Mechanism）的 Encoder-Decoder 结构，本例则介绍的是不带注意力机制的 Encoder-Decoder 结构。关于注意力机制，读者可进一步参考 PaddleBook 和参考文献\[[3](#参考文献)]。
+-PaddleBook中[机器翻译](https://github.com/PaddlePaddle/book/blob/develop/08.machine_translation/README.cn.md)的相关章节中，已介绍了带注意力机制（Attention Mechanism）的 Encoder-Decoder 结构，本例则介绍的是不带注意力机制的 Encoder-Decoder 结构。关于注意力机制，读者可进一步参考 PaddleBook 和参考文献\[[3](#参考文献)]。


"本例则介绍的是" -> "本例介绍的则是"? 感觉会好一点

kuke · 2017-06-27T11:12:13Z

nmt_without_attention/README.md

-* `prob`表示生成句子的得分，随之其后则是翻译生成的句子；
-* `<s>` 表示句子的开始，`<e>`表示一个句子的结束，如果出现了在词典中未包含的词，则用`<unk>`替代。
+- 第一行为输入的源语言句子。
+- 第二 ~ `beam_size + 1` 行是柱搜索生成的 `beam_size` 条翻译结果


第二 ~ beam_size + 1 行 -> 第二至第beam_size + 1 行，
在文字中少用符号是不是有好点？anyway，个人偏好。

kuke · 2017-06-27T11:12:17Z

nmt_without_attention/README.md

+- 第一行为输入的源语言句子。
+- 第二 ~ `beam_size + 1` 行是柱搜索生成的 `beam_size` 条翻译结果
+    - 一行之内以“\t”分隔为两列，第一列是句子的log 概率，第二列是翻译结果的文本。
+    - `<s>` 表示句子的开始，`<e>`表示一个句子的结束，如果出现了在词典中未包含的词，则用`<unk>`替代。


<s> -> 符号<s>， <unk>-> 符号<unk>

kuke · 2017-06-27T11:16:40Z

nmt_without_attention/README.md

@@ -198,22 +193,25 @@ else:



Need to reorganize line 189 because we don't have params --train and --generate now.

kuke · 2017-06-27T11:20:02Z

nmt_without_attention/README.md

-* `<s>` 表示句子的开始，`<e>`表示一个句子的结束，如果出现了在词典中未包含的词，则用`<unk>`替代。
+- 第一行为输入的源语言句子。
+- 第二 ~ `beam_size + 1` 行是柱搜索生成的 `beam_size` 条翻译结果
+    - 一行之内以“\t”分隔为两列，第一列是句子的log 概率，第二列是翻译结果的文本。


一行之内 -> 相同行的输出

kuke · 2017-06-27T11:23:20Z

nmt_without_attention/generate.py

+def generate(source_dict_dim, target_dict_dim, model_path, beam_size,
+             batch_size):
+    """
+    sequence generation for NMT


sequence -> Sequence

kuke · 2017-06-27T11:25:42Z

nmt_without_attention/generate.py

+
+if __name__ == "__main__":
+    generate(
+        source_dict_dim=3000,


source_dict_dim is 30000 in the original configuration. Should 3000 be OK? the same in train.py.

this is a bug. fix this.

lcy-seso

follow comments.

lcy-seso · 2017-06-28T05:49:00Z

nmt_without_attention/README.md

@@ -84,19 +85,17 @@ encoded_vector = paddle.networks.bidirectional_gru(


 ### 无注意力机制的解码器
-PaddleBook中[机器翻译](https://github.com/PaddlePaddle/book/blob/develop/08.machine_translation/README.cn.md)的相关章节中，已介绍了带注意力机制（Attention Mechanism）的 Encoder-Decoder 结构，本例则介绍的是不带注意力机制的 Encoder-Decoder 结构。关于注意力机制，读者可进一步参考 PaddleBook 和参考文献\[[3](#参考文献)]。
+-PaddleBook中[机器翻译](https://github.com/PaddlePaddle/book/blob/develop/08.machine_translation/README.cn.md)的相关章节中，已介绍了带注意力机制（Attention Mechanism）的 Encoder-Decoder 结构，本例则介绍的是不带注意力机制的 Encoder-Decoder 结构。关于注意力机制，读者可进一步参考 PaddleBook 和参考文献\[[3](#参考文献)]。


lcy-seso · 2017-06-28T05:49:37Z

nmt_without_attention/README.md

-* `prob`表示生成句子的得分，随之其后则是翻译生成的句子；
-* `<s>` 表示句子的开始，`<e>`表示一个句子的结束，如果出现了在词典中未包含的词，则用`<unk>`替代。
+- 第一行为输入的源语言句子。
+- 第二 ~ `beam_size + 1` 行是柱搜索生成的 `beam_size` 条翻译结果


lcy-seso · 2017-06-28T05:50:26Z

nmt_without_attention/README.md

-* `<s>` 表示句子的开始，`<e>`表示一个句子的结束，如果出现了在词典中未包含的词，则用`<unk>`替代。
+- 第一行为输入的源语言句子。
+- 第二 ~ `beam_size + 1` 行是柱搜索生成的 `beam_size` 条翻译结果
+    - 一行之内以“\t”分隔为两列，第一列是句子的log 概率，第二列是翻译结果的文本。


lcy-seso · 2017-06-28T05:50:34Z

nmt_without_attention/README.md

+- 第一行为输入的源语言句子。
+- 第二 ~ `beam_size + 1` 行是柱搜索生成的 `beam_size` 条翻译结果
+    - 一行之内以“\t”分隔为两列，第一列是句子的log 概率，第二列是翻译结果的文本。
+    - `<s>` 表示句子的开始，`<e>`表示一个句子的结束，如果出现了在词典中未包含的词，则用`<unk>`替代。


lcy-seso · 2017-06-28T05:59:48Z

nmt_without_attention/generate.py

+
+if __name__ == "__main__":
+    generate(
+        source_dict_dim=3000,


this is a bug. fix this.

lcy-seso · 2017-06-28T06:00:14Z

nmt_without_attention/generate.py

+def generate(source_dict_dim, target_dict_dim, model_path, beam_size,
+             batch_size):
+    """
+    sequence generation for NMT


lcy-seso · 2017-06-28T06:10:06Z

nmt_without_attention/README.md

@@ -198,22 +193,25 @@ else:



kuke

Great. LGTM

lcy-seso requested a review from kuke June 21, 2017 07:37

lcy-seso force-pushed the refine_seq2seq branch 5 times, most recently from 28f9377 to 107d31e Compare June 22, 2017 05:45

rewrite nmt without attention.

555e089

lcy-seso force-pushed the refine_seq2seq branch 10 times, most recently from 96a1cff to 00eb42a Compare June 26, 2017 03:42

Merge branch 'develop' into refine_seq2seq

ddaba7f

lcy-seso force-pushed the refine_seq2seq branch from 00eb42a to ddaba7f Compare June 27, 2017 05:24

kuke reviewed Jun 27, 2017

View reviewed changes

lcy-seso commented Jun 28, 2017

View reviewed changes

lcy-seso force-pushed the refine_seq2seq branch from 3a17d6f to 58ac283 Compare June 28, 2017 06:14

follow comments.

52cd312

kuke approved these changes Jun 28, 2017

View reviewed changes

lcy-seso force-pushed the refine_seq2seq branch from 58ac283 to 52cd312 Compare June 28, 2017 08:23

lcy-seso merged commit 6517398 into PaddlePaddle:develop Jun 28, 2017

lcy-seso deleted the refine_seq2seq branch June 28, 2017 09:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reorganize sequence 2 sequence. #112

reorganize sequence 2 sequence. #112

lcy-seso commented Jun 21, 2017

kuke left a comment

kuke Jun 27, 2017

lcy-seso Jun 28, 2017

kuke Jun 27, 2017

lcy-seso Jun 28, 2017

kuke Jun 27, 2017

lcy-seso Jun 28, 2017

kuke Jun 27, 2017

lcy-seso Jun 28, 2017

kuke Jun 27, 2017

lcy-seso Jun 28, 2017

kuke Jun 27, 2017

lcy-seso Jun 28, 2017

kuke Jun 27, 2017

lcy-seso Jun 28, 2017

lcy-seso left a comment

lcy-seso Jun 28, 2017

lcy-seso Jun 28, 2017

lcy-seso Jun 28, 2017

lcy-seso Jun 28, 2017

lcy-seso Jun 28, 2017

lcy-seso Jun 28, 2017

lcy-seso Jun 28, 2017

kuke left a comment

reorganize sequence 2 sequence. #112

reorganize sequence 2 sequence. #112

Conversation

lcy-seso commented Jun 21, 2017

kuke left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lcy-seso left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kuke left a comment

Choose a reason for hiding this comment