refine NER. #102

lcy-seso · 2017-06-16T10:27:47Z

refactor NER demo.

lcy-seso · 2017-06-16T10:28:27Z

README hasn't updated yet.

guoshengCS · 2017-06-17T08:50:36Z

sequence_tagging_for_ner/network_conf.py

+
+    forward_hidden, rnn_forward = stacked_rnn(word_caps_vector, hidden_dim,
+                                              hidden_para_attr, rnn_para_attr)
+    backward_hidden, rnn_backward = stacked_rnn(


It seems that the two stacked_rnn branches import separated hidden layers at the first level, while the two RNN branches in the raw config share the same hidden layer. It matters if strict consistency is required.

This is a bug, these two RNN should not share the parameters. I will fix this. Thanks for your comment.

I modify the network configuration to keep it consistent with the original one.

guoshengCS · 2017-06-17T09:03:25Z

sequence_tagging_for_ner/infer.py

+            pred_str = ""
+            for w, tag in zip(test_sample[0],
+                              probs[start_id:start_id + len(test_sample[0])]):
+                pred_str += "%s[%s] " % (id_2_word[w], id_2_label[tag])


Since lowercase is used, the outputs might be different from the raw texts. It should be noticed if the raw texts are wanted.

This hasn't be fixed yet. During inferring only normalized text is printed.

guoshengCS · 2017-06-25T15:03:08Z

sequence_tagging_for_ner/README.md

 ```

-其中第一列为原始句子序列（第二、三列分别为词性标签和句法分析中的语块标签，这里暂时不用），第四列为采用了I-TYPE方式表示的NER标签（I-TYPE和BIO方式的主要区别在于语块开始标记的使用上，I-TYPE只有在出现相邻的同类别实体时对后者使用B标记，其他均使用I标记），句子之间以空行分隔。
+- 第一列为原始句子序列
+- 第二、三列分别为词性标签和句法分析中的语块标签，本利不使用


"本利不使用"应改为"本例不使用"

guoshengCS · 2017-06-25T15:09:19Z

sequence_tagging_for_ner/README.md

+    1. 输入文本的词典
+    2. 为词典中的词语提供预训练好的词向量
+    2. 标记标签的词典
+   标记标签词典已附在`data`目录中，对应于`data/target.txt`文件。输入文本的词典以及词典中词语的预训练的词向量来自：[Stanford CS224d](http://cs224d.stanford.edu/)课程作业。**为运行本例，请首先在`data`目录下运行`download.sh`脚本下载预训练的词向量。** 完成后会将这两个文件一并放入`data`目录下，输入文本的词典和预训练的词向量分别对应：`data/vocab.txt`和`data/wordVectors.txt`这两个文件。


"下载预训练的词向量"是否应为"下载输入文本的词典和预训练的词向量"

lcy-seso

follow comments, thank you.

lcy-seso · 2017-06-26T01:49:26Z

sequence_tagging_for_ner/README.md

+    1. 输入文本的词典
+    2. 为词典中的词语提供预训练好的词向量
+    2. 标记标签的词典
+   标记标签词典已附在`data`目录中，对应于`data/target.txt`文件。输入文本的词典以及词典中词语的预训练的词向量来自：[Stanford CS224d](http://cs224d.stanford.edu/)课程作业。**为运行本例，请首先在`data`目录下运行`download.sh`脚本下载预训练的词向量。** 完成后会将这两个文件一并放入`data`目录下，输入文本的词典和预训练的词向量分别对应：`data/vocab.txt`和`data/wordVectors.txt`这两个文件。


lcy-seso · 2017-06-26T01:49:35Z

sequence_tagging_for_ner/README.md

 ```

-其中第一列为原始句子序列（第二、三列分别为词性标签和句法分析中的语块标签，这里暂时不用），第四列为采用了I-TYPE方式表示的NER标签（I-TYPE和BIO方式的主要区别在于语块开始标记的使用上，I-TYPE只有在出现相邻的同类别实体时对后者使用B标记，其他均使用I标记），句子之间以空行分隔。
+- 第一列为原始句子序列
+- 第二、三列分别为词性标签和句法分析中的语块标签，本利不使用


lcy-seso requested a review from guoshengCS June 16, 2017 10:27

lcy-seso force-pushed the refine_ner branch 2 times, most recently from b682e7d to 9dbe1cd Compare June 16, 2017 10:39

refine NER.

132a26a

lcy-seso force-pushed the refine_ner branch from 9dbe1cd to 132a26a Compare June 16, 2017 10:42

guoshengCS reviewed Jun 17, 2017

View reviewed changes

fix incorrect parameter sharing between bidirectional rnns.

ba0ff69

lcy-seso force-pushed the refine_ner branch 4 times, most recently from 0ddc309 to 670378c Compare June 21, 2017 05:33

guoshengCS approved these changes Jun 25, 2017

View reviewed changes

update README.

72a7215

lcy-seso commented Jun 26, 2017

View reviewed changes

lcy-seso force-pushed the refine_ner branch from 670378c to 72a7215 Compare June 26, 2017 01:50

lcy-seso merged commit 436f480 into PaddlePaddle:develop Jun 26, 2017

lcy-seso deleted the refine_ner branch June 26, 2017 04:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refine NER. #102

refine NER. #102

lcy-seso commented Jun 16, 2017

lcy-seso commented Jun 16, 2017

guoshengCS Jun 17, 2017

lcy-seso Jun 19, 2017

lcy-seso Jun 19, 2017

guoshengCS Jun 17, 2017

lcy-seso Jun 26, 2017

guoshengCS Jun 25, 2017

lcy-seso Jun 26, 2017

guoshengCS Jun 25, 2017

lcy-seso Jun 26, 2017

lcy-seso left a comment

lcy-seso Jun 26, 2017

lcy-seso Jun 26, 2017

refine NER. #102

refine NER. #102

Conversation

lcy-seso commented Jun 16, 2017

lcy-seso commented Jun 16, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lcy-seso left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment