About 'X' label #1

kugwzk · 2019-03-19T08:18:12Z

In my opinion, you should remove the 'X' label's signal in evaluation, because you add more label than standard dataset, so I can't know very well the F1-score increase because the more label of 'X'. I think the 'X' label is not equal the 'O' label in standard dataset and the BERT paper, but in your code it may be same.

kamalkraj · 2019-03-19T09:00:02Z

Label 'X' is not considered for f1 metrics

BERT-NER/run_ner.py

Line 515 in dba5e7d

if m and label_map[label_ids[i][j]] != "X":

Label 'X' is not equal to Label 'O'

kugwzk · 2019-03-19T09:06:45Z

But did you use in train?

kamalkraj · 2019-03-19T09:12:23Z

For training I used "X".
While Inference and f1 metrics only the first output label of token is considered . same as in BERT paper

kugwzk · 2019-03-19T09:13:52Z

I think add more label in the conll2003 NER standard dataset make it not very comparable for previous works. Could you remove 'X' label during training and get a similar result?

kamalkraj · 2019-03-19T09:15:47Z

If you remove "X" while training or replace "X" with "O" , model performance will drop to ~89 f1 score

kugwzk · 2019-03-19T09:22:42Z

So this is my opinion, use ‘X’ label make it high F1-score, it's not fair. I get the similar result about 91.3 F1 score. And I think BERT origin paper is also remove 'X' label because they use document information to get high F1-score. In short, 'X' label don't have any signal.

kamalkraj · 2019-03-19T09:26:09Z

91.3 without using "X" ??

kugwzk · 2019-03-19T09:28:51Z

Yes. I get the word piece output from BERT model, and then map the first token's vector so I get the same numbers vectors as the standard dataset. And then use a softmax matrix to get the final result. But I only use the BERTModel in pytorch_pretrained_bert.

kamalkraj · 2019-03-19T09:33:16Z

For example
After extracting features for the sentence below
Jim Hen ##son was a puppet ##eer
You're giving only [Jim , Hen , was , a, puppet] hidden states to linear classification layer ?

kugwzk · 2019-03-19T09:36:07Z

Yes. Because I think the fine tune bert could learn this pattern.

kamalkraj · 2019-03-19T09:37:07Z

I will try this way and let you know

kamalkraj · 2019-03-19T10:34:28Z

@kugwzk
Can you share your code ?
How are you handling padding after extracting first sub-token hidden states from BERT ?

kugwzk · 2019-03-19T13:02:28Z

I just record the origin word position use a dict in python. For example: [Jim Hen ##son was a puppet ##eer] for [0,1,3,4,5], so I padding the origin word sequence again in the classifier layer. It may be slowly :).

alphanlp · 2019-04-10T08:15:31Z

i think we can add mask to X label when training

ereday · 2019-05-02T14:24:07Z

I agree with @kugwzk on "misusage of X label". @tkukurin In the latest version, are you still using X during training (or evaluation) or have you already removed it as suggested ?

kamalkraj · 2019-05-03T05:05:10Z

@ereday
Latest version still use X , I have code without usingX label , I need to clean the code a bit. I will try to push code by monday

Nic-Ma · 2019-05-06T05:27:46Z

Hi @kamalkraj ,

I am Nic from NVIDIA, thanks for your contribution on this project!
I tried to replace [CLS], X with O directly, and removed [SEP].
I think you supposed to release new code today, have you finished?
Thanks.

kamalkraj · 2019-05-06T09:40:48Z

@toxic2m
Check out branch experiment

Nic-Ma · 2019-05-08T08:27:51Z

Hi @kamalkraj ,

Actually, I already done this part locally, and I suggest you to map [CLS] and [SEP] to O directly.
Then your FC layer only output real number of classifications, can get better performance.
Thanks.

sbmaruf · 2019-05-16T01:47:32Z

Is there anyone able to reproduce BERT_NER paper's results (92.4F1 for BERT Base) ?
The experiment on the master branch supports the result given in the BERT paper. But as @kugwzk mentioned the problem. Is there anyone able to reproduce the results without inferred X tags?

kugwzk · 2019-05-21T06:23:53Z

@sbmaruf The result of Conll03 NER reported in the BERT origin paper used document context, which is different from the standard sentence-based evaluation. You can see something about that in here: allenai/allennlp#2067 (comment)

sbmaruf · 2019-05-24T01:26:19Z

@kugwzk thanks for your reply. but how to add document level context with NER?
any idea or code repo?

tkukurin mentioned this issue Apr 25, 2019

fix evaluation script bugs #5

Merged

kamalkraj added the good first issue Good for newcomers label May 6, 2019

mcfly5 mentioned this issue May 29, 2019

Label [SEP] in results #9

Closed

kamalkraj closed this as completed Jul 29, 2019

sbmaruf mentioned this issue Oct 19, 2019

Reproduce BERT-NER result google-research/bert#581

Closed

ToshihikoSakai mentioned this issue Dec 13, 2021

Are there any plans to avoid using the special token X? Louis-udm/NER-BERT-CRF#7

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About 'X' label #1

About 'X' label #1

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019 •

edited

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019 •

edited

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019

kamalkraj commented Mar 19, 2019 •

edited

kugwzk commented Mar 19, 2019

alphanlp commented Apr 10, 2019

ereday commented May 2, 2019

kamalkraj commented May 3, 2019

Nic-Ma commented May 6, 2019 •

edited

kamalkraj commented May 6, 2019 •

edited

Nic-Ma commented May 8, 2019

sbmaruf commented May 16, 2019 •

edited

kugwzk commented May 21, 2019

sbmaruf commented May 24, 2019

About 'X' label #1

About 'X' label #1

Comments

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019 • edited

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019 • edited

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019

kugwzk commented Mar 19, 2019

kamalkraj commented Mar 19, 2019

kamalkraj commented Mar 19, 2019 • edited

kugwzk commented Mar 19, 2019

alphanlp commented Apr 10, 2019

ereday commented May 2, 2019

kamalkraj commented May 3, 2019

Nic-Ma commented May 6, 2019 • edited

kamalkraj commented May 6, 2019 • edited

Nic-Ma commented May 8, 2019

sbmaruf commented May 16, 2019 • edited

kugwzk commented May 21, 2019

sbmaruf commented May 24, 2019

kamalkraj commented Mar 19, 2019 •

edited

kamalkraj commented Mar 19, 2019 •

edited

kamalkraj commented Mar 19, 2019 •

edited

Nic-Ma commented May 6, 2019 •

edited

kamalkraj commented May 6, 2019 •

edited

sbmaruf commented May 16, 2019 •

edited