app.nlp.ner: Turning ChainNer2 into an annotator, many cleanups #32

alextp · 2013-08-08T16:35:31Z

Since ChainNer2 is the one NER system we have with state-of-the-art performance on conll I think it's desirable to revisit it and make it work in the new framework.

The current situation where its maintenance is "make it compile" is not ideal for something so much effort was put on. This pull request makes it into an annotator and simplifies the code in a way which hopefully preserves correctness.

alextp · 2013-08-08T16:36:24Z

@andrewmccallum @samanz , please review and comment.

andrewmccallum · 2013-08-08T17:35:49Z

Since this is for CoNLL 2003 training data, let's name it something that doesn't have "NER2" in it.
I suggest either "NER1b" or "NER3", with a mild preference for the former.

samanz · 2013-08-08T17:42:20Z

Thanks for doing this! One thing is that using the new lexicon objects we lose the BILOU labels on the lexicon features that my lexicons class provided, which provided a boost in f1 when I trained this model originally. This should not be that big of a change to add to Lexicon.

Also, if we are going to have this model work as out of the box as the other models, we do need to have our own brown clusters file (which we could generate) and a trainer object with the default options of

ner.aggregate = true,
ner.twoStage = true,
ner.bP = true

Those are all things I can take care of after you merge though. Otherwise it looks good to me.

opts.sigmaSq is not being used anymore and can be removed.

There were many problems before with deserializing a trained model in ChainNer2 for some reason beforehand, and I'd like to test it now, with the new serialization.

alextp · 2013-08-08T17:42:54Z

My plan is to extend this code so it will also work in ontonotes labels, as
nothing is very conll-specific in it. This pull request mainly tries to get
it in runnable shape again.

I'm fine with naming it NER3.

On Thu, Aug 8, 2013 at 1:35 PM, Andrew McCallum notifications@github.comwrote:

Since this is for CoNLL 2003 training data, let's name it something that
doesn't have "NER2" in it.
I suggest either "NER1b" or "NER3", with a mild preference for the former.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/32#issuecomment-22340490
.

Alexandre

daltonj · 2013-08-08T18:00:22Z

+1 for revisiting this. It would be great to have for comparison with previous work. Let me know if I can help testing it.

alextp · 2013-08-08T18:13:09Z

Converted the naming to NER3. Is this ready for merge now?

alextp · 2013-08-08T18:16:38Z

@samanz , thanks for the comments.

Is there value in keeping the code around for aggregate = false, bp = false, and twoStage = false ?

Vineet is working on some word embeddings which can already outperform brown clusters on a worse NER system, and I think going forward the plan is to check those into lexicons and use them in this ner, so I'm not too worried about shipping a model with brown clusters right now.

samanz · 2013-08-08T18:37:53Z

@alextp twoStage = false is important since it will allow somebody who wants most of the accuracy without the added time penalty of running two models since it will only use the first model. It should always be true during training.

The other two are useful for adding new features, since by selectively turing them on, you replicate different experiments from the paper the features were derived from, allowing you to see how a new feature changes the outcome of a model with a specific set of features turned on. Since setting them all to true adds all the features, which has the best results, they should all be set to true by default.

alextp · 2013-08-08T18:42:36Z

I see. Thanks for the explanation :-)

On Thu, Aug 8, 2013 at 2:37 PM, Sam Anzaroot notifications@github.comwrote:

@alextp https://github.com/alextp twoStage = false is important since
it will allow somebody who wants most of the accuracy without the added
time penalty of running two models since it will only use the first model.
It should always be true during training.

The other two are useful for adding new features, since by selectively
turing them on, you replicate different experiments from the paper the
features were derived from, allowing you to see how a new feature changes
the outcome of a model with a specific set of features turned on. Since
setting them all to true adds all the features, which has the best results,
they should all be set to true by default.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/32#issuecomment-22345296
.

Alexandre

alextp · 2013-08-08T18:43:37Z

Also, I have to run initSecondaryFeatures only when using the second model
but after the first model has run, right?

On Thu, Aug 8, 2013 at 2:42 PM, Alexandre Passos alexandre.tp@gmail.comwrote:

I see. Thanks for the explanation :-)

On Thu, Aug 8, 2013 at 2:37 PM, Sam Anzaroot notifications@github.comwrote:

@alextp https://github.com/alextp twoStage = false is important since
it will allow somebody who wants most of the accuracy without the added
time penalty of running two models since it will only use the first model.
It should always be true during training.

The other two are useful for adding new features, since by selectively
turing them on, you replicate different experiments from the paper the
features were derived from, allowing you to see how a new feature changes
the outcome of a model with a specific set of features turned on. Since
setting them all to true adds all the features, which has the best results,
they should all be set to true by default.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/32#issuecomment-22345296
.

Alexandre

Alexandre

samanz · 2013-08-08T18:57:34Z

That's correct. It gets the features by looking at the values at the tokens attr[BilouConllNerLabel].categoryValue, hopefully predicted from the previous model. This may not be the best way of doing it, since there is no guarantee that this is the source of the label, especially since when loading the model, the value of categoryValue is the same as target.categoryValue. This isn't a big problem, since this code is only ever used here, and we can control where the values come from. It isn't perfect, and this way we need to take care to make sure that the wrong values don't seep in.

Do you think that we should have separate labels for first stage and second stage models?

andrewmccallum · 2013-08-08T18:59:48Z

I'm fine with you merging this whenever it is ready.

alextp · 2013-08-08T19:05:57Z

Since the labels are created within the model I think the current approach
is fine. Thanks. I will merge this and continue work on another pull
request once I have more changes.

On Thu, Aug 8, 2013 at 2:57 PM, Sam Anzaroot notifications@github.comwrote:

That's correct. It get's the features by looking at the values at the
tokens attr[BilouConllNerLabel].categoryValue, hopefully predicted from the
previous model. This may not be the best way of doing it, since there is no
guarantee that this is the source of the label, especially since when
loading the model, the value of categoryValue is the same as
target.categoryValue. This isn't a big problem, since this code is only
ever used here, and we can control where the values come from, it isn't
perfect, and this way we need to take care to make sure that the wrong
values don't seep in.

Do you think that we should have separate labels for first stage and
second stage models?

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/32#issuecomment-22346759
.

Alexandre

app.nlp.ner: Turning ChainNer2 into an annotator, many cleanups

app.nlp.ner: Turning ChainNer2 into an annotator, many cleanups

0c7ab93

alextp added 2 commits August 8, 2013 14:12

app.nlp.ner: Renaming ChainNer2 to NER3

d3b33ba

app.nlp.ner: Renaming file ChainNer2 to NER3

a26c785

alextp added a commit that referenced this pull request Aug 8, 2013

Merge pull request #32 from alextp/revisit-chainner2

4e82605

app.nlp.ner: Turning ChainNer2 into an annotator, many cleanups

alextp merged commit 4e82605 into factorie:master Aug 8, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

app.nlp.ner: Turning ChainNer2 into an annotator, many cleanups #32

app.nlp.ner: Turning ChainNer2 into an annotator, many cleanups #32

alextp commented Aug 8, 2013

alextp commented Aug 8, 2013

andrewmccallum commented Aug 8, 2013

samanz commented Aug 8, 2013

alextp commented Aug 8, 2013

daltonj commented Aug 8, 2013

alextp commented Aug 8, 2013

alextp commented Aug 8, 2013

samanz commented Aug 8, 2013

alextp commented Aug 8, 2013

alextp commented Aug 8, 2013

samanz commented Aug 8, 2013

andrewmccallum commented Aug 8, 2013

alextp commented Aug 8, 2013

app.nlp.ner: Turning ChainNer2 into an annotator, many cleanups #32

app.nlp.ner: Turning ChainNer2 into an annotator, many cleanups #32

Conversation

alextp commented Aug 8, 2013

alextp commented Aug 8, 2013

andrewmccallum commented Aug 8, 2013

samanz commented Aug 8, 2013

alextp commented Aug 8, 2013

daltonj commented Aug 8, 2013

alextp commented Aug 8, 2013

alextp commented Aug 8, 2013

samanz commented Aug 8, 2013

alextp commented Aug 8, 2013

alextp commented Aug 8, 2013

samanz commented Aug 8, 2013

andrewmccallum commented Aug 8, 2013

alextp commented Aug 8, 2013