Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FIX] changed Config to RbConfig to provide ruby 2.2.0 compatibility #1

Merged
merged 1 commit into from
Nov 27, 2015
Merged

[FIX] changed Config to RbConfig to provide ruby 2.2.0 compatibility #1

merged 1 commit into from
Nov 27, 2015

Conversation

PaulRenvoise
Copy link
Contributor

No description provided.

inukshuk added a commit that referenced this pull request Nov 27, 2015
[FIX] changed Config to RbConfig to provide ruby 2.2.0 compatibility
@inukshuk inukshuk merged commit 04e27d7 into inukshuk:master Nov 27, 2015
@inukshuk
Copy link
Owner

Thanks! Are you using the gem? I have moved on to wapiti instead so I'm not really maintaining this anymore. Glad to push a release to Rubygems of course, but thought I'd let you know.

@PaulRenvoise
Copy link
Contributor Author

I tried Wapiti already, i'm just seeking what's the differences between these two.
I read somewhere that it's more convenient to use gazetteers with CRF++, is that true ?
Thanks ! :)

@inukshuk
Copy link
Owner

To be honest, I'm not sure what gazetteers even are? Do you mean something like a key-value dictionary?

In any case, I started out with crfpp and then moved on to wapiti mostly because I preferred to work with ANSI C instead of C++ and wapiti did provided all the features I needed. For what it's worth, I am using a dictionaries for some of my feature classifications, but I'm doing the classification in Ruby anyway so I would not have noticed a difference there.

@PaulRenvoise
Copy link
Contributor Author

That's it, let's say for NER, a gazetteer (or dictionary) will be in the form "LOC New_York".

From what i'm seeing now, wapiti seems more promising than CRFPP! May I ask what do you classify?
I'm having troubles understanding some basics things and i'm seeking an implementation to see how it works.

@inukshuk
Copy link
Owner

I use wapiti for anystyle a parser for bibliographies; in that repository you can see how I use the wapiti gem. It has served me very well; the only issue I've run in so far is that it does not work to train a model repeatedly (e.g, if you train it with data A and then later with data B the resulting model will not be correct), but that's a minor nuisance because it's easy to work around it (you can simply train a new model with data A+B).

Anyway, in my experience, the biggest effort is to create good training data; to tackle this I came up with the idea of a graphical editor which works great if I may say so myself. You can see an example on anystyle.io.

Out of curiosity, what do you want to classify?

@PaulRenvoise
Copy link
Contributor Author

I'm using wapiti to do NER, and i'm currently at the stage of finding the best (or a least a suitable) set of features. I have labeled data, and test data, but the results are not good enough to me. Also, the unigram/bigram/both observation is still blurry to me..
I'm going check out anystyle!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants