tiendung / ruby-nlp

Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer

This URL has Read+Write access

tiendung (author)
Mon Aug 11 03:49:48 -0700 2008
ruby-nlp / nlp.rb
100644 19 lines (13 sloc) 0.6 kb
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
require 'measure'
require 'rjb'
 
Rjb::load('stanford-postagger.jar:stanford-ner.jar', ['-Xmx200m'])
 
CRFClassifier = Rjb::import('edu.stanford.nlp.ie.crf.CRFClassifier')
classifier = CRFClassifier.getClassifierNoExceptions("ner-eng-ie.crf-4-conll.ser.gz")
 
MaxentTagger = Rjb::import('edu.stanford.nlp.tagger.maxent.MaxentTagger')
MaxentTagger.init("left3words-wsj-0-18.tagger")
 
Sentence = Rjb::import('edu.stanford.nlp.ling.Sentence')
 
sent = "Good afternoon Rajat Raina, how are you today?"
 
Measure.time "NER", 10 do
  puts classifier.testString( sent )
  puts MaxentTagger.tagString( sent )
end