public
Description: A Simple Ruby Rule-Based Part of Speech Tagger based on Eric Brill Tagger
Clone URL: git://github.com/taf2/rb-brill-tagger.git
ensure words we suggest are longer then 3 characters
taf2 (author)
Tue Jun 24 21:37:49 -0700 2008
commit  67a1644da5ea98fe4b2d33cedf8a09fbdedfb99d
tree    51666ca7cd50d04a971cd595e485ef200718de9e
parent  bbfabcdabcd7e8b4722333b8ec86ecd45c941926
...
48
49
50
51
 
52
53
54
...
48
49
50
 
51
52
53
54
0
@@ -48,7 +48,7 @@ module Brill
0
         end
0
         tag
0
       }
0
- results = tags.select{|tag| tag.last.match(/NN/) }
0
+ results = tags.select{|tag| tag.last.match(/NN/) and tag.first.size > 3 }
0
       if results.size > max
0
         counts = {}
0
         tags = []

Comments

    No one has commented yet.