Skip to content

Commit

Permalink
added list of stopwords, much more aggressive than before, feel free …
Browse files Browse the repository at this point in the history
…to use your own
  • Loading branch information
Jacob Perkins committed Apr 29, 2011
1 parent 16f0b5f commit fa9ad21
Show file tree
Hide file tree
Showing 3 changed files with 425 additions and 3 deletions.
1 change: 0 additions & 1 deletion FIXME.txt
@@ -1,2 +1 @@
- text tokenizer needs some aggressive filtering of stopwords. Look at other Lucene analyzers
- standardize the way jars are dealt with, using relative paths in the scripts themselves is crufty and doesn't scale

0 comments on commit fa9ad21

Please sign in to comment.