Permalink
Switch branches/tags
Nothing to show
Find file
Fetching contributors…
Cannot retrieve contributors at this time
6 lines (3 sloc) 363 Bytes
As part of my learning process, a rewrite of Storm's wordcount topology in Scala with some extra toys: uses Lucene's ShingleFilter to count the word 2-grams in the Twitter sample firehose. An output bolt pushes the results into Redis.
Sample results: https://gist.github.com/1244665
Uses language detection code from http://code.google.com/p/language-detection