You can clone with
Added sample program that identifies the most common phrases of length
1, 2 or 3 from English text.
Gives decent results on Alice's Adventures in Wonderland.
Parameters may need some tuning.
Extended algorithm into a "incremental forgetful Bloom filter",
a space-efficient data structure for identifying most frequent
Implemented and tested forgetful Bloom filter.