- trainable-tokenizer 15 Fast and trainable tokenizer for natural languages relying on maximum entropy methods.
- bc-thesis 3 Fast and Trainable Tokenizer for Natural Languages - my bachelor thesis
- block-cipher-design 3 Some Haskell code to prototype a design for a lame block cipher.
- mulan-ensemble 2 Ensemble classifiers implementation for Mulan and a paper using it for experiments with text classification.
- quex 2 OBSOLETE: Quex 0.59.1 fixed this issue. (This is a fork of the Quex Lexical Analyzer Generator by Frank-Rene Schäfer fixing a trivial show-stopping bug.)