Report or block saffsd
Contact Support about this user's behavior.Report abuse
Stand-alone language identification system
My entry to the Kaggle 2012 Stack Overflow competition. Ranked 10th on the final public leaderboard.
Tools to manipulate and extract data from wikipedia dumps
Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the languages therein.
- part-of-speech tagging, shallow parsing, and named entity recognition for biomedical text -
My entry to the Kaggle 2013 StumbleUpon competition. Ranked 4th on the final private leaderboard.