Block or report user

Popular repositories


    Stand-alone language identification system

    Python 822 148

  2. kaggle-stackoverflow2012

    My entry to the Kaggle 2012 Stack Overflow competition. Ranked 10th on the final public leaderboard.

    Python 39 28

  3. wikidump

    Tools to manipulate and extract data from wikipedia dumps

    Python 36 15

  4. polyglot

    Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the languages therein.

    Python 14 5

  5. geniatagger

    - part-of-speech tagging, shallow parsing, and named entity recognition for biomedical text -

    C++ 11 12

  6. kaggle-stumbleupon2013

    My entry to the Kaggle 2013 StumbleUpon competition. Ranked 4th on the final private leaderboard.

    Python 8 4

2 contributions in the last year

Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Jul Mon Wed Fri

Contribution activity First pull request First issue First repository Joined GitHub

July 2017

Seeing something unexpected? Take a look at the GitHub profile guide.