  1. gensim Public

    Topic Modelling for Humans

  2. Utils for streaming large files (S3, HDFS, gzip, bz2...)

  3. bounter Public

    Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.

  4. Persistent dict, backed by sqlite3 and pickle, multithread-safe.

  5. Data repository for pretrained NLP models and NLP corpora.

  6. Movie plots by genre tutorial at PyData Berlin 2016

