Topic Modelling for Humans
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Persistent dict, backed by sqlite3 and pickle, multithread-safe.
Repository to build and test Gensim wheels
Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.
Movie plots by genre tutorial at PyData Berlin 2016
Presentations & notebooks from our talks /workshops/meetups/etc
Data repository for pretrained NLP models and NLP corpora.
Code for the GPU mega-benchmark article
Code for the word2vec HTTP server running at https://rare-technologies.com/word2vec-tutorial/#bonus_app
Tools and services for evaluating topic models
Scripts and utilities for the RaReBot competition
Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition
[NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]