GitHub is home to over 31 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Topic Modelling for Humans
Persistent dict, backed by sqlite3 and pickle, multithread-safe.
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.
Movie plots by genre tutorial at PyData Berlin 2016
Presentations & notebooks from our talks /workshops/meetups/etc
Data repository for pretrained NLP models and NLP corpora.
Code for the GPU mega-benchmark article
Code for the word2vec HTTP server running at https://rare-technologies.com/word2vec-tutorial/#bonus_app
Tools and services for evaluating topic models
Scripts and utilities for the RaReBot competition
Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition
[NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]