GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Topic Modelling for Humans
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Movie plots by genre tutorial at PyData Berlin 2016
Persistent dict, backed by sqlite3 and pickle, multithread-safe.
Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.
Presentations & notebooks from our talks /workshops/meetups/etc
Data repository for pretrained NLP models and NLP corpora.
Code for the GPU mega-benchmark article
Code for the word2vec HTTP server running at https://rare-technologies.com/word2vec-tutorial/#bonus_app
Tools and services for evaluating topic models
Scripts and utilities for the RaReBot competition
Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition
[NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]