RARE Technologies
Repositories
-
gensim
Topic Modelling for Humans
-
smart_open
Utils for streaming large files (S3, HDFS, gzip, bz2...)
-
sqlitedict
Persistent dict, backed by sqlite3 and pickle, multithread-safe.
-
gensim-wheels
Repository to build and test Gensim wheels
-
bounter
Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.
-
movie-plots-by-genre
Movie plots by genre tutorial at PyData Berlin 2016
-
talks
Presentations & notebooks from our talks /workshops/meetups/etc
-
gensim-data
Data repository for pretrained NLP models and NLP corpora.
-
benchmark_GPU_platforms
Code for the GPU mega-benchmark article
-
w2v_server_googlenews
Code for the word2vec HTTP server running at https://rare-technologies.com/word2vec-tutorial/#bonus_app
-
topic_eval
Tools and services for evaluating topic models
-
rarebot
Scripts and utilities for the RaReBot competition
-
sparsesvd
Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition
-
gensim-simserver
[NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]