benchmark-word2vec-frameworks [WIP]

A repository for benchmarking popular Neural Networks frameworks used to learn unsupervised word embeddings, on different hardware platforms. This repository is a result of this issue raised on the Gensim repo which achieves the following objectives:

Compare word2vec implementations of popular neural network frameworks by reporting metrics like time to train, peak memory, word vectors quality etc.
Benchmark these implementaions across various cloud platforms
Carry out the benchmarking in self-contained fully reproducible scripts for repeatable deployment

Frameworks

Below is a short description of the frameworks supported and the code used to train the word vectors.

Framework	Description	Code used	References
Tensorflow	TensorFlow is an open source software library for numerical computation using data flow graphs. Widely used for conducting machine learning and deep neural networks research. Can run on both CPU or GPU.	The code used in benchmarking has been directly picked up and modified from the tutorials on the official github page.	link to github code
OriginalC	This tool has been made available by the original author(Tomas Mikolov)and provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words.	Use unmodified code and simply run the executable by setting relevant parameters	link to code link to toolkit
Gensim	Gensim is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.	Refer to the code given on the word2vec tutorial on the official github repo	link to reference code
DL4J	Deeplearning4j is a commercial-grade, open-source, distributed deep-learning library written for Java and Scala. Integrated with Hadoop and Spark, DL4J is designed to be used in business environments on distributed GPUs and CPUs.	Modified word2vec example given on the official github repo	link to github example

Running benchmark

Docker

For the purpose of reproducibilty, the benchmarks need to be run inside a docker.

Build your own docker using the Docker file provided in the repo. This step assumes you have installed docker. Run the following command inside the repo directory.

sudo docker build -f Dockerfile-cpu -t benchmarkword2vec-cpu .

Download the pre-built docker image from Docker's public registry.

sudo docker pull manneshiva/playground:benchmarkword2vec-gpu

Images on Docker Hub

GPU benchmarks

Exploiting GPU to train word vectors inside docker requires nvidia-docker. Requirements, installation guide and why it is necessary can be found here. Follow the same steps mentioned above replacing docker with nvidia-docker and benchmarkword2vec-cpu with benchmarkword2vec-gpu.

sudo nvidia-docker build -f Dockerfile-gpu -t benchmarkword2vec-gpu .

Compile tensorflow from source

Use the benchmarkword2vec-{c/g}pu-tfsource Dockerfile to benefit from sse4.2/avx/fma optimizations which results in a faster training time for tensorflow. This may not work on all machines.

Run the docker image:

docker run -v absPathTo/benchmark-word2vec-frameworks/:/benchmark-word2vec-frameworks/ -w /benchmark-word2vec-frameworks/ --rm -it -p 8888:8888 manneshiva/playground:benchmarkword2vec-cpu

nvidia-docker run -v absPathTo/benchmark-word2vec-frameworks/:/benchmark-word2vec-frameworks/ -w /benchmark-word2vec-frameworks/ --rm -it -p 8888:8888 manneshiva/playground:benchmarkword2vec-gpu

Run

To run all the benchmarks, eg:

python benchmark.py --file data/text8 --frameworks tensorflow gensim originalc --epochs 5 --size 100 --workers 4

usage help : python benchmark.py -h

Available options:

Parameter	Description
--frameworks	Specify frameworks to run the benchmarks on(demilited by space). If None provided, benchmarks will be run on all supported frameworks.
--file	Path to text corpus
--epochs	Number of iterations (epochs) over the corpus. Default : 5
--size	Dimensionality of the embeddings/feature vectors. Default : 100
--window	Maximum distance between the current and predicted word within a sentence. Default : 5
--min_count	This will discard words that appear less than MIN_COUNT times. Default : 5
--workers	Use these many worker threads to train the model. default : 5
--sample	Set threshold for occurrence of words. Those that appear with higher frequency in the training data will be randomly down-sampled; default is 1e-3, useful range is (0, 1e-5)
--sg	Use the skip-gram model; default is 1 (use 0 for continuous bag of words model)
--negative	Number of negative examples; default is 5, common values are 3 - 10 (0 = not used)
--alpha	The initial learning rate. Default : 0.025

On completion, the report can be found in platform-report.json file.

Jupyter Notebook

To generate graphics from the report,fire up the notebook, go to localhost:8888 and open visualize_report.ipynb

jupyter notebook --allow-root

Running benchmark on cloud

[TODO]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

benchmark-word2vec-frameworks [WIP]

Frameworks

Running benchmark

Docker

GPU benchmarks

Compile tensorflow from source

Run

Jupyter Notebook

Running benchmark on cloud

Files

README.md

Latest commit

History

README.md

File metadata and controls

benchmark-word2vec-frameworks [WIP]

Frameworks

Running benchmark

Docker

GPU benchmarks

Compile tensorflow from source

Run

Jupyter Notebook

Running benchmark on cloud