MiniWord2Vec

This application is an implementation of both the skipgram and cbow techniques used in the Word2Vec algorithm.

Requirements

Support for Python 2 and 3. Install the package requirements via

pip install -r requirements.txt

Note

Requires cupy to run on GPU for fast computations, and that is the default behaviour. cupy requires CUDA related libraries, cuDNN and NCCL, to be installed before installing CuPy.

Replace import cupy as np with import numpy as np if you wish to run it on the CPU.

Data

The training data can be found in the data/ folder.

Usage

For training, use the run script. For CBoW, use:

./run cbow

For Skipgram, use:

./run skipgram

Tuning Parameters

You can edit the parameters by specifying their values in the run file. Parameters that can be edited:

Dimension of the word embedding, default: 300
No. of epochs to train the data on, default: 100
Window size for CBoW, default: 3

Output

Currently stores the outputs in the form of .npy files after each epoch in the utils/ folder.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
data		data
outputs		outputs
utils		utils
.gitignore		.gitignore
README.md		README.md
cbow.py		cbow.py
requirements.txt		requirements.txt
run		run
skipgram.py		skipgram.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MiniWord2Vec

Requirements

Note

Data

Usage

Tuning Parameters

Output

About

Releases

Packages

Languages

viix-co/MiniWord2Vec

Folders and files

Latest commit

History

Repository files navigation

MiniWord2Vec

Requirements

Note

Data

Usage

Tuning Parameters

Output

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages