CS439 project - Gradient Compression over SGD and Adam: A Survey

Code for the experimental parts of the course project in CS-439: Optimization for Machine Learning.

The implementation is based on this repository's code and uses PyTorch.

Requirements

The following packages were used for the experiments. Newer versions are also likely to work.

torchvision==0.2.1
numpy==1.15.4
torch==0.4.1
pandas==0.23.4
scikit_learn==0.20.3

To install them automatically: pip install -r requirements.txt

Organization

optimizers/ contains the custom optimizer, namely CompSGD, ErrorFeedbackSGD and OneBitAdam.
models/ contains the deep net architectures. Only Resnet were experimented.
results/ contains the results of the experiments in pickle files.
utils/ contains utility functions for saving/loading objects, convex optimization, progress bar...
checkpoints/ contains the saved models' checkpoints with all the nets parameters. The folder is empty here as those files are very large.

Notations

We clarify the noations here. In particular,

ssgd: SGD with sign gradient compression.
sgd_topk: SGD with top-k gradient compression.
sgd_pcak: SGD with k-PCA gradient compression.
sssgd: SGD with scaled sign gradient compression.
ussgd: Unscaled SignSGD (MEM-SGD), i.e., SGD with sign gradient compression and error feedback.
ssgdf: Error-feedback SignSGD, i.e., SGD with scaled sign gradient compression and error feedback.
onebit_adam_unscaled: the original version of one-bit Adam.
onebit_adam_scaled: the scaled version of one-bit Adam.

Usage

run.ipynb has three parts, consisting of lines for tuning learning rates, running experiments, and plotting figures that are in the report.
main.py can be called from the command line to run a single network training and testing. It can take a variety of optional arguments. Type python main.py --help for further details.
utils.hyperparameters.py facilitate the definition of all the hyper-parameters of the experiments.
tune_lr.py allows to tune the learning rate for a network architecture/data set/optimizer configuration.
main_experiments.py contains the experiments in the report.
plot_graph.py constains the code for plotting the results
print_stats.py constains the code to list the best performance of each experiment done by tunr_lr.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS439 project - Gradient Compression over SGD and Adam: A Survey

Requirements

Organization

Notations

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
figs		figs
models		models
optimizers		optimizers
results		results
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py
main_experiments.py		main_experiments.py
norm_ratio_experiments.py		norm_ratio_experiments.py
plot_adam.ipynb		plot_adam.ipynb
plot_graph.py		plot_graph.py
print_stats.py		print_stats.py
requirements.txt		requirements.txt
run.ipynb		run.ipynb
tune_lr.py		tune_lr.py

Folders and files

Latest commit

History

Repository files navigation

CS439 project - Gradient Compression over SGD and Adam: A Survey

Requirements

Organization

Notations

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages