GaussianK-SGD

Introduction

This repository contains the codes for the paper: Understanding Top-k Sparsification in Distributed Deep Learning. Key features include

Distributed training with gradient sparsification.
Measurement of gradient distribution on various deep learning models including feed foward neural networks (FFNs), CNNs and LSTMs.
A computing-efficient top-k approximation (called gaussian-k) for gradient sparsification.

For more details about the algorithm, please refer to our papers.

Installation

Prerequisites

Python 2 or 3
PyTorch-0.4.+
OpenMPI-3.1.+
Horovod-0.14.+

Quick Start

git clone https://github.com/hclhkbu/GaussianK-SGD.git
cd GaussianK-SGD
HOROVOD_GPU_ALLREDUCE=NCCL pip install --no-cache-dir horovod (optional if horovod has been installed)
pip install -r requirements.txt
dnn=resnet20 nworkers=4 compressor=topk density=0.001 ./run.sh

Assume that you have 4 GPUs on a single node and everything works well, you will see that there are 4 workers running at a single node training the ResNet-20 model with the Cifar-10 data set using SGD with top-k sparsification.

Papers

S. Shi, X.-W. Chu, K. Cheung and S. See, “Understanding Top-k Sparsification in Distributed Deep Learning,” 2019.

Referred Models

Deep speech: https://github.com/SeanNaren/deepspeech.pytorch
PyTorch examples: https://github.com/pytorch/examples

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
audio_data		audio_data
exp_configs		exp_configs
models		models
scripts		scripts
test		test
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
cluster4		cluster4
compression.py		compression.py
dist_trainer.py		dist_trainer.py
distributed_optimizer.py		distributed_optimizer.py
dl_trainer.py		dl_trainer.py
evaluate.py		evaluate.py
labels.json		labels.json
profiling.py		profiling.py
ptb_reader.py		ptb_reader.py
requirements.txt		requirements.txt
run.sh		run.sh
settings.py		settings.py
utils.py		utils.py

HKBU-HPML/GaussianK-SGD

Folders and files

Latest commit

History

Repository files navigation

GaussianK-SGD

Introduction

Installation

Prerequisites

Quick Start

Papers

Referred Models

About

Resources

Stars

Watchers

Forks

Languages