A compressed adaptive optimizer for training large-scale deep learning models using PyTorch
hashing
deep-learning
neural-network
pytorch
transformer
imagenet
count-min-sketch
language-model
adagrad
adam-optimizer
sgd-optimizer
count-sketch
sgd-momentum
-
Updated
Nov 26, 2019 - Python