Update on "[Gradient Compression] Error feedback for PowerSGD (still … · pytorch/pytorch@5309f70

Commit

Update on "[Gradient Compression] Error feedback for PowerSGD (still …

…need to fix the key in error_dict)"


Support an optional error feedback for PowerSGD -- storing the difference (i.e., the local error caused by compression) between the input gradient (adjusted by the existing error) and the gradient after decompression, and reinserting it at the next iteration.

Still need to add an index field to GradBucket as the key of error_dict. This is because the current key, input tensor of the bucket, can change across steps, as the buckets may be rebuilt in forward pass in order to save peak memory usage.

This is halfway of error feedback. Plan to add the new index field in a separate PR.

Original PR issue: Investigate Applying PowerSGD to Communication Hook for Gradient Compression #47202

Differential Revision: [D25240290](https://our.internmc.facebook.com/intern/diff/D25240290/)

[ghstack-poisoned]

Loading branch information

wayi committed Dec 2, 2020

1 parent 65b0e32 commit 5309f70

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit

There are no files selected for viewing

0 comments on commit `5309f70`

Commit

There are no files selected for viewing

0 comments on commit 5309f70

0 comments on commit `5309f70`