Accumulative blockwise pruning #1170

psuzhanhy · 2019-11-25T23:22:02Z

Summary:
[sparsification] fix sparsity calculation error when use accumulate_mask option. Purpose: enable building up the mask accumulately by permanently removing s% from the unpruned weights in every X steps.

Previously, the sparsity is calculated among ALL weights, which is wrong when we want to use accumulate_mask option. In this case, a param is masked to 0 permanently and future sparsification should perform only among the unmasked weights.

._masks: attribute stores the masks. In between the X steps, the ._masks are still applied to the weights but the masks are only updated in every X steps. When ._masks are updated, it prunes away s% of the weights that were previously "1" (kept) in ._masks. In addition, the mask is also applied to the gradient (so the param is effective removed from the architecture, not just set to zero)

Differential Revision: D18698077

facebook-github-bot · 2019-11-25T23:22:23Z

This pull request was exported from Phabricator. Differential Revision: D18698077

Summary: Pull Request resolved: facebookresearch#1170 [sparsification] fix sparsity calculation error when use accumulate_mask option. Purpose: enable building up the mask accumulately by permanently removing s% from the unpruned weights in every X steps. Previously, the sparsity is calculated among ALL weights, which is wrong when we want to use accumulate_mask option. In this case, a param is masked to 0 permanently and future sparsification should perform only among the unmasked weights. ._masks: attribute stores the masks. In between the X steps, the ._masks are still applied to the weights but the masks are only updated in every X steps. When ._masks are updated, it prunes away s% of the weights that were previously "1" (kept) in ._masks. In addition, the mask is also applied to the gradient (so the param is effective removed from the architecture, not just set to zero) Reviewed By: arbabu123 Differential Revision: D18698077 fbshipit-source-id: 79ad67ddbd8eb55e3ef0eaec320d2b3cf4ed5239

facebook-github-bot · 2019-12-06T21:17:25Z

This pull request was exported from Phabricator. Differential Revision: D18698077

facebook-github-bot · 2019-12-07T06:20:17Z

This pull request has been merged in 8f54218.

facebook-github-bot added CLA Signed Do not delete this pull request or issue due to inactivity. fb-exported labels Nov 25, 2019

psuzhanhy force-pushed the export-D18698077 branch from e2fa0ab to cfd40dc Compare December 6, 2019 21:17

facebook-github-bot closed this in 8f54218 Dec 7, 2019

facebook-github-bot added the Merged label Dec 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accumulative blockwise pruning #1170

Accumulative blockwise pruning #1170

psuzhanhy commented Nov 25, 2019

facebook-github-bot commented Nov 25, 2019

facebook-github-bot commented Dec 6, 2019

facebook-github-bot commented Dec 7, 2019

Accumulative blockwise pruning #1170

Accumulative blockwise pruning #1170

Conversation

psuzhanhy commented Nov 25, 2019

facebook-github-bot commented Nov 25, 2019

facebook-github-bot commented Dec 6, 2019

facebook-github-bot commented Dec 7, 2019