Refine pruning logic #222

nzmora · 2019-04-08T00:56:39Z

Add finer control over the pruning logic, to accomodate more pruning
use-cases.
The full description of the new logic will be available as documentation.

In this commit:

Added a new callback to the CompressionScheduler:
compression_scheduler.before_parameter_optimization which is invoked
after the gradients are are computed, but before the weights are updated
by the optimizer.
We skip the first mini-batch of the first epoch (global_mini_batch_id == 0)
because of PyTorch's SGD implementation (details later).
We register to the parameter backward hook in order to mask the gradients.
This gives us finer control over the parameter updates.
Add several DropFilter schedules
DropFilter is a method to regularize networks, and it can also be
used to prepare a network for permanent filter pruning.
Add documentation of pruning fine-control

Add finer control over the pruning logic, to accomodate more pruning use-cases. The full description of the new logic will be available as documentation. In this commit: - Added a new callback to the CompressionScheduler: ```compression_scheduler.before_parameter_optimization``` which is invoked after the gradients are are computed, but before the weights are updated by the optimizer. - We skip the first mini-batch of the first epoch (global_mini_batch_id == 0) because of PyTorch's SGD implementation (details later). - We register to the parameter backward hook in order to mask the gradients. This gives us finer control over the parameter updates.

DropFilter is a method to regularize networks, and it can also be used to prepare a network for permanent filter pruning.

'pages' was deprecated and replaced by 'nav'. Also removed unused earlyexit.md file.

A new version of mkdocs moved files around - creating and deleting files.

Add finer control over the pruning logic, to accommodate more pruning use-cases. The full description of the new logic is available in the updated [documentation of the CompressionScheduler](https://nervanasystems.github.io/distiller/schedule.html#pruning-fine-control), which is also part of this PR. In this PR: * Added a new callback to the CompressionScheduler: compression_scheduler.before_parameter_optimization which is invoked after the gradients are are computed, but before the weights are updated by the optimizer. * We provide an option to mask the gradients, before the weights are updated by the optimizer. We register to the parameter backward hook in order to mask the gradients. This gives us finer control over the parameter updates. * Added several DropFilter schedules. DropFilter is a method to regularize networks, and it can also be used to "prepare" a network for permanent filter pruning. *Added documentation of pruning fine-control

nzmora added 10 commits March 21, 2019 12:58

remove dead code (remarked code)

932c92b

Add several DropFilter schedules

f18ecb0

DropFilter is a method to regularize networks, and it can also be used to prepare a network for permanent filter pruning.

Simplify code

6ddfedd

Documentation: update mkdocs.yml to new syntax

9df9a43

'pages' was deprecated and replaced by 'nav'. Also removed unused earlyexit.md file.

Documentation: add documentation of pruning fine-control

5cdc303

Documentation: add new files (mkdocs version update)

9257766

A new version of mkdocs moved files around - creating and deleting files.

Merge branch 'master' into refine_pruning_logic

1a73f98

Update documentation

d3e8a7b

Update documentation

25e0438

nzmora merged commit 816a943 into master Apr 8, 2019

nzmora deleted the refine_pruning_logic branch April 9, 2019 14:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refine pruning logic #222

Refine pruning logic #222

nzmora commented Apr 8, 2019

Refine pruning logic #222

Refine pruning logic #222

Conversation

nzmora commented Apr 8, 2019