System information
- TensorFlow version (you are using): 2.3.0
- Are you willing to contribute it (Yes/No): No
Motivation
Currently pruning in tensorflow_model_optimization does not result in a reduction in inference time. Even though the pruned model is sparser than the original, the inference time remains the same. (This was tested on a Resnet model.)
Describe the feature Pruning sets the weights to zero, but does not prune the networks edges. Update the pruning feature such that the new sparse weights result in a corresponding increase in speed.
System information
Motivation
Currently pruning in tensorflow_model_optimization does not result in a reduction in inference time. Even though the pruned model is sparser than the original, the inference time remains the same. (This was tested on a Resnet model.)
Describe the feature Pruning sets the weights to zero, but does not prune the networks edges. Update the pruning feature such that the new sparse weights result in a corresponding increase in speed.