2-stage pruning to favor distributed inference (local device compute half of the model, upload the feature for further computing on stronger devices or cloud).
-
Updated
May 31, 2018 - Python
2-stage pruning to favor distributed inference (local device compute half of the model, upload the feature for further computing on stronger devices or cloud).
Add a description, image, and links to the distributed-inferencing topic page so that developers can more easily learn about it.
To associate your repository with the distributed-inferencing topic, visit your repo's landing page and select "manage topics."