MLP-Rank: A graph theoretical approach to structured pruning of deep neural networks based on weighted Page Rank centrality as introduced by the related thesis.
-
Updated
Apr 22, 2024 - Python
MLP-Rank: A graph theoretical approach to structured pruning of deep neural networks based on weighted Page Rank centrality as introduced by the related thesis.
inference with the structured sparsity and quantization
Add a description, image, and links to the structured-sparsity topic page so that developers can more easily learn about it.
To associate your repository with the structured-sparsity topic, visit your repo's landing page and select "manage topics."