Pruning Filters for Efficient ConvNets

Li, Hao; Kadav, Asim; Durdanovic, Igor; Samet, Hanan; Graf, Hans Peter

Computer Science > Computer Vision and Pattern Recognition

arXiv:1608.08710 (cs)

[Submitted on 31 Aug 2016 (v1), last revised 10 Mar 2017 (this version, v3)]

Title:Pruning Filters for Efficient ConvNets

Authors:Hao Li, Asim Kadav, Igor Durdanovic, Hanan Samet, Hans Peter Graf

View PDF

Abstract:The success of CNNs in various applications is accompanied by a significant increase in the computation and parameter storage costs. Recent efforts toward reducing these overheads involve pruning and compressing the weights of various layers without hurting original accuracy. However, magnitude-based pruning of weights reduces a significant number of parameters from the fully connected layers and may not adequately reduce the computation costs in the convolutional layers due to irregular sparsity in the pruned networks. We present an acceleration method for CNNs, where we prune filters from CNNs that are identified as having a small effect on the output accuracy. By removing whole filters in the network together with their connecting feature maps, the computation costs are reduced significantly. In contrast to pruning weights, this approach does not result in sparse connectivity patterns. Hence, it does not need the support of sparse convolution libraries and can work with existing efficient BLAS libraries for dense matrix multiplications. We show that even simple filter pruning techniques can reduce inference costs for VGG-16 by up to 34% and ResNet-110 by up to 38% on CIFAR10 while regaining close to the original accuracy by retraining the networks.

Comments:	Published as a conference paper at ICLR 2017
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1608.08710 [cs.CV]
	(or arXiv:1608.08710v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1608.08710

Submission history

From: Hao Li [view email]
[v1] Wed, 31 Aug 2016 02:29:59 UTC (2,623 KB)
[v2] Thu, 15 Sep 2016 02:12:36 UTC (2,624 KB)
[v3] Fri, 10 Mar 2017 17:57:56 UTC (7,203 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Pruning Filters for Efficient ConvNets

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Pruning Filters for Efficient ConvNets

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators