Effective Model Compression via Stage-wise Pruning

Automated machine learning (AutoML) pruning methods aim at searching for a pruning strategy automatically to reduce the computational complexity of deep convolutional neural networks (deep CNNs). However, some previous work found that the results of many Auto-ML pruning methods cannot even surpass t...

Full description

Saved in:

Bibliographic Details
Published in	International journal of automation and computing Vol. 20; no. 6; pp. 937 - 951
Main Authors	Zhang, Ming-Yang, Yu, Xin-Yi, Ou, Lin-Lin
Format	Journal Article
Language	English
Published	Berlin/Heidelberg Springer Berlin Heidelberg 01.12.2023 Springer Nature B.V
Subjects	Artificial Intelligence Artificial neural networks Automation Candidates Computer Science Datasets Distillation Genetic algorithms Iterative methods Machine learning Methods Neural networks Research Article model compression channel pruning convolutional neural networks (CNN) distillation Automated machine learning (AutoML)
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Automated machine learning (AutoML) pruning methods aim at searching for a pruning strategy automatically to reduce the computational complexity of deep convolutional neural networks (deep CNNs). However, some previous work found that the results of many Auto-ML pruning methods cannot even surpass the results of the uniformly pruning method. In this paper, the ineffectiveness of Auto-ML pruning, which is caused by unfull and unfair training of the supernet, is shown. A deep supernet suffers from unfull training because it contains too many candidates. To overcome the unfull training, a stage-wise pruning (SWP) method is proposed, which splits a deep supernet into several stage-wise supernets to reduce the candidate number and utilize inplace distillation to supervise the stage training. Besides, a wide supernet is hit by unfair training since the sampling probability of each channel is unequal. Therefore, the fullnet and the tinynet are sampled in each training iteration to ensure that each channel can be overtrained. Remarkably, the proxy performance of the subnets trained with SWP is closer to the actual performance than that of most of the previous AutoML pruning work. Furthermore, experiments show that SWP achieves the state-of-the-art in both CIFAR-10 and ImageNet under the mobile setting.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2731-538X 1476-8186 2153-182X 2731-5398 1751-8520 2153-1838
DOI:	10.1007/s11633-022-1357-9