Bimodal-Distributed Binarized Neural Networks

Binary neural networks (BNNs) are an extremely promising method for reducing deep neural networks’ complexity and power consumption significantly. Binarization techniques, however, suffer from ineligible performance degradation compared to their full-precision counterparts. Prior work mainly focused...

Full description

Saved in:

Bibliographic Details
Published in	Mathematics (Basel) Vol. 10; no. 21; p. 4107
Main Authors	Rozen, Tal, Kimhi, Moshe, Chmiel, Brian, Mendelson, Avi, Baskin, Chaim
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.11.2022
Subjects	Accuracy Approximation Artificial neural networks Back propagation binarization Binary searching Computer networks convolutional neural networks Distributions, Theory of (Functional analysis) efficient inference deployment Feature maps Kurtosis Mathematical research Neural networks Normal distribution Optimization Performance degradation Power consumption quantization Regularization Training Wavelength division multiplexing Israel
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Binary neural networks (BNNs) are an extremely promising method for reducing deep neural networks’ complexity and power consumption significantly. Binarization techniques, however, suffer from ineligible performance degradation compared to their full-precision counterparts. Prior work mainly focused on strategies for sign function approximation during the forward and backward phases to reduce the quantization error during the binarization process. In this work, we propose a bimodal-distributed binarization method (BD-BNN). The newly proposed technique aims to impose a bimodal distribution of the network weights by kurtosis regularization. The proposed method consists of a teacher–trainer training scheme termed weight distribution mimicking (WDM), which efficiently imitates the full-precision network weight distribution to their binary counterpart. Preserving this distribution during binarization-aware training creates robust and informative binary feature maps and thus it can significantly reduce the generalization error of the BNN. Extensive evaluations on CIFAR-10 and ImageNet demonstrate that our newly proposed BD-BNN outperforms current state-of-the-art schemes.
ISSN:	2227-7390 2227-7390
DOI:	10.3390/math10214107