Boundary between noise and information applied to filtering neural network weight matrices

Deep neural networks have been successfully applied to a broad range of problems where overparametrization yields weight matrices which are partially random. A comparison of weight matrix singular vectors to the Porter-Thomas distribution suggests that there is a boundary between randomness and lear...

Full description

Saved in:

Bibliographic Details
Published in	arXiv.org
Main Authors	Staats, Max, Thamm, Matthias, Rosenow, Bernd
Format	Paper Journal Article
Language	English
Published	Ithaca Cornell University Library, arXiv.org 08.06.2022
Subjects	Algorithms Artificial neural networks Computer Science - Learning Filtration Mathematical analysis Physics - Disordered Systems and Neural Networks
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Deep neural networks have been successfully applied to a broad range of problems where overparametrization yields weight matrices which are partially random. A comparison of weight matrix singular vectors to the Porter-Thomas distribution suggests that there is a boundary between randomness and learned information in the singular value spectrum. Inspired by this finding, we introduce an algorithm for noise filtering, which both removes small singular values and reduces the magnitude of large singular values to counteract the effect of level repulsion between the noise and the information part of the spectrum. For networks trained in the presence of label noise, we indeed find that the generalization performance improves significantly due to noise filtering.
ISSN:	2331-8422
DOI:	10.48550/arxiv.2206.03927