Optimizing Kernel Machines Using Deep Learning

Building highly nonlinear and nonparametric models is central to several state-of-the-art machine learning systems. Kernel methods form an important class of techniques that induce a reproducing kernel Hilbert space (RKHS) for inferring non-linear models through the construction of similarity functi...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transaction on neural networks and learning systems Vol. 29; no. 11; pp. 5528 - 5540
Main Authors	Huan Song, Thiagarajan, Jayaraman J., Sattigeri, Prasanna, Spanias, Andreas
Format	Journal Article
Language	English
Published	United States IEEE 01.11.2018 The Institute of Electrical and Electronics Engineers, Inc. (IEEE) IEEE Computational Intelligence Society
Subjects	Artificial neural networks Case studies Computational modeling Computer applications Computer architecture Computing time Construction Data models Deep learning Deep neural networks (DNNs) Hilbert space Kernel kernel methods Learning algorithms Machine learning MATHEMATICS AND COMPUTING multiple kernel learning (MKL) Neural networks Nyström approximation Optimization Regularization Subspaces Task analysis Training
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Building highly nonlinear and nonparametric models is central to several state-of-the-art machine learning systems. Kernel methods form an important class of techniques that induce a reproducing kernel Hilbert space (RKHS) for inferring non-linear models through the construction of similarity functions from data. These methods are particularly preferred in cases where the training data sizes are limited and when prior knowledge of the data similarities is available. Despite their usefulness, they are limited by the computational complexity and their inability to support end-to-end learning with a task-specific objective. On the other hand, deep neural networks have become the de facto solution for end-to-end inference in several learning paradigms. In this paper, we explore the idea of using deep architectures to perform kernel machine optimization, for both computational efficiency and end-to-end inferencing. To this end, we develop the deep kernel machine optimization framework, that creates an ensemble of dense embeddings using Nyström kernel approximations and utilizes deep learning to generate task-specific representations through the fusion of the embeddings. Intuitively, the filters of the network are trained to fuse information from an ensemble of linear subspaces in the RKHS. Furthermore, we introduce the kernel dropout regularization to enable improved training convergence. Finally, we extend this framework to the multiple kernel case, by coupling a global fusion layer with pretrained deep kernel machines for each of the constituent kernels. Using case studies with limited training data, and lack of explicit feature sources, we demonstrate the effectiveness of our framework over conventional model inferencing techniques.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 AC52-07NA27344 USDOE National Nuclear Security Administration (NNSA) LLNL-JRNL-753878
ISSN:	2162-237X 2162-2388
DOI:	10.1109/TNNLS.2018.2804895