Selecting a neural network architecture for a supervised machine learning problem

Systems and methods for selecting a neural network for a machine learning problem are disclosed. A method includes accessing an input matrix. The method includes accessing a machine learning problem space associated with a machine learning problem and multiple untrained candidate neural networks for...

Full description

Saved in:
Bibliographic Details
Main Authors Amizadeh, Saeed, Casale, Francesco Paolo, Fusi, Nicolo, Yang, Ge
Format Patent
LanguageEnglish
Published 28.05.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Systems and methods for selecting a neural network for a machine learning problem are disclosed. A method includes accessing an input matrix. The method includes accessing a machine learning problem space associated with a machine learning problem and multiple untrained candidate neural networks for solving the machine learning problem. The method includes computing, for each untrained candidate neural network, at least one expressivity measure capturing an expressivity of the candidate neural network with respect to the machine learning problem. The method includes computing, for each untrained candidate neural network, at least one trainability measure capturing a trainability of the candidate neural network with respect to the machine learning problem. The method includes selecting, based on the at least one expressivity measure and the at least one trainability measure, at least one candidate neural network for solving the machine learning problem. The method includes providing an output representing the selected at least one candidate neural network.
Bibliography:Application Number: US201815976514