Compiling code for a machine learning model for execution on a specialized processor

The subject technology receives a neural network model in a model format, the model format including information for a set of layers of the neural network model, each layer of the set of layers including a set of respective operations. The subject technology generates neural network (NN) code from t...

Full description

Saved in:
Bibliographic Details
Main Authors Paek, Timothy S, Jeong, Minwoo, Kaur, Harveen, Dhanani, Jamil, Avery, Keith P, Westing, Brandt M, Rossi, Francesco, Shi, Xiaojin
Format Patent
LanguageEnglish
Published 16.11.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The subject technology receives a neural network model in a model format, the model format including information for a set of layers of the neural network model, each layer of the set of layers including a set of respective operations. The subject technology generates neural network (NN) code from the neural network model, the NN code being in a programming language distinct from the model format, and the NN code comprising a respective memory allocation for each respective layer of the set of layers of the neural network model, where the generating comprises determining the respective memory allocation for each respective layer based at least in part on a resource constraint of a target device. The subject technology compiles the NN code into a binary format. The subject technology generates a package for deploying the compiled NN code on the target device.
Bibliography:Application Number: US201916583191