EFFICIENT SIMULTANEOUS INFERENCE COMPUTATION FOR MULTIPLE NEURAL NETWORKS

A method for the inference computation of a plurality of neural networks on a hardware platform. Each of the neural networks comprise a plurality of neurons, which respectively aggregate inputs into a network input using a transfer function characterized by weights and process this network input int...

Full description

Saved in:
Bibliographic Details
Main Authors Staneker, Dirk, Horst, Hans-Georg, Dressler, Wolfgang, Wacker, Nicolai, Matschke, Thomas
Format Patent
LanguageEnglish
Published 16.09.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method for the inference computation of a plurality of neural networks on a hardware platform. Each of the neural networks comprise a plurality of neurons, which respectively aggregate inputs into a network input using a transfer function characterized by weights and process this network input into an activation using an activation function. The method includes: identifying at least one unit, which comprises one or multiple transfer functions and/or complete neurons and exists in at least two of the networks in the same form or in a form that is similar according to a predefined criterion; performing a single inference computation for the unit on the hardware platform so that the unit provides a set of outputs; processing this set of outputs in the respective networks as an output of the unit. A method for the simultaneous execution of multiple applications is also provided.
Bibliography:Application Number: US202117192250