EFFICIENT SIMULTANEOUS INFERENCE COMPUTATION FOR MULTIPLE NEURAL NETWORKS
A method for the inference computation of a plurality of neural networks on a hardware platform. Each of the neural networks comprise a plurality of neurons, which respectively aggregate inputs into a network input using a transfer function characterized by weights and process this network input int...
Saved in:
Main Authors | , , , , |
---|---|
Format | Patent |
Language | English |
Published |
16.09.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | A method for the inference computation of a plurality of neural networks on a hardware platform. Each of the neural networks comprise a plurality of neurons, which respectively aggregate inputs into a network input using a transfer function characterized by weights and process this network input into an activation using an activation function. The method includes: identifying at least one unit, which comprises one or multiple transfer functions and/or complete neurons and exists in at least two of the networks in the same form or in a form that is similar according to a predefined criterion; performing a single inference computation for the unit on the hardware platform so that the unit provides a set of outputs; processing this set of outputs in the respective networks as an output of the unit. A method for the simultaneous execution of multiple applications is also provided. |
---|---|
Bibliography: | Application Number: US202117192250 |