TECHNIQUES FOR ACCELERATING NEURAL NETWORKS

Embodiments are generally directed to techniques for accelerating neural networks. Many embodiments include a hardware accelerator for a bi-directional multi-layered GRU and LC neural network. Some embodiments are particularly directed to a hardware accelerator that enables offloading of the entire...

Full description

Saved in:
Bibliographic Details
Main Authors SUBRAMONEY, Sreenivas, KALSI, Gurpreet S, CHAKENALLI NANJEGOWDA, Ramachandra, PILLAI, Kamlesh R
Format Patent
LanguageEnglish
French
German
Published 20.12.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Embodiments are generally directed to techniques for accelerating neural networks. Many embodiments include a hardware accelerator for a bi-directional multi-layered GRU and LC neural network. Some embodiments are particularly directed to a hardware accelerator that enables offloading of the entire LC+GRU network to the hardware accelerator. Various embodiments include a hardware accelerator with a plurality of matrix vector units to perform GRU steps in parallel with LC steps. For example, at least a portion of computation by a first matrix vector unit of a GRU step in a neural network may overlap at least a portion of computation by a second matrix vector unit of an output feature vector for the neural network. Several embodiments include overlapping computation associated with a layer of a neural network with data transfer associated with another of the neural network.
Bibliography:Application Number: EP20220753101