Techniques for Accelerating Neural Networks

Embodiments are generally directed to techniques for accelerating neural networks. Many embodiments include a hardware accelerator for a bi-directional multi-layered GRU and LC neural network. Some embodiments are particularly directed to a hardware accelerator that enables offloading of the entire...

Full description

Saved in:
Bibliographic Details
Main Authors Subramoney, Sreenivas, Kalsi, Gurpreet S, Pillai, Kamlesh R, Chakenalli Nanjegowda, Ramachandra
Format Patent
LanguageEnglish
Published 03.06.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Embodiments are generally directed to techniques for accelerating neural networks. Many embodiments include a hardware accelerator for a bi-directional multi-layered GRU and LC neural network. Some embodiments are particularly directed to a hardware accelerator that enables offloading of the entire LC+GRU network to the hardware accelerator. Various embodiments include a hardware accelerator with a plurality of matrix vector units to perform GRU steps in parallel with LC steps. For example, at least a portion of computation by a first matrix vector unit of a GRU step in a neural network may overlap at least a portion of computation by a second matrix vector unit of an output feature vector for the neural network. Several embodiments include overlapping computation associated with a layer of a neural network with data transfer associated with another of the neural network.
Bibliography:Application Number: US202117172627