Transposed convolution using systolic array

In one example, a neural network accelerator can execute a set of instructions to: load a first weight data element from a memory into a systolic array, the first weight data element having first coordinates; extract, from the instructions, information indicating a first subset of input data element...

Full description

Saved in:
Bibliographic Details
Main Authors Vivekraja, Vignesh, Huynh, Jeffrey T
Format Patent
LanguageEnglish
Published 09.04.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In one example, a neural network accelerator can execute a set of instructions to: load a first weight data element from a memory into a systolic array, the first weight data element having first coordinates; extract, from the instructions, information indicating a first subset of input data elements to be obtained from the memory, the first subset being based on a stride of a transposed convolution operation and second coordinates of first weight data element in a rotated array of weight data elements; based on the information, obtain the first subset of input data elements from the memory; load the first subset of input data elements into the systolic array; and control the systolic array to perform first computations based on the first weight data element and the first subset of input data elements to generate output data elements of an array of output data elements.
Bibliography:Application Number: US202318134726