Flexible-access instructions for efficient access of ML data

A graphics processing unit (GPU) and a method is disclosed that performs a convolution operation recast as a matrix multiplication operation. The GPU includes a register file, a processor and a state machine. The register file stores data of an input feature map and data of a filter weight kernel. T...

Full description

Saved in:
Bibliographic Details
Main Authors Frascati, Christopher P, Waters, Simon, Harihara, Rama S. B, Tannenbaum, David C
Format Patent
LanguageEnglish
Published 30.04.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A graphics processing unit (GPU) and a method is disclosed that performs a convolution operation recast as a matrix multiplication operation. The GPU includes a register file, a processor and a state machine. The register file stores data of an input feature map and data of a filter weight kernel. The processor performs a convolution operation on data of the input feature map and data of the filter weight kernel as a matrix multiplication operation. The state machine facilitates performance of the convolution operation by unrolling the data of the input feature map and the data of the filter weight kernel in the register file. The state machine includes control registers that determine movement of data through the register file to perform the matrix multiplication operation on the data in the register file in an unrolled manner.
Bibliography:Application Number: US202117173203