HYBRID-SPARSE NPU WITH FINE-GRAINED STRUCTURED SPARSITY

A neural processing unit is disclosed that supports dual-sparsity modes. A weight buffer is configured to store weight values in an arrangement selected from a structured weight sparsity arrangement or a random weight sparsity arrangement. A weight multiplexer array is configured to output one or mo...

Full description

Saved in:
Bibliographic Details
Main Authors SHIN, Jong Hoon, PEDRAM, Ardavan, HASSOUN, Joseph
Format Patent
LanguageEnglish
French
German
Published 27.03.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A neural processing unit is disclosed that supports dual-sparsity modes. A weight buffer is configured to store weight values in an arrangement selected from a structured weight sparsity arrangement or a random weight sparsity arrangement. A weight multiplexer array is configured to output one or more weight values stored in the weight buffer as first operand values based on the selected weight sparsity arrangement. An activation buffer is configured to store activation values. An activation multiplexer array includes inputs to the activation multiplexer array that are coupled to the activation buffer, and is configured to output one or more activation values stored in the activation buffer as second operand values in which each respective second operand value and a corresponding first operand value forming an operand value pair. A multiplier array is configured to output a product value for each operand value pair.
Bibliography:Application Number: EP20230190796