DISTRIBUTED PROCESSING ARCHITECTURE

Embodiments of the present disclosure include techniques for processing neural networks. Various forms of parallelism may be implemented using topology that combines sequences of processors. In one embodiment, the present disclosure includes a computer system comprising a plurality of processor grou...

Full description

Saved in:
Bibliographic Details
Main Authors HOEFLER, Torsten, BELK, Jonathan R, HEDDES, Mattheus C, GOEL, Deepak
Format Patent
LanguageEnglish
Published 08.07.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Embodiments of the present disclosure include techniques for processing neural networks. Various forms of parallelism may be implemented using topology that combines sequences of processors. In one embodiment, the present disclosure includes a computer system comprising a plurality of processor groups, the processor groups each comprising a plurality of processors. A plurality of network switches are coupled to subsets of the plurality of processor groups. A subset of the processors in the processor groups may be configurable to form sequences, and the network switches are configurable to form at least one sequence across one or more of the plurality of processor groups to perform neural network computations. Various alternative configurations for creating Hamiltonian cycles are disclosed to support data parallelism, pipeline parallelism, layer parallelism, or combinations thereof.
Bibliography:Application Number: US202016734092