PROCESSING IMAGES USING MIXTURE OF EXPERTS

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating predictions about images. One of the systems includes a neural network comprising a sequence of one or more network blocks that are each configured to perform operations comprising: obtaini...

Full description

Saved in:
Bibliographic Details
Main Authors Riquelme Ruiz, Carlos, Mustafa, Basil, Keysers, Daniel M, Neumann, Maxim, Houlsby, Neil Matthew Tinmouth, Jenatton, Rodolphe, Susano Pinto, André, Puigcerver i Perez, Joan
Format Patent
LanguageEnglish
Published 29.08.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating predictions about images. One of the systems includes a neural network comprising a sequence of one or more network blocks that are each configured to perform operations comprising: obtaining a block input that represents an intermediate representation of an input image; determining a plurality of patches of the block input or of an updated representation of the block input, wherein each patch comprises a different subset of elements of the block input or of the updated representation of the block input; assigning each patch to one or more respective expert modules of a plurality of expert modules of the network block; for each patch of the plurality of patches, processing the patch using the corresponding expert modules to generate respective module outputs; and generating a block output by combining the module outputs.
Bibliography:Application Number: US202218564915