System coherency in a distributed graphics processor hierarchy
Methods and systems may provide for executing, by a physically distributed set of compute slices, a plurality of work items. Additionally, the coherency of one or more memory lines associated with the plurality of work items may be maintained, by a cache fabric, across a graphics processor, a system...
Saved in:
Main Authors | , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
16.10.2015
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Methods and systems may provide for executing, by a physically distributed set of compute slices, a plurality of work items. Additionally, the coherency of one or more memory lines associated with the plurality of work items may be maintained, by a cache fabric, across a graphics processor, a system memory and one or more host processors. In one example, a plurality of crossbar nodes track the one or more memory lines, wherein the coherency of the one or more memory lines is maintained across a plurality of level one (L1) caches and a physically distributed cache structure. Each L1 cache may be dedicated to an execution block of a compute slice and each crossbar node may be dedicated to a compute slice. |
---|---|
Bibliography: | Application Number: TW20154103891 |