PIUMA: Programmable Integrated Unified Memory Architecture

High performance large scale graph analytics is essential to timely analyze relationships in big data sets. Conventional processor architectures suffer from inefficient resource usage and bad scaling on graph workloads. To enable efficient and scalable graph analysis, Intel developed the Programmabl...

Full description

Saved in:
Bibliographic Details
Main Authors Aananthakrishnan, Sriram, Ahmed, Nesreen K, Cave, Vincent, Cintra, Marcelo, Demir, Yigit, Bois, Kristof Du, Eyerman, Stijn, Fryman, Joshua B, Ganev, Ivan, Heirman, Wim, Hoppe, Hans-Christian, Howard, Jason, Hur, Ibrahim, Kodiyath, MidhunChandra, Jain, Samkit, Klowden, Daniel S, Landowski, Marek M, Montigny, Laurent, More, Ankit, Ossowski, Przemyslaw, Pawlowski, Robert, Pepperling, Nick, Petrini, Fabrizio, Sikora, Mariusz, Seshasayee, Balasubramanian, Smith, Shaden, Szkoda, Sebastian, Tayal, Sanjaya, Tithi, Jesmin Jahan, Vandriessche, Yves, Wrosz, Izajasz P
Format Journal Article
LanguageEnglish
Published 13.10.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:High performance large scale graph analytics is essential to timely analyze relationships in big data sets. Conventional processor architectures suffer from inefficient resource usage and bad scaling on graph workloads. To enable efficient and scalable graph analysis, Intel developed the Programmable Integrated Unified Memory Architecture (PIUMA). PIUMA consists of many multi-threaded cores, fine-grained memory and network accesses, a globally shared address space and powerful offload engines. This paper presents the PIUMA architecture, and provides initial performance estimations, projecting that a PIUMA node will outperform a conventional compute node by one to two orders of magnitude. Furthermore, PIUMA continues to scale across multiple nodes, which is a challenge in conventional multinode setups.
DOI:10.48550/arxiv.2010.06277