MUSA: A Multi-level Simulation Approach for Next-Generation HPC Machines

The complexity of High Performance Computing (HPC) systems is increasing in the number of components and their heterogeneity. Interactions between software and hardware involve many different aspects which are typically not transparent to scientific programmers and system architects. Therefore, pred...

Full description

Saved in:
Bibliographic Details
Published inSC16: International Conference for High Performance Computing, Networking, Storage and Analysis pp. 526 - 537
Main Authors Grass, Thomas, Allande, Cesar, Armejach, Adria, Rico, Alejandro, Ayguade, Eduard, Labarta, Jesus, Valero, Mateo, Casas, Marc, Moreto, Miquel
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.11.2016
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The complexity of High Performance Computing (HPC) systems is increasing in the number of components and their heterogeneity. Interactions between software and hardware involve many different aspects which are typically not transparent to scientific programmers and system architects. Therefore, predicting the behavior of current scientific applications on future HPC infrastructures is a challenging task. In this paper we present MUSA, an end-to-end methodology that employs a multi-level simulation infrastructure. By combining different levels of abstraction, MUSA is able to model the communication network, microarchitectural details and system software interactions, providing different trade-offs in terms of simulation cost and accuracy. We compare detailed MUSA simulations with native executions of up to 2,048 cores and find relative errors that are within 10% in the common case. In addition, we use MUSA to simulate up to 16,384 cores and successfully identify scalability bottlenecks due to different factors, e.g. memory contention or load imbalance. We also compare different system configurations, showing how MUSA can help system designers to assess the usefulness of future technologies in next-generation HPC machines.
AbstractList The complexity of High Performance Computing (HPC) systems is increasing in the number of components and their heterogeneity. Interactions between software and hardware involve many different aspects which are typically not transparent to scientific programmers and system architects. Therefore, predicting the behavior of current scientific applications on future HPC infrastructures is a challenging task. In this paper we present MUSA, an end-to-end methodology that employs a multi-level simulation infrastructure. By combining different levels of abstraction, MUSA is able to model the communication network, microarchitectural details and system software interactions, providing different trade-offs in terms of simulation cost and accuracy. We compare detailed MUSA simulations with native executions of up to 2,048 cores and find relative errors that are within 10% in the common case. In addition, we use MUSA to simulate up to 16,384 cores and successfully identify scalability bottlenecks due to different factors, e.g. memory contention or load imbalance. We also compare different system configurations, showing how MUSA can help system designers to assess the usefulness of future technologies in next-generation HPC machines.
Author Allande, Cesar
Rico, Alejandro
Ayguade, Eduard
Casas, Marc
Moreto, Miquel
Labarta, Jesus
Valero, Mateo
Grass, Thomas
Armejach, Adria
Author_xml – sequence: 1
  givenname: Thomas
  surname: Grass
  fullname: Grass, Thomas
– sequence: 2
  givenname: Cesar
  surname: Allande
  fullname: Allande, Cesar
– sequence: 3
  givenname: Adria
  surname: Armejach
  fullname: Armejach, Adria
– sequence: 4
  givenname: Alejandro
  surname: Rico
  fullname: Rico, Alejandro
– sequence: 5
  givenname: Eduard
  surname: Ayguade
  fullname: Ayguade, Eduard
– sequence: 6
  givenname: Jesus
  surname: Labarta
  fullname: Labarta, Jesus
– sequence: 7
  givenname: Mateo
  surname: Valero
  fullname: Valero, Mateo
– sequence: 8
  givenname: Marc
  surname: Casas
  fullname: Casas, Marc
– sequence: 9
  givenname: Miquel
  surname: Moreto
  fullname: Moreto, Miquel
BookMark eNotzEFPgzAYgOFqNHFOLl699A-Abb_Sr3gjRIfJUBPceSnwEWsYEGBG_70m8_QenuS9Zhf90BNjt1JEUorkvswiJaSJtD5jQYJWaoNgrYzhnK2UNBhqALxiwTx_CiFACS2tWLG82JXpA095cewWH3b0RR0v_eHYucUPPU_HcRpc_cHbYeIv9L2EG-ppOmH-lvHiD31P8w27bF03U_DfNds9Pb5nebh93Txn6Tb0Uhkd6pikqxrlrKnQ2qRpWwVOJzVWsWjQmriq0DmtBcQNgk4EktLCQg02NlTBmt2dvp6I9uPkD2762aNFlArgFzmgSxM
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SC.2016.44
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781467388153
1467388157
EISSN 2167-4337
EndPage 537
ExternalDocumentID 7877123
Genre orig-research
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IN
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
OCL
RIE
RIL
ID FETCH-LOGICAL-i1264-45e1abd2a86b7889dff23a49c7b50d7865bb7aa44035d734907e24083c3856eb3
IEDL.DBID RIE
IngestDate Wed Jun 26 19:24:05 EDT 2024
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i1264-45e1abd2a86b7889dff23a49c7b50d7865bb7aa44035d734907e24083c3856eb3
OpenAccessLink https://upcommons.upc.edu/bitstream/2117/99179/1/MUSA%20A%20Multi-Level%20Simulation%20Approach.pdf
PageCount 12
ParticipantIDs ieee_primary_7877123
PublicationCentury 2000
PublicationDate 2016-Nov.
PublicationDateYYYYMMDD 2016-11-01
PublicationDate_xml – month: 11
  year: 2016
  text: 2016-Nov.
PublicationDecade 2010
PublicationTitle SC16: International Conference for High Performance Computing, Networking, Storage and Analysis
PublicationTitleAbbrev SC
PublicationYear 2016
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003204180
Score 1.7708803
Snippet The complexity of High Performance Computing (HPC) systems is increasing in the number of components and their heterogeneity. Interactions between software and...
SourceID ieee
SourceType Publisher
StartPage 526
SubjectTerms Computational modeling
Microarchitecture
Next generation networking
Parallel processing
Programming
Runtime
System software
Title MUSA: A Multi-level Simulation Approach for Next-Generation HPC Machines
URI https://ieeexplore.ieee.org/document/7877123
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxFHy0PXlSacVvcvDobpPdfGy8lWJZhJZCLfRWkmwCRW1F24u_3iS7VhQP3kJySEhCXiaZmQdw46hwShOb-FjvAYq0JlFMesxjK0acctjWLN8JL-f0YcEWLbjda2GstZF8ZtNQjH_51cbswlNZ328u4U_aNrSFlLVWa_-ekmeYkgI3DqQEy_5sGJhbPKX0R-aUGDhGhzD-6rLmizylu61OzccvN8b_jukIet8SPTTdB59jaNl1F8rxfDa4QwMUZbXJcyAEodnqpUnRhQaNgTjyN1U0CZi3dp2OjeV0iMaRWmnfezAf3T8Oy6RJlZCsiL_SJJRZonSVqYJrD2pl5VyWKyqN0AxXouBMa6EUpThnlciph8Q2mJvlJi8Y94D6BDrrzdqeAnJaOsX82SOy4AZoZM6dyowuTMYNJvoMumEWlq-1G8aymYDzv6sv4CAsQq3eu4TO9m1nr3wY3-rruH6fBKucmA
link.rule.ids 310,311,783,787,792,793,799,27939,55088
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED2VMsAEqEV844GRpPmwnZitqkABmqpSW6lbZTtnqQJaBO3Cr8dOQhGIgc2Kh0S2dXfPee8dwJWhiZEqRM_megtQBGpPMmExDxYsNNIEWLF8Bzyb0IcpmzbgeqOFQcSSfIa-G5b_8oulXrurso49XImNtFuwzVxdUam1NjcqcRTQMA1qD9IwEJ1Rz3G3uE_pj94pZeq424P866UVY-TJX6-Urz9--TH-96v2of0t0iPDTfo5gAYuWpDlk1H3hnRJKaz1nh0liIzmL3WTLtKtLcSJrVXJwKHeyne6nMyGPZKX5Ep8b8Pk7nbcy7y6WYI3D21R41GGoVRFJFOuLKwVhTFRLKnQiWJBkaScKZVISWkQsyKJqQXF6OzNYh2njFtIfQjNxXKBR0CMEkYyG32SyPkBahFzIyOtUh1xHYTqGFpuFWavlR_GrF6Ak78fX8JONs77s_794PEUdt2GVFq-M2iu3tZ4bpP6Sl2Ue_kJa_if5Q
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=SC16%3A+International+Conference+for+High+Performance+Computing%2C+Networking%2C+Storage+and+Analysis&rft.atitle=MUSA%3A+A+Multi-level+Simulation+Approach+for+Next-Generation+HPC+Machines&rft.au=Grass%2C+Thomas&rft.au=Allande%2C+Cesar&rft.au=Armejach%2C+Adria&rft.au=Rico%2C+Alejandro&rft.date=2016-11-01&rft.pub=IEEE&rft.eissn=2167-4337&rft.spage=526&rft.epage=537&rft_id=info:doi/10.1109%2FSC.2016.44&rft.externalDocID=7877123