MPI+X: task-based parallelisation and dynamic load balance of finite element assembly

The main computing phases of numerical methods for solving partial differential equations are the algebraic system assembly and the iterative solver. This work focuses on the first task, in the context of a hybrid MPI+X paradigm. The matrix assembly consists of a loop over the elements, faces, edges...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of computational fluid dynamics Vol. 33; no. 3; pp. 115 - 136
Main Authors Garcia-Gasulla, Marta, Houzeaux, Guillaume, Ferrer, Roger, Artigues, Antoni, López, Victor, Labarta, Jesús, Vázquez, Mariano
Format Journal Article
LanguageEnglish
Published Abingdon Taylor & Francis 16.03.2019
Taylor & Francis Ltd
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The main computing phases of numerical methods for solving partial differential equations are the algebraic system assembly and the iterative solver. This work focuses on the first task, in the context of a hybrid MPI+X paradigm. The matrix assembly consists of a loop over the elements, faces, edges or nodes of the MPI partitions to compute element matrices and vectors and then of their assemblies. In a MPI+X hybrid parallelism context, X has consisted traditionally of loop parallelism using OpenMP, with different techniques to avoid the race condition, but presenting efficiency or implementation drawbacks. We propose an alternative, based on task parallelism using some extensions to the OpenMP programming model. In addition, dynamic load balance will be applied, especially efficient in the presence of hybrid meshes. This paper presents the proposed methodology, its implementation and its validation through the solution of large computational mechanics problems up to 16k cores.
AbstractList The main computing phases of numerical methods for solving partial differential equations are the algebraic system assembly and the iterative solver. This work focuses on the first task, in the context of a hybrid MPI+X paradigm. The matrix assembly consists of a loop over the elements, faces, edges or nodes of the MPI partitions to compute element matrices and vectors and then of their assemblies. In a MPI+X hybrid parallelism context, X has consisted traditionally of loop parallelism using OpenMP, with different techniques to avoid the race condition, but presenting efficiency or implementation drawbacks. We propose an alternative, based on task parallelism using some extensions to the OpenMP programming model. In addition, dynamic load balance will be applied, especially efficient in the presence of hybrid meshes. This paper presents the proposed methodology, its implementation and its validation through the solution of large computational mechanics problems up to 16k cores.
Author Artigues, Antoni
López, Victor
Garcia-Gasulla, Marta
Houzeaux, Guillaume
Ferrer, Roger
Vázquez, Mariano
Labarta, Jesús
Author_xml – sequence: 1
  givenname: Marta
  surname: Garcia-Gasulla
  fullname: Garcia-Gasulla, Marta
  email: marta.garcia@bsc.es
  organization: Barcelona Supercomputing Center
– sequence: 2
  givenname: Guillaume
  surname: Houzeaux
  fullname: Houzeaux, Guillaume
  email: guillaume.houzeaux@bsc.es
  organization: Barcelona Supercomputing Center
– sequence: 3
  givenname: Roger
  surname: Ferrer
  fullname: Ferrer, Roger
  organization: Barcelona Supercomputing Center
– sequence: 4
  givenname: Antoni
  surname: Artigues
  fullname: Artigues, Antoni
  organization: Barcelona Supercomputing Center
– sequence: 5
  givenname: Victor
  surname: López
  fullname: López, Victor
  organization: Barcelona Supercomputing Center
– sequence: 6
  givenname: Jesús
  surname: Labarta
  fullname: Labarta, Jesús
  organization: Barcelona Supercomputing Center
– sequence: 7
  givenname: Mariano
  surname: Vázquez
  fullname: Vázquez, Mariano
  organization: Barcelona Supercomputing Center
BookMark eNp9kE1LxDAQhoOs4OdPEAIepesk_UjrSRG_QNGDC97CtJlA1zRZky6y_94uq1dPMwPP-w48R2zmgyfGzgTMBdRwKaASdVnJuQTRzEUl1HTtsUMBsslAlmq23SuRbaEDdpTSEgAaWYtDtnh5e7r4uOIjps-sxUSGrzCic-T6hGMfPEdvuNl4HPqOu4CGt-jQd8SD5bb3_UicHA3kR44p0dC6zQnbt-gSnf7OY7a4v3u_fcyeXx-ebm-es06W-ZgpKCQg1apQVsmqVFYqEobImrJobCFN25lOWKqsaLGr0YBoZVHbui0bpfL8mJ3velcxfK0pjXoZ1tFPL7WUuZIKoICJKndUF0NKkaxexX7AuNEC9Nag_jOotwb1r8Epd73L9d6GOOB3iM7oETcuRBsnBX3S-f8VPxFqeUk
CitedBy_id crossref_primary_10_1016_j_future_2020_01_045
crossref_primary_10_1016_j_compfluid_2022_105723
crossref_primary_10_1080_10618562_2020_1771325
crossref_primary_10_1016_j_compfluid_2022_105577
crossref_primary_10_1016_j_proci_2022_07_222
crossref_primary_10_1080_10618562_2020_1783440
Cites_doi 10.1016/j.compbiomed.2015.12.003
10.1080/10618560701816387
10.1016/S0307-904X(00)00043-3
10.1007/s11831-014-9126-8
10.1002/nme.2989
10.1142/S0129626411000151
10.1016/j.procs.2011.04.229
10.1016/0020-0190(92)90041-S
10.1080/10618562.2016.1227070
10.1016/S0045-7825(98)00005-X
10.1108/09615530310456796
10.1016/j.compfluid.2012.04.017
10.1002/nme.2973
10.1137/S1064827598337373
10.1137/1.9780898718003
10.1016/j.jocs.2015.12.007
10.1016/j.parco.2003.12.016
10.1002/nme.2932
10.1002/9780470989746
10.1016/j.compfluid.2011.01.017
10.1155/2010/273921
10.1016/0045-7825(89)90157-6
10.1201/b21930
10.1016/j.jcp.2009.05.019
ContentType Journal Article
Copyright 2019 Informa UK Limited, trading as Taylor & Francis Group 2019
2019 Informa UK Limited, trading as Taylor & Francis Group
Copyright_xml – notice: 2019 Informa UK Limited, trading as Taylor & Francis Group 2019
– notice: 2019 Informa UK Limited, trading as Taylor & Francis Group
DBID AAYXX
CITATION
7QH
7SC
7TB
7U5
7UA
8FD
C1K
F1W
FR3
H8D
H96
JQ2
KR7
L.G
L7M
L~C
L~D
DOI 10.1080/10618562.2019.1617856
DatabaseName CrossRef
Aqualine
Computer and Information Systems Abstracts
Mechanical & Transportation Engineering Abstracts
Solid State and Superconductivity Abstracts
Water Resources Abstracts
Technology Research Database
Environmental Sciences and Pollution Management
ASFA: Aquatic Sciences and Fisheries Abstracts
Engineering Research Database
Aerospace Database
Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources
ProQuest Computer Science Collection
Civil Engineering Abstracts
Aquatic Science & Fisheries Abstracts (ASFA) Professional
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Civil Engineering Abstracts
Aquatic Science & Fisheries Abstracts (ASFA) Professional
Technology Research Database
Computer and Information Systems Abstracts – Academic
Mechanical & Transportation Engineering Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Aqualine
Water Resources Abstracts
Environmental Sciences and Pollution Management
Computer and Information Systems Abstracts Professional
Aerospace Database
Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources
ASFA: Aquatic Sciences and Fisheries Abstracts
Solid State and Superconductivity Abstracts
Engineering Research Database
Advanced Technologies Database with Aerospace
DatabaseTitleList Civil Engineering Abstracts

DeliveryMethod fulltext_linktorsrc
Discipline Applied Sciences
EISSN 1029-0257
EndPage 136
ExternalDocumentID 10_1080_10618562_2019_1617856
1617856
Genre Original Articles
GrantInformation_xml – fundername: Brazilian Ministry of Science, Technology and Innovation
  grantid: SEV2015-0493; TIN2015-65316
  funderid: 10.13039/501100004837
– fundername: Intel-BSC Exascale Lab collaboration project
– fundername: European Union
– fundername: Horizon2020 framework
  grantid: 689772; 676629
– fundername: Spanish Ministry of Science and Innovation
  funderid: http://dx.doi.org/10.13039/501100004837
– fundername: Comissió Interdepartamental de Recerca i Innovació Tecnológica
  grantid: 2014-SGR-1051
– fundername: Spanish Government
– fundername: Energy oriented Centre of Excellence (EoCoE)
– fundername: Generalitat de Catalunya
  funderid: http://dx.doi.org/10.13039/501100002809
– fundername: European Union's Horizon 2020 Programme
GroupedDBID .7F
.QJ
0BK
0R~
29J
30N
4.4
5GY
5VS
AAAVI
AAENE
AAJMT
AALDU
AAMIU
AAPUL
AAQRR
ABBKH
ABCCY
ABFIM
ABHAV
ABJVF
ABLIJ
ABPEM
ABPTK
ABQHQ
ABTAI
ABXUL
ACGEJ
ACGFS
ACGOD
ACIWK
ACTIO
ADCVX
ADGTB
ADXPE
AEGYZ
AEISY
AENEX
AEOZL
AEPSL
AEYOC
AFKVX
AFOLD
AFRAH
AFWLO
AGDLA
AGMYJ
AHDLD
AIJEM
AIRXU
AJWEG
AKBVH
AKOOK
ALMA_UNASSIGNED_HOLDINGS
ALQZU
AQRUH
AVBZW
AWYRJ
BLEHA
CCCUG
CE4
CS3
DGEBU
DKSSO
DU5
EBS
EJD
E~A
E~B
F5P
FUNRP
FVPDL
GTTXZ
H13
HF~
HZ~
H~P
IPNFZ
J.P
KYCEM
LJTGL
M4Z
NA5
NX~
O9-
P2P
PQEST
PQQKQ
RIG
RNANH
ROSJB
RTWRZ
S-T
SNACF
TEN
TFL
TFT
TFW
TN5
TNC
TTHFI
TWF
UT5
UU3
V1K
ZGOLN
~S~
AAYXX
ABJNI
ABPAQ
ABXYU
AHDZW
CITATION
TBQAZ
TDBHL
TUROJ
7QH
7SC
7TB
7U5
7UA
8FD
C1K
F1W
FR3
H8D
H96
JQ2
KR7
L.G
L7M
L~C
L~D
ID FETCH-LOGICAL-c253t-70420ae8747f72657f27e1deefd549f42dbcdc1fe6f1bac8ad01b248f8b597733
ISSN 1061-8562
IngestDate Thu Oct 10 20:48:43 EDT 2024
Fri Aug 23 02:39:49 EDT 2024
Tue Jun 13 19:51:36 EDT 2023
IsPeerReviewed true
IsScholarly true
Issue 3
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c253t-70420ae8747f72657f27e1deefd549f42dbcdc1fe6f1bac8ad01b248f8b597733
PQID 2237270040
PQPubID 52945
PageCount 22
ParticipantIDs crossref_primary_10_1080_10618562_2019_1617856
informaworld_taylorfrancis_310_1080_10618562_2019_1617856
proquest_journals_2237270040
PublicationCentury 2000
PublicationDate 2019-03-16
PublicationDateYYYYMMDD 2019-03-16
PublicationDate_xml – month: 03
  year: 2019
  text: 2019-03-16
  day: 16
PublicationDecade 2010
PublicationPlace Abingdon
PublicationPlace_xml – name: Abingdon
PublicationTitle International journal of computational fluid dynamics
PublicationYear 2019
Publisher Taylor & Francis
Taylor & Francis Ltd
Publisher_xml – name: Taylor & Francis
– name: Taylor & Francis Ltd
References CIT0030
CIT0010
CIT0031
CIT0034
CIT0011
CIT0014
CIT0036
CIT0013
CIT0016
CIT0038
CIT0015
CIT0017
CIT0019
CIT0001
CIT0023
CIT0022
Belytschko T. (CIT0005) 2014
CIT0025
CIT0024
CIT0004
CIT0026
CIT0007
CIT0029
CIT0009
CIT0008
References_xml – ident: CIT0007
  doi: 10.1016/j.compbiomed.2015.12.003
– ident: CIT0016
  doi: 10.1080/10618560701816387
– ident: CIT0004
  doi: 10.1016/S0307-904X(00)00043-3
– ident: CIT0008
  doi: 10.1007/s11831-014-9126-8
– volume-title: Nonlinear Finite Elements for Continua and Structures
  year: 2014
  ident: CIT0005
  contributor:
    fullname: Belytschko T.
– ident: CIT0009
  doi: 10.1002/nme.2989
– ident: CIT0010
  doi: 10.1142/S0129626411000151
– ident: CIT0026
  doi: 10.1016/j.procs.2011.04.229
– ident: CIT0025
  doi: 10.1016/0020-0190(92)90041-S
– ident: CIT0015
  doi: 10.1080/10618562.2016.1227070
– ident: CIT0022
  doi: 10.1016/S0045-7825(98)00005-X
– ident: CIT0030
  doi: 10.1108/09615530310456796
– ident: CIT0014
  doi: 10.1016/j.compfluid.2012.04.017
– ident: CIT0001
  doi: 10.1002/nme.2973
– ident: CIT0036
  doi: 10.1137/S1064827598337373
– ident: CIT0029
  doi: 10.1137/1.9780898718003
– ident: CIT0034
  doi: 10.1016/j.jocs.2015.12.007
– ident: CIT0019
  doi: 10.1016/j.parco.2003.12.016
– ident: CIT0024
  doi: 10.1002/nme.2932
– ident: CIT0023
  doi: 10.1002/9780470989746
– ident: CIT0013
  doi: 10.1016/j.compfluid.2011.01.017
– ident: CIT0038
  doi: 10.1155/2010/273921
– ident: CIT0011
  doi: 10.1016/0045-7825(89)90157-6
– ident: CIT0031
  doi: 10.1201/b21930
– ident: CIT0017
  doi: 10.1016/j.jcp.2009.05.019
SSID ssj0009281
Score 2.2760353
Snippet The main computing phases of numerical methods for solving partial differential equations are the algebraic system assembly and the iterative solver. This work...
SourceID proquest
crossref
informaworld
SourceType Aggregation Database
Publisher
StartPage 115
SubjectTerms Assembly
CFD
Computer applications
Differential equations
distributed memory parallelism
dynamic load balance
Dynamic loads
finite element
Finite element method
hybrid parallelism
Iterative methods
Load balancing
Mathematical analysis
Mathematical models
Matrix algebra
Matrix methods
Mechanics
MPI
MPI+X
Numerical methods
OpenMP
Partial differential equations
shared-memory parallelism
Vectors
Vectors (mathematics)
Title MPI+X: task-based parallelisation and dynamic load balance of finite element assembly
URI https://www.tandfonline.com/doi/abs/10.1080/10618562.2019.1617856
https://www.proquest.com/docview/2237270040
Volume 33
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9NAEF6F9MIFyksUCtoDnCIHP-N1bwGaBNQWhBwRcbF27d0qqhujxr70B_E7mfGuHYdEvC5WtFbW0cyXeaxnviHklVAROubAUjwSlh95qRUhM6Lvi4CpNAuYg83J5xej2dz_uAgWvd6PTtVSVYpheru3r-R_tAproFfskv0HzbabwgJ8Bv3CFTQM17_S8fnnD6_dt4u6NoOvryx0SdkA2bzzXOamUKd-PZDpwfODvOAQcWI5Y1ofFqglxpwDqYvIBxBJy2uRb73q3T4z7DBNpPVAiOaGyqtl-5w2UJ_iqCJuTfkaUl1umoPK1hXMiupW8qp2gNMKByBV1y3SJvLGzNr-UlxuiojHIIfLSlu3MU5AXnZPLrBZyrN0Y2WNtXhniEinkgltMYQaFguMsZZmzYVtXM1p3RhwzaRhgOp1rLGjO0V3vIQuq8T9cXus74uGmOexYA8r98WnZDI_O0vi00V8hxy4YNBYnxyMZ--_fd3QO7v1ONz2Jze9Ysx-s_cxW1HQFkfuTkxQBzrxIblnMhQ61nB7QHpy9ZDcN9kKNb5g_YjMAX2DxQndII_-gjwKyKMGERSRRw3yaKGoRh41yKMN8h6T-eQ0fjezzJAOK3UDr7RCsPo2lwzSUhW6oyBUbiidTEqVBX6kfDcTaZY6So6UI3jKeGY7wvWZYgKpDz3vCemvipV8SqiSkae4ko7HfcijbcEhPFaQMAh4EhvZR2TYCC35rrlYEsdQ3DZSTlDKiZHyEYm6ok3KGnFKgy3x_vDd40YPiflrrROIpUOs1PDtZ7-__Zzc3SD-mPTLm0q-gOi1FC8NdH4CoJqZGg
link.rule.ids 315,783,787,27938,27939,60220,61009
linkProvider Library Specific Holdings
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1JTwIxFG4UD3oR14ii9uDNDM6-eDNGAgrEAyTcmm4vMYxgZDjor7dvlgga44HjZNJOp9v7Xvu97xFyJSBBwxxYwBNh-YknrQSVEX1fBDFIFcQOBif3B2Fn5D-Og_FSLAzSKtGHhkIoIt-rcXHjYXRFibtBNyY2hhuZWUkLEbp52iRbIYaNYhiHPfgW3nXzRKVYxMIyVRTPX9Ws2KcV9dJfu3Vugtp1IqvGF8yTSWuRiZb8_KHruN7f7ZHdEqHSu2JK7ZMNPT0g9RKt0nIvmB-SUf-5ez2-pRmfTyw0hoqijnia6rSkCFHTBqqKlPc0nXFFBRIppaYzoPCCaJfqgr5ODYbXryL9OCKj9sPwvmOVSRos6QZeZkVm1dtcx8YtgcgNgwjcSDtKa1DG9QTfVUIq6YAOwRFcxlzZjnD9GGKB0need0xq09lUnxAKOvGAg3Y8bpzGyBbcwCMwgFGYL8Wh3SCtamjYW6HFwZxS4rTqNIadxspOa5BkeQBZlh-CQJGxhHn_lG1Wo83KZT1nBktFeFPv26drVH1JtjvDfo_1uoOnM7KDr5DV5oRNUsveF_rcwJxMXOTz-AvG4-6X
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV07T8MwELagSIiF8hSFAh7YUEreDzYEVC3QqgOVull27JNQQ1vRdIBfjy9xBAUhho5RZMexfXffJZ-_I-RCQIKBObCAJ8LyEy-1ElRG9H0RxJDKIHbwcHKvH3aG_sMoqNiEc0OrxBwaSqGIwlejcc8kVIy4K8xiYh23kZiVtBCg66t1sqGRgI0b27P7X7q7blGnFJtY2KY6xPNXN0vhaUm89JezLiJQu05ENfaSeDJuLXLRSj9-yDqu9HI7ZNvgU3pTbqhdsqYme6RusCo1nmC-T4a9QfdydE1zPh9bGAolRRXxLFOZIQhRPQQqy4L3NJtySQXSKFNFp0DhBbEuVSV5nWoEr15F9n5Ahu3759uOZUo0WKkbeLkVaZu3uYp1UgKRGwYRuJFypFIgdeIJvitFKlMHVAiO4GnMpe0I148hFih853mHpDaZTtQRoaASDzgox-M6ZYxswTU4Ag0XhX5SHNoN0qpWhs1KJQ7mGIHTatIYThozk9Ygyff1Y3nxCQTKeiXM-6dts1psZox6zjSSivA_vW8fr9D1Odkc3LXZU7f_eEK28A5S2pywSWr520KdaoyTi7NiF38CD1_tTQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=MPI%2BX%3A+task-based+parallelisation+and+dynamic+load+balance+of+finite+element+assembly&rft.jtitle=International+journal+of+computational+fluid+dynamics&rft.au=Garcia-Gasulla%2C+Marta&rft.au=Houzeaux%2C+Guillaume&rft.au=Ferrer%2C+Roger&rft.au=Artigues%2C+Antoni&rft.date=2019-03-16&rft.pub=Taylor+%26+Francis+Ltd&rft.issn=1061-8562&rft.eissn=1029-0257&rft.volume=33&rft.issue=3&rft.spage=115&rft_id=info:doi/10.1080%2F10618562.2019.1617856&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1061-8562&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1061-8562&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1061-8562&client=summon