MPI+X: task-based parallelisation and dynamic load balance of finite element assembly
The main computing phases of numerical methods for solving partial differential equations are the algebraic system assembly and the iterative solver. This work focuses on the first task, in the context of a hybrid MPI+X paradigm. The matrix assembly consists of a loop over the elements, faces, edges...
Saved in:
Published in | International journal of computational fluid dynamics Vol. 33; no. 3; pp. 115 - 136 |
---|---|
Main Authors | , , , , , , |
Format | Journal Article |
Language | English |
Published |
Abingdon
Taylor & Francis
16.03.2019
Taylor & Francis Ltd |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | The main computing phases of numerical methods for solving partial differential equations are the algebraic system assembly and the iterative solver. This work focuses on the first task, in the context of a hybrid MPI+X paradigm. The matrix assembly consists of a loop over the elements, faces, edges or nodes of the MPI partitions to compute element matrices and vectors and then of their assemblies. In a MPI+X hybrid parallelism context, X has consisted traditionally of loop parallelism using OpenMP, with different techniques to avoid the race condition, but presenting efficiency or implementation drawbacks. We propose an alternative, based on task parallelism using some extensions to the OpenMP programming model. In addition, dynamic load balance will be applied, especially efficient in the presence of hybrid meshes. This paper presents the proposed methodology, its implementation and its validation through the solution of large computational mechanics problems up to 16k cores. |
---|---|
AbstractList | The main computing phases of numerical methods for solving partial differential equations are the algebraic system assembly and the iterative solver. This work focuses on the first task, in the context of a hybrid MPI+X paradigm. The matrix assembly consists of a loop over the elements, faces, edges or nodes of the MPI partitions to compute element matrices and vectors and then of their assemblies. In a MPI+X hybrid parallelism context, X has consisted traditionally of loop parallelism using OpenMP, with different techniques to avoid the race condition, but presenting efficiency or implementation drawbacks. We propose an alternative, based on task parallelism using some extensions to the OpenMP programming model. In addition, dynamic load balance will be applied, especially efficient in the presence of hybrid meshes. This paper presents the proposed methodology, its implementation and its validation through the solution of large computational mechanics problems up to 16k cores. |
Author | Artigues, Antoni López, Victor Garcia-Gasulla, Marta Houzeaux, Guillaume Ferrer, Roger Vázquez, Mariano Labarta, Jesús |
Author_xml | – sequence: 1 givenname: Marta surname: Garcia-Gasulla fullname: Garcia-Gasulla, Marta email: marta.garcia@bsc.es organization: Barcelona Supercomputing Center – sequence: 2 givenname: Guillaume surname: Houzeaux fullname: Houzeaux, Guillaume email: guillaume.houzeaux@bsc.es organization: Barcelona Supercomputing Center – sequence: 3 givenname: Roger surname: Ferrer fullname: Ferrer, Roger organization: Barcelona Supercomputing Center – sequence: 4 givenname: Antoni surname: Artigues fullname: Artigues, Antoni organization: Barcelona Supercomputing Center – sequence: 5 givenname: Victor surname: López fullname: López, Victor organization: Barcelona Supercomputing Center – sequence: 6 givenname: Jesús surname: Labarta fullname: Labarta, Jesús organization: Barcelona Supercomputing Center – sequence: 7 givenname: Mariano surname: Vázquez fullname: Vázquez, Mariano organization: Barcelona Supercomputing Center |
BookMark | eNp9kE1LxDAQhoOs4OdPEAIepesk_UjrSRG_QNGDC97CtJlA1zRZky6y_94uq1dPMwPP-w48R2zmgyfGzgTMBdRwKaASdVnJuQTRzEUl1HTtsUMBsslAlmq23SuRbaEDdpTSEgAaWYtDtnh5e7r4uOIjps-sxUSGrzCic-T6hGMfPEdvuNl4HPqOu4CGt-jQd8SD5bb3_UicHA3kR44p0dC6zQnbt-gSnf7OY7a4v3u_fcyeXx-ebm-es06W-ZgpKCQg1apQVsmqVFYqEobImrJobCFN25lOWKqsaLGr0YBoZVHbui0bpfL8mJ3velcxfK0pjXoZ1tFPL7WUuZIKoICJKndUF0NKkaxexX7AuNEC9Nag_jOotwb1r8Epd73L9d6GOOB3iM7oETcuRBsnBX3S-f8VPxFqeUk |
CitedBy_id | crossref_primary_10_1016_j_future_2020_01_045 crossref_primary_10_1016_j_compfluid_2022_105723 crossref_primary_10_1080_10618562_2020_1771325 crossref_primary_10_1016_j_compfluid_2022_105577 crossref_primary_10_1016_j_proci_2022_07_222 crossref_primary_10_1080_10618562_2020_1783440 |
Cites_doi | 10.1016/j.compbiomed.2015.12.003 10.1080/10618560701816387 10.1016/S0307-904X(00)00043-3 10.1007/s11831-014-9126-8 10.1002/nme.2989 10.1142/S0129626411000151 10.1016/j.procs.2011.04.229 10.1016/0020-0190(92)90041-S 10.1080/10618562.2016.1227070 10.1016/S0045-7825(98)00005-X 10.1108/09615530310456796 10.1016/j.compfluid.2012.04.017 10.1002/nme.2973 10.1137/S1064827598337373 10.1137/1.9780898718003 10.1016/j.jocs.2015.12.007 10.1016/j.parco.2003.12.016 10.1002/nme.2932 10.1002/9780470989746 10.1016/j.compfluid.2011.01.017 10.1155/2010/273921 10.1016/0045-7825(89)90157-6 10.1201/b21930 10.1016/j.jcp.2009.05.019 |
ContentType | Journal Article |
Copyright | 2019 Informa UK Limited, trading as Taylor & Francis Group 2019 2019 Informa UK Limited, trading as Taylor & Francis Group |
Copyright_xml | – notice: 2019 Informa UK Limited, trading as Taylor & Francis Group 2019 – notice: 2019 Informa UK Limited, trading as Taylor & Francis Group |
DBID | AAYXX CITATION 7QH 7SC 7TB 7U5 7UA 8FD C1K F1W FR3 H8D H96 JQ2 KR7 L.G L7M L~C L~D |
DOI | 10.1080/10618562.2019.1617856 |
DatabaseName | CrossRef Aqualine Computer and Information Systems Abstracts Mechanical & Transportation Engineering Abstracts Solid State and Superconductivity Abstracts Water Resources Abstracts Technology Research Database Environmental Sciences and Pollution Management ASFA: Aquatic Sciences and Fisheries Abstracts Engineering Research Database Aerospace Database Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources ProQuest Computer Science Collection Civil Engineering Abstracts Aquatic Science & Fisheries Abstracts (ASFA) Professional Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | CrossRef Civil Engineering Abstracts Aquatic Science & Fisheries Abstracts (ASFA) Professional Technology Research Database Computer and Information Systems Abstracts – Academic Mechanical & Transportation Engineering Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Aqualine Water Resources Abstracts Environmental Sciences and Pollution Management Computer and Information Systems Abstracts Professional Aerospace Database Aquatic Science & Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy & Non-Living Resources ASFA: Aquatic Sciences and Fisheries Abstracts Solid State and Superconductivity Abstracts Engineering Research Database Advanced Technologies Database with Aerospace |
DatabaseTitleList | Civil Engineering Abstracts |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Applied Sciences |
EISSN | 1029-0257 |
EndPage | 136 |
ExternalDocumentID | 10_1080_10618562_2019_1617856 1617856 |
Genre | Original Articles |
GrantInformation_xml | – fundername: Brazilian Ministry of Science, Technology and Innovation grantid: SEV2015-0493; TIN2015-65316 funderid: 10.13039/501100004837 – fundername: Intel-BSC Exascale Lab collaboration project – fundername: European Union – fundername: Horizon2020 framework grantid: 689772; 676629 – fundername: Spanish Ministry of Science and Innovation funderid: http://dx.doi.org/10.13039/501100004837 – fundername: Comissió Interdepartamental de Recerca i Innovació Tecnológica grantid: 2014-SGR-1051 – fundername: Spanish Government – fundername: Energy oriented Centre of Excellence (EoCoE) – fundername: Generalitat de Catalunya funderid: http://dx.doi.org/10.13039/501100002809 – fundername: European Union's Horizon 2020 Programme |
GroupedDBID | .7F .QJ 0BK 0R~ 29J 30N 4.4 5GY 5VS AAAVI AAENE AAJMT AALDU AAMIU AAPUL AAQRR ABBKH ABCCY ABFIM ABHAV ABJVF ABLIJ ABPEM ABPTK ABQHQ ABTAI ABXUL ACGEJ ACGFS ACGOD ACIWK ACTIO ADCVX ADGTB ADXPE AEGYZ AEISY AENEX AEOZL AEPSL AEYOC AFKVX AFOLD AFRAH AFWLO AGDLA AGMYJ AHDLD AIJEM AIRXU AJWEG AKBVH AKOOK ALMA_UNASSIGNED_HOLDINGS ALQZU AQRUH AVBZW AWYRJ BLEHA CCCUG CE4 CS3 DGEBU DKSSO DU5 EBS EJD E~A E~B F5P FUNRP FVPDL GTTXZ H13 HF~ HZ~ H~P IPNFZ J.P KYCEM LJTGL M4Z NA5 NX~ O9- P2P PQEST PQQKQ RIG RNANH ROSJB RTWRZ S-T SNACF TEN TFL TFT TFW TN5 TNC TTHFI TWF UT5 UU3 V1K ZGOLN ~S~ AAYXX ABJNI ABPAQ ABXYU AHDZW CITATION TBQAZ TDBHL TUROJ 7QH 7SC 7TB 7U5 7UA 8FD C1K F1W FR3 H8D H96 JQ2 KR7 L.G L7M L~C L~D |
ID | FETCH-LOGICAL-c253t-70420ae8747f72657f27e1deefd549f42dbcdc1fe6f1bac8ad01b248f8b597733 |
ISSN | 1061-8562 |
IngestDate | Thu Oct 10 20:48:43 EDT 2024 Fri Aug 23 02:39:49 EDT 2024 Tue Jun 13 19:51:36 EDT 2023 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 3 |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c253t-70420ae8747f72657f27e1deefd549f42dbcdc1fe6f1bac8ad01b248f8b597733 |
PQID | 2237270040 |
PQPubID | 52945 |
PageCount | 22 |
ParticipantIDs | crossref_primary_10_1080_10618562_2019_1617856 informaworld_taylorfrancis_310_1080_10618562_2019_1617856 proquest_journals_2237270040 |
PublicationCentury | 2000 |
PublicationDate | 2019-03-16 |
PublicationDateYYYYMMDD | 2019-03-16 |
PublicationDate_xml | – month: 03 year: 2019 text: 2019-03-16 day: 16 |
PublicationDecade | 2010 |
PublicationPlace | Abingdon |
PublicationPlace_xml | – name: Abingdon |
PublicationTitle | International journal of computational fluid dynamics |
PublicationYear | 2019 |
Publisher | Taylor & Francis Taylor & Francis Ltd |
Publisher_xml | – name: Taylor & Francis – name: Taylor & Francis Ltd |
References | CIT0030 CIT0010 CIT0031 CIT0034 CIT0011 CIT0014 CIT0036 CIT0013 CIT0016 CIT0038 CIT0015 CIT0017 CIT0019 CIT0001 CIT0023 CIT0022 Belytschko T. (CIT0005) 2014 CIT0025 CIT0024 CIT0004 CIT0026 CIT0007 CIT0029 CIT0009 CIT0008 |
References_xml | – ident: CIT0007 doi: 10.1016/j.compbiomed.2015.12.003 – ident: CIT0016 doi: 10.1080/10618560701816387 – ident: CIT0004 doi: 10.1016/S0307-904X(00)00043-3 – ident: CIT0008 doi: 10.1007/s11831-014-9126-8 – volume-title: Nonlinear Finite Elements for Continua and Structures year: 2014 ident: CIT0005 contributor: fullname: Belytschko T. – ident: CIT0009 doi: 10.1002/nme.2989 – ident: CIT0010 doi: 10.1142/S0129626411000151 – ident: CIT0026 doi: 10.1016/j.procs.2011.04.229 – ident: CIT0025 doi: 10.1016/0020-0190(92)90041-S – ident: CIT0015 doi: 10.1080/10618562.2016.1227070 – ident: CIT0022 doi: 10.1016/S0045-7825(98)00005-X – ident: CIT0030 doi: 10.1108/09615530310456796 – ident: CIT0014 doi: 10.1016/j.compfluid.2012.04.017 – ident: CIT0001 doi: 10.1002/nme.2973 – ident: CIT0036 doi: 10.1137/S1064827598337373 – ident: CIT0029 doi: 10.1137/1.9780898718003 – ident: CIT0034 doi: 10.1016/j.jocs.2015.12.007 – ident: CIT0019 doi: 10.1016/j.parco.2003.12.016 – ident: CIT0024 doi: 10.1002/nme.2932 – ident: CIT0023 doi: 10.1002/9780470989746 – ident: CIT0013 doi: 10.1016/j.compfluid.2011.01.017 – ident: CIT0038 doi: 10.1155/2010/273921 – ident: CIT0011 doi: 10.1016/0045-7825(89)90157-6 – ident: CIT0031 doi: 10.1201/b21930 – ident: CIT0017 doi: 10.1016/j.jcp.2009.05.019 |
SSID | ssj0009281 |
Score | 2.2760353 |
Snippet | The main computing phases of numerical methods for solving partial differential equations are the algebraic system assembly and the iterative solver. This work... |
SourceID | proquest crossref informaworld |
SourceType | Aggregation Database Publisher |
StartPage | 115 |
SubjectTerms | Assembly CFD Computer applications Differential equations distributed memory parallelism dynamic load balance Dynamic loads finite element Finite element method hybrid parallelism Iterative methods Load balancing Mathematical analysis Mathematical models Matrix algebra Matrix methods Mechanics MPI MPI+X Numerical methods OpenMP Partial differential equations shared-memory parallelism Vectors Vectors (mathematics) |
Title | MPI+X: task-based parallelisation and dynamic load balance of finite element assembly |
URI | https://www.tandfonline.com/doi/abs/10.1080/10618562.2019.1617856 https://www.proquest.com/docview/2237270040 |
Volume | 33 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9NAEF6F9MIFyksUCtoDnCIHP-N1bwGaBNQWhBwRcbF27d0qqhujxr70B_E7mfGuHYdEvC5WtFbW0cyXeaxnviHklVAROubAUjwSlh95qRUhM6Lvi4CpNAuYg83J5xej2dz_uAgWvd6PTtVSVYpheru3r-R_tAproFfskv0HzbabwgJ8Bv3CFTQM17_S8fnnD6_dt4u6NoOvryx0SdkA2bzzXOamUKd-PZDpwfODvOAQcWI5Y1ofFqglxpwDqYvIBxBJy2uRb73q3T4z7DBNpPVAiOaGyqtl-5w2UJ_iqCJuTfkaUl1umoPK1hXMiupW8qp2gNMKByBV1y3SJvLGzNr-UlxuiojHIIfLSlu3MU5AXnZPLrBZyrN0Y2WNtXhniEinkgltMYQaFguMsZZmzYVtXM1p3RhwzaRhgOp1rLGjO0V3vIQuq8T9cXus74uGmOexYA8r98WnZDI_O0vi00V8hxy4YNBYnxyMZ--_fd3QO7v1ONz2Jze9Ysx-s_cxW1HQFkfuTkxQBzrxIblnMhQ61nB7QHpy9ZDcN9kKNb5g_YjMAX2DxQndII_-gjwKyKMGERSRRw3yaKGoRh41yKMN8h6T-eQ0fjezzJAOK3UDr7RCsPo2lwzSUhW6oyBUbiidTEqVBX6kfDcTaZY6So6UI3jKeGY7wvWZYgKpDz3vCemvipV8SqiSkae4ko7HfcijbcEhPFaQMAh4EhvZR2TYCC35rrlYEsdQ3DZSTlDKiZHyEYm6ok3KGnFKgy3x_vDd40YPiflrrROIpUOs1PDtZ7-__Zzc3SD-mPTLm0q-gOi1FC8NdH4CoJqZGg |
link.rule.ids | 315,783,787,27938,27939,60220,61009 |
linkProvider | Library Specific Holdings |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1JTwIxFG4UD3oR14ii9uDNDM6-eDNGAgrEAyTcmm4vMYxgZDjor7dvlgga44HjZNJOp9v7Xvu97xFyJSBBwxxYwBNh-YknrQSVEX1fBDFIFcQOBif3B2Fn5D-Og_FSLAzSKtGHhkIoIt-rcXHjYXRFibtBNyY2hhuZWUkLEbp52iRbIYaNYhiHPfgW3nXzRKVYxMIyVRTPX9Ws2KcV9dJfu3Vugtp1IqvGF8yTSWuRiZb8_KHruN7f7ZHdEqHSu2JK7ZMNPT0g9RKt0nIvmB-SUf-5ez2-pRmfTyw0hoqijnia6rSkCFHTBqqKlPc0nXFFBRIppaYzoPCCaJfqgr5ODYbXryL9OCKj9sPwvmOVSRos6QZeZkVm1dtcx8YtgcgNgwjcSDtKa1DG9QTfVUIq6YAOwRFcxlzZjnD9GGKB0need0xq09lUnxAKOvGAg3Y8bpzGyBbcwCMwgFGYL8Wh3SCtamjYW6HFwZxS4rTqNIadxspOa5BkeQBZlh-CQJGxhHn_lG1Wo83KZT1nBktFeFPv26drVH1JtjvDfo_1uoOnM7KDr5DV5oRNUsveF_rcwJxMXOTz-AvG4-6X |
linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV07T8MwELagSIiF8hSFAh7YUEreDzYEVC3QqgOVull27JNQQ1vRdIBfjy9xBAUhho5RZMexfXffJZ-_I-RCQIKBObCAJ8LyEy-1ElRG9H0RxJDKIHbwcHKvH3aG_sMoqNiEc0OrxBwaSqGIwlejcc8kVIy4K8xiYh23kZiVtBCg66t1sqGRgI0b27P7X7q7blGnFJtY2KY6xPNXN0vhaUm89JezLiJQu05ENfaSeDJuLXLRSj9-yDqu9HI7ZNvgU3pTbqhdsqYme6RusCo1nmC-T4a9QfdydE1zPh9bGAolRRXxLFOZIQhRPQQqy4L3NJtySQXSKFNFp0DhBbEuVSV5nWoEr15F9n5Ahu3759uOZUo0WKkbeLkVaZu3uYp1UgKRGwYRuJFypFIgdeIJvitFKlMHVAiO4GnMpe0I148hFih853mHpDaZTtQRoaASDzgox-M6ZYxswTU4Ag0XhX5SHNoN0qpWhs1KJQ7mGIHTatIYThozk9Ygyff1Y3nxCQTKeiXM-6dts1psZox6zjSSivA_vW8fr9D1Odkc3LXZU7f_eEK28A5S2pywSWr520KdaoyTi7NiF38CD1_tTQ |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=MPI%2BX%3A+task-based+parallelisation+and+dynamic+load+balance+of+finite+element+assembly&rft.jtitle=International+journal+of+computational+fluid+dynamics&rft.au=Garcia-Gasulla%2C+Marta&rft.au=Houzeaux%2C+Guillaume&rft.au=Ferrer%2C+Roger&rft.au=Artigues%2C+Antoni&rft.date=2019-03-16&rft.pub=Taylor+%26+Francis+Ltd&rft.issn=1061-8562&rft.eissn=1029-0257&rft.volume=33&rft.issue=3&rft.spage=115&rft_id=info:doi/10.1080%2F10618562.2019.1617856&rft.externalDBID=NO_FULL_TEXT |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1061-8562&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1061-8562&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1061-8562&client=summon |