An implementation of direct linear equation solver using a many-core CPU for mechanical dynamic analysis
This research proposes an effective implementation of linear equation solver for an implicit integration on a many-core CPU. Although this implementation is applied to a flexible body simulation in mechanical dynamics, it could be also utilized in a wide range of other fields. BFS-based nested disse...
Saved in:
Published in | Journal of mechanical science and technology Vol. 31; no. 10; pp. 4637 - 4645 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Seoul
Korean Society of Steel Construction
01.10.2017
Springer Nature B.V 대한기계학회 |
Subjects | |
Online Access | Get full text |
ISSN | 1738-494X 1976-3824 |
DOI | 10.1007/s12206-017-0910-x |
Cover
Loading…
Abstract | This research proposes an effective implementation of linear equation solver for an implicit integration on a many-core CPU. Although this implementation is applied to a flexible body simulation in mechanical dynamics, it could be also utilized in a wide range of other fields. BFS-based nested dissection and its numerical factorization enables adaptive control of setting operational range as well as positive parallelization compared with traditional DFS-based nested dissection. It brings better parallel efficiency when various sized separators are divided into blocks under a certain size. This study presents an experiment to identify an optimal maximum block size. Sparse matrices from mechanical dynamics software are numerically factorized, and the time results show that CACHE memory mode is appropriate for a better performance than FLAT mode. And it is recommended to split the operational region in accordance with MCDRAM size in this experiment. Our research shows fairly similar performance to DSS included in MKL and speeds up the time approximately 8 - 14 times in comparison with CHOLMOD, a part of SuiteSparse. |
---|---|
AbstractList | This research proposes an effective implementation of linear equation solver for an implicit integration on a many-core CPU. Although this implementation is applied to a flexible body simulation in mechanical dynamics, it could be also utilized in a wide range of other fields. BFS-based nested dissection and its numerical factorization enables adaptive control of setting operational range as well as positive parallelization compared with traditional DFS-based nested dissection. It brings better parallel efficiency when various sized separators are divided into blocks under a certain size. This study presents an experiment to identify an optimal maximum block size. Sparse matrices from mechanical dynamics software are numerically factorized, and the time results show that CACHE memory mode is appropriate for a better performance than FLAT mode. And it is recommended to split the operational region in accordance with MCDRAM size in this experiment. Our research shows fairly similar performance to DSS included in MKL and speeds up the time approximately 8 - 14 times in comparison with CHOLMOD, a part of SuiteSparse. This research proposes an effective implementation of linear equation solver for an implicit integration on a many-core CPU. Although this implementation is applied to a flexible body simulation in mechanical dynamics, it could be also utilized in a wide range of other fields. BFS-based nested dissection and its numerical factorization enables adaptive control of setting operational range as well as positive parallelization compared with traditional DFS-based nested dissection. It brings better parallel efficiency when various sized separators are divided into blocks under a certain size. This study presents an experiment to identify an optimal maximum block size. Sparse matrices from mechanical dynamics software are numerically factorized, and the time results show that CACHE memory mode is appropriate for a better performance than FLAT mode. And it is recommended to split the operational region in accordance with MCDRAM size in this experiment. Our research shows fairly similar performance to DSS included in MKL and speeds up the time approximately 8 - 14 times in comparison with CHOLMOD, a part of SuiteSparse. KCI Citation Count: 0 |
Author | Bae, Dae-Sung Jung, Ji-Hyun |
Author_xml | – sequence: 1 givenname: Ji-Hyun surname: Jung fullname: Jung, Ji-Hyun organization: Department of Mechanical Engineering, Hanyang University – sequence: 2 givenname: Dae-Sung surname: Bae fullname: Bae, Dae-Sung email: dsbae@hanyang.ac.kr organization: Department of Mechanical Engineering, Hanyang University |
BackLink | https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002267066$$DAccess content in National Research Foundation of Korea (NRF) |
BookMark | eNp1kE1LxDAQhoMo-PkDvAU8eYhOkqbZHpfFLxAU2QVvIZtO12ibrElX3H9vtYInTzMwz_syPIdkN8SAhJxyuOAA-jJzIaBkwDWDigP73CEHvNIlkxNR7A67lhNWVMXzPjnM-RWgFAXnB-RlGqjv1i12GHrb-xhobGjtE7qetj6gTRTfN-Mlx_YDE91kH1bU0s6GLXMxIZ09LmgTE-3QvdjgnW1pvQ22847aYNtt9vmY7DW2zXjyO4_I4vpqPrtl9w83d7PpPXMSRM8mqpE1WFU6cEorixI4aIW1k5Uoal64ZVnUallz5bASWghZAZalVWJZVGjlETkfe0NqzJvzJlr_M1fRvCUzfZrfGaE0gJwM7NnIrlN832DuzWvcpOHhbASAACmVFgPFR8qlmHPCxqyT72zaGg7mW74Z5ZtBvvmWbz6HjBgzeWDDCtNf8_-hL837iTE |
Cites_doi | 10.1007/978-0-387-09766-4_86 10.1145/1391989.1391995 10.1007/s12206-014-0928-2 10.1145/992200.992201 10.1016/S0045-7825(00)00372-8 10.1007/978-3-319-41321-1_18 10.1137/S0895479801385037 10.1007/s12206-015-0508-0 10.1137/1.9780898718003 10.1080/08905459908915692 10.1016/j.compstruc.2015.05.010 10.1007/s12206-017-0322-y 10.1007/s11227-012-0825-3 10.1109/TMAG.2011.2179527 10.1016/j.compstruc.2014.05.009 10.1137/1.9780898718881 10.1109/JPROC.2008.917757 10.1007/s12206-014-1129-8 |
ContentType | Journal Article |
Copyright | The Korean Society of Mechanical Engineers and Springer-Verlag GmbH Germany 2017 The Korean Society of Mechanical Engineers and Springer-Verlag GmbH Germany 2017. |
Copyright_xml | – notice: The Korean Society of Mechanical Engineers and Springer-Verlag GmbH Germany 2017 – notice: The Korean Society of Mechanical Engineers and Springer-Verlag GmbH Germany 2017. |
DBID | AAYXX CITATION 7TB 8FD 8FE 8FG ABJCF AFKRA BENPR BGLVJ CCPQU DWQXO FR3 HCIFZ L6V M7S PHGZM PHGZT PKEHL PQEST PQGLB PQQKQ PQUKI PRINS PTHSS S0W ACYCR |
DOI | 10.1007/s12206-017-0910-x |
DatabaseName | CrossRef Mechanical & Transportation Engineering Abstracts Technology Research Database ProQuest SciTech Collection ProQuest Technology Collection Materials Science & Engineering Collection ProQuest Central UK/Ireland ProQuest Central Technology Collection ProQuest One Community College ProQuest Central Korea Engineering Research Database SciTech Premium Collection ProQuest Engineering Collection Engineering Database (Proquest) ProQuest Central Premium ProQuest One Academic (New) ProQuest One Academic Middle East (New) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering collection DELNET Engineering & Technology Collection Korean Citation Index |
DatabaseTitle | CrossRef Engineering Database Technology Collection Technology Research Database ProQuest One Academic Middle East (New) Mechanical & Transportation Engineering Abstracts ProQuest One Academic Eastern Edition SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central ProQuest One Applied & Life Sciences ProQuest Engineering Collection ProQuest One Academic UKI Edition ProQuest Central Korea ProQuest DELNET Engineering and Technology Collection Materials Science & Engineering Collection Engineering Research Database ProQuest One Academic ProQuest Central (New) Engineering Collection ProQuest One Academic (New) |
DatabaseTitleList | Engineering Database |
Database_xml | – sequence: 1 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering |
EISSN | 1976-3824 |
EndPage | 4645 |
ExternalDocumentID | oai_kci_go_kr_ARTI_2570038 10_1007_s12206_017_0910_x |
GroupedDBID | -5B -5G -BR -EM -Y2 -~C .86 .UV .VR 06D 0R~ 0VY 1N0 2.D 203 29L 29~ 2J2 2JN 2JY 2KG 2KM 2LR 2VQ 2~H 30V 4.4 406 408 40D 40E 5GY 5VS 6NX 8FE 8FG 8UJ 95- 95. 95~ 96X 9ZL AAAVM AABHQ AACDK AAHNG AAIAL AAJBT AAJKR AANZL AARHV AARTL AASML AATNV AATVU AAUYE AAWCG AAYIU AAYQN AAYTO AAYZH ABAKF ABDZT ABECU ABFTD ABFTV ABHQN ABJCF ABJNI ABJOX ABKCH ABMNI ABMQK ABNWP ABQBU ABQSL ABSXP ABTEG ABTHY ABTKH ABTMW ABWNU ABXPI ACAOD ACBXY ACDTI ACGFS ACHSB ACHXU ACIWK ACKNC ACMDZ ACMLO ACOKC ACOMO ACPIV ACSNA ACZOJ ADHIR ADINQ ADKNI ADKPE ADMLS ADRFC ADTPH ADURQ ADYFF ADZKW AEBTG AEFQL AEGAL AEGNC AEJHL AEJRE AEKMD AEMSY AENEX AEOHA AEPYU AESKC AETLH AEVLU AEXYK AFBBN AFGCZ AFKRA AFLOW AFQWF AFWTZ AFZKB AGAYW AGDGC AGJBK AGMZJ AGQEE AGQMX AGRTI AGWIL AGWZB AGYKE AHAVH AHBYD AHKAY AHSBF AHYZX AIAKS AIGIU AIIXL AILAN AITGF AJBLW AJRNO ALMA_UNASSIGNED_HOLDINGS ALWAN AMKLP AMXSW AMYLF AOCGG ARCEE ARMRJ ASPBG AVWKF AXYYD AYJHY AZFZN B-. BA0 BDATZ BENPR BGLVJ CAG CCPQU COF CS3 CSCUP DBRKI DDRTE DNIVK DPUIP EBLON EBS EIOEI EJD ESBYG FEDTE FERAY FFXSO FIGPU FINBP FNLPD FRRFC FSGXE FWDCC GGCAI GGRSB GJIRD GNWQR GQ6 GQ7 GW5 H13 HCIFZ HF~ HG6 HMJXF HRMNR HVGLF HZ~ I-F IJ- IKXTQ IWAJR IXC IXD I~X I~Z J-C J0Z JBSCW JZLTJ KOV KVFHK L6V LLZTM M7S MA- MK~ ML~ MZR NDZJH NF0 NPVJJ NQJWS O9- P9P PF0 PT4 PTHSS Q2X QOS R89 R9I RHV ROL RPX RSV S0W S16 S1Z S26 S27 S28 S3B SAP SCLPG SDH SEG SHX SISQX SJYHP SNE SNPRN SNX SOHCF SOJ SPISZ SRMVM SSLCW STPWE SZN T13 T16 TDB TSG TSV TUC TUS U2A UG4 UOJIU UTJUX UZXMN VC2 VFIZW W48 WK8 YLTOR Z45 Z5O Z7R Z7S Z7V Z7W Z7X Z7Y Z7Z Z81 Z83 Z85 Z86 Z88 Z8M Z8R Z8T Z8W ZMTXR ZZE ~A9 AAPKM AAYXX ABDBE ABFSG ACSTC ADHKG AEZWR AFDZB AFHIU AFOHR AGQPQ AHPBZ AHWEU AIXLP ATHPR CITATION PHGZM PHGZT 7TB 8FD ABRTQ DWQXO FR3 PKEHL PQEST PQGLB PQQKQ PQUKI PRINS ACYCR |
ID | FETCH-LOGICAL-c302t-85f3d0a56c0c575ae301075edc3924d14cb64d5bd15ce92722390e66a52b49ea3 |
IEDL.DBID | BENPR |
ISSN | 1738-494X |
IngestDate | Sun Mar 09 07:53:17 EDT 2025 Fri Jul 25 11:17:55 EDT 2025 Tue Jul 01 02:20:28 EDT 2025 Fri Feb 21 02:31:25 EST 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 10 |
Keywords | Linear equation solver Nested dissection Mechanical dynamics Implicit integration Knights landing |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c302t-85f3d0a56c0c575ae301075edc3924d14cb64d5bd15ce92722390e66a52b49ea3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
PQID | 2002033572 |
PQPubID | 326249 |
PageCount | 9 |
ParticipantIDs | nrf_kci_oai_kci_go_kr_ARTI_2570038 proquest_journals_2002033572 crossref_primary_10_1007_s12206_017_0910_x springer_journals_10_1007_s12206_017_0910_x |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 20171000 2017-10-00 20171001 2017-10 |
PublicationDateYYYYMMDD | 2017-10-01 |
PublicationDate_xml | – month: 10 year: 2017 text: 20171000 |
PublicationDecade | 2010 |
PublicationPlace | Seoul |
PublicationPlace_xml | – name: Seoul – name: Heidelberg |
PublicationTitle | Journal of mechanical science and technology |
PublicationTitleAbbrev | J Mech Sci Technol |
PublicationYear | 2017 |
Publisher | Korean Society of Steel Construction Springer Nature B.V 대한기계학회 |
Publisher_xml | – name: Korean Society of Steel Construction – name: Springer Nature B.V – name: 대한기계학회 |
References | ChenY.DavisT. A.HagerW. W.RajamanickamS.Algorithm 887: CHOLMOD, supernodal sparse Cholesky factorization and update/downdateACM Transactions on Mathematical Software (TOMS)200835322273820910.1145/1391989.1391995 BaeD. S.KimH. W.YooH. H.SuhM. S.A decoupling solution method for implicit numerical integration of constrained mechanical systemsMechanics of Structures and Machines1999272129141170568810.1080/08905459908915692 YeralanS. N.DavisT. A.RankaS. A. N. J. A. Y.Algorithm 9xx: Sparse QR Factorization on the GPUACM Transactions on Mathematical Software201511128 ShinS.CherguiJ.JuricD.A solver for massively parallel direct numerical simulation of three-dimensional multiphase flowsJournal of Mechanical Science and Technology20173141739175110.1007/s12206-017-0322-y HeineckeA.BreuerA.BaderM.DubeyP.High order seismic simulations on the intel xeon phi processor (Knights landing)International Conference on High Performance Computing2016343362 KarypisG.KumarV.METIS-A software package for partitioning unstructured graphs, partitioning meshes, and computing fill-reducing orderings of sparse matrices Version 5.1.02013 KimH. W.BaeD. S.ChoiK. K.Configuration design sensitivity analysis of dynamics for constrained mechanical systemsComputer Methods in Applied Mechanics and Engineering2001190405271528210.1016/S0045-7825(00)00372-80997.70007 L'ExcellentJ. Y.Multifrontal methods: Parallelism, memory usage and numerical aspects2012 IntelIntel Math Kernel Library Reference Manual 2017201716771690 OwensJ. D.HoustonM.LuebkeD.GreenS.StoneJ. E.PhillipsJ. C.GPU computingProceedings of the IEEE200896587989910.1109/JPROC.2008.917757 Virtual Motion, IncDAFUL 5.1 User’s Manual2015 KoricA.LuQ.GuleryuzE.Evaluation of massively parallel linear sparse solvers on unstructured finite element meshesComputers & Structures2014141192510.1016/j.compstruc.2014.05.009 RodriguesA. W. O.Guyomarc'hF.DekeyserJ. L.Le MenachY.Automatic multi-GPU code generation applied to simulation of electrical machinesIEEE Transactions on Magnetics201248283183410.1109/TMAG.2011.2179527 DuffI. S.ScottJ. A.A parallel direct solver for large sparse highly unsymmetric linear systemsACM Transactions on Mathematical Software (TOMS)200430295117207597610.1145/992200.9922011072.65038 LiR.SaadY.GPU-accelerated preconditioned iterative linear solversThe Journal of Supercomputing201363244346610.1007/s11227-012-0825-3 BrainmanI.ToledoS.Nested-dissection orderings for sparse LU with partial pivotingSIAM Journal on Matrix Analysis and Applications20022349981012192093010.1137/S08954798013850371011.65012 GaikwadA.TokeI. M.Parallel iterative linear solvers on GPU: A financial engineering caseParallel, Distributed and Network-Based Processing (PDP), 18th Euromicro International Conference on IEEE2010607614 JungJ. H.BaeD. S.Optimization of operating and assembling mass properties of solid elements on heterogeneous platforms using OpenCL frameworkJournal of Mechanical Science and Technology20152972631263710.1007/s12206-015-0508-0 TangZ.HuangH.JiangH.LiB.MIC-based preconditioned conjugate gradient method for solving large sparse linear equations201611231129 SodaniA.Knights landing (KNL): 2nd Generation Intel® Xeon Phi processorHot Chips 27 Symposium (HCS), 2015 IEEE2015 SeoI. S.KimJ. H.ShinJ. H.ShinS. W.LeeS. H.Particle behaviors of printing system using GPU-based discrete element methodJournal of Mechanical Science and Technology201428125083508710.1007/s12206-014-1129-8 Martínez-FrutosJ.Martínez-CastejónP. J.Herrero-PérezD.Fine-grained GPU implementation of assembly-free iterative solver for finite element problemsComputers & Structures201515791810.1016/j.compstruc.2015.05.010 DuranA.CelebiM. S.AkaydinB.TuncelM.ÖztoprakaF.Analysis of SuperLU_DIST and SuperLU_ MCDT on Intel® Xeon Phi™ ArchitecturePartnership for Advanced Computing in Europe2013 GuptaA.JoshiM.KumarV.WSMP: A highperformance shared-and distributed-memory parallel sparse linear equation solver2001 ChrysosG.Intel® Xeon Phi™ Coprocessor-the Architecture2014 HorowitzE.Fundamentals of data structures in C++2006 DavisT. A.Direct methods for sparse linear systems200610.1137/1.97808987188811119.65021 KangD. K.KimC. W.YangH. I.GPU-based parallel computation for structural dynamic response analysis with CUDAJournal of Mechanical Science and Technology201428104155416210.1007/s12206-014-0928-2 PaduaD.Encyclopedia of parallel computing20111209121610.1007/978-0-387-09766-41231.68001 SaadY.Iterative methods for sparse linear systems200310.1137/1.97808987180031031.65046 DavisT. A.User guide for CHOLMOD: A sparse Cholesky factorization and modification package2016Gainesville, FL, USADepartment of Computer and Information Science and Engineering, University of Florida A. Heinecke (910_CR21) 2016 J. Y. L'Excellent (910_CR28) 2012 I. S. Duff (910_CR7) 2004; 30 Y. Chen (910_CR17) 2008; 35 E. Horowitz (910_CR24) 2006 A. Gupta (910_CR6) 2001 A. Sodani (910_CR20) 2015 T. A. Davis (910_CR31) 2016 Y. Saad (910_CR1) 2003 D. Padua (910_CR27) 2011 R. Li (910_CR10) 2013; 63 G. Karypis (910_CR26) 2013 G. Chrysos (910_CR9) 2014 T. A. Davis (910_CR2) 2006 S. Shin (910_CR5) 2017; 31 S. N. Yeralan (910_CR18) 2015; 1 J. D. Owens (910_CR8) 2008; 96 D. S. Bae (910_CR22) 1999; 27 Virtual Motion, Inc (910_CR29) 2015 A. Koric (910_CR4) 2014; 141 A. Duran (910_CR19) 2013 Z. Tang (910_CR11) 2016 I. S. Seo (910_CR12) 2014; 28 A. W. O. Rodrigues (910_CR15) 2012; 48 D. K. Kang (910_CR13) 2014; 28 J. Martínez-Frutos (910_CR14) 2015; 157 J. H. Jung (910_CR3) 2015; 29 H. W. Kim (910_CR23) 2001; 190 Intel (910_CR30) 2017 A. Gaikwad (910_CR16) 2010 I. Brainman (910_CR25) 2002; 23 |
References_xml | – reference: Virtual Motion, IncDAFUL 5.1 User’s Manual2015 – reference: SaadY.Iterative methods for sparse linear systems200310.1137/1.97808987180031031.65046 – reference: GuptaA.JoshiM.KumarV.WSMP: A highperformance shared-and distributed-memory parallel sparse linear equation solver2001 – reference: DavisT. A.User guide for CHOLMOD: A sparse Cholesky factorization and modification package2016Gainesville, FL, USADepartment of Computer and Information Science and Engineering, University of Florida – reference: Martínez-FrutosJ.Martínez-CastejónP. J.Herrero-PérezD.Fine-grained GPU implementation of assembly-free iterative solver for finite element problemsComputers & Structures201515791810.1016/j.compstruc.2015.05.010 – reference: LiR.SaadY.GPU-accelerated preconditioned iterative linear solversThe Journal of Supercomputing201363244346610.1007/s11227-012-0825-3 – reference: GaikwadA.TokeI. M.Parallel iterative linear solvers on GPU: A financial engineering caseParallel, Distributed and Network-Based Processing (PDP), 18th Euromicro International Conference on IEEE2010607614 – reference: BrainmanI.ToledoS.Nested-dissection orderings for sparse LU with partial pivotingSIAM Journal on Matrix Analysis and Applications20022349981012192093010.1137/S08954798013850371011.65012 – reference: ShinS.CherguiJ.JuricD.A solver for massively parallel direct numerical simulation of three-dimensional multiphase flowsJournal of Mechanical Science and Technology20173141739175110.1007/s12206-017-0322-y – reference: PaduaD.Encyclopedia of parallel computing20111209121610.1007/978-0-387-09766-41231.68001 – reference: HeineckeA.BreuerA.BaderM.DubeyP.High order seismic simulations on the intel xeon phi processor (Knights landing)International Conference on High Performance Computing2016343362 – reference: RodriguesA. W. O.Guyomarc'hF.DekeyserJ. L.Le MenachY.Automatic multi-GPU code generation applied to simulation of electrical machinesIEEE Transactions on Magnetics201248283183410.1109/TMAG.2011.2179527 – reference: SodaniA.Knights landing (KNL): 2nd Generation Intel® Xeon Phi processorHot Chips 27 Symposium (HCS), 2015 IEEE2015 – reference: DuffI. S.ScottJ. A.A parallel direct solver for large sparse highly unsymmetric linear systemsACM Transactions on Mathematical Software (TOMS)200430295117207597610.1145/992200.9922011072.65038 – reference: KoricA.LuQ.GuleryuzE.Evaluation of massively parallel linear sparse solvers on unstructured finite element meshesComputers & Structures2014141192510.1016/j.compstruc.2014.05.009 – reference: IntelIntel Math Kernel Library Reference Manual 2017201716771690 – reference: JungJ. H.BaeD. S.Optimization of operating and assembling mass properties of solid elements on heterogeneous platforms using OpenCL frameworkJournal of Mechanical Science and Technology20152972631263710.1007/s12206-015-0508-0 – reference: ChrysosG.Intel® Xeon Phi™ Coprocessor-the Architecture2014 – reference: KangD. K.KimC. W.YangH. I.GPU-based parallel computation for structural dynamic response analysis with CUDAJournal of Mechanical Science and Technology201428104155416210.1007/s12206-014-0928-2 – reference: ChenY.DavisT. A.HagerW. W.RajamanickamS.Algorithm 887: CHOLMOD, supernodal sparse Cholesky factorization and update/downdateACM Transactions on Mathematical Software (TOMS)200835322273820910.1145/1391989.1391995 – reference: KarypisG.KumarV.METIS-A software package for partitioning unstructured graphs, partitioning meshes, and computing fill-reducing orderings of sparse matrices Version 5.1.02013 – reference: KimH. W.BaeD. S.ChoiK. K.Configuration design sensitivity analysis of dynamics for constrained mechanical systemsComputer Methods in Applied Mechanics and Engineering2001190405271528210.1016/S0045-7825(00)00372-80997.70007 – reference: BaeD. S.KimH. W.YooH. H.SuhM. S.A decoupling solution method for implicit numerical integration of constrained mechanical systemsMechanics of Structures and Machines1999272129141170568810.1080/08905459908915692 – reference: DavisT. A.Direct methods for sparse linear systems200610.1137/1.97808987188811119.65021 – reference: TangZ.HuangH.JiangH.LiB.MIC-based preconditioned conjugate gradient method for solving large sparse linear equations201611231129 – reference: HorowitzE.Fundamentals of data structures in C++2006 – reference: YeralanS. N.DavisT. A.RankaS. A. N. J. A. Y.Algorithm 9xx: Sparse QR Factorization on the GPUACM Transactions on Mathematical Software201511128 – reference: OwensJ. D.HoustonM.LuebkeD.GreenS.StoneJ. E.PhillipsJ. C.GPU computingProceedings of the IEEE200896587989910.1109/JPROC.2008.917757 – reference: SeoI. S.KimJ. H.ShinJ. H.ShinS. W.LeeS. H.Particle behaviors of printing system using GPU-based discrete element methodJournal of Mechanical Science and Technology201428125083508710.1007/s12206-014-1129-8 – reference: DuranA.CelebiM. S.AkaydinB.TuncelM.ÖztoprakaF.Analysis of SuperLU_DIST and SuperLU_ MCDT on Intel® Xeon Phi™ ArchitecturePartnership for Advanced Computing in Europe2013 – reference: L'ExcellentJ. Y.Multifrontal methods: Parallelism, memory usage and numerical aspects2012 – start-page: 1209 volume-title: Encyclopedia of parallel computing year: 2011 ident: 910_CR27 doi: 10.1007/978-0-387-09766-4_86 – volume: 35 start-page: 22 issue: 3 year: 2008 ident: 910_CR17 publication-title: ACM Transactions on Mathematical Software (TOMS) doi: 10.1145/1391989.1391995 – volume: 1 start-page: 1 issue: 1 year: 2015 ident: 910_CR18 publication-title: ACM Transactions on Mathematical Software – volume: 28 start-page: 4155 issue: 10 year: 2014 ident: 910_CR13 publication-title: Journal of Mechanical Science and Technology doi: 10.1007/s12206-014-0928-2 – volume-title: Hot Chips 27 Symposium (HCS), 2015 IEEE year: 2015 ident: 910_CR20 – volume: 30 start-page: 95 issue: 2 year: 2004 ident: 910_CR7 publication-title: ACM Transactions on Mathematical Software (TOMS) doi: 10.1145/992200.992201 – volume-title: DAFUL 5.1 User’s Manual year: 2015 ident: 910_CR29 – volume-title: WSMP: A highperformance shared-and distributed-memory parallel sparse linear equation solver year: 2001 ident: 910_CR6 – volume: 190 start-page: 5271 issue: 40 year: 2001 ident: 910_CR23 publication-title: Computer Methods in Applied Mechanics and Engineering doi: 10.1016/S0045-7825(00)00372-8 – volume-title: User guide for CHOLMOD: A sparse Cholesky factorization and modification package year: 2016 ident: 910_CR31 – start-page: 343 volume-title: International Conference on High Performance Computing year: 2016 ident: 910_CR21 doi: 10.1007/978-3-319-41321-1_18 – start-page: 607 volume-title: Parallel, Distributed and Network-Based Processing (PDP), 18th Euromicro International Conference on IEEE year: 2010 ident: 910_CR16 – volume: 23 start-page: 998 issue: 4 year: 2002 ident: 910_CR25 publication-title: SIAM Journal on Matrix Analysis and Applications doi: 10.1137/S0895479801385037 – volume: 29 start-page: 2631 issue: 7 year: 2015 ident: 910_CR3 publication-title: Journal of Mechanical Science and Technology doi: 10.1007/s12206-015-0508-0 – start-page: 1677 volume-title: Intel Math Kernel Library Reference Manual 2017 year: 2017 ident: 910_CR30 – volume-title: Iterative methods for sparse linear systems year: 2003 ident: 910_CR1 doi: 10.1137/1.9780898718003 – volume: 27 start-page: 129 issue: 2 year: 1999 ident: 910_CR22 publication-title: Mechanics of Structures and Machines doi: 10.1080/08905459908915692 – volume: 157 start-page: 9 year: 2015 ident: 910_CR14 publication-title: Computers & Structures doi: 10.1016/j.compstruc.2015.05.010 – volume-title: METIS-A software package for partitioning unstructured graphs, partitioning meshes, and computing fill-reducing orderings of sparse matrices Version 5.1.0 year: 2013 ident: 910_CR26 – volume-title: Partnership for Advanced Computing in Europe year: 2013 ident: 910_CR19 – volume: 31 start-page: 1739 issue: 4 year: 2017 ident: 910_CR5 publication-title: Journal of Mechanical Science and Technology doi: 10.1007/s12206-017-0322-y – start-page: 1123 volume-title: MIC-based preconditioned conjugate gradient method for solving large sparse linear equations year: 2016 ident: 910_CR11 – volume: 63 start-page: 443 issue: 2 year: 2013 ident: 910_CR10 publication-title: The Journal of Supercomputing doi: 10.1007/s11227-012-0825-3 – volume: 48 start-page: 831 issue: 2 year: 2012 ident: 910_CR15 publication-title: IEEE Transactions on Magnetics doi: 10.1109/TMAG.2011.2179527 – volume: 141 start-page: 19 year: 2014 ident: 910_CR4 publication-title: Computers & Structures doi: 10.1016/j.compstruc.2014.05.009 – volume-title: Direct methods for sparse linear systems year: 2006 ident: 910_CR2 doi: 10.1137/1.9780898718881 – volume-title: Fundamentals of data structures in C++ year: 2006 ident: 910_CR24 – volume-title: Intel® Xeon Phi™ Coprocessor-the Architecture year: 2014 ident: 910_CR9 – volume: 96 start-page: 879 issue: 5 year: 2008 ident: 910_CR8 publication-title: Proceedings of the IEEE doi: 10.1109/JPROC.2008.917757 – volume: 28 start-page: 5083 issue: 12 year: 2014 ident: 910_CR12 publication-title: Journal of Mechanical Science and Technology doi: 10.1007/s12206-014-1129-8 – volume-title: Multifrontal methods: Parallelism, memory usage and numerical aspects year: 2012 ident: 910_CR28 |
SSID | ssj0062411 |
Score | 2.0841799 |
Snippet | This research proposes an effective implementation of linear equation solver for an implicit integration on a many-core CPU. Although this implementation is... |
SourceID | nrf proquest crossref springer |
SourceType | Open Website Aggregation Database Index Database Publisher |
StartPage | 4637 |
SubjectTerms | Adaptive control Central processing units Computer memory Computer simulation Control CPUs Dissection Dynamical Systems Engineering Flexible bodies Industrial and Production Engineering Linear equations Mechanical Engineering Separators Sparse matrices Vibration 기계공학 |
SummonAdditionalLinks | – databaseName: SpringerLink Journals (ICM) dbid: U2A link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEA5aL3oQn7haJYgnJZBmd7ObYymWKigeXOgtZPOoRbrVbQV_vpN9UCt68LSHLLswM8n3DZn5BqErACUn_AxAbhMGCQrXRFjqiE4AjFOaWFfpbD888lEW3Y_jcdPHvWir3dsryeqkXjW7MVZlvwnxGEeAOG7FkLr7Or6M9dvjlwMkVVlWAjs5EtG4vcr87RNrYLRZlG6NZ_64Gq0QZ7iHdhuqiPu1b_fRhi0O0M43AcFD9NIv8HTWloB7G-O5wzVMYU8gVYnte63mjSHKIG6xr3SfYIVncAwQL2KJB08ZBu6KZ9a3AXuvYVMPqseq0Sw5Qtnw9nkwIs3sBKJDypYkjV1oqIq5phoYmbKwkYEdWKPBB5HpRTrnkYlz04u1FSwBliCo5VzFLI-EVeEx6hTzwp4gDKvc9YxJhdKRFVZwk-Sa91KXQ3qjXYCuWyPKt1oiQ67EkL3FJVhceovLzwBdgpnlq55KL2ztn5O5fC0l0Pc76Ufq0TANULf1gmy21MLPy2Q0DOOEBeim9cxq-c8_nv7r7TO0zXx8VOV6XdRZlh_2HGjHMr-owuwLqQXO7Q priority: 102 providerName: Springer Nature |
Title | An implementation of direct linear equation solver using a many-core CPU for mechanical dynamic analysis |
URI | https://link.springer.com/article/10.1007/s12206-017-0910-x https://www.proquest.com/docview/2002033572 https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002267066 |
Volume | 31 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
ispartofPNX | Journal of Mechanical Science and Technology, 2017, 31(10), , pp.4637-4645 |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3dS-NAEB9s-3L3IHd6ctVeWcQnZTHdJJvsk_Sk9QtF5Aq9pyXZDxVpqrVC_3xn8kHPg7unQBYSmJmd-c3O7G8ADjAoeUUzAKVLBCYo0nDlAs9NgsE4DRLnS57t6xt5Pokup_G0PnB7rdsqG59YOmo7N3RGfizKmlkYJ-Lk-YXT1CiqrtYjNFrQQRecxm3o_Bzd3N41vlhifCpTrgS3daSiaVPXLC_PCVFm0wmnmMlXHyJTq1j4D6DzrzppGX7GX2Czxo1sWCn6K2y4Ygs-_8EmuA0Pw4I9zpp-cBI4m3tWxSxGaDJbMPdSUXszNDk0YkZt7_csYzP0CZwYLdnp7YQhkGUzR3eCSYXMVlPrWVYTmHyDyXj06_Sc14MUuAkDseRp7EMbZLE0gUF4ljnc1QgVnDWokMgOIpPLyMa5HcTGKZEgZFCBkzKLRR4pl4U70C7mhfsODFelH1ibqsxETjklbZIbOUh9jrmO8V04bISonyu-DL1mRiaJa5S4JonrVRf2Ucz6yTxqYrmm5_1cPy00YvkLTfP1gjDtQq_Rgq7316teW0MXjhrNrJf_-cfd_39sDz4JMoiyWa8H7eXizf1A0LHM-9BKx2d96AzPfl-N-rWd4duJGL4DcYPWmQ |
linkProvider | ProQuest |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwEB71cQAOiKdYKGAhuIAssk7ixAeESmm1S9tVhbrS3ozjR1lVm223i2j_FL-RmSTWUiS49ZSDJSea-TzfTGY8A_AaSSkomgEofSEwQJGWK58Ebgsk4zIpfGj6bB-O5GCcfZnkkzX4Fe_CUFlltImNoXZzS__I34smZ5bmhfh4ds5pahRlV-MIjRYW-_7qJ4ZsFx-Gn1G_b4TY2z3eGfBuqgC3aSKWvMxD6hKTS5tY9FWMR4gjb3pn8esy189sJTOXV66fW69EgfypEi-lyUWVKW9S3HcdNtHNUHiKNj_tjo6-RtsvkQ-bEK9AM5KpbBLzqM1lPSGa6L3gxNH88hoTrteLcM3J_Ssv29Dd3j242_mpbLsF1n1Y8_UDuPNH98KH8H27ZtNZrD8nBbN5YC1HMvJezYL587aVOEOI46FhVGZ_wgyboQ3i1EGT7RyNGTrObObpDjJBhrmr2symlpmuYcojGN-IiB_DRj2v_RNguCpD37lSGZt55ZV0RWVlvwwVxlY29OBtFKI-a_tz6FUnZpK4Rolrkri-7MErFLM-tVNNXbXpeTLXpwuNscNQ0zy_JC17sBW1oLvzfKFX6OvBu6iZ1fI_3_j0_5u9hFuD48MDfTAc7T-D24LA0RQKbsHGcvHDP0eHZ1m96FDG4NtNA_s3akkPbg |
linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LbxMxEB61qYTggHiqgVIsBBeQ1Y13114fECpto4ZCFCEi5Wa8fpSoyqZNg2j_Wn9dx7trpUWCW097sLSPmc_zzeyMZwDeIil5GWYAcicYBijcUOkST41AMi4S4XzdZ_vbkB-Osy-TfLIGV_EsTCirjDaxNtR2bsI_8h1W58zSXLAd35ZFjPb7n07PaJggFTKtcZxGA5Ejd_kHw7fzj4N91PU7xvoHP_YOaTthgJo0YUta5D61ic65SQz6Ldoh3JFDnTX4ppntZabkmc1L28uNk0wgl8rEca5zVmbS6RTvuw4bAr-x6MDG54Ph6HvkAY7cWId7Ak1KJrNJzKnWB_cYqyN5QQNf04tbrLheLfwth_evHG1Nff1H8LD1WcluA7LHsOaqJ_DgRifDp_BrtyLTWaxFD8omc08aviTBk9UL4s6atuIE4Y4biISS-2OiyQztEQ3dNMneaEzQiSYzF84jB_gQe1np2dQQ3TZPeQbjOxHxc-hU88ptAsFV7nvWFlKbzEknuRWl4b3ClxhnGd-F91GI6rTp1aFWXZmDxBVKXAWJq4suvEExqxMzVaHDdrgez9XJQmEcMVBhtl-SFl3YilpQ7d4-VyskduFD1Mxq-Z9PfPH_m72Gewho9XUwPHoJ91nARl0zuAWd5eK3e4W-z7LcbkFG4Odd4_oa6J4Tmg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=An+implementation+of+direct+linear+equation+solver+using+a+many-core+CPU+for+mechanical+dynamic+analysis&rft.jtitle=Journal+of+mechanical+science+and+technology&rft.au=%EC%A0%95%EC%A7%80%ED%98%84&rft.au=%EB%B0%B0%EB%8C%80%EC%84%B1&rft.date=2017-10-01&rft.pub=%EB%8C%80%ED%95%9C%EA%B8%B0%EA%B3%84%ED%95%99%ED%9A%8C&rft.issn=1738-494X&rft.eissn=1976-3824&rft.spage=4637&rft.epage=4645&rft_id=info:doi/10.1007%2Fs12206-017-0910-x&rft.externalDBID=n%2Fa&rft.externalDocID=oai_kci_go_kr_ARTI_2570038 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1738-494X&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1738-494X&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1738-494X&client=summon |