Scaling of a Fast Fourier Transform and a pseudo-spectral fluid solver up to 196608 cores

In this paper we present scaling results of a FFT library, FFTK, and a pseudospectral code, Tarang, on grid resolutions up to 81923 grid using 65536 cores of Blue Gene/P and 196608 cores of Cray XC40 supercomputers. We observe that communication dominates computation, more so on the Cray XC40. The c...

Full description

Saved in:
Bibliographic Details
Published inJournal of parallel and distributed computing Vol. 113; pp. 77 - 91
Main Authors Chatterjee, Anando G., Verma, Mahendra K., Kumar, Abhishek, Samtaney, Ravi, Hadri, Bilel, Khurram, Rooh
Format Journal Article
LanguageEnglish
Published Elsevier Inc 01.03.2018
Subjects
Online AccessGet full text

Cover

Loading…
Abstract In this paper we present scaling results of a FFT library, FFTK, and a pseudospectral code, Tarang, on grid resolutions up to 81923 grid using 65536 cores of Blue Gene/P and 196608 cores of Cray XC40 supercomputers. We observe that communication dominates computation, more so on the Cray XC40. The computation time scales as Tcomp∼p−1, and the communication time as Tcomm∼n−γ2 with γ2 ranging from 0.7 to 0.9 for Blue Gene/P, and from 0.43 to 0.73 for Cray XC40. FFTK, and the fluid and convection solvers of Tarang exhibit weak as well as strong scaling nearly up to 196608 cores of Cray XC40. We perform a comparative study of the performance on the Blue Gene/P and Cray XC40 clusters. •Scalability of Fast Fourier Transform on maximum of 196608 cores.•Implementation and scaling of a scalable pseudo-spectral code Tarang.•Comparative performance study of FFT and Tarang on Blue Gene/P and Cray XC40.•Deduce Kolmogorov-like spectrum for Rayleigh–Bénard convection using simulation.
AbstractList In this paper we present scaling results of a FFT library, FFTK, and a pseudospectral code, Tarang, on grid resolutions up to 81923 grid using 65536 cores of Blue Gene/P and 196608 cores of Cray XC40 supercomputers. We observe that communication dominates computation, more so on the Cray XC40. The computation time scales as Tcomp∼p−1, and the communication time as Tcomm∼n−γ2 with γ2 ranging from 0.7 to 0.9 for Blue Gene/P, and from 0.43 to 0.73 for Cray XC40. FFTK, and the fluid and convection solvers of Tarang exhibit weak as well as strong scaling nearly up to 196608 cores of Cray XC40. We perform a comparative study of the performance on the Blue Gene/P and Cray XC40 clusters. •Scalability of Fast Fourier Transform on maximum of 196608 cores.•Implementation and scaling of a scalable pseudo-spectral code Tarang.•Comparative performance study of FFT and Tarang on Blue Gene/P and Cray XC40.•Deduce Kolmogorov-like spectrum for Rayleigh–Bénard convection using simulation.
Author Chatterjee, Anando G.
Samtaney, Ravi
Verma, Mahendra K.
Hadri, Bilel
Kumar, Abhishek
Khurram, Rooh
Author_xml – sequence: 1
  givenname: Anando G.
  surname: Chatterjee
  fullname: Chatterjee, Anando G.
  email: anandogc@iitk.ac.in
  organization: Department of Physics, Indian Institute of Technology Kanpur, Kanpur 208016, India
– sequence: 2
  givenname: Mahendra K.
  surname: Verma
  fullname: Verma, Mahendra K.
  email: mkv@iitk.ac.in
  organization: Department of Physics, Indian Institute of Technology Kanpur, Kanpur 208016, India
– sequence: 3
  givenname: Abhishek
  orcidid: 0000-0002-6026-5727
  surname: Kumar
  fullname: Kumar, Abhishek
  email: abhkr@iitk.ac.in
  organization: Department of Physics, Indian Institute of Technology Kanpur, Kanpur 208016, India
– sequence: 4
  givenname: Ravi
  surname: Samtaney
  fullname: Samtaney, Ravi
  email: ravi.samtaney@kaust.edu.sa
  organization: Mechanical Engineering, Division of Physical Science and Engineering, King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
– sequence: 5
  givenname: Bilel
  surname: Hadri
  fullname: Hadri, Bilel
  email: bilel.hadri@kaust.edu.sa
  organization: KAUST Supercomputing Laboratory, King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
– sequence: 6
  givenname: Rooh
  surname: Khurram
  fullname: Khurram, Rooh
  email: rooh.khurram@kaust.edu.sa
  organization: KAUST Supercomputing Laboratory, King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
BookMark eNp9kM9KAzEQh4NUsK2-gKe8wK4zm-w_8CLFqlDwYD14CmmSlSzbzZLsFnx7s9STh54GfjPfMPOtyKJ3vSHkHiFFwOKhTdtBqzQDLGOQAvIrskSoiwQqXi3IEkrOkpJhfkNWIbQAiHlZLcnXh5Kd7b-pa6ikWxlGunWTt8bTvZd9aJw_Utnr2ByCmbRLwmDU6GVHm26ymgbXneLwNNDRUayLAiqqnDfhllw3sgvm7q-uyef2eb95TXbvL2-bp12iGOdjIiHjTHGsdA4aZQ4H06CpDTscuDISCqaVAmlkVqmyyErNK8WyBlHnBWO1ZGtSnfcq70LwphHKjnK0ro9X2k4giFmRaMWsSMyK5iwqimj2Dx28PUr_cxl6PEMmPnWKokRQ1vTKaOujGaGdvYT_Al0Hgbc
CitedBy_id crossref_primary_10_1063_5_0222258
crossref_primary_10_3390_fluids6060225
crossref_primary_10_1007_s41403_020_00161_3
crossref_primary_10_1103_PhysRevFluids_6_063501
crossref_primary_10_1029_2017JC013516
crossref_primary_10_1063_1_5119905
crossref_primary_10_1016_j_cpc_2022_108406
crossref_primary_10_1137_19M1303848
crossref_primary_10_1177_09544054221116703
crossref_primary_10_3390_atmos11020178
crossref_primary_10_1088_1402_4896_ab36ce
crossref_primary_10_1007_s42979_020_00184_1
crossref_primary_10_1098_rsos_172152
crossref_primary_10_21105_joss_02095
crossref_primary_10_1103_PhysRevLett_127_274501
crossref_primary_10_1016_j_cpc_2024_109364
crossref_primary_10_1080_10407782_2018_1509602
crossref_primary_10_1103_PhysRevE_98_043109
crossref_primary_10_2139_ssrn_4050299
crossref_primary_10_3390_fluids8060181
crossref_primary_10_1002_qj_4289
crossref_primary_10_1063_1_5109856
crossref_primary_10_1063_1_5096335
crossref_primary_10_1098_rsta_2019_0470
crossref_primary_10_1063_1_5051444
crossref_primary_10_1103_PhysRevFluids_4_084607
crossref_primary_10_1007_s42979_023_02109_0
crossref_primary_10_1002_fld_4898
crossref_primary_10_1063_1_5018346
crossref_primary_10_1103_PhysRevE_110_055106
crossref_primary_10_1016_j_jpdc_2024_104945
crossref_primary_10_1016_j_procs_2024_07_005
crossref_primary_10_1016_j_compfluid_2018_07_016
crossref_primary_10_1063_5_0016495
crossref_primary_10_1103_PhysRevE_100_053101
crossref_primary_10_1016_j_jocs_2023_101945
crossref_primary_10_1016_j_jpdc_2019_02_006
crossref_primary_10_1063_5_0015009
crossref_primary_10_1002_cpe_4964
crossref_primary_10_1002_cpe_6508
Cites_doi 10.1016/j.parco.2011.05.004
10.1109/JPROC.2004.840301
10.1063/1.2907227
10.1103/PhysRevFluids.1.064302
10.1063/1.2001690
10.1007/s12043-013-0594-4
10.1209/0295-5075/104/54001
10.1103/PhysRevE.92.013003
10.1017/jfm.2012.632
10.1063/1.1539855
10.1103/PhysRevE.90.023016
10.1002/sapm1972513253
10.1103/PhysRevE.89.023006
10.1063/1.4864654
10.1016/j.physrep.2004.07.007
10.1007/s10494-010-9271-6
10.1017/S0022112010001400
10.1063/1.4918590
10.1063/1.4921076
10.1103/PhysRevE.85.016310
10.1073/pnas.1517368112
10.1103/PhysRevLett.115.204501
10.1088/1367-2630/aa5d63
10.1137/11082748X
10.1063/1.4825281
10.1090/S0025-5718-1965-0178586-1
10.1007/978-3-540-39707-6_36
ContentType Journal Article
Copyright 2017 Elsevier Inc.
Copyright_xml – notice: 2017 Elsevier Inc.
DBID AAYXX
CITATION
DOI 10.1016/j.jpdc.2017.10.014
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1096-0848
EndPage 91
ExternalDocumentID 10_1016_j_jpdc_2017_10_014
S0743731517302903
GroupedDBID --K
--M
-~X
.~1
0R~
1B1
1~.
1~5
29L
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
AAYFN
ABBOA
ABEFU
ABFNM
ABFSI
ABJNI
ABMAC
ABTAH
ABXDB
ABYKQ
ACDAQ
ACGFS
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADFGL
ADHUB
ADJOM
ADMUD
ADTZH
AEBSH
AECPX
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
ASPBG
AVWKF
AXJTR
AZFZN
BJAXD
BKOJK
BLXMC
CAG
COF
CS3
DM4
DU5
E.L
EBS
EFBJH
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-2
G-Q
G8K
GBLVA
GBOLZ
HLZ
HVGLF
HZ~
H~9
IHE
J1W
JJJVA
K-O
KOM
LG5
LG9
LY7
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
Q38
R2-
RIG
ROL
RPZ
SBC
SDF
SDG
SDP
SES
SET
SEW
SPC
SPCBC
SST
SSV
SSZ
T5K
TN5
TWZ
WUQ
XJT
XOL
XPP
ZMT
ZU3
ZY4
~G-
~G0
AATTM
AAXKI
AAYWO
AAYXX
ABDPE
ABWVN
ACRPL
ACVFH
ADCNI
ADNMO
ADVLN
AEIPS
AEUPX
AFJKZ
AFPUW
AFXIZ
AGCQF
AGQPQ
AGRNS
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
BNPGV
CITATION
SSH
ID FETCH-LOGICAL-c344t-a0243c418d50d1a50bef1e9e3bb4cea063dcc0aea28c7627d48c32f11d56339a3
IEDL.DBID .~1
ISSN 0743-7315
IngestDate Tue Jul 01 03:20:46 EDT 2025
Thu Apr 24 23:07:34 EDT 2025
Fri Feb 23 02:31:21 EST 2024
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Keywords Extreme-resolution turbulence simulation
Pseudospectral method
Turbulence simulation
Fast Fourier transform
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c344t-a0243c418d50d1a50bef1e9e3bb4cea063dcc0aea28c7627d48c32f11d56339a3
ORCID 0000-0002-6026-5727
OpenAccessLink https://www.sciencedirect.com/science/article/pii/S0743731517302903
PageCount 15
ParticipantIDs crossref_citationtrail_10_1016_j_jpdc_2017_10_014
crossref_primary_10_1016_j_jpdc_2017_10_014
elsevier_sciencedirect_doi_10_1016_j_jpdc_2017_10_014
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate March 2018
2018-03-00
PublicationDateYYYYMMDD 2018-03-01
PublicationDate_xml – month: 03
  year: 2018
  text: March 2018
PublicationDecade 2010
PublicationTitle Journal of parallel and distributed computing
PublicationYear 2018
Publisher Elsevier Inc
Publisher_xml – name: Elsevier Inc
References Reddy, Verma (b28) 2014; 26
Rorai, Mininni, Pouquet (b30) 2015; 92
Yeung, Sreenivasan (b41) 2013; 716
Dallas, Fauve, Alexakis (b9) 2015; 115
Verma, Kumar, Pandey (b38) 2017; 19
Donzis, Yeung, Sreenivasan (b13) 2008; 20
D. Pekurovsky, P3DFFT: User Guide
Yeung, Zhai, Sreenivasan (b42) 2015; 112
Verma, Mishra, Pandey, Paul (b39) 2012; 85
Verma (b35) 2004; 401
Yeung, Donzis, Sreenivasan (b40) 2005; 17
Mininni, Rosenberg, Reddy, Pouquet (b20) 2011; 37
Pandey, Verma, Mishra (b23) 2014; 89
T. Sutou, K. Tamura, Y. Mori, H. Kitakami, Design and implementation of parallel modified prefix span method, in: High Performance Computing, 5th International Symposium, ISHPC 2003 Tokyo-Odaiba, Japan, 2003.
Verma, Chatterjee, Reddy, Yadav, Paul, Chandra, Samtaney (b37) 2013; 81
Kaneda, Ishihara, Yokokawa, Itakura, Uno (b17) 2003; 15
B. Hadri, S. Kortas, S. Feki, R. Khurram, G. Newby, Overview of the KAUST’s Cray X40 System–Shaheen II, in: CUG2015 Proceedings, 2015.
M. Pippig, D. Potts, Scaling parallel fast fourier transform on bluegene/p. Jülich BlueGene/P Scaling Workshop, 2010.
Veldhuizen (b34) 1998
Pekurovsky (b24) 2012; 34
Donzis, Sreenivasan (b10) 2010; 657
Rosenberg, Pouquet, Marino, Mininni (b31) 2015; 27
Chan, Balaji, Gropp, Thakur (b6) 2008
Verma, Ambhire, Pandey (b36) 2015; 27
Orszag (b22) 1972; 51
Blitz++, The open source meta-template library
Cooley, Tukey (b7) 1965; 19
Czechowski, Battaglino, McClanahan, Iyer, Yeung, Vuduc (b8) 2012
Kumar, Chatterjee, Verma (b18) 2014; 90
Sharma, Kumar, Verma, Chakraborty (b32) 2017
Boyd (b3) 2001
processors, in: Proc TeraGrid, 2008.
Nath, Pandey, Kumar, Verma (b21) 2016; 1
S.G. Berg, Cache Prefetching. Technical Report, UW-CSE, 2004.
.
D.A. Donzis, P.K. Yeung, D. Pekurovsky, Turbulence simulations on
Richards, Glosli, Chan, Dorr, Draeger, Fattebert, Krauss, Spelce, Streitz, Surh, Gunnels (b29) 2009
Canuto, Hussaini, Quarteroni, Zhang (b5) 1988
Buchanan (b4) 2004
Frigo, Johnson (b15) 2005; 93
Donzis, Sreenivasan, Yeung (b11) 2010; 85
Yokokawa, Itakura, Uno, Ishihara, Kaneda (b43) 2002
Pharasi, Kumar (b26) 2013; 25
Kumar, Verma, Samtaney (b19) 2014; 104
FFTW, The open source fast Fourier transform library
Frigo (10.1016/j.jpdc.2017.10.014_b15) 2005; 93
Buchanan (10.1016/j.jpdc.2017.10.014_b4) 2004
Dallas (10.1016/j.jpdc.2017.10.014_b9) 2015; 115
Mininni (10.1016/j.jpdc.2017.10.014_b20) 2011; 37
Pandey (10.1016/j.jpdc.2017.10.014_b23) 2014; 89
Verma (10.1016/j.jpdc.2017.10.014_b37) 2013; 81
Verma (10.1016/j.jpdc.2017.10.014_b39) 2012; 85
Yokokawa (10.1016/j.jpdc.2017.10.014_b43) 2002
Yeung (10.1016/j.jpdc.2017.10.014_b41) 2013; 716
Donzis (10.1016/j.jpdc.2017.10.014_b10) 2010; 657
10.1016/j.jpdc.2017.10.014_b16
10.1016/j.jpdc.2017.10.014_b14
Sharma (10.1016/j.jpdc.2017.10.014_b32) 2017
10.1016/j.jpdc.2017.10.014_b12
Veldhuizen (10.1016/j.jpdc.2017.10.014_b34) 1998
10.1016/j.jpdc.2017.10.014_b33
10.1016/j.jpdc.2017.10.014_b2
Boyd (10.1016/j.jpdc.2017.10.014_b3) 2001
10.1016/j.jpdc.2017.10.014_b1
Nath (10.1016/j.jpdc.2017.10.014_b21) 2016; 1
Verma (10.1016/j.jpdc.2017.10.014_b38) 2017; 19
Yeung (10.1016/j.jpdc.2017.10.014_b42) 2015; 112
Pharasi (10.1016/j.jpdc.2017.10.014_b26) 2013; 25
Cooley (10.1016/j.jpdc.2017.10.014_b7) 1965; 19
Chan (10.1016/j.jpdc.2017.10.014_b6) 2008
Kaneda (10.1016/j.jpdc.2017.10.014_b17) 2003; 15
Rosenberg (10.1016/j.jpdc.2017.10.014_b31) 2015; 27
Czechowski (10.1016/j.jpdc.2017.10.014_b8) 2012
Richards (10.1016/j.jpdc.2017.10.014_b29) 2009
10.1016/j.jpdc.2017.10.014_b27
10.1016/j.jpdc.2017.10.014_b25
Pekurovsky (10.1016/j.jpdc.2017.10.014_b24) 2012; 34
Verma (10.1016/j.jpdc.2017.10.014_b36) 2015; 27
Yeung (10.1016/j.jpdc.2017.10.014_b40) 2005; 17
Kumar (10.1016/j.jpdc.2017.10.014_b18) 2014; 90
Orszag (10.1016/j.jpdc.2017.10.014_b22) 1972; 51
Kumar (10.1016/j.jpdc.2017.10.014_b19) 2014; 104
Donzis (10.1016/j.jpdc.2017.10.014_b13) 2008; 20
Donzis (10.1016/j.jpdc.2017.10.014_b11) 2010; 85
Verma (10.1016/j.jpdc.2017.10.014_b35) 2004; 401
Reddy (10.1016/j.jpdc.2017.10.014_b28) 2014; 26
Rorai (10.1016/j.jpdc.2017.10.014_b30) 2015; 92
Canuto (10.1016/j.jpdc.2017.10.014_b5) 1988
References_xml – volume: 89
  start-page: 023006
  year: 2014
  ident: b23
  article-title: Scaling of heat flux and energy spectrum for very large Prandtl number convection
  publication-title: Phys. Rev. E
– year: 2002
  ident: b43
  article-title: 16.4-Tflops direct numerical simulation of turbulence by a fourier spectral method on the earth simulator
  publication-title: ACM/IEEE Conference
– volume: 20
  start-page: 045108
  year: 2008
  ident: b13
  article-title: Dissipation and enstrophy in isotropic turbulence: Resolution effects and scaling in direct numerical simulations
  publication-title: Phys. Fluids
– year: 2017
  ident: b32
  article-title: Structures and energy spectrum of strongly rotating decaying turbulence
  publication-title: J. Turbul.
– volume: 115
  start-page: 204501
  year: 2015
  ident: b9
  article-title: Statistical equilibria of large scales in dissipative hydrodynamic turbulence
  publication-title: Phys. Rev. Lett.
– volume: 27
  start-page: 047102
  year: 2015
  ident: b36
  article-title: Flow reversals in turbulent convection with free-slip walls
  publication-title: Phys. Fluids
– start-page: 350
  year: 2008
  end-page: 364
  ident: b6
  article-title: Communication analysis of parallel 3D FFT for flat cartesian meshes on large blue gene systems
  publication-title: High Performance Computing - HiPC
– reference: ).
– volume: 716
  start-page: R14
  year: 2013
  ident: b41
  article-title: Spectrum of passive scalars of high molecular diffusivity in turbulent mixing
  publication-title: J. Fluid Mech.
– reference: Blitz++, The open source meta-template library,
– volume: 93
  start-page: 216
  year: 2005
  end-page: 231
  ident: b15
  article-title: The design and implementation of FFTW3
  publication-title: Proc. IEEE
– volume: 401
  start-page: 229
  year: 2004
  end-page: 380
  ident: b35
  article-title: Statistical theory of magnetohydrodynamic turbulence: recent results
  publication-title: Phys. Rep.
– volume: 90
  start-page: 023016
  year: 2014
  ident: b18
  article-title: Energy spectrum of buoyancy-driven turbulence
  publication-title: Phys. Rev. E
– start-page: 223
  year: 1998
  end-page: 230
  ident: b34
  article-title: Arrays in Blitz++
  publication-title: Proceedings of the Second International Symposium on Computing in Object-Oriented Parallel Environments
– reference: M. Pippig, D. Potts, Scaling parallel fast fourier transform on bluegene/p. Jülich BlueGene/P Scaling Workshop, 2010.
– volume: 51
  start-page: 253
  year: 1972
  end-page: 259
  ident: b22
  article-title: Comparison of pseudospectral and spectral approximation
  publication-title: Stud. Appl. Math.
– reference: T. Sutou, K. Tamura, Y. Mori, H. Kitakami, Design and implementation of parallel modified prefix span method, in: High Performance Computing, 5th International Symposium, ISHPC 2003 Tokyo-Odaiba, Japan, 2003.
– volume: 1
  start-page: 064302
  year: 2016
  ident: b21
  article-title: Near isotropic behavior of turbulent thermal convection
  publication-title: Phys. Rev. Fluids
– volume: 15
  start-page: L21
  year: 2003
  ident: b17
  article-title: Energy dissipation rate and energy spectrum in high resolution direct numerical simulations of turbulence in a periodic box
  publication-title: Phys. Fluids
– volume: 17
  start-page: 081703
  year: 2005
  ident: b40
  article-title: High-Reynolds-number simulation of turbulent mixing
  publication-title: Phys. Fluids
– volume: 19
  start-page: 297
  year: 1965
  end-page: 301
  ident: b7
  article-title: An algorithm for the machine calculation of complex fourier series
  publication-title: Math. Comp.
– volume: 25
  start-page: 104105
  year: 2013
  end-page: 104121
  ident: b26
  article-title: Oscillatory instability and fluid patterns in low-Prandtl-number Rayleigh-Bénard convection with uniform rotation
  publication-title: Phys. Fluids
– start-page: 205
  year: 2012
  end-page: 214
  ident: b8
  article-title: On the communication complexity of 3D FFTs and its implications for Exascale
  publication-title: The 26th ACM International Conference
– year: 1988
  ident: b5
  publication-title: Spectral Methods in Fluid Turbulence
– reference: processors, in: Proc TeraGrid, 2008.
– reference: .
– year: 2004
  ident: b4
  publication-title: The Handbook of Data Communications and Networks
– volume: 34
  start-page: C192
  year: 2012
  end-page: C209
  ident: b24
  article-title: P3DFFT: a framework for parallel computations of Fourier transforms in three dimensions
  publication-title: SIAM J. Sci. Comput.
– volume: 27
  start-page: 055105
  year: 2015
  ident: b31
  article-title: Evidence for Bolgiano-Obukhov scaling in rotating stratified turbulence using high-resolution direct numerical simulations
  publication-title: Phys. Fluids
– volume: 92
  start-page: 013003
  year: 2015
  ident: b30
  article-title: Stably stratified turbulence in the presence of large-scale forcing
  publication-title: Phys. Rev. E
– start-page: 60:1
  year: 2009
  end-page: 60:12
  ident: b29
  article-title: Beyond homogeneous decomposition: Scaling long-range forces on massively parallel systems
  publication-title: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
– volume: 104
  start-page: 54001
  year: 2014
  ident: b19
  article-title: Energy transfers and magnetic energy growth in small-scale dynamo
  publication-title: Europhys. Lett.
– volume: 37
  start-page: 316
  year: 2011
  end-page: 326
  ident: b20
  article-title: A hybrid MPI-OpenMP scheme for scalable parallel pseudospectral computations for fluid turbulence
  publication-title: Parallel Comput.
– reference: D.A. Donzis, P.K. Yeung, D. Pekurovsky, Turbulence simulations on
– volume: 81
  start-page: 617
  year: 2013
  end-page: 629
  ident: b37
  article-title: Benchmarking and scaling studies of a pseudospectral code Tarang for turbulence simulations
  publication-title: Pramana
– volume: 112
  start-page: 12633
  year: 2015
  ident: b42
  article-title: Extreme events in computational turbulence
  publication-title: Proc. Natl. Acad. Sci. USA
– year: 2001
  ident: b3
  publication-title: Chebyshev and Fourier Spectral Methods
– reference: D. Pekurovsky, P3DFFT: User Guide (
– reference: FFTW, The open source fast Fourier transform library,
– reference: S.G. Berg, Cache Prefetching. Technical Report, UW-CSE, 2004.
– volume: 657
  start-page: 171
  year: 2010
  ident: b10
  article-title: The bottleneck effect and the Kolmogorov constant in isotropic turbulence
  publication-title: J. Fluid Mech.
– volume: 19
  start-page: 025012
  year: 2017
  ident: b38
  article-title: Phenomenology of buoyancy-driven turbulence: recent results
  publication-title: New J. Phys.
– reference: B. Hadri, S. Kortas, S. Feki, R. Khurram, G. Newby, Overview of the KAUST’s Cray X40 System–Shaheen II, in: CUG2015 Proceedings, 2015.
– volume: 85
  start-page: 549
  year: 2010
  end-page: 566
  ident: b11
  article-title: The batchelor spectrum for mixing of passive scalars in isotropic turbulence
  publication-title: Flow Turbul. Combust.
– volume: 26
  start-page: 025109
  year: 2014
  ident: b28
  article-title: Strong anisotropy in quasi-static magnetohydrodynamic turbulence for high interaction parameters
  publication-title: Phys. Fluids
– volume: 85
  start-page: 016310
  year: 2012
  ident: b39
  article-title: Scalings of field correlations and heat transport in turbulent convection
  publication-title: Phys. Rev. E
– ident: 10.1016/j.jpdc.2017.10.014_b12
– start-page: 223
  year: 1998
  ident: 10.1016/j.jpdc.2017.10.014_b34
  article-title: Arrays in Blitz++
– ident: 10.1016/j.jpdc.2017.10.014_b14
– volume: 37
  start-page: 316
  issue: 6–7
  year: 2011
  ident: 10.1016/j.jpdc.2017.10.014_b20
  article-title: A hybrid MPI-OpenMP scheme for scalable parallel pseudospectral computations for fluid turbulence
  publication-title: Parallel Comput.
  doi: 10.1016/j.parco.2011.05.004
– volume: 93
  start-page: 216
  issue: 2
  year: 2005
  ident: 10.1016/j.jpdc.2017.10.014_b15
  article-title: The design and implementation of FFTW3
  publication-title: Proc. IEEE
  doi: 10.1109/JPROC.2004.840301
– ident: 10.1016/j.jpdc.2017.10.014_b16
– volume: 20
  start-page: 045108
  issue: 4
  year: 2008
  ident: 10.1016/j.jpdc.2017.10.014_b13
  article-title: Dissipation and enstrophy in isotropic turbulence: Resolution effects and scaling in direct numerical simulations
  publication-title: Phys. Fluids
  doi: 10.1063/1.2907227
– volume: 1
  start-page: 064302
  year: 2016
  ident: 10.1016/j.jpdc.2017.10.014_b21
  article-title: Near isotropic behavior of turbulent thermal convection
  publication-title: Phys. Rev. Fluids
  doi: 10.1103/PhysRevFluids.1.064302
– volume: 17
  start-page: 081703
  issue: 8
  year: 2005
  ident: 10.1016/j.jpdc.2017.10.014_b40
  article-title: High-Reynolds-number simulation of turbulent mixing
  publication-title: Phys. Fluids
  doi: 10.1063/1.2001690
– start-page: 60:1
  year: 2009
  ident: 10.1016/j.jpdc.2017.10.014_b29
  article-title: Beyond homogeneous decomposition: Scaling long-range forces on massively parallel systems
– start-page: 205
  year: 2012
  ident: 10.1016/j.jpdc.2017.10.014_b8
  article-title: On the communication complexity of 3D FFTs and its implications for Exascale
– volume: 81
  start-page: 617
  year: 2013
  ident: 10.1016/j.jpdc.2017.10.014_b37
  article-title: Benchmarking and scaling studies of a pseudospectral code Tarang for turbulence simulations
  publication-title: Pramana
  doi: 10.1007/s12043-013-0594-4
– ident: 10.1016/j.jpdc.2017.10.014_b2
– volume: 104
  start-page: 54001
  issue: 5
  year: 2014
  ident: 10.1016/j.jpdc.2017.10.014_b19
  article-title: Energy transfers and magnetic energy growth in small-scale dynamo
  publication-title: Europhys. Lett.
  doi: 10.1209/0295-5075/104/54001
– volume: 92
  start-page: 013003
  issue: 1
  year: 2015
  ident: 10.1016/j.jpdc.2017.10.014_b30
  article-title: Stably stratified turbulence in the presence of large-scale forcing
  publication-title: Phys. Rev. E
  doi: 10.1103/PhysRevE.92.013003
– year: 2004
  ident: 10.1016/j.jpdc.2017.10.014_b4
– year: 2001
  ident: 10.1016/j.jpdc.2017.10.014_b3
– volume: 716
  start-page: R14
  year: 2013
  ident: 10.1016/j.jpdc.2017.10.014_b41
  article-title: Spectrum of passive scalars of high molecular diffusivity in turbulent mixing
  publication-title: J. Fluid Mech.
  doi: 10.1017/jfm.2012.632
– year: 2002
  ident: 10.1016/j.jpdc.2017.10.014_b43
  article-title: 16.4-Tflops direct numerical simulation of turbulence by a fourier spectral method on the earth simulator
– year: 2017
  ident: 10.1016/j.jpdc.2017.10.014_b32
  article-title: Structures and energy spectrum of strongly rotating decaying turbulence
  publication-title: J. Turbul.
– volume: 15
  start-page: L21
  issue: 2
  year: 2003
  ident: 10.1016/j.jpdc.2017.10.014_b17
  article-title: Energy dissipation rate and energy spectrum in high resolution direct numerical simulations of turbulence in a periodic box
  publication-title: Phys. Fluids
  doi: 10.1063/1.1539855
– volume: 90
  start-page: 023016
  issue: 2
  year: 2014
  ident: 10.1016/j.jpdc.2017.10.014_b18
  article-title: Energy spectrum of buoyancy-driven turbulence
  publication-title: Phys. Rev. E
  doi: 10.1103/PhysRevE.90.023016
– volume: 51
  start-page: 253
  issue: 3
  year: 1972
  ident: 10.1016/j.jpdc.2017.10.014_b22
  article-title: Comparison of pseudospectral and spectral approximation
  publication-title: Stud. Appl. Math.
  doi: 10.1002/sapm1972513253
– volume: 89
  start-page: 023006
  year: 2014
  ident: 10.1016/j.jpdc.2017.10.014_b23
  article-title: Scaling of heat flux and energy spectrum for very large Prandtl number convection
  publication-title: Phys. Rev. E
  doi: 10.1103/PhysRevE.89.023006
– volume: 26
  start-page: 025109
  year: 2014
  ident: 10.1016/j.jpdc.2017.10.014_b28
  article-title: Strong anisotropy in quasi-static magnetohydrodynamic turbulence for high interaction parameters
  publication-title: Phys. Fluids
  doi: 10.1063/1.4864654
– volume: 401
  start-page: 229
  issue: 5
  year: 2004
  ident: 10.1016/j.jpdc.2017.10.014_b35
  article-title: Statistical theory of magnetohydrodynamic turbulence: recent results
  publication-title: Phys. Rep.
  doi: 10.1016/j.physrep.2004.07.007
– volume: 85
  start-page: 549
  issue: 3–4
  year: 2010
  ident: 10.1016/j.jpdc.2017.10.014_b11
  article-title: The batchelor spectrum for mixing of passive scalars in isotropic turbulence
  publication-title: Flow Turbul. Combust.
  doi: 10.1007/s10494-010-9271-6
– volume: 657
  start-page: 171
  year: 2010
  ident: 10.1016/j.jpdc.2017.10.014_b10
  article-title: The bottleneck effect and the Kolmogorov constant in isotropic turbulence
  publication-title: J. Fluid Mech.
  doi: 10.1017/S0022112010001400
– volume: 27
  start-page: 047102
  issue: 4
  year: 2015
  ident: 10.1016/j.jpdc.2017.10.014_b36
  article-title: Flow reversals in turbulent convection with free-slip walls
  publication-title: Phys. Fluids
  doi: 10.1063/1.4918590
– volume: 27
  start-page: 055105
  issue: 5
  year: 2015
  ident: 10.1016/j.jpdc.2017.10.014_b31
  article-title: Evidence for Bolgiano-Obukhov scaling in rotating stratified turbulence using high-resolution direct numerical simulations
  publication-title: Phys. Fluids
  doi: 10.1063/1.4921076
– volume: 85
  start-page: 016310
  issue: 1
  year: 2012
  ident: 10.1016/j.jpdc.2017.10.014_b39
  article-title: Scalings of field correlations and heat transport in turbulent convection
  publication-title: Phys. Rev. E
  doi: 10.1103/PhysRevE.85.016310
– volume: 112
  start-page: 12633
  issue: 41
  year: 2015
  ident: 10.1016/j.jpdc.2017.10.014_b42
  article-title: Extreme events in computational turbulence
  publication-title: Proc. Natl. Acad. Sci. USA
  doi: 10.1073/pnas.1517368112
– volume: 115
  start-page: 204501
  issue: 20
  year: 2015
  ident: 10.1016/j.jpdc.2017.10.014_b9
  article-title: Statistical equilibria of large scales in dissipative hydrodynamic turbulence
  publication-title: Phys. Rev. Lett.
  doi: 10.1103/PhysRevLett.115.204501
– start-page: 350
  year: 2008
  ident: 10.1016/j.jpdc.2017.10.014_b6
  article-title: Communication analysis of parallel 3D FFT for flat cartesian meshes on large blue gene systems
– ident: 10.1016/j.jpdc.2017.10.014_b25
– ident: 10.1016/j.jpdc.2017.10.014_b1
– volume: 19
  start-page: 025012
  year: 2017
  ident: 10.1016/j.jpdc.2017.10.014_b38
  article-title: Phenomenology of buoyancy-driven turbulence: recent results
  publication-title: New J. Phys.
  doi: 10.1088/1367-2630/aa5d63
– ident: 10.1016/j.jpdc.2017.10.014_b27
– volume: 34
  start-page: C192
  issue: 4
  year: 2012
  ident: 10.1016/j.jpdc.2017.10.014_b24
  article-title: P3DFFT: a framework for parallel computations of Fourier transforms in three dimensions
  publication-title: SIAM J. Sci. Comput.
  doi: 10.1137/11082748X
– volume: 25
  start-page: 104105
  issue: 10
  year: 2013
  ident: 10.1016/j.jpdc.2017.10.014_b26
  article-title: Oscillatory instability and fluid patterns in low-Prandtl-number Rayleigh-Bénard convection with uniform rotation
  publication-title: Phys. Fluids
  doi: 10.1063/1.4825281
– volume: 19
  start-page: 297
  issue: 90
  year: 1965
  ident: 10.1016/j.jpdc.2017.10.014_b7
  article-title: An algorithm for the machine calculation of complex fourier series
  publication-title: Math. Comp.
  doi: 10.1090/S0025-5718-1965-0178586-1
– year: 1988
  ident: 10.1016/j.jpdc.2017.10.014_b5
– ident: 10.1016/j.jpdc.2017.10.014_b33
  doi: 10.1007/978-3-540-39707-6_36
SSID ssj0011578
Score 2.4242606
Snippet In this paper we present scaling results of a FFT library, FFTK, and a pseudospectral code, Tarang, on grid resolutions up to 81923 grid using 65536 cores of...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 77
SubjectTerms Extreme-resolution turbulence simulation
Fast Fourier transform
Pseudospectral method
Turbulence simulation
Title Scaling of a Fast Fourier Transform and a pseudo-spectral fluid solver up to 196608 cores
URI https://dx.doi.org/10.1016/j.jpdc.2017.10.014
Volume 113
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV27asMwFBUhXbr0XZo-goZuxYllSbY0hlCTtpAlCaST0cuQEByT2Gu_vZIfoYWSoaNlyZjj63vucO65ADxjHjJEU-0RwoVHFOceUzryfBkYZSRz3Y9ObTENJwvyvqTLDhi3vTBOVtnk_jqnV9m6WRk2aA7z1Wo4c-QXYctYNkgDXjl-EhK5KB98HWQezkuGtVacbnfTOFNrvNa5djaGKBo4hRcif5PTD8KJL8BZUynCUf0yl6Bjsitw3k5hgM1PeQ0-ZxZmS0Bwm0IBY7EvYFwPooPztiqFItP2Zr43pd56VXflzj473ZQrDW302XiGZQ6LLUTOR5NBZ265vwGL-HU-nnjNwARPYUIKTzh7QUUQ09TXSFBfmhQZbrCURBlhqxGtlC-MCJiySTDShCkcpAhpGmLMBb4F3WybmTsAwzTk0tZ6ijNCVCgZDajEMpQRp8byfg-gFqlENW7ibqjFJmllY-vEoZs4dN2aRbcHXg5n8tpL4-hu2n6A5FdEJDbZHzl3_89zD-DUXrFaX_YIusWuNE-24Chkv4qoPjgZvX1Mpt8Ln9MO
linkProvider Elsevier
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07T8MwELaqdoCFN-KNBzYUWsd2Yo8Vomp5dKGVyhT5FakINVGb_n98jYNAQgysds6yvlzuTsp33yF0Q2UiCM9txJhUETNSRsLYNOrp2BmnBXQ_AttinAyn7HHGZy103_TCAK0yxP46pm-idVjpBjS75XzefYXkl1KfsbyTxhIUPzugTsXbqNMfPQ3HXz8TCK8DMqhxgkHonalpXu-lBSVDkt4ByYuw3_PTt5wz2EM7oVjE_fo--6jlFgdotxnEgMN3eYjeXj3SPgfhIscKD9SqwoN6Fh2eNIUpVgvrN8uVW9si2jRYLv3Z-cd6brF3QO_SeF3iqsAEpDQFBn3L1RGaDh4m98MozEyIDGWsihQoDBpGhOU9SxTvaZcTJx3VmhmnfEFijekpp2JhfBxMLROGxjkhlieUSkWPUXtRLNwJwkmeSO3LPSMFYybRgsdcU53oVHLnU_8pIg1SmQmC4jDX4iNrmGPvGaCbAbqw5tE9RbdfNmUtp_Hn07x5AdkPp8h8vP_D7uyfdtdoazh5ec6eR-Onc7Ttd0RNN7tA7Wq5dpe-_qj0VfCvT1tK1b8
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Scaling+of+a+Fast+Fourier+Transform+and+a+pseudo-spectral+fluid+solver+up+to+196608+cores&rft.jtitle=Journal+of+parallel+and+distributed+computing&rft.au=Chatterjee%2C+Anando+G.&rft.au=Verma%2C+Mahendra+K.&rft.au=Kumar%2C+Abhishek&rft.au=Samtaney%2C+Ravi&rft.date=2018-03-01&rft.pub=Elsevier+Inc&rft.issn=0743-7315&rft.eissn=1096-0848&rft.volume=113&rft.spage=77&rft.epage=91&rft_id=info:doi/10.1016%2Fj.jpdc.2017.10.014&rft.externalDocID=S0743731517302903
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0743-7315&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0743-7315&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0743-7315&client=summon