NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Scalability of Parallel Spatial Direct Numerical Simulations on Intel Hypercube and IBM SP1 and SP2The implementation and performance of a parallel spatial direct numerical simulation (PSDNS) approach on the Intel iPSC/860 hypercube and IBM SP1 and SP2 parallel computers is documented. Spatially evolving disturbances associated with the laminar-to-turbulent transition in boundary-layer flows are computed with the PSDNS code. The feasibility of using the PSDNS to perform transition studies on these computers is examined. The results indicate that PSDNS approach can effectively be parallelized on a distributed-memory parallel machine by remapping the distributed data structure during the course of the calculation. Scalability information is provided to estimate computational costs to match the actual costs relative to changes in the number of grid points. By increasing the number of processors, slower than linear speedups are achieved with optimized (machine-dependent library) routines. This slower than linear speedup results because the computational cost is dominated by FFT routine, which yields less than ideal speedups. By using appropriate compile options and optimized library routines on the SP1, the serial code achieves 52-56 M ops on a single node of the SP1 (45 percent of theoretical peak performance). The actual performance of the PSDNS code on the SP1 is evaluated with a "real world" simulation that consists of 1.7 million grid points. One time step of this simulation is calculated on eight nodes of the SP1 in the same time as required by a Cray Y/MP supercomputer. For the same simulation, 32-nodes of the SP1 and SP2 are required to reach the performance of a Cray C-90. A 32 node SP1 (SP2) configuration is 2.9 (4.6) times faster than a Cray Y/MP for this simulation, while the hypercube is roughly 2 times slower than the Y/MP for this application. KEY WORDS: Spatial direct numerical simulations; incompressible viscous flows; spectral methods; finite differences; parallel computing.
Document ID
20040111318
Acquisition Source
Langley Research Center
Document Type
Other
Authors
Joslin, Ronald D.
(NASA Langley Research Center Hampton, VA, United States)
Hanebutte, Ulf R.
(Institute for Computer Applications in Science and Engineering Hampton, VA, United States)
Zubair, Mohammad
(Institute for Computer Applications in Science and Engineering Hampton, VA, United States)
Date Acquired
September 7, 2013
Publication Date
January 1, 1995
Subject Category
Computer Operations And Hardware
Funding Number(s)
CONTRACT_GRANT: NAS1-19480
Distribution Limits
Public
Copyright
Public Use Permitted.
No Preview Available