NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
The OpenMP Implementation of NAS Parallel Benchmarks and its PerformanceAs the new ccNUMA architecture became popular in recent years, parallel programming with compiler directives on these machines has evolved to accommodate new needs. In this study, we examine the effectiveness of OpenMP directives for parallelizing the NAS Parallel Benchmarks. Implementation details will be discussed and performance will be compared with the MPI implementation. We have demonstrated that OpenMP can achieve very good results for parallelization on a shared memory system, but effective use of memory and cache is very important.
Document ID
20000102377
Acquisition Source
Ames Research Center
Document Type
Preprint (Draft being sent to journal)
Authors
Jin, Hao-Qiang
(MRJ Technology Solutions Moffett Field, CA United States)
Frumkin, Michael
(MRJ Technology Solutions Moffett Field, CA United States)
Yan, Jerry
(MRJ Technology Solutions Moffett Field, CA United States)
Date Acquired
September 7, 2013
Publication Date
January 1, 1999
Subject Category
Computer Systems
Funding Number(s)
PROJECT: RTOP 509-10-31
CONTRACT_GRANT: NAS2-14303
Distribution Limits
Public
Copyright
Work of the US Gov. Public Use Permitted.
No Preview Available