NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Preliminary Evaluation of MapReduce for High-Performance Climate Data AnalysisMapReduce is an approach to high-performance analytics that may be useful to data intensive problems in climate research. It offers an analysis paradigm that uses clusters of computers and combines distributed storage of large data sets with parallel computation. We are particularly interested in the potential of MapReduce to speed up basic operations common to a wide range of analyses. In order to evaluate this potential, we are prototyping a series of canonical MapReduce operations over a test suite of observational and climate simulation datasets. Our initial focus has been on averaging operations over arbitrary spatial and temporal extents within Modern Era Retrospective- Analysis for Research and Applications (MERRA) data. Preliminary results suggest this approach can improve efficiencies within data intensive analytic workflows.
Document ID
20120009187
Acquisition Source
Goddard Space Flight Center
Document Type
Conference Paper
Authors
Duffy, Daniel Q.
(NASA Goddard Space Flight Center Greenbelt, MD, United States)
Schnase, John L.
(NASA Goddard Space Flight Center Greenbelt, MD, United States)
Thompson, John H.
(NASA Goddard Space Flight Center Greenbelt, MD, United States)
Freeman, Shawn M.
(Northrop Grumman Corp. United States)
Clune, Thomas L.
(NASA Goddard Space Flight Center Greenbelt, MD, United States)
Date Acquired
August 25, 2013
Publication Date
April 16, 2012
Subject Category
Computer Systems
Report/Patent Number
GSFC.CP.6024.2012
Meeting Information
Meeting: 28th IEEE Conference on Massive Data Storage (MSST 2012)
Location: Pacific Grove, CA
Country: United States
Start Date: April 16, 2012
End Date: April 20, 2012
Sponsors: Institute of Electrical and Electronics Engineers
Distribution Limits
Public
Copyright
Public Use Permitted.
No Preview Available