NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Benchmark Comparison of Cloud Analytics Methods Applied to Earth ObservationsEarth Observation data are a vital resource for studying long term changes, but the large data volumes can be challenging to analyze. Time series analysis in particular is hampered by the typical thin-time-slice file organization. We examine several potential solutions inspired in large part by the data-parallel methods that have arisen with cloud computing. These solutions include various combinations of data re-organization, spatial indexing, distributed storage and pre-computation that we term "Analytics Optimized Data Stores" (AODS). We find that even simple solutions (such as a data cube) produce more than an order of magnitude improvement; the best provide two to three orders of magnitude improvement. The most performant solutions have tradeoffs in terms of generality or storage footprint, but may nonetheless be useful components in data analytics frameworks where performance is critical.
Document ID
20180007362
Acquisition Source
Goddard Space Flight Center
Document Type
Book Chapter
Authors
Lynnes, Christopher
(NASA Goddard Space Flight Center Greenbelt, MD, United States)
Little, Michael M.
(NASA Goddard Space Flight Center Greenbelt, MD, United States)
Huang, Thomas
(Jet Propulsion Lab., California Inst. of Tech. Pasadena, CA, United States)
Jacob, Joseph Charles
(Jet Propulsion Lab., California Inst. of Tech. Pasadena, CA, United States)
Yang, Chaowei Phil
(George Mason Univ. Fairfax, VA, United States)
Hegde, Mahabaleshwara
(Adnet Systems, Inc. Greenbelt, MD, United States)
Zhang, Hailiang
(Adnet Systems, Inc. Greenbelt, MD, United States)
Date Acquired
October 30, 2018
Publication Date
October 26, 2018
Subject Category
Computer Systems
Report/Patent Number
GSFC-E-DAA-TN62404
Report Number: GSFC-E-DAA-TN62404
Funding Number(s)
OTHER: 2018-423-ESDIS
CONTRACT_GRANT: NNG12PL17C
CONTRACT_GRANT: NAS7-030010
Distribution Limits
Public
Copyright
Use by or on behalf of the US Gov. Permitted.
Keywords
parallel processing (computers)
time series analysis
distributed processing
No Preview Available