NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Data Quality Challenges for Analysis Ready Data (ARD)Data quality plays a critical role in research and applications. The Earth Science Information Partners (ESIP) Information Quality Cluster (IQC) defines four aspects of information quality: Science, Product, Stewardship, and Services. The ESIP IQC has become internationally recognized as an authoritative and responsive resource of information and guidance to data producers and distributors on how to implement data quality standards and best practices for their science data systems, datasets, and data/metadata dissemination services.

In recent years, cloud computing environments have provided scale-up capabilities such as data archives and services, enabling interdisciplinary science and applications. More value-added products are expected from data service providers, including Analysis Ready Data (ARD). ARD refers to data that has been preprocessed into a form that allows immediate analysis by the end user, processed to a minimum set of requirements and provides interoperability over time and across multiple datasets. Once a dataset has been developed from its original form to produce ARD, what quality characteristics should the derived dataset or ARD possess? Also, is it safe to assume that the quality of the ARD is consistent with the quality of the source data, or are there special attributes to an ARD that would warrant a secondary, independent quality assessment? What provenance (also called “data lineage”) information needs to be included in ARD? It is important to answer these questions, especially given the ease of use of ARD, and the consequent temptation by users to trust ARD without understanding the limitations or possible variations in quality compared to the source data. In this presentation, we will discuss data quality challenges for ARD products and services and introduce IQC for participation.
Document ID
20240001391
Acquisition Source
Goddard Space Flight Center
Document Type
Poster
Authors
Zhong Liu
(George Mason University Fairfax, Virginia, United States)
Robert Downs
(Columbia University New York, United States)
Ge Peng ORCID
(University of Alabama in Huntsville Huntsville, United States)
David F Moroni
(Jet Propulsion Laboratory La Cañada Flintridge, United States)
Hampapuram Ramapriyan
(Science Systems & Applications, Inc. Hampton, VA, USA)
Yaxing Wei
(Oak Ridge National Laboratory Oak Ridge, United States)
Chung-lin Shie
(University of Maryland, Baltimore County Baltimore, Maryland, United States)
Date Acquired
January 31, 2024
Subject Category
Computer Programming and Software
Meeting Information
Meeting: 23rd Meeting of the American Geophysical Union (AGU)
Location: San Francisco, CA
Country: US
Start Date: December 11, 2023
End Date: December 15, 2023
Sponsors: American Geophysical Union
Funding Number(s)
CONTRACT_GRANT: 80NSSC21M0236
CONTRACT_GRANT: 80GSFC23CA001
CONTRACT_GRANT: 80NM0018D0004
Distribution Limits
Public
Copyright
Use by or on behalf of the US Gov. Permitted.
Technical Review
Single Expert
Keywords
data quality
analysis ready data
No Preview Available