NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Interpretable Machine Learning Models for Autonomous Characterization of Analogue Ocean World Seawater Chemistry and Biosignature Potential Using Isotope Ratio DataBackground: Future missions to ocean worlds, such as Enceladus and Europa, will attempt to characterize the subsurface seawater chemistry and assess the potential for life. Such missions will be equipped with capabilities to precisely measure volatile isotopes in plumes, atmospheres, and exospheres. Motivation: While large isotopic fractionations can indicate a biological source, there are signatures resulting from abiotic geochemical processes that mimic isotopic biosignatures. While machine learning (ML) has the potential to disentangle competing effects and biotic mimicry, high-dimensional isotope ratio mass spectrometry (IRMS) data is likely to contain noise/irrelevant features and involve complex statistical interactions that make human inference and interpretation difficult. Further, ML predictions with as far-reaching implications as an extraterrestrial biosignature on an ocean world requires the use of interpretable models (i.e., not “black box” models) with physically and mathematically meaningful feature spaces along with false positive diagnostics. Methods: We use volatile CO2 IRMS data of analogue ocean world seawaters to validate an ML approach to provide biogeochemical context for biosignature detection. We employ a feature selection method called nearest-neighbor projected distance regression (NPDR) that detects statistical interactions and helps elucidate the mechanisms of the Random Forest classification models. Results: We train and validate predictive ML models on volatile CO2 IRMS data of analogue ocean world seawaters to predict major salt components (e.g., MgSO4, NaHCO3), pH, ionic strength, and the presence of biosignatures. Features derived from IRMS measurements are augmented with extracted time-series features. Our results show high test accuracy and interpretability, which is increased by interaction network visualization, sample-wise variable importance scores, and single-sample class probability estimates. We demonstrate an ML mission software solution that triggers autonomous data transmission and biogeochemical sample prediction.
Document ID
20230017897
Acquisition Source
Goddard Space Flight Center
Document Type
Poster
Authors
Lily A. Clough
(University of Tulsa Tulsa, Oklahoma, United States)
Brett A. McKinney
(University of Tulsa Tulsa, Oklahoma, United States)
Bethany P. Theiling
(Goddard Space Flight Center Greenbelt, Maryland, United States)
Victoria Da Poian
(Microtel LLC Washington D.C., District of Columbia, United States)
Jonathan D. Major
(University of South Florida Tampa, Florida, United States)
Lauren M. Seyler
(Stockton University Galloway, United States)
Date Acquired
December 7, 2023
Subject Category
Exobiology
Geosciences (General)
Meeting Information
Meeting: 23rd Meeting of the American Geophysical Union (AGU)
Location: San Francisco, CA
Country: US
Start Date: December 11, 2023
End Date: December 15, 2023
Sponsors: American Geophysical Union
Funding Number(s)
WBS: 811073.02.52.01.04.34
WBS: 981698.01.02.51.05.10.26
CONTRACT_GRANT: 80GSFC22C020
Distribution Limits
Public
Copyright
Portions of document may include copyright protected material.
Keywords
geochemistry
ocean worlds
astrobiology
machine learning
No Preview Available