NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Interpretable Machine Learning for Molecular Biosignatures: a Novel Single-Sample Feature Importance Method That Is Sensitive To Statistical InteractionsIsotope ratio mass spectrometry (IRMS) of volatiles (e.g., CO2) promises to be a powerful tool for potential biosignature detection for future missions to ocean worlds (OW) such as Europa and Enceladus. Machine learning (ML) methods for IRMS data could enable science autonomy by onboard prediction of seawater chemistry and biosignature presence. However, ML models are likely to be complex and involve statistical interactions between features (variables), which can make predictions seem opaque and enigmatic. For ML predictions as significant as extraterrestrial biosignatures, we must place extraordinary confidence in models. It is therefore essential that these models make interpretable predictions (i.e., human-understandable) and include false-prediction diagnostics. We achieve high accuracy and interpretability in ML biosignature and seawater chemistry models for OW through a nearest-neighbors feature selection tool that detects statistical interactions between predictors, constructs interaction networks for visualization of selected features working together to make a prediction, and reports single-sample feature importance scores for false-detection diagnostics. Here we develop a novel single-sample nearest-neighbors projected distance regression(ssNPDR) feature selection method that improves upon existing single-sample algorithms through the inclusion of statistical interactions while providing false-prediction diagnostics for ML models.
Document ID
20240005163
Acquisition Source
Goddard Space Flight Center
Document Type
Other - Online poster content PDF generated by conference iposter website
Authors
Lily A. Clough
(University of Tulsa Tulsa, Oklahoma, United States)
Victoria Da Poian
(Microtel LLC Washington D.C., District of Columbia, United States)
Bethany P. Theiling
(Goddard Space Flight Center Greenbelt, Maryland, United States)
Brett A. McKinney
(University of Tulsa Tulsa, Oklahoma, United States)
Date Acquired
April 25, 2024
Publication Date
May 7, 2024
Publication Information
Subject Category
Exobiology
Geosciences (General)
Meeting Information
Meeting: Astrobiology Science Conference (AbSciCon) 2024
Location: Providence, RI
Country: US
Start Date: May 7, 2024
End Date: May 10, 2024
Sponsors: American Geophysical Union
Funding Number(s)
WBS: 811073.02.52.01.04.34
WBS: 981698.01.02.51.05.10.26
CONTRACT_GRANT: 80GSFC22C020
Distribution Limits
Public
Copyright
Use by or on behalf of the US Gov. Permitted.
Keywords
geochemistry
ocean worlds
astrobiology
machine learning
No Preview Available