NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Developing Open-Source Training Materials for AI/ML and Space Biological Sciences Using NASA Cloud-Based DataArtificial Intelligence (AI) and Machine Learning (ML) has gained significant traction in the biological and biomedical research fields in the last two decades, in part thanks to an increasing culture of open data sharing and reuse. Due to its capability for identifying complex relationships and patterns, AI/ML methodology is particularly well suited to recognize and predict biological patterns from high-dimensional next-generation sequencing data (e.g. whole genome sequencing, transcriptomic sequencing), as well as from biological or medical imaging data (e.g. microscopy, computed tomography, ultrasound, magnetic resonance imaging, radiography). These methodologies hold particular promise for space biosciences research and automated space health monitoring systems.
However, there are many key considerations for properly training, validating, and testing a machine learning model in biological research or clinical application. Even with the positive culture of Open Science and data sharing, inexperienced researchers working quickly without proper checks can produce models that perform poorly outside of the immediate training dataset. Lessons learned from biological AI/ML research indicate that Open Science principles such as data sharing and open-source code must go hand-in-hand with publicly available, high-quality training curricula in best practices, with modules centered on real-life scientific use cases and data so future AI/ML practitioners gain experience on real problems.
Here we present the development of open-source training materials for AI/ML and space biosciences, as part of the NASA Transform to Open Science Training (TOPST) initiative. We develop 4 independent training programs, focused on the following topics: 1) Fundamentals of Machine Learning and Space Biosciences Domain, 2) Open Science, Artificial Intelligence, and Ethical Best Practices for Data Sharing and Analysis, 3) Using AI/ML Classification to Identify Gene Networks Affected By Space Exposure in Mouse Liver, and 4) Using Neural Networks to Find DNA Damage Patterns in Immune Cells after Radiation. All programs leverage cloud-based NASA biological datasets. The curriculum we present will enable worldwide access to training in AI/ML and scientific analysis.
Document ID
20230010666
Acquisition Source
Ames Research Center
Document Type
Presentation
Authors
James Andrew Casaletto
(Blue Marble Space Seattle, Washington, United States)
Lauren Marie Sanders
(Blue Marble Space Seattle, Washington, United States)
Sylvain V Costes
(Ames Research Center Mountain View, California, United States)
Ryan Thomas Scott
(Wyle (United States) El Segundo, California, United States)
Parag Avinash Vaishampayan
(Ames Research Center Mountain View, California, United States)
Date Acquired
July 20, 2023
Subject Category
Life Sciences (General)
Meeting Information
Meeting: AGU Fall Meeting 2023
Location: San Francisco, CA
Country: US
Start Date: December 11, 2023
End Date: December 15, 2023
Sponsors: American Geophysical Union
Funding Number(s)
CONTRACT_GRANT: 80NSSC18M0060
Distribution Limits
Public
Copyright
Public Use Permitted.
Technical Review
Single Expert

Available Downloads

There are no available downloads for this record.
No Preview Available