NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
NASA’S AI Foundation Models for Science: Current Initiatives, Workflow, Roadmap and Lessons LearnedNASA’s scientific data archives have grown rapidly, surpassing 150 petabytes and expected to exceed 500 petabytes by 2029. This expansion presents considerable challenges in data management and analysis. To address these, the Office of the Chief Science Data Officer (OCSDO) has implemented a strategy centered on AI foundation models (FMs) to enhance workflows across NASA’s five science divisions. These models, trained with self-supervised pretraining, create versatile, application-agnostic representations that can be adapted efficiently to specific tasks. The OCSDO’s ”5+1” strategy integrates tailored FMs for each science division with a large language model for cross-domain tasks. Notable initiatives include the INDUS language model suite covering all science divisions, the Prithvi-HLS model for optical remote sensing, and the Prithvi-WxC model for atmospheric analysis. These models reduce computational demands and data labeling needs while performing well on existing benchmarks. Current efforts focus on new models for heliophysics (SuryaSDO), lunar studies, and biological research, undertaken with new partnerships. The FM development process follows science standards prioritizing transparency, accuracy, and relevance. Key workflows encompass pretraining, adaptation, and inference.
Lessons learned from this work emphasize the value of balancing costs and benefits, fostering interdisciplinary collaboration, designing science use case-driven models, and maintaining rigorous validation. This paper details the design, implementation, and roadmap of NASA’s FMs, illustrating their potential role in advancing scientific discovery through AI-powered methodologies.
Document ID
20250000225
Acquisition Source
Marshall Space Flight Center
Document Type
Conference Paper
Authors
Rahul Ramachandran
(Marshall Space Flight Center Redstone Arsenal, United States)
Tsengdar Lee
(National Aeronautics and Space Administration Washington, United States)
Kevin Murphy
(National Aeronautics and Space Administration Washington, United States)
Date Acquired
January 8, 2025
Subject Category
Cybernetics, Artificial Intelligence and Robotics
Meeting Information
Meeting: International Geoscience and Remote Sensing Symposium (IGARSS)
Location: Brisbane
Country: AU
Start Date: August 3, 2025
End Date: August 8, 2025
Sponsors: Institute of Electrical and Electronics Engineers
Funding Number(s)
WBS: 182939.05.02.01.22
Distribution Limits
Public
Copyright
Work of the US Gov. Public Use Permitted.
Technical Review
Single Expert
Keywords
large language model
open science
artificial intelligence
foundation models
No Preview Available