NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
AI Foundation Models for Science: An Open Collaborative InitiativeFoundation Models (FMs), AI models designed to replace task-specific models, are increasingly being recognized for their versatility across numerous downstream applications. These models, trained using self-supervised techniques on any type of sequence data, circumvent the need for large annotated datasets, a major bottleneck in traditional AI model development. FMs can be applied to downstream tasks using few-shot learning and fine-tuning, significantly reducing the need for large labeled training datasets and computational resources.

However, the development of FMs requires substantial resources, including access to data and compute power, expertise in the latest models, and specialized scientific knowledge for systematic evaluation. It is challenging for a single group to possess all these capabilities. To address this, NASA IMPACT has initiated an open collaborative effort, leveraging partnerships with the private sector and other groups within and outside NASA, to jointly build FMs. The overarching goal is to develop a consistent and collaborative approach to building FMs for high-value science datasets. This initiative has fostered collaboration within NASA and with external partners, including IBM Research, Clark University, DOE’s ORNL, ESA, and USGS.

The effort focuses on identifying key datasets with a wide range of downstream applications, pretraining and building FMs using modified transformer architectures, evaluating compute infrastructure needs, and sharing models, pretraining and fine-tuning code, and data with the community. Furthermore, it aims to train the Earth science community to fine-tune these models for various downstream applications.

Our initial effort resulted in the creation of a 100 million parameter HLS Geospatial Model within six months, which was released on HuggingFace. We are now expanding our scope to include data from weather and climate models and investigating multimodal models. We invite those interested in participating in this effort to join us by sharing their use cases, expertise, or data.
Document ID
20230016489
Acquisition Source
Marshall Space Flight Center
Document Type
Presentation
Authors
Rahul Ramachandran
(National Aeronautics and Space Administration Washington D.C., District of Columbia, United States)
Sujit Roy
(University of Alabama in Huntsville Huntsville, Alabama, United States)
Kumar Ankur
(University of Alabama in Huntsville Huntsville, Alabama, United States)
Christopher Phillips
(University of Alabama in Huntsville Huntsville, Alabama, United States)
Iksha Gurung
(University of Alabama in Huntsville Huntsville, Alabama, United States)
Muthukumaran Ramasubramanian
(University of Alabama in Huntsville Huntsville, Alabama, United States)
Manil Maskey
(Marshall Space Flight Center Redstone Arsenal, Alabama, United States)
Pontus Olofsson
(Marshall Space Flight Center Redstone Arsenal, Alabama, United States)
Elizabeth Fancher
(Alion Science and Technology (United States) McLean, Virginia, United States)
Tsengdar Lee
(National Aeronautics and Space Administration Washington D.C., District of Columbia, United States)
Kevin Murphy
(National Aeronautics and Space Administration Washington D.C., District of Columbia, United States)
Dan Duffy
(Goddard Space Flight Center Greenbelt, Maryland, United States)
Mike Little
(LSP Technologies (United States) Dublin, Ohio, United States)
Raghu Ganti
(IBM Research – Thomas J. Watson Research Center Yorktown Heights, New York, United States)
Hamed Alemohammad
(Clark University Worcester, Massachusetts, United States)
Date Acquired
November 13, 2023
Subject Category
Documentation and Information Science
Cybernetics, Artificial Intelligence and Robotics
Meeting Information
Meeting: 23rd Meeting of the American Geophysical Union (AGU)
Location: San Francisco, CA
Country: US
Start Date: December 11, 2023
End Date: December 15, 2023
Sponsors: American Geophysical Union
Funding Number(s)
CONTRACT_GRANT: 80MSFC22M0004
Distribution Limits
Public
Copyright
Portions of document may include copyright protected material.
Technical Review
Single Expert
No Preview Available