NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Using Machine Learning to Estimate Surface-Level SO2 Concentrations from Satellite-Based Measurements Sulfur dioxide (SO2) is a criteria air pollutant due to its contributions to aerosol formation, rainfall acidification, and harm to human health. The placement of air quality monitoring sites is typically biased towards urban areas, leaving large areas with very limited monitoring data. The Ozone Monitoring Instrument (OMI) has been used to provide estimates of SO2 vertical column densities (VCDs) globally at spatial resolution of 10s of kms once per day. OMI SO2 VCDs have been previously used to estimate surface SO2 concentrations using chemical transport model (CTM) simulations. The CTMs use estimated emissions and assimilated meteorological data, and simulate the chemical and physical processes that determine the vertical profile of SO2, which can be used to derive a ratio between the surface concentrations and VCDs. These models are complex, computationally expensive, and have large uncertainties in the simulated surface-to-VCD ratio due to biases in emissions and relatively coarse resolution. Machine learning techniques are comparatively easier to use, much less computationally expensive to use after training, and can produce more accurate estimations of surface concentrations than the CTM-based method. The interpretation of machine learning models often poses challenges, and in some cases, non-physical variables unrelated to SO2 are used as predictors. In this work, we create an artificial neural network (ANN) to relate OMI retrievals and archived GEOS-FP boundary layer heights to surface SO2 concentrations from the ChinaHighAirPollutants ChinaHighSO2 dataset (CHAP; Wei et al., 2023) on a seasonal average timescale from 2013-2018. Our model only utilizes five variables that are directly relevant to the satellite retrieval, lifetime, and spatial distribution of SO2. The model was trained on 16 seasons (four of each) with independent validation (one of each season) and testing datasets (one of each season) to avoid overfitting. Our ANN generates surface SO2 concentrations that are sensitive (slope = 0.51) and consistent (r = 0.74) with the CHAP data, but are underpredicted by an average of 1.2 ppbv with a mean absolute error of 2.2 ppbv. These results are better than recent studies utilizing the CTM method. To our knowledge, this is the best performing machine learning model that only uses physical variables to predict surface SO2. Our work demonstrates that a carefully constructed, simple ML model can accurately estimate surface-based SO2 concentrations from satellite VCD measurements, and this technique has future promise to expend to newer, higher resolution satellites and other air pollutants.
Document ID
20240013917
Acquisition Source
Goddard Space Flight Center
Document Type
Abstract
Authors
Zachary Watson
(University of Alabama in Huntsville Huntsville, United States)
Sean W Freeman
(University of Alabama in Huntsville Huntsville, United States)
Can Li
(Goddard Space Flight Center Greenbelt, United States)
Joanna Joiner
(Goddard Space Flight Center Greenbelt, United States)
Shan-Hu Lee
(University of Alabama in Huntsville Huntsville, United States)
Date Acquired
November 4, 2024
Subject Category
Geosciences (General)
Meeting Information
Meeting: 105th American Meteorological Society (AMS) Annual Meeting
Location: Baltimore, MD
Country: US
Start Date: January 12, 2025
End Date: January 16, 2025
Sponsors: American Meteorological Society
Funding Number(s)
WBS: 583998.04.01.01
Distribution Limits
Public
Copyright
Use by or on behalf of the US Gov. Permitted.
Technical Review
NASA Peer Committee
Keywords
SO2, air quality, OMI, machine learning
No Preview Available