NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
A Global Land Cover Training Dataset From 1984 to 2020State-of-the-art cloud computing platforms such as Google Earth Engine (GEE) enable regional-to-global land cover and land cover change mapping with machine learning algorithms. However, collection of high-quality training data, which is necessary for accurate land cover mapping, remains costly and labor-intensive. To address this need, we created a global database of nearly 2 million training units spanning the period from 1984 to 2020 for seven primary and nine secondary land cover classes. Our training data collection approach leveraged GEE and machine learning algorithms to ensure data quality and biogeographic representation. We sampled the spectral-temporal feature space from Landsat imagery to efficiently allocate training data across global ecoregions and incorporated publicly available and collaborator-provided datasets to our database. To reflect the underlying regional class distribution and post-disturbance landscapes, we strategically augmented the database. We used a machine learning-based cross-validation procedure to remove potentially mis-labeled training units. Our training database is relevant for a wide array of studies such as land cover change, agriculture, forestry, hydrology, urban development, among many others.
Document ID
20230015858
Acquisition Source
Marshall Space Flight Center
Document Type
Accepted Manuscript (Version with final changes)
Authors
Radost Stanimirova ORCID
(Boston University Boston, Massachusetts, United States)
Katelyn Tarrio ORCID
(Boston University Boston, Massachusetts, United States)
Konrad Turlej
(Boston University Boston, Massachusetts, United States)
Kristina McAvoy
(Boston University Boston, Massachusetts, United States)
Sophia Stonebrook
(Boston University Boston, Massachusetts, United States)
Kai-Ting Hu
(Boston University Boston, Massachusetts, United States)
Paulo Arévalo
(Boston University Boston, Massachusetts, United States)
Eric L Bullock
(Boston University Boston, Massachusetts, United States)
Yingtong Zhang
(Boston University Boston, Massachusetts, United States)
Curtis E. Woodcock
(Boston University Boston, Massachusetts, United States)
Pontus Olofsson
(Marshall Space Flight Center Redstone Arsenal, Alabama, United States)
Zhe Zhu ORCID
(University of Connecticut Storrs, Connecticut, United States)
Christopher P Barber
(United States Geological Survey Reston, Virginia, United States)
Carlos M Souza Jr.
(Imazon Belém, Brazil)
Shijuan Chen
(Yale University New Haven, Connecticut, United States)
Jonathan A Wang ORCID
(University of Utah Salt Lake City, Utah, United States)
Foster Mensah ORCID
(University of Ghana Accra, Ghana)
Marco Calderón-Loor
(Deakin University Geelong, Victoria, Australia)
Michalis Hadjikakou ORCID
(Deakin University Geelong, Victoria, Australia)
Brett A Bryan ORCID
(Deakin University Geelong, Victoria, Australia)
Jordan Graesser
(Indigo Ag)
Dereje L Beyene
(Oromia Environmental Protection Authority)
Brian Mutasha
(Ministry of Green Economy and Environment)
Sylvester Siame
(Ministry of Green Economy and Environment)
Abel Siampale
(Ministry of Green Economy and Environment)
Mark A Friedl ORCID
(Boston University Boston, Massachusetts, United States)
Date Acquired
November 2, 2023
Publication Date
December 7, 2023
Publication Information
Publication: Nature Scientific Data
Publisher: Nature Research
Volume: 10
Issue Publication Date: December 7, 2023
e-ISSN: 2052-4463
Subject Category
Documentation and Information Science
Funding Number(s)
CONTRACT_GRANT: 80NSSC18K0994
Distribution Limits
Public
Copyright
Portions of document may include copyright protected material.
Technical Review
External Peer Committee
No Preview Available