Earth System Science Data

Mapping global non-floodplain wetlands
Lane CR, D'Amico E, Christensen JR, Golden HE, Wu Q and Rajib A
Non-floodplain wetlands - those located outside the floodplains - have emerged as integral components to watershed resilience, contributing hydrologic and biogeochemical functions affecting watershed-scale flooding extent, drought magnitude, and water-quality maintenance. However, the absence of a global dataset of non-floodplain wetlands limits their necessary incorporation into water quality and quantity management decisions and affects wetland-focused wildlife habitat conservation outcomes. We addressed this critical need by developing a publicly available "Global NFW" (Non-Floodplain Wetland) dataset, comprised of a global river-floodplain map at 90 m resolution coupled with a global ensemble wetland map incorporating multiple wetland-focused data layers. The floodplain, wetland, and non-floodplain wetland spatial data developed here were successfully validated within 21 large and heterogenous basins across the conterminous United States. We identified nearly 33 million potential non-floodplain wetlands with an estimated global extent of over 16×10 km. Non-floodplain wetland pixels comprised 53% of globally identified wetland pixels, meaning the majority of the globe's wetlands likely occur external to river floodplains and coastal habitats. The identified global NFWs were typically small (median 0.039 km), with a global median size ranging from 0.018-0.138 km. This novel geospatial Global NFW static dataset advances wetland conservation and resource-management goals while providing a foundation for global non-floodplain wetland functional assessments, facilitating non-floodplain wetland inclusion in hydrological, biogeochemical, and biological model development. The data are freely available through the United States Environmental Protection Agency's Environmental Dataset Gateway (https://gaftp.epa.gov/EPADataCommons/ORD/Global_NonFloodplain_Wetlands/, last access: 24 May 2023) and through https://doi.org/10.23719/1528331 (Lane et al., 2023a).
Improving intelligent dasymetric mapping population density estimates at 30 m resolution for the conterminous United States by excluding uninhabited areas
Baynes J, Neale A and Hultgren T
Population change impacts almost every aspect of global change from land use, to greenhouse gas emissions, to biodiversity conservation, to the spread of disease. Data on spatial patterns of population density help us understand patterns and drivers of human settlement and can help us quantify the exposure we face to natural disasters, pollution, and infectious disease. Human populations are typically recorded by national or regional units that can vary in shape and size. Using these irregularly sized units and ancillary data related to population dynamics, we can produce high-resolution gridded estimates of population density through intelligent dasymetric mapping (IDM). The gridded population density provides a more detailed estimate of how the population is distributed within larger units. Furthermore, we can refine our estimates of population density by specifying uninhabited areas which have impacts on the analysis of population density such as our estimates of human exposure. In this study, we used various geospatial datasets to expand the existing specification of uninhabited areas within the United States (US) Environmental Protection Agency's (EPA) EnviroAtlas Dasymetric Population Map for the conterminous United States (CONUS). When compared to the existing definition of uninhabited areas for the EnviroAtlas dasymetric population map, we found that IDM's population estimates for the US Census Bureau blocks improved across all states in the CONUS. We found that IDM performed better in states with larger urban areas than in states that are sparsely populated. We also updated the existing EnviroAtlas Intelligent Dasymetric Mapping toolbox and expanded its capabilities to accept uninhabited areas. The updated 30 m population density for the CONUS is available via the EPA's Environmental Dataset Gateway (Baynes et al., 2021, https://doi.org/10.23719/1522948) and the EPA's EnviroAtlas https://www.epa.gov/enviroatlas, last access: 15 June 2022; Pickard et al., 2015).
A global compilation of in situ aquatic high spectral resolution inherent and apparent optical property data for remote sensing applications
Casey KA, Rousseaux CS, Gregg WW, Boss E, Chase AP, Craig SE, Mouw CB, Reynolds RA, Stramski D, Ackleson SG, Bricaud A, Schaeffer B, Lewis MR and Maritorena S
Light emerging from natural water bodies and measured by radiometers contains information about the local type and concentrations of phytoplankton, non-algal particles and colored dissolved organic matter in the underlying waters. An increase in spectral resolution in forthcoming satellite and airborne remote sensing missions is expected to lead to new or improved capabilities for characterizing aquatic ecosystems. Such upcoming missions include NASA's Plankton, Aerosol, Cloud, ocean Ecosystem (PACE) mission; the NASA Surface Biology and Geology designated observable mission; and NASA Airborne Visible/Infrared Imaging Spectrometer - Next Generation (AVIRIS-NG) airborne missions. In anticipation of these missions, we present an organized dataset of geographically diverse, quality-controlled, high spectral resolution inherent and apparent optical property (IOP-AOP) aquatic data. The data are intended to be of use to increase our understanding of aquatic optical properties, to develop aquatic remote sensing data product algorithms, and to perform calibration and validation activities for forthcoming aquatic-focused imaging spectrometry missions. The dataset is comprised of contributions from several investigators and investigating teams collected over a range of geographic areas and water types, including inland waters, estuaries, and oceans. Specific in situ measurements include remote-sensing reflectance, irradiance reflectance, and coefficients describing particulate absorption, particulate attenuation, non-algal particulate absorption, colored dissolved organic matter absorption, phytoplankton absorption, total absorption, total attenuation, particulate backscattering, and total backscattering. The dataset can be downloaded from https://doi.org/10.1594/PANGAEA.902230 (Casey et al., 2019).
The Stratospheric Water and Ozone Satellite Homogenized (SWOOSH) database: a long-term database for climate studies
Davis SM, Rosenlof KH, Hassler B, Hurst DF, Read WG, Vömel H, Selkirk H, Fujiwara M and Damadeo R
In this paper, we describe the construction of the Stratospheric Water and Ozone Satellite Homogenized (SWOOSH) database, which includes vertically resolved ozone and water vapor data from a subset of the limb profiling satellite instruments operating since the 1980s. The primary SWOOSH products are zonal-mean monthly-mean time series of water vapor and ozone mixing ratio on pressure levels (12 levels per decade from 316 to 1 hPa). The SWOOSH pressure level products are provided on several independent zonal-mean grids (2.5, 5, and 10°), and additional products include two coarse 3-D griddings (30° long × 10° lat, 20° × 5°) as well as a zonal-mean isentropic product. SWOOSH includes both individual satellite source data as well as a merged data product. A key aspect of the merged product is that the source records are homogenized to account for inter-satellite biases and to minimize artificial jumps in the record. We describe the SWOOSH homogenization process, which involves adjusting the satellite data records to a "reference" satellite using coincident observations during time periods of instrument overlap. The reference satellite is chosen based on the best agreement with independent balloon-based sounding measurements, with the goal of producing a long-term data record that is both homogeneous (i.e., with minimal artificial jumps in time) and accurate (i.e., unbiased). This paper details the choice of reference measurements, homogenization, and gridding process involved in the construction of the combined SWOOSH product and also presents the ancillary information stored in SWOOSH that can be used in future studies of water vapor and ozone variability. Furthermore, a discussion of uncertainties in the combined SWOOSH record is presented, and examples of the SWOOSH record are provided to illustrate its use for studies of ozone and water vapor variability on interannual to decadal timescales. The version 2.5 SWOOSH data are publicly available at doi:10.7289/V5TD9VBX.
The Open-source Data Inventory for Anthropogenic Carbon dioxide (CO), version 2016 (ODIAC2016): A global, monthly fossil-fuel CO gridded emission data product for tracer transport simulations and surface flux inversions
Oda T, Maksyutov S and Andres RJ
The Open-source Data Inventory for Anthropogenic CO (ODIAC) is a global high-spatial resolution gridded emission data product that distributes carbon dioxide (CO) emissions from fossil fuel combustion. The emission spatial distributions are estimated at a 1×1 km spatial resolution over land using power plant profiles (emission intensity and geographical location) and satellite-observed nighttime lights. This paper describes the year 2016 version of the ODIAC emission data product (ODIAC2016) and presents analyses that help guiding data users, especially for atmospheric CO tracer transport simulations and flux inversion analysis. Since the original publication in 2011, we have made modifications to our emission modeling framework in order to deliver a comprehensive global gridded emission data product. Major changes from the 2011 publication are 1) the use of emissions estimates made by the Carbon Dioxide Information Analysis Center (CDIAC) at the Oak Ridge National Laboratory (ORNL) by fuel type (solid, liquid, gas, cement manufacturing, gas flaring and international aviation and marine bunkers), 2) the use of multiple spatial emission proxies by fuel type such as nightlight data specific to gas flaring and ship/aircraft fleet tracks and 3) the inclusion of emission temporal variations. Using global fuel consumption data, we extrapolated the CDIAC emissions estimates for the recent years and produced the ODIAC2016 emission data product that covers 2000-2015. Our emission data can be viewed as an extended version of CDIAC gridded emission data product, which should allow data users to impose global fossil fuel emissions in more comprehensive manner than original CDIAC product. Our new emission modeling framework allows us to produce future versions of ODIAC emission data product with a timely update. Such capability has become more significant given the CDIAC/ORNL's shutdown. ODIAC data product could play an important role to support carbon cycle science, especially modeling studies with space-based CO data collected near real time by ongoing carbon observing missions such as Japanese Greenhouse Observing SATellite (GOSAT), NASA's Orbiting Carbon Observatory 2 (OCO-2) and upcoming future missions. The ODIAC emission data product including the latest version of the ODIAC emission data (ODIAC2017, 2000-2016), is distributed from http://db.cger.nies.go.jp/dataset/ODIAC/ with a DOI.
The GEWEX Water Vapor Assessment archive of water vapour products from satellite observations and reanalyses
Schröder M, Lockhoff M, Fell F, Forsythe J, Trent T, Bennartz R, Borbas E, Bosilovich MG, Castelli E, Hersbach H, Kachi M, Kobayashi S, Kursinski ER, Loyola D, Mears C, Preusker R, Rossow WB and Saha S
The Global Energy and Water cycle Exchanges (GEWEX) Data and Assessments Panel (GDAP) initiated the GEWEX Water Vapor Assessment (G-VAP), which has the main objectives to quantify the current state of art in water vapour products being constructed for climate applications and to support the selection process of suitable water vapour products by GDAP for its production of globally consistent water and energy cycle products. During the construction of the G-VAP data archive, freely available and mature satellite and reanalysis data records with a minimum temporal coverage of 10 years were considered. The archive contains total column water vapour (TCWV) as well as specific humidity and temperature at four pressure levels (1000, 700, 500, 300 hPa) from 22 different data records. All data records were remapped to a regular longitude/latitude grid of 2°x2°. The archive consists of four different folders: 22 TCWV data records covering the period 2003-2008, 11 TCWV data records covering the period 1988-2008, as well as seven specific humidity and seven temperature data records covering the period 1988-2009. The G-VAP data archive is referenced under the following digital object identifier (doi): http://dx.doi.org/10.5676/EUM SAF CM/GVAP/V001. Within G-VAP, the characterisation of water vapour products is, among other ways, achieved through intercomparisons of the considered data records, as a whole and grouped into three classes of predominant retrieval condition: clear-sky, cloudy-sky and all-sky. Associated results are shown using the 22 TCWV data records. The standard deviations among the 22 TCWV data records have been analysed and exhibit distinct maxima over central Africa and the tropical warm pool (in absolute terms) as well as over the poles and mountain regions (in relative terms). The variability in TCWV within each class can be large and prohibits conclusions on systematic differences in TCWV between the classes.
Greenhouse gas observations from the Northeast Corridor tower network
Karion A, Callahan W, Stock M, Prinzivalli S, Verhulst KR, Kim J, Salameh PK, Lopez-Coto I and Whetstone J
We present the organization, structure, instrumentation, and measurements of the Northeast Corridor greenhouse gas observation network. This network of tower-based in situ carbon dioxide and methane observation stations was established in 2015 with the goal of quantifying emissions of these gases in urban areas in the northeastern United States. A specific focus of the network is the cities of Baltimore, MD, and Washington, DC, USA, with a high density of observation stations in these two urban areas. Additional observation stations are scattered throughout the northeastern US, established to complement other existing urban and regional networks and to investigate emissions throughout this complex region with a high population density and multiple metropolitan areas. Data described in this paper are archived at the National Institute of Standards and Technology and can be found at https://doi.org/10.18434/M32126 (Karion et al., 2019).
Fine-grained, spatiotemporal datasets measuring 200 years of land development in the United States
Uhl JH, Leyk S, McShane CM, Braswell AE, Connor DS and Balk D
The collection, processing, and analysis of remote sensing data since the early 1970s has rapidly improved our understanding of change on the Earth's surface. While satellite-based Earth observation has proven to be of vast scientific value, these data are typically confined to recent decades of observation and often lack important thematic detail. Here, we advance in this arena by constructing new spatially explicit settlement data for the United States that extend back to the early 19th century and are consistently enumerated at fine spatial and temporal granularity (i.e. 250m spatial and 5-year temporal resolution). We create these time series using a large, novel building-stock database to extract and map retrospective, fine-grained spatial distributions of built-up properties in the conterminous United States from 1810 to 2015. From our data extraction, we analyse and publish a series of gridded geospatial datasets that enable novel retrospective historical analysis of the built environment at an unprecedented spatial and temporal resolution. The datasets are part of the Historical Settlement Data Compilation for the United States (https://dataverse.harvard.edu/dataverse/hisdacus, last access: 25 January 2021) and are available at https://doi.org/10.7910/DVN/YSWMDR (Uhl and Leyk, 2020a), https://doi.org/10.7910/DVN/SJ213V (Uhl and Leyk, 2020b), and https://doi.org/10.7910/DVN/J6CYUJ (Uhl and Leyk, 2020c).