ESS Open Archive - ESS Open Archive

https://essopenarchive.org/

by author

by title

by keyword

agricultural

286

atmospheric sciences

3679

biological sciences

271

climatology (global change)

2287

ecology

395

education

974

environmental sciences

1570

geochemistry

975

geodesy

463

geography

750

geology

2125

geophysics

3372

health sciences

162

human society

242

hydrology

2398

indigenous studies

30

informatics

510

meteorology

1273

microbiology

186

oceanography

1879

paleontology

96

planetology

701

radioastronomy

90

soil sciences

270

solar system physics

899

Informed Neural Networks for Flood Forecasting with Limited Amount of Training Data

Kenji Komiya

and 4 more

November 08, 2023

This study presents a novel approach to improving the accuracy of flood forecast models with limited training data. Flood forecast information is crucial for early evacuation planning. However, the probability of flooding caused by continuous heavy rainfall is increasing, even in areas where we have not installed flood forecasts. New methods exist to provide flood forecasts, but they require long-term observations and regular updating of extensive data on the basin. Existing methods of providing new flood forecast information require long-term observations and regular updates of extensive data on the watershed. These requirements are related to the construction time and cost of providing flood forecasts. To address this issue, we propose Informed Neural Networks (INN) that integrate existing domain knowledge of river engineering to enhance the performance of flood forecasts with a limited amount of training data. We evaluated the performance of our proposed method with Japanese real-world river water levels and compared it to conventional methods such as artificial neural networks (ANN). Our results demonstrate that the INN can significantly improve forecasting accuracy with only a small amount of training data, comparable to conventional methods trained with eight times the amount of flood data. This study highlights the potential of INN as a novel approach for accurate and efficient flood forecasting with limited training data.

Physical, Social, and Biological Attributes for Improved Understanding and Prediction...

Yavar Pourmohamad

and 13 more

October 19, 2023

Wildfires are increasingly impacting social and environmental systems in the United States. The ability to mitigate the adverse effects of wildfires increases with understanding of the social, physical, and biological conditions that co-occurred with or caused the wildfire ignitions and contributed to the wildfire impacts. To this end, we developed the FPA FOD-Attributes dataset, which augments the sixth version of the Fire Program Analysis-Fire Occurrence Database (FPA FOD v6) with nearly 270 attributes that coincide with the date and location of each wildfire ignition in the United States. FPA FOD v6 contains information on location, jurisdiction, discovery time, cause, and final size of >2.3 million wildfires from 1992-2020 in the United States. For each wildfire, we added physical (e.g., weather, climate, topography, infrastructure), biological (e.g., land cover, normalized difference vegetation index), social (e.g., population density, social vulnerability index), and administrative (e.g., national and regional preparedness level, jurisdiction) attributes. This publicly available dataset can be used to answer numerous questions about the covariates associated with human- and lightning-caused wildfires. Furthermore, the FPA FOD-Attributes dataset can support descriptive, diagnostic, predictive, and prescriptive wildfire analytics, including development of machine learning models.

Response to NASA Request for Information on the NASA Public Access Plan

Matthew Giampoala

and 2 more

October 05, 2023

A document by Shelley Stall. Click on the document to view its contents.

Unsupervised Learning of Sea Surface Height Interpolation from Multi-variate Simulate...

Théo Archambault

and 4 more

October 17, 2023

Satellite-based remote sensing missions have revolutionized our understanding of the Ocean state and dynamics. Among them, spaceborne altimetry provides valuable measurements of Sea Surface Height (SSH), which is used to estimate surface geostrophic currents. However, due to the sensor technology employed, important gaps occur in SSH observations. Complete SSH maps are produced by the altimetry community using linear Optimal Interpolations (OI) such as the widely-used Data Unification and Altimeter Combination System (DUACS). However, OI is known for producing overly smooth fields and thus misses some mesostructures and eddies. On the other hand, Sea Surface Temperature (SST) products have much higher data coverage and SST is physically linked to geostrophic currents through advection. We design a realistic twin experiment to emulate the satellite observations of SSH and SST to evaluate interpolation methods. We introduce a deep learning network able to use SST information, and a trainable in two settings: one where we have no access to ground truth during training and one where it is accessible. Our investigation involves a comparative analysis of the aforementioned network when trained using either supervised or unsupervised loss functions. We assess the quality of SSH reconstructions and further evaluate the network's performance in terms of eddy detection and physical properties. We find that it is possible, even in an unsupervised setting to use SST to improve reconstruction performance compared to SST-agnostic interpolations. We compare our reconstructions to DUACS's and report a decrease of 41\% in terms of root mean squared error.

Computing the ecRad radiation scheme with half-precision arithmetic

Anton Pershin

and 4 more

September 30, 2023

Numerical simulations of weather and climate models are conventionally carried out using double-precision floating-point numbers throughout the vast majority of the code. At the same time, the urgent need of high-resolution forecasts given limited computational resources encourages development of much more efficient numerical codes. A number of recent studies has suggested the use of reduced numerical precision, including half-precision floating-point numbers increasingly supported by hardware, as a promising avenue. In this paper, the possibility of using half-precision calculations in the radiation scheme ecRad operationally used in the ECMWF's Integrated Forecasting System (IFS). By deliberately mixing half-, single- and double-precision variables, we develop a mixed-precision version of the Tripleclouds solver, the most computationally demanding part of the radiation scheme, where reduced-precision calculations are emulated by a Fortran software rpe. By employing two tools that estimate the dynamic range of model parameters and identify problematic areas of the model code using ensemble statistics, the code variables were assigned particular precision levels.It is demonstrated that heating rates computed by the mixed-precision code are reasonably close to those produced by the double-precision code. Moreover, it is shown that using the mixed-precision ecRad in OpenIFS has a very limited impact on the accuracy of a medium-range forecast in comparison to the original double-precision configuration. These results imply that mixed-precision arithmetic could successfully be used to accelerate the radiation scheme ecRad and, possibly, other parametrization schemes used in weather and climate models without harming the forecast accuracy.

AstroPortal: An ontology repository concept for astronomy, astronautics and other spa...

Robert Rovetto

and 1 more

September 30, 2023

A document by Robert Rovetto. Click on the document to view its contents.

Fermat's Last Theorem: A Historical and Mathematical Overview

Ijtihed Kilani

September 11, 2023

This paper offers a comprehensive examination of Fermat's Last Theorem , a statement in number theory that captivated mathematicians for over 350 years until its proof by Andrew Wiles in 1994. Beginning with historical context surrounding Pierre de Fermat and the theorem's formulation , the paper meticulously reviews the mathematical foundations underlying the theorem, including Diophantine equations, modular forms, and elliptic curves. Special attention is given to Wiles' groundbreaking use of the Taniyama-Shimura-Weil conjecture and Ribet's theorem to provide a complete proof, including the resolution of an initial flaw in the proof. Furthermore, the paper explores the theorem's far-reaching implications in number theory, algebraic geometry, cryptography, and computer science. The study reveals that Fermat's Last Theorem is not just an isolated mathematical problem but a testament to the depth, beauty, and inter-connectedness of mathematics, with broad impact across various scientific disciplines.

Generating interpretable rainfall-runoff models automatically from data

Travis Adrian Dantzer

and 1 more

September 11, 2023

A sudden surge of data has created new challenges in water management, spanning quality control, assimilation, and analysis. Few approaches are available to integrate growing volumes of data into interpretable results. Process-based hydrologic models have not been designed to consume large amounts of data. Alternatively, new machine learning tools can automate data analysis and forecasting, but their lack of interpretability and reliance on very large data sets limits the discovery of insights and may impact trust. To that end, we present a new approach, which seeks to strike a middle ground between process-, and data-based modeling. The contribution of this work is an automated and scalable methodology that discovers differential equations and latent state estimations within hydrologic systems using only rainfall and runoff measurements. We show how this enables automated tools to learn interpretable models of 6 to 18 parameters solely from measurements. We apply this approach to nearly 400 stream gaging sites across the US, showing how complex catchment dynamics can be reconstructed solely from rainfall and runoff measurements. We also show how the approach discovers surrogate models that can replicate the dynamics of a much more complex process-based model, but at a fraction of the computational complexity. We discuss how the resulting representation of watershed dynamics provides insight and computational efficiency to enable automated predictions across large sensor networks.

Navigating GDPR Compliance in AI: A Deep Dive into OpenAI’s ChatGPT — A Perspective f...

Alicia Colmenero-Fernandez

September 11, 2023

IntroductionIn an era increasingly dominated by artificial intelligence, the matter of informed consent has never been more crucial. A study conducted by Ipsos indicates a significant 11-point drop in internet trust since 2019[1]. Particularly in the European Union, there is growing concern about the handling of personal data. This article aims to shed light on the ways in which AI platforms, like OpenAI’s ChatGPT, fall short of meeting key guidelines established by the European Union’s General Data Protection Regulation (GDPR).Utilizing data from Enforcement Tracker , we present a graph illustrating the distribution of GDPR fines across various sectors. The Media, Telecoms, and Broadcasting sectors are particularly noteworthy, both for the number of violations and the scale of the fines imposed, signaling serious continuos non-compliance .

Comparison of Four Competing Invasion Percolation Models for Gas Flow in Porous Media

Ishani Banerjee

and 4 more

September 08, 2023

Numerous variations of Invasion-Percolation (IP) models can simulate multiphase flow in porous media across various scales (pore-scale IP to macroscopic IP); here, we are interested in gas flow in water-saturated porous media. This flow occurs either as continuous or discontinuous flow, depending on the flow rate and the porous medium’s nature. Literature suggests that IP models are well suited for the discontinuous gas flow regime; other flow regimes have not been explored. Our research compares four existing macroscopic IP models and ranks their performance in these “other” flow regimes. We test the models on a range of gas-injection in water-saturated sand experiments from transitional and continuous gas flow regimes. Using the light transmission technique, the experimental data is obtained as a time series of images in a 2-dimensional setup. To represent pore-scale heterogeneities, we ran each model version on several random realizations of the initial entry pressure field. We use a diffused version of the so-called Jaccard coefficient to rank the models against the experimental data. We average the Jaccard coefficient over all realizations per model version to evaluate each model and calibrate specific model parameters. Depending on the application domain, we observe that some macroscopic IP model versions are suitable in these previously unexplored flow regimes. Also, we identify that the initial entry pressure fields strongly affect the performance of these models. Our comparison method is not limited to gas-water systems in porous media but generalizes to any modelling situation accompanied by spatially and temporally highly resolved data.

Unveiling the Quantum Frontier: A Journey into the Mechanics and Wonders of Quantum C...

Andy Hou

September 11, 2023

The main purpose of this research paper is to examine how quantum computing functions at both a mechanical and logical level. Through entanglement to parallelism, crucial concepts intrinsic to qubits are explained, not at extremely heavy depth, but still go rather far in most subjects. There are also implications for the future of quantum computing, as scientists continue the development and research of quantum machines, in relation to the limits of physics. Some significant findings of the research done were that a. quantum computing has many problems with decoherence and noise which disturbs results and b. that quantum computing has many advantages to classical bit-oriented computing, with more efficiency and better allocation of memory.

Forecasting West Nile Virus Infections A Machine Learning Approach to Epidemiological...

Rachel Chen

and 3 more

July 03, 2023

Mosquitoes are vectors for a number of serious illnesses, such as Dengue, Zika, Malaria, and West Nile Virus. In the United States, West Nile Virus (WNV) is the leading mosquito-borne disease. As there are currently no vaccines to prevent WNV nor medications to cure it, government agencies must sustain financially taxing programs to monitor mosquito populations and WNV infections and share this data across various departments in an effort to prevent WNV outbreaks. In this study, we develop four machine learning models that forecast WNV infections in humans, enabling government and healthcare officials to take proactive action instead of reacting to real-time infection data. Our models take in open-access data describing ecological variables – such as temperature, humidity, wind, air quality index (AQI), and enhanced vegetation index (EVI) — and use that data to predict future WNV infections five weeks in advance. We then perform a comparative analysis of the two types of machine learning architectures – support vector machine (SVM) regressors and random forest (RF) regressors – represented across our four models to evaluate which is best suited for the task. Our results indicate RF regressors are best suited to the task of forecasting WNV infections; however, SVM regressors perform comparably well and even exceed RF regressors when the magnitude of error is unweighted. Additionally, our results contribute a new perspective on the usefulness of AQI and wind speed for predicting mosquito-borne infections. Our RF regressor’s feature importance results indicate that AQI and wind speed were of similar importance as EVI and humidity – ecological variables well-known to influence mosquito population dynamics. Our work provides valuable directions for future research and development of early warning systems for disease prevention efforts as our models’ ability to forecast WNV infections five weeks in advance provides critical lead time for government officials to pursue mosquito containment efforts and healthcare facilities to increase capacity, enabling proactive action in combating WNV.Link to abstract published at AGU's Fall 2022 Session

Dynamical mean-field approach to Ising models with impurities

Chun Tao

July 08, 2023

A document by Chun Tao. Click on the document to view its contents.

A multi-chemistry modelling framework to enable flexible and reproducible water quali...

Diogo Costa

and 5 more

June 19, 2023

This work advances the incorporation and cross-model deployment of multi-biogeochemistry and ecological simulations in existing process-based hydro-modelling tools. It aims to transform the current practice of water quality modelling as an isolated research effort into a more integrated and collaborative activity between science communities. Our approach, which we call “Open Water Quality” (OpenWQ), enables existing hydrological, hydrodynamic, and groundwater models to extend their capabilities to water quality simulations, which can be set up to examine a variety of water-related pollution problems. OpenWQ’s objective is to provide a flexible biogeochemical model representation that can be used to test different modelling hypotheses in a multi-disciplinary co-creative process. In this paper, we introduce the general approach used in OpenWQ. We detail aspects of its architecture that enable its coupling with existing models. This integration enables water quality models to benefit from advances made by hydrologic- and hydrodynamic-focused groups, strengthening collaboration between the hydrological, biogeochemistry, and soil science communities. We also detail innovative aspects of OpenWQ’s modules that enable biogeochemistry lab-like capabilities, where modellers can define the pollution problem(s) of interest, the appropriate complexity of the biogeochemistry routines, and test different modelling hypotheses. In a companion paper, we demonstrate how OpenWQ has been coupled to two hydrological models, the “Structure for Unifying Multiple Modelling Alternatives” (SUMMA) and the “Cold Regions Hydrological Model” (CRHM), demonstrating the innovative aspects of OpenWQ, the flexibility of its couplers and internal spatiotemporal data structures, and the versatile eco-modelling lab capabilities that can be used to study different pollution problems.

The PyHC Open Science Experiment: A PyHC session led by Rebecca Ringuette

Rebecca Ringuette

May 25, 2023

Ringuette, R., Niehof, J. T., Polson, S. A., Zheng, Y., Rastaetter, L., Antunes, A., … Drozdov, A. (2023, August 3). Comparing Magnetopause Crossings using Open Science. https://doi.org/10.17605/OSF.IO/V4DRT

Ethical and Responsible Use of AI/ML in the Earth, Space, and Environmental Sciences

Shelley Stall

and 23 more

April 12, 2023

A document by Shelley Stall. Click on the document to view its contents.

Long-term support is needed for crucial ground-based sensor networks

J. Gannon

and 5 more

April 11, 2023

Recently, many in the space weather community have taken up the cause to advocate for an orphan among our own. It’s an important fight – for ground-based sensor networks. Although ground-based sensors are used across all disciplines of space weather, in terms of long-term support, they have no single clear home in any United States agency or department. This has resulted in an ongoing struggle throughout the community to maintain important space weather sensors and networks.The Promoting Research and Observations of Space Weather to Improve the Forecasting of Tomorrow (PROSWIFT) Act of 2020 (Public Law 116-181) attempts to clarify Federal roles and responsibilities, stating that “… ground-based observations provide crucial data necessary to understand, forecast, and prepare for space weather phenomena”, which it defines as ”radars, lidars, magnetometers, neutron monitors, radio receivers, aurora and airglow imagers, spectrometers, interferometers, and solar observatories.”The data from this list of sensors and arrays support research across the space weather domains, including magnetospheric, ionospheric, and atmospheric science. Networks are run by governmental, academic, and commercial providers, and are used to support a range of end-users, from aviation to the power sector. Given the wide range of applications, it’s not surprising that no single entity has primary custody.In separate sections of PROSWIFT, sustainment of these instruments is assigned to “The Director of the National Science Foundation, the Director of the United States Geological Survey, the Secretary of the Air Force, and, as practicable in support of the Air Force, the Secretary of the Navy” who are directed to “maintain and improve ground-based observations of the Sun, as necessary and advisable”, and also to the National Oceanic and Atmospheric Administration (NOAA), as the civil operational space weather agency that is responsible for maintaining “ground-based… assets to provide observations needed for space weather forecasting, prediction, and warnings”.While PROSWIFT’s clarification of federal responsibilities is welcome, what is highlighted is a problem of the “ownership” of the issue of long-term sustainability of such varied instruments.We can start to unravel the ownership problem by understanding its history. One complication to an easy definition is that ground-based sensor networks support both space weather science and operations. The National Science Foundation (NSF) has a long history of supporting novel instrument development, small arrays of sensors placed for scientific research (fundamental research is the foundation of NSF’s mandate), and mid- and larger-scale facilities. But the needs of science do not necessarily intersect the needs of operations, and neither do their requirements in terms of engineering and support. Operational sensors, in many cases, are entirely different than scientific sensors.Like scientific arrays, operational sensors must provide the “right” data - accurate and relevant – but the delivery of those data must also be timely, consistent, and reliable. In other words, the data must be usable for space weather predictions, forecasts, and alerts. The United States Geological Survey (USGS) is one example of a federal provider of operational ground-based data. The commercial sector, by mandate of PROSWIFT, is another.Whether scientific or operational, ground-based networks need to be supported and maintained long-term to fulfill their missions. It is more expensive to shut down and rebuild an array than to keep it operating, and strategic planning is required to prioritize and balance needs across the space weather enterprise.Those taking up the initiative to support ground-based sensors span the space weather enterprise, reflecting the interdisciplinary and cross-sector need for these data. In addition to a myriad of white papers submitted to the Heliophysics Decadal Survey (e.g., Hartinger et al., and Bhatt et al.) and publications (see Engebretson and Zesta, 2017, and Bain et al., 2023), advisory groups such as the Space Weather Advisory Group (SWAG) and the National Academies Space Weather Roundtable, both put into place by the PROSWIFT Act itself, have taken up the cause. The SWAG, in a public meeting on March 20, 2023 (https://www.weather.gov/swag), called for a “paradigm shift”, agreeing upon a recommendation that there is a need “Provide long-term support for operational ground-based and airborne sensors and networks”.It’s clear that these data are crucial for space weather – both space weather research and operations. With the approach of solar maximum, and the associated rise in space weather hazard, what’s less clear is whether this problem will be solved in time. The community efforts have been effective in raising awareness about the dire situation facing many ground-based sensor networks. What is needed now is a mechanism to maintain these networks long-term, and advocacy for new Federal appropriations to support the organizations that take on the responsibility.

Where in the World is Ocean Carbon Data?

Mike Smit

and 3 more

April 16, 2023

Efforts to validate, monitor, and verify ocean-based carbon dioxide removal (CDR) will require a rich understanding of the ocean carbon system. Ocean observations anchor this understanding, but we know that some ongoing observations are precariously funded, that data products like SOCAT rely on volunteer effort, that regions essential to our understanding of the ocean carbon system are under-observed, and that some observation data is under-used. This presentation will be a progress report on our efforts to identify and document ocean carbon data flows using systematic literature reviews and examination of ocean data repositories. These data flows are essential to identify what data the scientific community already relies on; what data and observation gaps exist; and what data might be under-used. We examined variables of interest based on GOOS EOVs, including Oxygen (and supporting variables), Stable Carbon Isotopes (and supporting variables), Ocean Surface Stress (and supporting variables), and Ocean Surface Heat Flux (and supporting variables). Commonly observed supporting variables include O2, alkalinity, pCO2, pH, temperature, and near-surface air temperature, humidity, pressure, and wind speed.