Tao Huang

and 1 more

As all kinds of physics-based and data-driven models are emerging in hydrologic and hydraulic engineering, Bayesian model averaging (BMA) is one of the popular multi-model methods used to account for various uncertainty sources in the flood modeling process and generate robust ensemble predictions. The reliability of BMA parameters (weights and variances) determines the accuracy of BMA predictions. However, the uncertainty in BMA parameters with fixed values, which are usually obtained from Expectation-Maximization (EM) algorithm, has not been adequately investigated in BMA-related applications over the past few decades. Given the limitations of the commonly used EM algorithm, Metropolis-Hastings (M-H) algorithm, which is one of the most widely used algorithms in Markov Chain Monte Carlo (MCMC) method, is proposed to estimate BMA parameters. Both numerical experiments and one-dimensional HEC-RAS models are employed to examine the applicability of M-H algorithm with multiple independent Markov chains. The performances of EM and M-H algorithms are compared based on the daily water stage predictions from 10 model members. Results show that BMA weights estimated from both algorithms are comparable, while BMA variances obtained from M-H algorithm are closer to the given variances in the numerical experiment. Moreover, the normal proposal used in M-H algorithm can yield narrower distributions for BMA weights than those from the uniform proposal. Overall, MCMC approach with multiple chains can provide more information associated with the uncertainty of BMA parameters and its performance is better than the default EM algorithm in terms of multiple evaluation metrics as well as algorithm flexibility.

Tao Huang

and 1 more

As all kinds of physics-based and data-driven models are emerging in the fields of hydrologic and hydraulic engineering, Bayesian model averaging (BMA) is one of the popular multi-model methods used to account for the various uncertainty sources in the flood modeling process and generate robust ensemble predictions based on multiple competitive candidate models. The reliability of BMA parameters (weights and variances) determines the accuracy of BMA predictions. However, the uncertainty in the BMA parameters with fixed values, which are usually obtained from the Expectation-Maximization (EM) algorithm, has not been adequately investigated in BMA-related applications over the past few decades. Given the limitations of the commonly used EM algorithm, the Metropolis-Hastings (M-H) algorithm, which is one of the most widely used algorithms in the Markov Chain Monte Carlo (MCMC) method, is proposed to estimate the BMA parameters and quantify their associated uncertainty. Both numerical experiments and the one-dimensional HEC-RAS models are employed to examine the applicability of the M-H algorithm with multiple independent Markov chains. The performances of the EM and M-H algorithms in the BMA analysis are compared based on the daily water stage predictions from 10 model configurations. The results show that the BMA weights estimated from both algorithms are comparable, while the BMA variances obtained from the M-H MCMC algorithm are closer to the given variances in the numerical experiment. Moreover, the normal proposal distribution used in the M-H algorithm can yield narrower distributions for the BMA weights than those from the uniform prior. Overall, the MCMC approach with multiple chains can provide more information associated with the uncertainty of BMA parameters and its prediction performance is better than the default EM algorithm in terms of multiple evaluation metrics as well as algorithm flexibility.

Tao Huang

and 1 more

Evaluation of the performance of hydrologic and hydraulic models is a crucial step in the modeling process. Considering the limitations of single statistical metrics, such as the Nash Sutcliffe efficiency (NSE), the Kling Gupta efficiency (KGE), and the coefficient of determination (R2), which are widely used in the evaluation of model performance, an evaluation framework that incorporates multiple criteria and based on the generalized likelihood uncertainty estimation (GLUE) is proposed to demonstrate the uncertainty in the evaluation criteria and hence to quantify the overall uncertainty of flood models in a comprehensive way. This framework is applied to the one-dimensional HEC-RAS models of six reaches located in States of Indiana and Texas of the United States to quantify the uncertainty associated with the channel roughness and upstream flow input. Specifically, the effects of different prior distributions of the uncertainty sources, multiple high-flow scenarios, and various types of measurement errors (white noise, positive bias, and negative bias) in observations on the evaluation metrics are investigated by using the bootstrapping method and Monte Carlo simulations. The results show that the model performances based on the uniform and normal priors are comparable. The distributions of all the evaluation metrics in the framework are significantly different for the flood model under different high-flow scenarios, and it further indicates that the metrics are essentially random statistical variables. Additionally, the white-noise error in observations has the least impact on the metrics, while the positive and the negative biases would have opposite impacts, which depends on whether the model overestimated or underestimated the hydrologic variable.

Sayan Dey

and 4 more

Several studies have focused on the importance of river bathymetry (channel geometry) in hydrodynamic routing along individual reaches. However, its effect on other watershed processes such as infiltration and surface water (SW) – groundwater (GW) interactions has not been explored across large river networks. Surface and subsurface processes are interdependent, therefore, errors due to inaccurate representation of one watershed process can cascade across other hydraulic or hydrologic processes. This study hypothesizes that accurate bathymetric representation is not only essential for simulating channel hydrodynamics but also affects subsurface processes by impacting SW-GW interactions. Moreover, quantifying the effect of bathymetry on surface and subsurface hydrological processes across a river network can facilitate an improved understanding of how bathymetric characteristics affect these processes across large spatial domains. The study tests this hypothesis by developing physically-based distributed models capable of bidirectional coupling (SW-GW) with four configurations with progressively reduced levels of bathymetric representation. A comparison of hydrologic and hydrodynamic outputs shows that changes in channel geometry across the four configurations has a considerable effect on infiltration, lateral seepage, and location of water table across the entire river network. For example, when using bathymetry with inaccurate channel conveyance capacity but accurate channel depth, peak lateral seepage rate exhibited 58% error. The results from this study provide insights into the level of bathymetric detail required for accurately simulating flooding-related physical processes while also highlighting potential issues with ignoring bathymetry across lower order streams such as spurious backwater flow, inaccurate water table elevations, and incorrect inundation extents.

Tao Huang

and 1 more

Flood Insurance Rate Maps (FIRMs) managed by the Federal Emergency Management Agency (FEMA) have been providing ongoing flood information to most of the communities in the United States over the past half century. However, the uncertainty associated with the modeling of the FIRMs, some of which are created by using a single HEC-RAS one-dimensional (1D) steady flow model, may have adverse effects on the reliability of flood stage and inundation extent. Therefore, a systematic understating of the uncertainty in the modeling process of FIRMs is important and necessary. The Bayesian model averaging (BMA), which is a statistical approach that can combine estimations from multiple models and produce reliable probabilistic predictions, is applied to evaluating the uncertainty associated with the FIRMs. In this study, both the BMA and HBMA approaches are used to quantify the uncertainty within the detailed FEMA models of the Deep River and the Saint Marys River in the state of Indiana based on water stage predictions from 150 HEC-RAS 1D unsteady flow model configurations that incorporate four uncertainty sources including the bridges, channel roughness, floodplain roughness, and upstream flow input. The BMA weight and the variance for each model member are obtained given the ensemble predictions and the observed water stage data in the training period, and then the BMA prediction ability is validated for the observed data from the later period. The results indicate that BMA prediction is more robust than the original FEMA model as well as the ensemble mean. Different types of uncertainty coefficients based on the BMA prediction distribution are also proposed to evaluate the FEMA models. Furthermore, the HBMA framework shows that both the channel roughness and the upstream flow input have a larger impact on prediction variance than bridges, and hence provides some insights for modelers into the relative impact of individual uncertainty sources in the flood modeling process.

Rajesh Kalyanam

and 5 more

Scientific research is increasingly collaborative and globally distributed; research groups now rely on web- based scientific tools and data management systems to simplify their day-to-day collaborative workflows. However, such tools often lack seamless interfaces, requiring researchers to contend with manual data transfers, annotation and sharing. MyGeoHub is a web platform that supports out-of-the-box, seamless workflows involving data ingestion, metadata extraction, analysis, sharing and publication. MyGeoHub is built on the HUBzero cyberinfrastructure platform and adds general-purpose software building blocks (GABBs), for geospatial data management, visualization and analysis. A data management building block iData, processes geospatial files, extracting metadata for keyword and map-based search while enabling quick previews. iData is pervasive, allowing access through a web interface, scientific tools on MyGeoHub or even mobile field devices via a data service API. GABBs includes a Python map library as well as map widgets that in a few lines of code, generate complete geospatial visualization web interfaces for scientific tools. GABBs also includes powerful tools that can be used with no programming effort. The GeoBuilder tool provides an intuitive wizard for importing multi-variable, geo-located time series data (typical of sensor readings, GPS trackers) to build visualizations supporting data filtering and plotting. MyGeoHub has been used in tutorials at scientific conferences and educational activities for K-12 students. MyGeoHub is also constantly evolving; the recent addition of Jupyter and R Shiny notebook environments enable reproducible, richly interactive geospatial analyses and applications ranging from simple pre- processing to published tools. MyGeoHub is not a monolithic geospatial science gateway, instead it supports diverse needs ranging from just a feature-rich data management system, to complex scientific tools and workflows.

Tao Huang

and 1 more

The Natural Resources Conservation Service (NRCS, formerly the Soil Conservation Service, SCS) unit hydrograph (UH) is one of the most commonly used synthetic UH methods for hydrologic modeling and engineering design all over the world. However, previous studies have shown that the application of the NRCS UH method for some ungauged watersheds in the state of Indiana produced unrealistic flood predictions for both the peak discharge and the time to peak. The objective of this work is to customize the NRCS UH by analyzing the role of its two key parameters, namely, the peak rate factor (PRF) and the lag time, in creating the runoff hydrograph. Based on 120 rainfall-runoff events collected from 30 small watersheds in Indiana over the past two decades, the observed UHs are derived and the corresponding PRF and lag time are extracted. The observed UHs in Indiana show that the mean value of PRF is 371, which is lower than the standard PRF of 484, and the NRCS lag time equation tends to underestimate the “true” lag time. Moreover, a multiple linear regression method, especially the stepwise selection technique, is employed to relate the NRCS UH parameters to the most appropriate geomorphic attributes extracted from the study watersheds. Both the statewide and regional regression models show that the main channel slope is a major factor in determining the PRF and lag time. A customized Indiana unit hydrograph (INUH) is derived with updated parameters and the Gamma function. Validation results show that the INUH provides more reliable and accurate predictions in terms of the peak discharge and the time to peak than the original NRCS UH for the watersheds in Indiana.