Despite the proliferation of computer-based research on hydrology and water resources, such research is typically poorly reproducible. Published studies have low reproducibility due to incomplete availability of data and computer code, and a lack of documentation of workflow processes. This leads to a lack of transparency and efficiency because existing code can neither be quality controlled nor re-used. Given the commonalities between existing process-based hydrological models in terms of their required input data and preprocessing steps, open sharing of code can lead to large efficiency gains for the modeling community. Here we present a model configuration workflow that provides full reproducibility of the resulting model instantiations in a way that separates the model-agnostic preprocessing of specific datasets from the model-specific requirements that models impose on their input files. We use this workflow to create large-domain (global, continental) and local configurations of the Structure for Unifying Multiple Modeling Alternatives (SUMMA) hydrologic model connected to the mizuRoute routing model. These examples show how a relatively complex model setup over a large domain can be organized in a reproducible and structured way that has the potential to accelerate advances in hydrologic modeling for the community as a whole. We provide a tentative blueprint of how community modeling initiatives can be built on top of workflows such as this. We term our workflow the “Community Workflows to Advance Reproducibility in Hydrologic Modeling’‘ (CWARHM; pronounced “swarm”).

Zhi Li

and 8 more

Precipitation is an essential climate and forcing variable for modeling the global water cycle. Particularly, the Integrated Multi-satellitE Retrievals for GPM (IMERG) product retrospectively provides unprecedented two-decades of high-resolution satellite precipitation estimates (0.1-deg, 30-min) globally. The primary goal of this study is to examine the similarities and differences between the two latest and also arguably most popular GPM IMERG Early and Final Run (ER and FR) products systematically over the globe. The results reveal that: (1) ER systematically estimates 13.0% higher annual rainfall than FR, particularly over land (13.8%); (2) ER and FR show less difference with instantaneous rates (Root Mean Squared Difference: RMSD=2.38 mm/h and normalized RMSD: RMSD_norm=1.10), especially in Europe (RMSD=2.16 mm/h) and cold areas (RMSD_norm=0.87); and (3) with similar detectability of extreme events and timely data delivery, ER is favored for use in hydrometeorological applications, especially in early warning of flooding. Throughout this study, large discrepancies between ER and FR are found in inland water bodies, (semi) arid regions, and complex terrains, possibly owing to morphing differences and gauge corrections while magnified by surface emissivity and precipitation dynamics. The exploration of their similarities and differences provides a first-order global assessment of various hydrological utilities: FR is designed to be more suitable for retrospective hydroclimatology and water resource management, while the earliest available ER product, though not bias-corrected by ground gauges, shows suitable applicability in operational modeling setting for early rainfall-triggered hazard alerts.