Despite the proliferation of computer-based research on hydrology and water resources, such research is typically poorly reproducible. Published studies have low reproducibility due to incomplete availability of data and computer code, and a lack of documentation of workflow processes. This leads to a lack of transparency and efficiency because existing code can neither be quality controlled nor re-used. Given the commonalities between existing process-based hydrological models in terms of their required input data and preprocessing steps, open sharing of code can lead to large efficiency gains for the modeling community. Here we present a model configuration workflow that provides full reproducibility of the resulting model instantiations in a way that separates the model-agnostic preprocessing of specific datasets from the model-specific requirements that models impose on their input files. We use this workflow to create large-domain (global, continental) and local configurations of the Structure for Unifying Multiple Modeling Alternatives (SUMMA) hydrologic model connected to the mizuRoute routing model. These examples show how a relatively complex model setup over a large domain can be organized in a reproducible and structured way that has the potential to accelerate advances in hydrologic modeling for the community as a whole. We provide a tentative blueprint of how community modeling initiatives can be built on top of workflows such as this. We term our workflow the “Community Workflows to Advance Reproducibility in Hydrologic Modeling’‘ (CWARHM; pronounced “swarm”).

Andrew Bennett

and 1 more

Andrew Bennett

and 4 more

Water resources planning often uses streamflow predictions made by hydrologic models. These simulated predictions have systematic errors which limit their usefulness as input to water management models. To account for these errors, streamflow predictions are bias-corrected through statistical methods which adjust model predictions based on comparisons to reference datasets (such as observed streamflow). Existing bias-correction methods have several shortcomings when used to correct spatially-distributed streamflow predictions. First, existing bias-correction methods destroy the spatio-temporal consistency of the streamflow predictions, when these methods are applied independently at multiple sites across a river network. Second, bias-correction techniques are usually built on simple, time-invariant mappings between reference and simulated streamflow without accounting for the hydrologic processes which underpin the systematic errors. We describe improved bias-correction techniques which account for the river network topology and which allow for corrections that are process-conditioned. Further, we present a workflow that allows the user to select whether to apply these techniques separately or in conjunction. We evaluate four different bias-correction methods implemented with our workflow in the Yakima River Basin in the Pacific Northwestern United States. We find that all four methods reduce systematic bias in the simulated streamflow. The spatially-consistent bias-correction methods produce spatially-distributed streamflow as well as bias-corrected incremental streamflow, which is suitable for input to water management models. We also find that the process-conditioning methods improve the timing of the corrected streamflow when conditioned on daily minimum temperature, which we use as a proxy for snowmelt processes

Jane Harrell

and 2 more

The Columbia River basin is a large transboundary basin located in the Pacific Northwest. The basin spans seven US states and one Canadian province, encompassing a diverse range of hydroclimates. Strong seasonality and complex topography are projected to give rise to spatially heterogeneous climate effects on unregulated streamflow. The basin’s water resources are economically critical, and regulation across the domain is extensive. Many sensitivity studies have investigated climate impacts on the basin’s naturalized hydrology; however, few have considered the large role of regulation. This study investigates where and when regulation affects projected changes in streamflow by comparing climate outcomes across 80-member ensembles of unregulated and regulated streamflow projections at 75 sites across the basin. Unregulated streamflow projections are taken from an existing dataset of climate projections derived from Coupled Model Intercomparison Project version 5 Global Climate Models. Regulated streamflow projections were modeled by the US Army Corps of Engineers and the US Bureau of Reclamation by using these unregulated flows as input to hydro-regulation models that simulate operations based on current and historical water demands. Regulation dampens shifts in winter and summer streamflow volumes. Regulation generally attenuates changes in cool-season high flow extremes but amplifies shifts in warm-season and annual high flow extremes at historically snow-dominant headwater reservoirs. Regulation reduces dry-season low flow changes in headwater tributaries where regulation is large but elsewhere has little effect on changes in low flows. Results highlight the importance of accounting for water management in climate sensitivity analysis particularly in snow-dominant basins.

Bart Nijssen

and 2 more

The hydrology community is engaged in an intense debate regarding the merits of machine learning (ML) models over traditional models. These traditional models include both conceptual and process-based hydrological models (PBHMs). Many in the hydrologic community remain skeptical about the use of ML models, because they consider these models “black-box” constructs that do not allow for a direct mapping between model internals and hydrologic states. In addition, they argue that it is unclear how to encode a priori hydrological expertise into ML models. Yet at the same time, ML models now routinely outperform traditional hydrological models for tasks such as streamflow simulation and short-range forecasting. Not only that, they are demonstrably better at generalizing runoff behavior across sites and therefore better at making predictions in ungauged basins, a long-standing problem in hydrology. In recent model experiments, we have shown that ML turbulent heat flux parameterizations embedded in a PBHM outperform the process-based parameterization in that PBHM. In this case, the PBHM enforced energy and mass constraints, while the ML parameterization calculated the heat fluxes. While this approach provides an interesting proof-of-concept and perhaps acts as a bridge between traditional models and ML models, we argue that it is time to take a bigger leap than incrementally improving the existing generation of models. We need to construct a new generation of hydrologic and land surface models (LSMs) that takes advantage of ML technologies in which we directly encode the physical concepts and constraints that we know are important, while being able to flexibly ingest a wide variety of data sources directly. To be employed as LSMs in coupled earth system models, they will need to conserve mass and energy. These new models will take time to develop, but the time to start is now, since the basic building blocks exist and we know how to get started. If nothing else, it will advance the debate and undoubtedly lead to better understanding within the hydrology and land surface communities regarding the merits and demerits of the competing approaches. In this presentation, we will discuss some of these early studies, illustrate how ML models can offer hydrologic insight, and argue the case for the development of ML-based LSMs.

Bart Nijssen

and 4 more

In 2020, renewables became the second-largest source of electricity generation in the United States after natural gas (US EIA, 2021). In recent years, wind energy generation has overtaken hydropower as the dominant source of renewable generation in the United States, but hydropower continues to offer advantages, in particular large-scale storage, that makes it particularly valuable as a complement to other weather-driven renewables. This storage, in the form of reservoirs, is rarely managed exclusively to optimize hydropower generation. Instead, reservoirs are operated for flood control, ecosystem services, irrigation, water supply, navigation, and recreation as well as hydropower. Managing these competing demands in a changing climate with existing infrastructure creates difficult challenges, because all these demands are themselves subject to change as is the electricity demand itself. Yet many climate change impact studies continue to treat rivers as entirely natural systems and water resources infrastructure is ignored or treated as an afterthought. In this presentation, we will discuss recent climate change impact studies in both the northwestern and southeastern United States in which we quantified the effects of regulation on discharge and other variables. We will make the case that to develop new strategies for mitigating and adapting to climate change, it is paramount to account for humans as active agents in the hydrologic cycle. The first study focuses on the Columbia River Basin in the Pacific Northwest, the main hydropower producing region in the United States, and examines the effect of accounting for regulation on changes in high and low flow extremes. The second study focuses on the southeastern United States and evaluates the effects of regulation on estimated changes in flow, stream temperature, and habitat suitability. US EIA, 2021: Monthly Energy Review, July 2021. www.eia.gov/mer [Last accessed on 8/3/2021].

Andrew Bennett

and 2 more

While machine learning (ML) techniques have proven to have exceptional performance in prediction of variables that have long and varied observational records, it is not clear how to use such techniques to learn about intermediate processes which may not be readily observable. We build on previous work that found that encoding either known, or approximated, physical relationships into the machine learning framework can allow the learned model to implicitly represent processes that are not directly observed, but can be related to an observable quantity. Zhao et al. (2019) found that encoding a Penman-Monteith-like equation of latent heat in an artificial neural network could reliably predict the latent heat while providing an estimate of the resistance term, which is not readily observable at the landscape scale. Specifically, we advance this framework in two ways. First, we expand the physics-based layer to include the partitioning of both the latent and sensible heat fluxes among the vegetation and soil domains, each with their own resistance terms. Second, we couple a land-surface model (LSM), which provides information from simulated processes to the ML model. We thus effectively provide the ML model with both physics-informed inputs as well as maintain constraints such as mass and energy balance on outputs of the coupled ML-LSM simulations. Previously we found that coupling a LSM to the ML model could provide good predictions of bulk turbulent heat fluxes, and in this work we explore how incorporating the additional physics-based partitioning allows the model to learn more ecohydrologically-relevant dynamics in diverse biomes. Further, we explore what the model learned in predicting the unobserved resistance terms and what we can learn from the model itself. Zhao, W. L., Gentine, P., Reichstein, M., Zhang, Y., Zhou, S., Wen, Y., et al. (2019). Physics-Constrained Machine Learning of Evapotranspiration. Geophysical Research Letters, 46(24), 14496–14507. https://doi.org/10.1029/2019GL085291

Andrew Bennett

and 1 more

Machine learning techniques have proven useful at predicting many variables of hydrologic interest, and often out-perform traditional models for univariate predictions. However, demonstration of multivariate output deep learning models has not had the same success as the univariate case in the hydrologic sciences. Multivariate prediction is a clear area where machine learning still lags behind traditional processed based modeling efforts. Reasons for this include the lack of coincident data from multiple variables, which make it difficult to train multivariate deep-learning models, as well as the need to capture inter-variable covariances and satisfy physical constraints. For these reasons process-based hydrologic models are still used to simulate and make predictions for entire hydrologic systems. Therefore, we anticipate that future state of the art hydrologic models will couple machine learning with process based representations in a way that satisfies physical constraints and allows for a blending of theoretical and data driven approaches as they are most appropriate. In this presentation we will demonstrate that it is possible to train deep learning models to represent individual processes, forming an effective process-parameterization, that can be directly coupled with a physically based hydrologic model. We will develop a deep-learning representation of latent heat and couple it to a mass and energy balance conserving hydrologic model. We will demonstrate its performance characteristics compared to traditional methods of predicting latent heat. We will also compare how incorporation of this deep learning representation affects other major states and fluxes internal to the hydrologic model.

Yifan Cheng

and 4 more

River temperature is projected to increase in the southeastern United States (SEUS) due to climate change, exacerbating the invasion of warm-water species and reducing suitable habitats for cold- and cool-water species. However, the response of river thermal regimes to climate change is also influenced by human activities, especially dam construction and operation. Large dams impound deep reservoirs, expand water surface area and prolong water residence time, modifying the interaction of surface meteorology with river systems. During warm seasons, surface energy fluxes can only heat the top layer (epilimnion) in deep reservoirs with bottom layer (hypolimnion) remaining cold. This vertical temperature gradient is called thermal stratification. Cold hypolimnetic releases from stratified reservoirs changes downstream thermal regimes that can expel indigenous warm-water species yet provide an ideal habitat for introduced cold-water species. For example, multiple species of trout (Family: Salmonidae) have been introduced to tailwaters downstream of multiple dams operated by the Tennessee Valley Authority, which has become a popular and lucrative recreational fishing location in the SEUS. Previous research has shown that reservoir thermal stratification will be retained under climate change, but stronger surface energy fluxes warm downstream river temperature, suggesting there will be a future decline in cold-water species habitat and a corresponding increase in local warm-water species habitat. In this study, we used a physically-based modeling method to simulate river temperatures, explicitly considering the impact of thermal stratification. The SEUS has a highly regulated river system and diverse freshwater fish species. We mapped the suitable habitats for selected cold-water and warm-water fish species by comparing the simulated river temperature against their physiological constraints. Model experiments were designed to quantify the impacts of dam operation by simulating river temperature for both regulated and unregulated scenarios. Potential ecological consequences under climate change were analyzed through projected changes in river thermal regimes, e.g., shrinking habitats for cold-water species and restoring local warm-water species.

Andrew Bennett

and 2 more

The hydrologic cycle is a complex and dynamic system of interacting processes. Hydrologists seeking to understand and predict these systems develop models of varying complexity, and compare their output to observations to evaluate their performance or diagnose shortcomings within the models. Often, these analyses take into account only single variables or isolated aspects of the hydrologic system. To explore how process interactions affect model performance we have developed a general framework based on information theory and conditional probabilities. We compare how conditional mutual information and mean square errors are related in a variety of hydrometeorological conditions. By exploring different regions of phase space we can quantify model strengths and weaknesses in terms of both process accuracy as well as classical performance. By considering a range of conditions we can evaluate and compare models outside of their average behavior. We apply this analysis to physically-based models (based on SUMMA), statistical models, and observations from FluxNet towers at a number of hydro-climatically diverse sites. By focusing on how the turbulent heat fluxes are affected by shortwave radiation, air temperature, and relative humidity we go beyond simple error metrics and are able to reason about model behavior in a physically motivated way. We find that the statistically based models, while showing better performance in the mean field, often do not represent the underlying physics as well as the physically based models. The statistically based model’s over-reliance on shortwave radiation inputs limits their ability to reproduce more complex phenomena.

Yifan Cheng

and 3 more

Over 270 major dams have been constructed in the Southeastern United States (SEUS) during the past century, changing natural flow patterns and affecting stream temperatures. Projected increases in air temperature combined with changes in precipitation may result in water scarcity and affect maximum stream temperatures during the summer for some regions in the SEUS. Currently existing reservoirs mitigate water shortages during drought by releasing more water but reducing residence time, the ratio of reservoir volume to inflow. Regulating stream temperature in the summer can be done by either increasing residence time or releasing more water. In this study, we investigate the extent to which the current reservoir infrastructure can be used to mitigate the impacts of climate change under current reservoir regulations as well as the range of operating rules that could minimize climate change impacts on both streamflow and river temperature. We use the Variable Infiltration Capacity (VIC) hydrological model to simulate runoff, which is then used as input to a large-scale river routing-reservoir model (MOSART-WM) to simulate reservoir operations and produce regulated streamflow. VIC and MOSART-WM outputs are then used as input to a stream temperature model that accounts for thermal stratification in reservoirs (RBM-res). Climate change projections are based on two representative concentration pathways (RCPs) and multiple global climate models from the Coupled Model Intercomparison Project Phase 5 (CMIP5). We compare modeled changes with those from a model implementation that does not include any reservoirs and which therefore lacks any flow regulation (VIC->MOSART-RBM) to evaluate the resilience of current reservoir infrastructures. We also evaluate different reservoir operating rules (residence time versus low flow mitigation) to investigate the extent to which the current reservoir system can be used to mitigate the impacts of climate changes on both streamflow and stream temperature.