<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">HESS</journal-id><journal-title-group>
    <journal-title>Hydrology and Earth System Sciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">HESS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Hydrol. Earth Syst. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1607-7938</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/hess-29-6257-2025</article-id><title-group><article-title>Fully differentiable, fully distributed rainfall-runoff modeling</article-title><alt-title>Fully differentiable, fully distributed rainfall-runoff modeling</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Scholz</surname><given-names>Fedor</given-names></name>
          <email>fedor.scholz@uni-tuebingen.de</email>
        <ext-link>https://orcid.org/0000-0001-8201-5924</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Traub</surname><given-names>Manuel</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Zarfl</surname><given-names>Christiane</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-2044-1335</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Scholten</surname><given-names>Thomas</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-4875-2602</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Butz</surname><given-names>Martin V.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-8120-8537</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Neuro-Cognitive Modeling Group, University of Tübingen, Tübingen, Germany</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Environmental Systems Analysis, University of Tübingen, Tübingen, Germany</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Soil Science and Geomorphology, University of Tübingen, Tübingen, Germany</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Fedor Scholz (fedor.scholz@uni-tuebingen.de)</corresp></author-notes><pub-date><day>13</day><month>November</month><year>2025</year></pub-date>
      
      <volume>29</volume>
      <issue>21</issue>
      <fpage>6257</fpage><lpage>6283</lpage>
      <history>
        <date date-type="received"><day>23</day><month>December</month><year>2024</year></date>
           <date date-type="rev-request"><day>7</day><month>March</month><year>2025</year></date>
           <date date-type="rev-recd"><day>23</day><month>September</month><year>2025</year></date>
           <date date-type="accepted"><day>28</day><month>September</month><year>2025</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2025 Fedor Scholz et al.</copyright-statement>
        <copyright-year>2025</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025.html">This article is available from https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025.html</self-uri><self-uri xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025.pdf">The full text article is available as a PDF file from https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e130">Traditional hydrological modeling simulates rainfall-runoff process dynamics using process-based models (PBMs). PBMs are grounded in physical laws and therefore highly interpretable. As environmental systems are highly complex, though, subprocesses are sometimes hard or even impossible to identify and quantify. Data-driven approaches, like artificial neural networks (ANNs), offer an alternative. Such approaches can automatically discover hidden relationships within the data. As a result, superior model performance may be achieved. However, the uncovered relationships are hard to analyze within black-box ANNs and often fail to respect physical laws. Differentiable modeling calls for knowledge discovery by combining both approaches, benefiting from their respective advantages. In this work, we present a physically inspired, fully differentiable, and fully distributed model, which we term DRRAiNN (Distributed Rainfall-Runoff ArtIficial Neural Network). DRRAiNN is a neural network model that estimates river discharge at gauging stations based on meteorological forcings and elevation. Focusing on the Neckar river catchment in Southwest Germany, DRRAiNN is trained to predict daily water discharge measurements using data from 17 stations and from ten meteorological years only. DRRAiNN's performance is compared to the performance of the European Flood Awareness System (EFAS) reanalysis. Some instances of our model outperform EFAS at lead times of over <inline-formula><mml:math id="M1" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d in terms of the applied metrics for model performance. As DRRAiNN is fully differentiable and fully distributed, efficient source allocation algorithms can be used to identify the precipitation sources responsible for the water discharge dynamics at specific gauging stations. Besides DRRAiNN's potential to forecast upcoming water discharge dynamics, its full differentiability could be utilized to infer erosion sites from turbidity data, particularly when integrated with an appropriate erosion model.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e149">Accurate water flow forecasting plays a critical role in mitigating short-term flood impacts, such as preventing loss of life and reducing economic damage <xref ref-type="bibr" rid="bib1.bibx48" id="paren.1"/>. For example, simulating river discharge is a prerequisite for flood inundation modeling <xref ref-type="bibr" rid="bib1.bibx24" id="paren.2"/> and enables informed decision-making in water management such as dam operations <xref ref-type="bibr" rid="bib1.bibx66" id="paren.3"/>. Accuracy is not everything though. Hydrological models that respect physical laws are more likely to generalize well to new situations and to offer insights into the underlying processes that govern water movement. A solid understanding of the dynamics of water systems is necessary to estimate the impacts of environmental planning and to improve infrastructure design <xref ref-type="bibr" rid="bib1.bibx46 bib1.bibx3" id="paren.4"/>. It also enables a better assessment of how climate change may alter existing ecosystems in the future <xref ref-type="bibr" rid="bib1.bibx46 bib1.bibx67 bib1.bibx1" id="paren.5"/>. Additionally, models that respect physical laws can be used to infer the origins of observed discharge, thereby further facilitating the development of policies that mitigate the damages caused by floods. From a practical perspective, a good model should allow efficient calibration and perform well even if data are sparse, which is often the case for river discharge. Traditionally, these challenges have been addressed using physically based approaches that explicitly encode domain knowledge. These process-based models (PBMs) describe physical processes with mathematical equations derived from physical laws and observations <xref ref-type="bibr" rid="bib1.bibx6" id="paren.6"/>.</p>
      <p id="d2e171">Environmental hydrological processes are highly complex, involving numerous interacting variables that make the overall process highly heterogeneous <xref ref-type="bibr" rid="bib1.bibx37" id="paren.7"/>. Recent advances, such as the Multiscale Parameter Regionalization framework <xref ref-type="bibr" rid="bib1.bibx52" id="paren.8"/> and scalable transfer function approaches <xref ref-type="bibr" rid="bib1.bibx25" id="paren.9"/> have focused on improving parameterization and capturing spatial heterogeneity in PBMs to alleviate these issues. To reduce uncertainty and initialize PBMs adequately, data assimilation incorporates concrete observations into running models <xref ref-type="bibr" rid="bib1.bibx34 bib1.bibx9 bib1.bibx38" id="paren.10"/>. Such advancements in data assimilation can improve performance in both lumped <xref ref-type="bibr" rid="bib1.bibx39 bib1.bibx33 bib1.bibx34" id="paren.11"/> and distributed models <xref ref-type="bibr" rid="bib1.bibx51" id="paren.12"/>. However, significant challenges remain, as the involved processes and their interactions are in most cases only partially understood <xref ref-type="bibr" rid="bib1.bibx23" id="paren.13"/>, leading to high uncertainty and biases. Even if a process is known well in detail, certain input variables may simply be unobservable, such as underground topography. Additionally, environmental processes often occur at scales that differ substantially from those observed under laboratory conditions <xref ref-type="bibr" rid="bib1.bibx23 bib1.bibx55 bib1.bibx43" id="paren.14"/>.</p>
      <p id="d2e199">Complementary to PBMs, data-driven models have gained traction in recent years, driven by the increasing amount of available hydrological data <xref ref-type="bibr" rid="bib1.bibx59" id="paren.15"/>. Artificial neural networks (ANNs) are data-driven models that automatically learn relationships from large datasets. Given the superior performance of early data-driven approaches in hydrology, it is likely that the full potential of data-driven approaches remains untapped <xref ref-type="bibr" rid="bib1.bibx55 bib1.bibx43" id="paren.16"/>. However, despite achieving strong predictive performance, ANNs often fail to respect physical laws due to their purely data-driven nature. This calls for measures such as hybrid or physics-informed models that bias data-driven approaches toward physical plausibility. Furthermore, it is often criticized that developers of machine learning (ML) models do not put enough effort into the interpretation of their developed systems, failing to gain a better understanding of the system's internal dynamics <xref ref-type="bibr" rid="bib1.bibx41" id="paren.17"/>.</p>
      <p id="d2e211">One promising avenue to overcome these limitations involves leveraging ML to infer latent variables that are otherwise inaccessible to direct measurement. To give an example, a considerable portion of total discharge originates from subsurface flow. It is not yet possible to directly measure subsurface flow, making underground topography a latent driver of hydrological behavior <xref ref-type="bibr" rid="bib1.bibx55" id="paren.18"/>. We believe that these latent variables may contribute to poor model generalization across basins. ML and especially ANNs can support hydrological modeling in such cases, because they allow to infer latent variables retrospectively given observation dynamics <xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx45" id="paren.19"/>. This motivates a key question we address in this paper: Given observed dynamics, in which areas did precipitation contribute to the measured discharge?</p>
      <p id="d2e221">Similar to subsurface flow, spatially distributed evapotranspiration patterns are challenging to measure directly. Model inversions of NNs <xref ref-type="bibr" rid="bib1.bibx59" id="paren.20"/> may therefore help to extend our understanding of the water cycle with ML. For a broader overview of ML applications in hydrology, we refer the reader to <xref ref-type="bibr" rid="bib1.bibx55" id="text.21"/> and <xref ref-type="bibr" rid="bib1.bibx59" id="text.22"/>.</p>
      <p id="d2e233">A combination of PBMs and ML-based approaches could leverage the advantages of both worlds. When combined with the goal of knowledge discovery, this approach is referred to as “differentiable modeling” <xref ref-type="bibr" rid="bib1.bibx56" id="paren.23"/>. It could result in well-performing interpretable models that automatically find new relationships in the data, respect physical laws, generalize well across different settings, and require comparatively little data. From the ML perspective, known relationships can be incorporated into differentiable models as constraints or inductive biases. Inductive biases encode prior assumptions about the data-generating process, effectively constraining the model's solution space. By doing so, they can improve performance, enhance generalization, and make learning more efficient. Furthermore, they guide the model towards discovering interpretable structures in the data, aligning its behavior with established principles <xref ref-type="bibr" rid="bib1.bibx8" id="paren.24"/>. A crucial challenge for the modeler is to find and incorporate those biases that restrict the solution space as much as possible without introducing incorrect or unjustified assumptions and without restricting the self-organizing power of NNs.</p>
      <p id="d2e242">In their seminal work, <xref ref-type="bibr" rid="bib1.bibx29" id="text.25"/> have successfully used a long short-term memory (LSTM) <xref ref-type="bibr" rid="bib1.bibx20" id="paren.26"/> for rainfall-runoff modeling at the basin scale <xref ref-type="bibr" rid="bib1.bibx29" id="paren.27"/>, demonstrating that purely data-driven models can exceed traditional methods. Since then, numerous studies have emerged, applying largely the same model to various data sets <xref ref-type="bibr" rid="bib1.bibx59" id="paren.28"/>. Notably, significant advancements to the model have also been made, including the incorporation of physical constraints <xref ref-type="bibr" rid="bib1.bibx30 bib1.bibx21" id="paren.29"/>, uncertainty estimation <xref ref-type="bibr" rid="bib1.bibx28" id="paren.30"/>, and the extension of modeling to multiple timescales <xref ref-type="bibr" rid="bib1.bibx14" id="paren.31"/>. Hybrid models such as neural ODEs, where differential equations of conceptual hydrological models are replaced by neural networks, were also applied in this setting <xref ref-type="bibr" rid="bib1.bibx22" id="paren.32"/>. All of the aforementioned models are lumped, meaning that inputs are spatially aggregated over each catchment. These catchments are typically delineated using digital elevation models.</p>
      <p id="d2e270">Semi-distributed models partially leverage river network topology, providing a compromise between lumped and fully distributed representations. These include purely data-driven graph-based models <xref ref-type="bibr" rid="bib1.bibx70 bib1.bibx40 bib1.bibx60 bib1.bibx31 bib1.bibx61 bib1.bibx10" id="paren.33"/>, as well as hybrid approaches that integrate domain knowledge – for example, by using a differentiable Muskingum-Cunge routing model <xref ref-type="bibr" rid="bib1.bibx4 bib1.bibx74" id="paren.34"/>. These models typically divide the overall catchment into multiple subbasins connected via the river network, enabling limited spatial interaction. Within each subbasin, however, forcings are still spatially aggregated, similar to lumped models.</p>
      <p id="d2e279">In contrast, fully distributed models directly operate on a spatial grid. While there is a call for more fully distributed data-driven models for rainfall-runoff modeling <xref ref-type="bibr" rid="bib1.bibx43" id="paren.35"/>, most existing approaches remain limited in critical ways. Some hybrid models operate on a grid but restrict cell-to-cell communication to the direction of steepest descent <xref ref-type="bibr" rid="bib1.bibx71 bib1.bibx68" id="paren.36"/>. This strong assumption effectively transforms the grid into a directed graph, excluding physically plausible underground flows in other directions. CNN-LSTMs process gridded input data without explicit assumptions about flow directions <xref ref-type="bibr" rid="bib1.bibx64 bib1.bibx49 bib1.bibx32" id="paren.37"/>. However, these models separate spatial and temporal processing by flattening the convolutional neural network (CNN) outputs before passing them to an LSTM. As a result, spatial dependencies are not maintained across time steps. This limitation is addressed in <xref ref-type="bibr" rid="bib1.bibx44" id="text.38"/>, were a ConvLSTM <xref ref-type="bibr" rid="bib1.bibx57" id="paren.39"/> is used to jointly model space and time. Yet, before the final discharge prediction, the outputs of all grid cells are flattened into a single feature vector and passed through a fully connected layer. Similar global aggregation strategies can be found elsewhere <xref ref-type="bibr" rid="bib1.bibx75 bib1.bibx63 bib1.bibx49 bib1.bibx72 bib1.bibx5" id="paren.40"/>. Moving a step closer to physical plausibility, <xref ref-type="bibr" rid="bib1.bibx36" id="text.41"/> combined a ConvLSTM with ridge regression to learn which grid cells should contribute to discharge estimation at each station. This allowed the reconstruction of plausible underground flow paths between subbasins. Since all of these distributed models aggregate the outputs of the spatial component globally over space, whether weighted or not, they lack the incentive to propagate water across the landscape in a physically plausible way.</p>
      <p id="d2e304">Our work builds on differentiable modeling to combine both process-based and data-based modeling, and to address the challenges of physical plausibility, interpretability, and latent variable inference. We present DRRAiNN (Distributed Rainfall-Runoff ArtIficial Neural Network), a physics-inspired, fully differentiable, fully distributed rainfall-runoff model. Our spatio-temporal ANN architecture estimates river discharge at gauging stations from gridded precipitation, solar radiation, elevation, and past discharge. DRRAiNN is fully distributed in the sense that it internally operates on a grid. However, its outputs are point-wise river discharge measurements at given gauging station locations. Its full differentiability allows gradients to flow seamlessly through the entire system, enabling end-to-end optimization of all its components with sparse discharge measurements being the only target variable. To avoid overfitting, and to improve interpretability and generalization, we incorporated several physics-inspired inductive biases into DRRAiNN. These include the modularization into a spatially fully distributed rainfall-runoff model and the utilization of a graph-based river discharge model. Additional architectural choices precondition DRRAiNN to encode distinct processes, such as lateral propagation of water across the landscape and local evapotranspiration. As a result, DRRAiNN turns into a gray-box deep learning model. Its model design encourages the development of sub-modules, which model surface and sub-surface water flow, water inflow into a river network, and water flow and discharge across the river network.</p>
      <p id="d2e308">Thanks to DRRAiNN's fully distributed and fully differentiable architecture, it is possible to answer spatially explicit questions, such as: Where is the true catchment area, including contributions from underground flows? In other words, DRRAiNN enables source allocations using gradient-based attribution methods like saliency maps <xref ref-type="bibr" rid="bib1.bibx58" id="paren.42"/>. These techniques can help to examine and understand internal model dynamics, enabling knowledge discovery.</p>
      <p id="d2e314">We demonstrate our approach on the Neckar River basin and compare performance against the operational European Flood Awareness System (EFAS).</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methods</title>
      <p id="d2e325">We present DRRAiNN, a spatio-temporal ANN architecture that estimates river discharge from static attributes and meteorological forcings in a distributed manner. This chapter presents the methodology underlying our approach. We begin with a detailed description of our neural network architecture and discuss key design choices that inform its structure. We then describe the input and output data specifications, which are closely tied to the architectural design. Following this, we introduce our study area and experimental setup, present the benchmark model used for comparison, and conclude with the evaluation metrics employed to assess model performance.</p>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Model</title>
      <p id="d2e335">DRRAiNN's structure is grounded in the following data and structural information sources. The locations <inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for estimations of discharge in the river network are determined by discharge gauging stations that provide observed discharge <inline-formula><mml:math id="M3" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> for time <inline-formula><mml:math id="M4" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> in <inline-formula><mml:math id="M5" display="inline"><mml:mrow><mml:mn mathvariant="normal">24</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> periods. The connectivity of stations, determined by the river network, is encoded in an adjacency matrix <inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>. Static maps <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and meteorological forcings <inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> for hourly time points <inline-formula><mml:math id="M9" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> are encoded on a grid that spans a domain larger than the elevation-delineated catchment area to capture effective catchment contributions beyond topographic boundaries. Given static maps <inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mo>:</mml:mo><mml:mo>,</mml:mo><mml:mo>:</mml:mo></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, meteorological forcings <inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mo>:</mml:mo><mml:mo>,</mml:mo><mml:mo>:</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>:</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>s</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mi>T</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> over the whole duration <inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mi mathvariant="normal">…</mml:mi><mml:msub><mml:mi>t</mml:mi><mml:mi>s</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> in hours, and past discharge <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>:</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>s</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> over the tune-in period <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mi mathvariant="normal">…</mml:mi><mml:msub><mml:mi>t</mml:mi><mml:mi>s</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> in days, DRRAiNN estimates discharge <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>s</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>s</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mi>T</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> over a temporal future horizon of <inline-formula><mml:math id="M16" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> days via a function <inline-formula><mml:math id="M17" display="inline"><mml:mi>f</mml:mi></mml:math></inline-formula>, representing the learned spatio-temporal mapping implemented by the model:

            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M18" display="block"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">̃</mml:mo></mml:mover><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>s</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>s</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mi>T</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>f</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mo>:</mml:mo><mml:mo>,</mml:mo><mml:mo>:</mml:mo></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mo>:</mml:mo><mml:mo>,</mml:mo><mml:mo>:</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>:</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>s</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mi>T</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>:</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mi>s</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>

          Since surface and subsurface flow differ from river flow dynamics as described above, we model these subprocesses separately. Therefore, DRRAiNN consists of two components, the rainfall-runoff model and the discharge model. The rainfall-runoff model operates recurrently on a grid, rendering it fully distributed. It is supposed to model surface and subsurface flow, and evapotranspiration. The discharge model operates recurrently on a graph to model river flow inside of channels and output estimated discharge <inline-formula><mml:math id="M19" display="inline"><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">̃</mml:mo></mml:mover></mml:math></inline-formula> at the station locations. While DRRAiNN is fully distributed in its internal computation over a spatial grid, its outputs are only available at selected gauging stations.</p>
      <p id="d2e753">At each time step, DRRAiNN processes the sequence in an auto-regressive loop by first invoking the rainfall-runoff model, followed by the discharge model. The rainfall-runoff model receives gridded static maps <inline-formula><mml:math id="M20" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> and meteorological forcings <inline-formula><mml:math id="M21" display="inline"><mml:mi>F</mml:mi></mml:math></inline-formula> as input to model the catchment on a grid. It is primed to distinguish between two important subprocesses, namely surface and subsurface flow, which is mainly driven by topography, and evapotranspiration, which is mainly driven by solar radiation. It produces a latent representation, which we term runoff embedding, extracted at station locations and used as input to the discharge model. Despite being the main driver of discharge, it cannot be directly interpreted as runoff due to its self-organizing nature. The discharge model additionally receives an adjacency matrix <inline-formula><mml:math id="M22" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> that describes the connectivity between stations, static river segment features, and the (potentially estimated) discharge <inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mo>:</mml:mo><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> from the previous time step. It then estimates discharge <inline-formula><mml:math id="M24" display="inline"><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover></mml:math></inline-formula> for each station, from which the training loss is computed.</p>
      <p id="d2e807">We implement DRRAiNN in pytorch <xref ref-type="bibr" rid="bib1.bibx47" id="paren.43"/>. In the following, we provide a more detailed description of DRRAiNN's components. See Fig. <xref ref-type="fig" rid="F1"/> for a depiction of the overall model.</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e818">Schematic overview of the DRRAiNN architecture. The gridded rainfall-runoff model has two main tasks: to model the redistribution of precipitation across the landscape, and to model evapotranspiration based on solar radiation. It receives precipitation as its main input to a point-wise LSTM, whose hidden states, but not cell states, are updated using a ConvNeXtBlock. The ConvNeXtBlock weights are not fixed but dynamically generated by hypernetworks (indicated by red arrows). The depth-wise convolution (DWConv), responsible for lateral water propagation, receives its weights from a CNN that takes elevation as input and shares the same receptive field as the DWConv. The point-wise convolutions (PWConv1 and PWConv2), used to model local evapotranspiration processes, receive their weights from an MLP that takes solar radiation as input. The LSTM hidden state is further processed by a linear layer before being passed to the discharge model. This graph-based discharge model aggregates information at the gauging stations, incorporating the last (possibly inferred) discharge values, elevation differences between stations, and river segment lengths. The arrows between Segment- and StationGRU indicate the temporal information exchange between these kernels: station outputs provide lateral inputs to segment kernels in the next time step, creating a recurrent communication pattern. The output of the discharge model is the estimated discharge at each station.    </p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f01.png"/>

        </fig>

<sec id="Ch1.S2.SS1.SSS1">
  <label>2.1.1</label><title>Rainfall-Runoff Model</title>
      <p id="d2e834">The rainfall-runoff model consists of a position-wise LSTM and a CNN that are called in each time step. This renders the rainfall-runoff model local in space and time. Only spatially local and temporally previous information is used to update internal states.</p>
      <p id="d2e837">The position-wise LSTM (PWLSTM) is responsible for modeling the temporal relationships in the data and therefore maintains a hidden and a cell state for each grid cell. The gating mechanism regulates when and how the cell state is updated, allowing the model to retain information over extended time periods. This can be particularly useful for implicitly modeling slow hydrological processes such as soil moisture or groundwater levels, which evolve more gradually than overland flow. The LSTM receives precipitation as input to update its hidden and cell states. It has a hidden size of <inline-formula><mml:math id="M25" display="inline"><mml:mn mathvariant="normal">4</mml:mn></mml:math></inline-formula> (see Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/> for hidden sizes <inline-formula><mml:math id="M26" display="inline"><mml:mn mathvariant="normal">2</mml:mn></mml:math></inline-formula> and <inline-formula><mml:math id="M27" display="inline"><mml:mn mathvariant="normal">6</mml:mn></mml:math></inline-formula>). Importantly, the weights of the LSTM are shared throughout the gridded area. As a result, while the LSTM at each grid cell maintains individual hidden and cell state values, the temporal processing principle is identical everywhere. The assumption is that the unfolding physics is the same everywhere, although they may be locally parameterized.</p>
      <p id="d2e863">The CNN models spatial relationships such as the propagation of water flow across the landscape and evapotranspiration. It receives and updates the hidden state <inline-formula><mml:math id="M28" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula> of the PWLSTM to model spatial interactions, while leaving the PWLSTM's cell states untouched to preserve temporal memory. Surface and subsurface flow are spatially extended processes, whereas evapotranspiration is primarily a local phenomenon, occurring independently at each grid cell. To reflect this distinction, we separate the CNN's treatment of these processes using different convolution types and input sources, introducing an inductive bias into the architecture.</p>
      <p id="d2e873">More precisely, the CNN is based on a modified ConvNeXt block <xref ref-type="bibr" rid="bib1.bibx35" id="paren.44"/>. A ConvNeXt block consists of three layers, namely a depth-wise convolutional layer (DWConv) with kernel size <inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:mn mathvariant="normal">7</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">7</mml:mn></mml:mrow></mml:math></inline-formula> followed by a position-wise inverted bottleneck given by two linear layers (PWConv1 and PWConv2). This way, ConvNeXt decouples spatial and channel-wise information flow. We apply the SiLU activation function after the convolutional and between the linear layers <xref ref-type="bibr" rid="bib1.bibx19" id="paren.45"/>. In contrast to its original formulation, the weights of our ConvNeXt block are not static but location-dependent. They are parameterized by other neural networks, turning this network component into a hypernetwork <xref ref-type="bibr" rid="bib1.bibx62" id="paren.46"/>. This means that the ConvNeXt block can behave differently at each location on the grid. Calling DWConv results in the following operation:

              <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M30" display="block"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow><mml:mn mathvariant="normal">3</mml:mn></mml:munderover><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow><mml:mn mathvariant="normal">3</mml:mn></mml:munderover><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>⋅</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>+</mml:mo><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>+</mml:mo><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

            where <inline-formula><mml:math id="M31" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> is the output, <inline-formula><mml:math id="M32" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> the input, <inline-formula><mml:math id="M33" display="inline"><mml:mi>w</mml:mi></mml:math></inline-formula> are the weights produced by the hypernetwork, <inline-formula><mml:math id="M34" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> is the considered channel, and <inline-formula><mml:math id="M35" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M36" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> are coordinates. We can still call this operation a convolution if we regard the input variables together with the weight-generating networks as the kernel. Calling PWConv1 and PWConv2 results in the following operation:

              <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M37" display="block"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mtext>out</mml:mtext></mml:msub></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mtext>in</mml:mtext></mml:msub></mml:mrow></mml:munder><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mtext>out</mml:mtext></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mtext>in</mml:mtext></mml:msub></mml:mrow></mml:msub><mml:mo>⋅</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mtext>in</mml:mtext></mml:msub></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

            each layer of the ConvNeXt block is parameterized by a distinct hypernetwork, tailored to the type of process it represents. The weights of DWConv are produced by a CNN that has the same kernel size as DWConv itself. The weights for PWConv1 and PWConv2 are produced by position-wise multi-layer perceptions (MLPs). By using different input variables for the different hypernetworks, we can distinguish between local and spatially extended processes. How water propagates across the landscape depends mainly on the topography, which is why we generate the weights of DWConv from elevation. Before feeding the elevation into the hypernetwork, we subtract the elevation of the center cell from the elevations of all other cells within each receptive field as flow direction directly depends on elevation gradients rather than absolute elevation. Evapotranspiration, on the other hand, is a local process and is therefore best captured by the position-wise components. This is why we generate the weights for PWConv1 and PWConv2 from solar radiation. See Fig. <xref ref-type="fig" rid="F2"/> for an illustration.</p>

      <fig id="F2"><label>Figure 2</label><caption><p id="d2e1123">Illustration of the hypernetworks used in DRRAiNN. In both panels, the dark gray cells represent locations whose hidden states are updated based on information from the light gray cells. The weights for these updates are generated by separate neural networks that share the same receptive field but receive different types of input data. Left: A CNN takes elevation as input and produces the weights for the depth-wise convolution, which models lateral water propagation. Right: An MLP takes solar radiation as input and produces the weights for the point-wise convolution, which models localized evapotranspiration.  </p></caption>
            <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f02.png"/>

          </fig>

      <p id="d2e1132">Lastly, the runoff embeddings are extracted at the station locations, fed through a single linear layer, and sent to the river discharge model. Aggregating the hidden states of all cells on the corresponding upstream river segment showed a tendency to overfit in preliminary experiments.</p>
</sec>
<sec id="Ch1.S2.SS1.SSS2">
  <label>2.1.2</label><title>Discharge Model</title>
      <p id="d2e1143">Our discharge model is a recurrent graph neural network called DISTANA <xref ref-type="bibr" rid="bib1.bibx27" id="paren.47"/>, with the graph structure defined by the actual river network and the stations. DISTANA maintains two types of recurrent units: station and segment kernels, both implemented as Gated Recurrent Units (GRUs, <xref ref-type="bibr" rid="bib1.bibx11" id="altparen.48"/>) with a hidden size of <inline-formula><mml:math id="M38" display="inline"><mml:mn mathvariant="normal">8</mml:mn></mml:math></inline-formula> (see Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/> for hidden sizes <inline-formula><mml:math id="M39" display="inline"><mml:mn mathvariant="normal">4</mml:mn></mml:math></inline-formula> and <inline-formula><mml:math id="M40" display="inline"><mml:mn mathvariant="normal">16</mml:mn></mml:math></inline-formula>, and a version in which the GRUs are replaced with LSTMs). Station kernels are placed at the gauging stations, while segment kernels are located on segments between stations. These kernels communicate with each other via lateral connections with <inline-formula><mml:math id="M41" display="inline"><mml:mn mathvariant="normal">4</mml:mn></mml:math></inline-formula> channels (Fig. <xref ref-type="fig" rid="F1"/>). In each time step, the segment kernels are updated first, followed by the station kernels, which then estimate the discharge <inline-formula><mml:math id="M42" display="inline"><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">̃</mml:mo></mml:mover></mml:math></inline-formula> at their respective locations. The segment kernels first concatenate the previous output of the upstream station kernels with static river segment attributes – specifically the altitude difference and segment length. After applying the GRU, the output is multiplied by the adjacency matrix, which is derived from the river network topology and station positions. The segment kernels thereby sum up information from upstream station kernels. The output of the segment kernels serves as input for the station kernels. The station kernels work similarly. They first concatenate the last output of the segment kernels with the last (potentially inferred) discharge and the output of the rainfall-runoff model. After applying the GRU, the output is split into the estimated discharge <inline-formula><mml:math id="M43" display="inline"><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover></mml:math></inline-formula> and the input for the segment kernels in the next time step. This creates a bidirectional information flow between station and segment kernels across time steps, where station outputs serve as lateral inputs to segment kernels in subsequent time steps.</p>
      <p id="d2e1205">Although DRRAiNN receives hourly meteorological forcings <inline-formula><mml:math id="M44" display="inline"><mml:mi>F</mml:mi></mml:math></inline-formula>, it produces discharge estimates at a daily resolution. During the initial <inline-formula><mml:math id="M45" display="inline"><mml:mn mathvariant="normal">10</mml:mn></mml:math></inline-formula> d tune-in phase of each sequence, we feed the same observed discharge value <inline-formula><mml:math id="M46" display="inline"><mml:mi>Q</mml:mi></mml:math></inline-formula> into DRRAiNN for each hourly step within the day.</p>
</sec>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Data</title>
      <p id="d2e1238">The input data for DRRAiNN consists of radar-based precipitation, elevation for above-ground topography, solar radiation, and river discharge data. Preliminary experiments showed no improvement when including temperature; therefore, we exclude it following Occam's razor.</p>
      <p id="d2e1241">For precipitation, we use the radar-based precipitation product RADOLAN provided by the Deutsche Wetterdienst <xref ref-type="bibr" rid="bib1.bibx50" id="paren.49"/>. The data domain is a <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:mn mathvariant="normal">900</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">900</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> pixel grid with a resolution of <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> that covers all of Germany and a temporal resolution of <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>. This grid defines the spatial resolution at which our model operates. RADOLAN data is log-standardized before being sent to the model due to its long-tail distribution. Specifically, we add <inline-formula><mml:math id="M50" display="inline"><mml:mn mathvariant="normal">1</mml:mn></mml:math></inline-formula> and take the logarithm, then compute the mean and standard deviation of the transformed data to standardize it. We replace missing values with zeros, which is the standardized mean.</p>
      <p id="d2e1306">For static topography information we use the digital elevation model (DEM) EU-DEM v1.1 provided by the Copernicus Land Monitoring Service of the European Environment Agency <xref ref-type="bibr" rid="bib1.bibx13" id="paren.50"/>. We also use the DEM to compute the differences in altitudes between adjacent discharge gauging stations. Elevation values and derived difference are standardized before being sent to the model, i.e., we subtract their mean and divide by their standard deviation.</p>
      <p id="d2e1312">For solar radiation, we use surface short-wave downward radiation (SSRD) from the ERA5 data set <xref ref-type="bibr" rid="bib1.bibx12" id="paren.51"/>. It comes with a temporal resolution of <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> and a relatively coarse spatial resolution of <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.25</mml:mn><mml:mi mathvariant="italic">°</mml:mi><mml:mo>×</mml:mo><mml:mn mathvariant="normal">0.25</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula>. Like the precipitation data, solar radiation data is log-standardized. We use rasterio <xref ref-type="bibr" rid="bib1.bibx16" id="paren.52"/> to transform and reproject the DEM and solar radiation data to match the RADOLAN coordinate reference system.</p>
      <p id="d2e1350">The topography of our river network is determined by the AWGN data set <xref ref-type="bibr" rid="bib1.bibx2" id="paren.53"/>. We use it to compute the adjacency matrix that describes which stations are connected via river segments and the corresponding river segment lengths.</p>
      <p id="d2e1356">Finally, we use discharge measurement data to tune in the discharge model and, more importantly, as the only target variable to train, validate, and test our model. We use data collected and provided by the German Federal Institute of Hydrology via the Global Runoff Data Centre <xref ref-type="bibr" rid="bib1.bibx17" id="paren.54"/>. The data set contains observed daily river discharge from gauging stations worldwide, including those in Germany. Since the location information of the discharge gauging stations is partially wrong, we corrected them manually. We then align the station locations to the nearest river segment (snapping). If the correction exceeds a predefined threshold of <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, the station is excluded. If two stations are very close to each other, one of them is discarded. Due to its long-tail distribution, discharge data is log-standardized on a per-station basis before being sent to the model. We add <inline-formula><mml:math id="M54" display="inline"><mml:mn mathvariant="normal">1</mml:mn></mml:math></inline-formula> and take the logarithm, then standardize the data using station-wise means and standard deviations. We replace missing values with zeros, which is the standardized mean of the corresponding station.</p>
      <p id="d2e1381">Our choice of input datasets was guided by temporal resolution, data provenance, and practical availability. Although the European Flood Awareness System (EFAS) employs EMO-1 for precipitation input, we opted for RADOLAN due to important differences: EMO-1 offers a coarser <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:mn mathvariant="normal">6</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> resolution and is interpolated from sparse station data, in contrast to RADOLAN's direct radar-based observations. Although we expect only minor differences in performance in some settings, radar-derived datasets like RADOLAN provide finer spatial and temporal resolution, which is advantageous for distributed models. Similarly, we chose ERA5 for solar radiation data due to its gridded format and hourly resolution. Alternative datasets, such as those provided by DWD, are either available only as station-wise hourly data, which lack the required grid format, or as gridded data aggregated monthly, which does not meet our temporal requirements. Daily datasets like EOBS may suffice if subdaily temporal patterns are encoded separately, but this would require additional preprocessing. A transition toward operation flood forecast would place increased importance on the choice of precipitation forecast products <xref ref-type="bibr" rid="bib1.bibx26" id="paren.55"/>. Ultimately, all data products entail inherent uncertainties and errors, and our choices reflect a balance between data availability, temporal resolution, and the specific requirements of our model.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Study site</title>
      <p id="d2e1407">The Neckar river network in Southwest Germany spans a catchment area of <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:mn mathvariant="normal">14</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">000</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:math></inline-formula> with a mean elevation of <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:mn mathvariant="normal">460</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>. According to ERA5, mean temperatures in this region were <inline-formula><mml:math id="M58" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.95</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="italic">°</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:mrow></mml:math></inline-formula> during winter and <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:mn mathvariant="normal">17.95</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="italic">°</mml:mi><mml:mi mathvariant="normal">C</mml:mi></mml:mrow></mml:math></inline-formula> during summer in our training period. Our dataset includes measurements from <inline-formula><mml:math id="M60" display="inline"><mml:mn mathvariant="normal">17</mml:mn></mml:math></inline-formula> gauging stations distributed across the river network (see Fig. <xref ref-type="fig" rid="F3"/>). At the most downstream station in Rockenau, discharge during the training period ranged from <inline-formula><mml:math id="M61" display="inline"><mml:mn mathvariant="normal">29.5</mml:mn></mml:math></inline-formula> to <inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:mn mathvariant="normal">1690</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:math></inline-formula> with a mean of <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mn mathvariant="normal">133.3</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e1531">The catchment features a highly heterogeneous landscape, including narrow and wide valleys, diverse geology (e.g., limestone, sandstone), different soil textures (e.g., clay, marl), and subsurface structures such as karst systems and pore water aquifers. This makes the modeling of the Neckar River network a challenging endeavor. To give a concrete example, there are underground flows south of Pforzheim that route water toward the east, while the elevation model suggests a different flow direction <xref ref-type="bibr" rid="bib1.bibx65" id="paren.56"/>. This relationship cannot be inferred from a digital elevation model alone. Latent underground structures route the water in a different direction than the elevation model alone would suggest.</p>
      <p id="d2e1537">By restricting the domain to the Neckar river network, we end up with an area of size <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:mn mathvariant="normal">200</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">200</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>. Following the transformations described above, all gridded data is reduced from a <inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> grid to a <inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:mn mathvariant="normal">4</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> grid by taking the mean. This results in a <inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:mn mathvariant="normal">50</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> grid covering the study area. We train our model on hydrological years 2006–2015, validate on 2016–2018, and test on <inline-formula><mml:math id="M68" display="inline"><mml:mn mathvariant="normal">2019</mml:mn></mml:math></inline-formula>. Forcings <inline-formula><mml:math id="M69" display="inline"><mml:mi>F</mml:mi></mml:math></inline-formula> are provided at hourly resolution, while discharge is provided at daily resolution.</p>

      <fig id="F3"><label>Figure 3</label><caption><p id="d2e1630">The study area used in this work is the Neckar River catchment in Southwest Germany.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f03.png"/>

        </fig>

</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Experimental setup</title>
      <p id="d2e1648">We train DRRAiNN on sequences of <inline-formula><mml:math id="M70" display="inline"><mml:mn mathvariant="normal">20</mml:mn></mml:math></inline-formula> d (<inline-formula><mml:math id="M71" display="inline"><mml:mn mathvariant="normal">480</mml:mn></mml:math></inline-formula> h steps), using the first <inline-formula><mml:math id="M72" display="inline"><mml:mn mathvariant="normal">10</mml:mn></mml:math></inline-formula> d as a warm-up phase. During this phase, we feed the model observed discharge values to initialize and align its hidden states with the true system dynamics. This procedure resembles data assimilation in traditional hydrological models, where observations are used to update model states and reduce uncertainty. In ML terms, this corresponds to teacher forcing. The warm-up phase allows the rainfall-runoff component of DRRAiNN to infer latent hydrological states, such as soil moisture or aquifer recharge, through its hidden state representations. This alignment helps the model transition smoothly to predictive, open-loop mode, where future discharge is estimated without access to ground-truth values.</p>
      <p id="d2e1672">After the warm-up phase, DRRAiNN transitions into open-loop mode for the remaining <inline-formula><mml:math id="M73" display="inline"><mml:mn mathvariant="normal">10</mml:mn></mml:math></inline-formula> d of each sequence. In this predictive mode, the discharge model feeds its own previous discharge estimations as inputs for subsequent time steps. The rainfall-runoff model, in contrast, continues to receive observed precipitation and solar radiation as inputs throughout the sequence. While informative, this setup does not reflect realistic operational conditions for discharge forecasting. Precipitation forecasting, in particular, remains a major challenge. Currently no algorithm can accurately predict precipitation <inline-formula><mml:math id="M74" display="inline"><mml:mn mathvariant="normal">10</mml:mn></mml:math></inline-formula> d ahead at a spatial resolution of <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:mn mathvariant="normal">4</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>. However, this setup is well suited for knowledge discovery concerning hydrologic processes, which is primary focus in this work. We leave the evaluation of DRRAiNN under realistic, forecast-based conditions for future work.</p>
      <p id="d2e1709">We use the mean squared error (MSE) computed on station-wise standardized discharge data as both the training and validation loss. Standardization ensures that stations with larger discharge values do not dominate the loss, promoting a balanced learning across all stations. Training is performed using truncated backpropagation through time (TBPTT), where the truncation length increases progressively over the course of training. Initially, we backpropagate the loss over <inline-formula><mml:math id="M76" display="inline"><mml:mn mathvariant="normal">1</mml:mn></mml:math></inline-formula> d sequences (24 time steps) to help DRRAiNN focus on short-term temporal relationships and stabilize learning. Over the course of training, we increase the truncation length, enabling the model to learn longer-term dependencies. The truncation length schedule is shown in Table <xref ref-type="table" rid="T1"/>. Training takes less than <inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:mn mathvariant="normal">8</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> on a single NVIDIA A100 GPU. A forward simulation of a <inline-formula><mml:math id="M78" display="inline"><mml:mn mathvariant="normal">20</mml:mn></mml:math></inline-formula> d sequence takes approximately <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:mn mathvariant="normal">4</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">s</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>.</p>

<table-wrap id="T1"><label>Table 1</label><caption><p id="d2e1756">Truncation length schedule in days for TBPTT.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">#Epochs</oasis:entry>
         <oasis:entry colname="col2">Truncation length</oasis:entry>
         <oasis:entry colname="col3">Batch size</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">10</oasis:entry>
         <oasis:entry colname="col2">1</oasis:entry>
         <oasis:entry colname="col3">256</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">2</oasis:entry>
         <oasis:entry colname="col3">128</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">4</oasis:entry>
         <oasis:entry colname="col3">64</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">10</oasis:entry>
         <oasis:entry colname="col3">32</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">20</oasis:entry>
         <oasis:entry colname="col3">32</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e1848">To improve generalization and account for model variability due to random initialization, we train five independent instances of DRRAiNN per experiment, each initialized with a different seed. We report test results based on the three runs with the lowest validation loss out of five seeds. This selection procedure is applied consistently to both the primary model and all ablation variants. We use the Ranger optimizer <xref ref-type="bibr" rid="bib1.bibx69" id="paren.57"/> with a learning rate of <inline-formula><mml:math id="M80" display="inline"><mml:mn mathvariant="normal">0.0025</mml:mn></mml:math></inline-formula> to optimize the <inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:mn mathvariant="normal">30</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">600</mml:mn></mml:mrow></mml:math></inline-formula> parameters in DRRAiNN. To stabilize training, we clip the gradient if its norm exceeds <inline-formula><mml:math id="M82" display="inline"><mml:mn mathvariant="normal">1</mml:mn></mml:math></inline-formula>, thereby preventing large parameter updates in steep regions of the loss surface. We use hydra to manage experiment configurations <xref ref-type="bibr" rid="bib1.bibx73" id="paren.58"/>.</p>
      <p id="d2e1882">To increase the size of the training data set and improve generalization, we apply data augmentation. CNNs are not inherently invariant to rotations and reflections, meaning a model trained on the original data orientation may perform poorly when presented with rotated or reflected versions of the same physical scenario. The symmetry group of the square contains eight elements: the identity, rotations by <inline-formula><mml:math id="M83" display="inline"><mml:mn mathvariant="normal">90</mml:mn></mml:math></inline-formula>, <inline-formula><mml:math id="M84" display="inline"><mml:mn mathvariant="normal">180</mml:mn></mml:math></inline-formula>, and <inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:mn mathvariant="normal">270</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula>, and reflection in the <inline-formula><mml:math id="M86" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M87" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>, and both diagonal axes. For each training sequence, we apply a uniformly sampled symmetry to the spatial variables in each time step. We ensure physical consistency by tapping into the runoff embeddings at the transformed station locations. The river discharge model's graph structure remains unchanged by this augmentation, as graph neural networks are inherently invariant to such spatial transformations.</p>
</sec>
<sec id="Ch1.S2.SS5">
  <label>2.5</label><title>Benchmark model: European Flood Awareness System</title>
      <p id="d2e1931">To provide context for DRRAiNN's performance, we compare it to the European Flood Awareness System (EFAS), an established and operational distributed process-based model. We use publicly available EFAS reanalysis data, which eliminates the need to tune EFAS ourselves. This avoids potential biases that could arise from allocating unequal tuning effort to the benchmark model versus our own model. While DRRAiNN achieves higher performance than EFAS in many scenarios, our primary aim is to demonstrate the potential of distributed neural networks for river discharge estimation, rather than merely outperforming EFAS.</p>
      <p id="d2e1934">EFAS simulates runoff on an approximately <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:mn mathvariant="normal">1.5</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1.5</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> grid with a temporal resolution of <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:mn mathvariant="normal">6</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>. It receives as inputs static maps describing topography, river networks, soil, and vegetation, as well as meteorological forcings such as precipitation, temperature, and potential evaporation.</p>
      <p id="d2e1969">While EFAS serves as a useful benchmark, the comparison to DRRAiNN is not perfectly fair due to fundamental differences in the input and output variables. Both models receive gridded meteorological forcings, but DRRAiNN additionally receives discharge measurements during the tune-in period. In contrast, EFAS does not use discharge measurements as input but relies on them for offline model calibration. Furthermore, DRRAiNN produces discharge estimates only at gauging station locations, whereas EFAS generates discharge predictions across the entire spatial grid. EFAS also relies on additional input variables not used by DRRAiNN, such as soil type, vegetation, temperature, and potential evapotranspiration. While this makes EFAS a powerful tool, it also limits its applicability in regions lacking such detailed input data. Another difference lies in the precipitation data used: EFAS relies on EMO-1, a <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:mn mathvariant="normal">6</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> product interpolated from weather station data, whereas DRRAiNN uses RADOLAN, a radar-based dataset offering higher spatial and temporal resolution. As a result, a direct comparison between EFAS and DRRAiNN is not valid. Nonetheless, EFAS serves as a baseline to contextualize the expected performance range of DRRAiNN. We thus emphasize that our goal is not to directly compare performance but to provide a baseline that allows us to place the principled quality of DRRAiNN's performance with respect to alternative state-of-the-art forecasting approaches.</p>
</sec>
<sec id="Ch1.S2.SS6">
  <label>2.6</label><title>Evaluation</title>
      <p id="d2e1992">Besides visualizing hydrographs for selected gauging stations, we evaluate DRRAiNN using four standard metrics in hydrology: Kling-Gupta efficiency (KGE, <xref ref-type="bibr" rid="bib1.bibx18" id="altparen.59"/>), Nash-Sutcliffe efficiency (NSE, <xref ref-type="bibr" rid="bib1.bibx42" id="altparen.60"/>), Pearson's correlation coefficient (PCC), and the mean absolute error (MAE). We report all four metrics because each highlights different aspects of model performance, and no single metric is free from limitations <xref ref-type="bibr" rid="bib1.bibx18" id="paren.61"/>. MAE is particularly intuitive, as it is expressed in the same unit as discharge and directly quantifies the average deviation between predictions and observations. However, because it lacks normalization, stations with larger discharge magnitudes contribute disproportionately to the overall MAE. PCC quantifies the strength of linear association between the observed and estimated discharges. While it captures shared variability, it is insensitive to systematic differences in scale or bias. To also capture the scale, the NSE was developed, which can be seen as a mean squared error that is weighted by the variance of the observed discharge. While the NSE does incorporate bias, it does so in normalized form scaled by the standard deviation of the target variable, which can mask individual contributions of different error components. The KGE was therefore developed to independently evaluate correlation, bias, and variability as separate components. When computing KGE and NSE values, we use station-wise means and variances calculated from the training data set, following the approach in <xref ref-type="bibr" rid="bib1.bibx30" id="text.62"/>. For KGE, NSE, and PCC, higher values indicate better performance, with a maximum of <inline-formula><mml:math id="M91" display="inline"><mml:mn mathvariant="normal">1</mml:mn></mml:math></inline-formula> representing a perfect match. In contrast, lower values of MAE are better, with <inline-formula><mml:math id="M92" display="inline"><mml:mn mathvariant="normal">0</mml:mn></mml:math></inline-formula> indicating a perfect fit.</p>
      <p id="d2e2022">During open-loop inference, we evaluate metrics separately for each open-loop step, where the first step resembles closed-loop estimation. This allows us to assess how model performance degrades with increasing lead times. Although DRRAiNN was only trained on sequences that span <inline-formula><mml:math id="M93" display="inline"><mml:mn mathvariant="normal">20</mml:mn></mml:math></inline-formula> d, we evaluate it on <inline-formula><mml:math id="M94" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d sequences to investigate its ability to  generalize beyond the training horizon. Additionally, we will plot the performance of the models against the mean discharge of the different stations to identify potential systematic dependencies between flow magnitude and model accuracy. In all cases, we exclude the initial <inline-formula><mml:math id="M95" display="inline"><mml:mn mathvariant="normal">10</mml:mn></mml:math></inline-formula> d tune-in period before calculating metrics and producing plots.</p>
      <p id="d2e2046">With knowledge discovery being the main motivation of this work, we also test DRRAiNN for physical plausibility. A physically implausible model might learn spurious relationships in the data. It could, for example, exploit the DEM to encode local biases that lead to gains or losses of water not driven by meteorological forcings. By retrospectively inferring catchment areas from observed dynamics, we assess whether the rainfall-runoff model successfully propagates water across the landscape. The procedure is as follows: After a forward pass, we compute saliency maps by taking the gradient of the final discharge estimate with respect to the precipitation inputs. These maps tell us to which extent the model's output depends on the precipitation in each grid cell and time step. We multiply this gradient by the precipitation itself to focus the analysis on cells in which precipitation occurred. To examine how the attributions change over time, we split the sequence into subsequences of <inline-formula><mml:math id="M96" display="inline"><mml:mn mathvariant="normal">5</mml:mn></mml:math></inline-formula> d, over which we take the mean. We do this for each station separately and visualize the resulting attributions to identify which areas contribute most to discharge estimation at each station. To reduce noise, we repeat this process across all test sequences and average the resulting attribution maps.</p>
      <p id="d2e2056">We compare the resulting attributions with catchment areas delineated from elevation data using standard hydrological techniques, which are widely used in the field. To evaluate their agreement quantitatively, we employ the following measure when comparing DRRAiNN to the ablated models: For each station, the attributions are standardized to lie between 0 and 1 using min-max scaling. We then compute the Wasserstein distance between the attributions values inside the delineated catchment area and those outside it. A higher Wasserstein distance indicates greater differentiation between attributions inside and outside elevation-delineated catchment areas. This provides one measure of how well the model's learned patterns correspond to topographically-defined flow boundaries, though deviations may reflect either model limitations or genuine subsurface flow processes not captured by surface topography alone. This quantitative measure complements the qualitative comparison, providing stronger evidence for our model's ability to propagate water across the landscape in a physically plausible way. Specifically, it indicates that the model has implicitly learned the topographic structure of flow direction – i.e., that water generally flows downhill – solely from observed discharge dynamics.</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results</title>
      <p id="d2e2068">To evaluate DRRAiNN, we first present hydrographs and compare performance with EFAS to contextualize DRRAiNN's results. We furthermore show that DRRAiNN can retrospectively infer catchment-like structures, thus demonstrating how full differentiability supports physical interpretability.</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e2073">Hydrographs showing observed discharge, EFAS simulations, and predictions from one of five DRRAiNN model instances for lead times of up to <inline-formula><mml:math id="M97" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d. The four panels show the stations with the lowest <bold>(a)</bold> and highest <bold>(d)</bold> mean discharge, as well as the stations where EFAS <bold>(b)</bold> and DRRAiNN <bold>(c)</bold> achieve the best KGE performance on average on one day lead times on the validation set. We selected the sequence from the test set with the highest discharge variance, as variance likely serves as a proxy for prediction difficulty.   </p></caption>
        <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f04.png"/>

      </fig>

<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Hydrographs</title>
      <p id="d2e2108">EFAS produces hydrographs that match both the shape and magnitude of observed discharge, rendering it a strong contestant (Fig. <xref ref-type="fig" rid="F4"/>). As EFAS produces gridded outputs, it is necessary to extract outputs from EFAS grid cells that correspond to the station locations in order to make meaningful comparisons.</p>
      <p id="d2e2113">DRRAiNN also produces plausible hydrographs that closely match the observed discharges. This includes both low flows (Fig. <xref ref-type="fig" rid="F4"/>a) and high flows (Fig. <xref ref-type="fig" rid="F4"/>d). Since DRRAiNN operates autoregressively – using its own discharge estimates as input in the next time step – error can accumulate over time, leading to gradual decline in accuracy. Nonetheless, it is notable that the model is in general able to hit peaks even after almost <inline-formula><mml:math id="M98" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d, despite being trained only on <inline-formula><mml:math id="M99" display="inline"><mml:mn mathvariant="normal">20</mml:mn></mml:math></inline-formula> d sequences.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e2136">Performances of the best three out of five DRRAiNN model instances, compared to EFAS across different metrics and lead times up to <inline-formula><mml:math id="M100" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d. Results are averaged across all stations. Each line style corresponds to a distinct DRRAiNN instance. </p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f05.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Performance</title>
      <p id="d2e2161">Overall, DRRAiNN outperforms EFAS in all considered metrics (Fig. <xref ref-type="fig" rid="F5"/>). Since EFAS does not incorporate discharge values during inference, we report its mean performance over lead times as constant. As described above, DRRAiNN's autoregressive nature causes errors to accumulate over time, leading to a gradual decline in performance at longer lead times.</p>
      <p id="d2e2166">The KGE plot (Fig. <xref ref-type="fig" rid="F5"/>a) indicates that DRRAiNN is able to maintain strong performance over time. Averaged over the seeds, starting with a KGE of about <inline-formula><mml:math id="M101" display="inline"><mml:mn mathvariant="normal">0.71</mml:mn></mml:math></inline-formula>, our model's estimations stay above those of EFAS during the entire estimation horizon of <inline-formula><mml:math id="M102" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d, despite having been trained only on <inline-formula><mml:math id="M103" display="inline"><mml:mn mathvariant="normal">20</mml:mn></mml:math></inline-formula> d sequences. In contrast, the NSE plot (Fig. <xref ref-type="fig" rid="F5"/>b) shows gradual decline in performance over time with a decrease from <inline-formula><mml:math id="M104" display="inline"><mml:mn mathvariant="normal">0.72</mml:mn></mml:math></inline-formula> to <inline-formula><mml:math id="M105" display="inline"><mml:mn mathvariant="normal">0.62</mml:mn></mml:math></inline-formula> over the estimation horizon. Regardless, even after <inline-formula><mml:math id="M106" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d, all seeds show higher NSE values than EFAS. The PCC plot (Fig. <xref ref-type="fig" rid="F5"/>c) shows a strong linear relationship between observed and estimated discharges, with an average value of about <inline-formula><mml:math id="M107" display="inline"><mml:mn mathvariant="normal">0.9</mml:mn></mml:math></inline-formula> at the start. DRRAiNN captures this relationship better than EFAS over the entire estimation horizon. Note that the linear correlation is also part of KGE and NSE. As the MAE allows direct interpretation, its plot (Fig. <xref ref-type="fig" rid="F5"/>d) shows that EFAS is off by about <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:mn mathvariant="normal">6.5</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:math></inline-formula> on average, while DRRAiNN with <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.9</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:math></inline-formula> on average on the first day produces a considerable smaller error. After about <inline-formula><mml:math id="M110" display="inline"><mml:mn mathvariant="normal">25</mml:mn></mml:math></inline-formula> d, EFAS yields a lower MAE on average.</p>
      <p id="d2e2283">All metrics reveal differences in performance across the model instances trained with different random seeds. However, the relative ranking of model instances varies depending on the specific metric and lead time. Some seeds perform better during the initial days, while others are better with greater lead times: For example, in the KGE plot (Fig. <xref ref-type="fig" rid="F5"/>a), the ranking changes after about <inline-formula><mml:math id="M111" display="inline"><mml:mn mathvariant="normal">42</mml:mn></mml:math></inline-formula> d. The difference between instances are due to random weight initialization and the order of batches only. These stochastic factors may lead some instances to start the training with a larger bias towards capturing short-term, while others start with a larger bias towards capturing long-term relationships in the data.</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e2298">Performances of the best three out of five DRRAiNN model instances and EFAS at a <inline-formula><mml:math id="M112" display="inline"><mml:mn mathvariant="normal">1</mml:mn></mml:math></inline-formula> d lead time across different metrics and stations. The <inline-formula><mml:math id="M113" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>-axis shows the logarithmic mean discharge at each station. Blue vertical lines depict the standard deviation across DRRAiNN seeds. Dashed lines represent linear regressions between the log-mean discharge and corresponding metric. </p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f06.png"/>

        </fig>

      <p id="d2e2321">The plots in Fig. <xref ref-type="fig" rid="F6"/> show that some stations consistently yield more accurate discharge estimates than others. This observation holds across all evaluation metrics. Which stations are harder to estimate, however, is different across the metrics, reflecting the distinct sensitivities each metric has, as discussed previously. Interestingly, both the different DRRAiNN instance and EFAS show partial agreement on which stations are more difficult to model. For example, the KGE values in Fig. <xref ref-type="fig" rid="F6"/>a show that Altensteig and Stein are consistently easier to estimate, while Oppenweiler, Bad Imnau, and Murr are among the most challenging. The reasons for this discrepancy – such as differences in data quality, catchment size, land cover, or upstream complexity – could be analyzed in future work.</p>
      <p id="d2e2328">The regression lines indicate whether model performance correlates with average discharge levels across stations. We performed linear regression; the regression lines appear exponential due to the logarithmic scaling of the <inline-formula><mml:math id="M114" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>-axis. All metrics, except MAE, show that both models tend to perform better at stations with higher discharges. This effect is more pronounced in EFAS, while our model exhibits a more balanced behavior. Both DRRAiNN and EFAS produce significantly larger MAEs with increased mean discharge (Fig. <xref ref-type="fig" rid="F6"/>d). This is expected since MAE does not account for the stations' mean discharges or their variability in discharge, unlike the other metrics.</p>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e2342">Attribution maps of precipitation for discharge estimation at selected stations and time intervals, averaged over all test set sequences. Darker colors indicate grid cells where precipitation has a stronger influence on the estimated discharge at the corresponding station. For comparison, traditional catchment areas delineated from elevation data are outlined in red. This juxtaposition highlights the agreement between data-driven attributions and physically derived catchment boundaries. The attribution method is described in detail in Sect. <xref ref-type="sec" rid="Ch1.S2.SS6"/> of the main text. </p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f07.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Catchment area inference</title>
      <p id="d2e2361">We observe that DRRAiNN implicitly infers catchment areas that align with topographically expected ones, as shown in Fig. <xref ref-type="fig" rid="F7"/>. Darker areas indicate regions with higher importance of precipitation for estimating discharge at the corresponding station. These attribution patterns spatially overlap with the catchment areas delineated from elevation alone (depicted in red). The first four columns visualize attributions for subsequences of <inline-formula><mml:math id="M115" display="inline"><mml:mn mathvariant="normal">5</mml:mn></mml:math></inline-formula> d length to illustrate temporal changes in spatial influence. There is a tendency of the area of influence to increases in size the further we look into the past. This suggests that DRRAiNN propagates encoded water quantities along the landscape in a manner that aligns, at least to some extent, with physical flow processes. The last column shows attributions averaged over the whole <inline-formula><mml:math id="M116" display="inline"><mml:mn mathvariant="normal">20</mml:mn></mml:math></inline-formula> d sequences.</p>
      <p id="d2e2380">In the case of Pforzheim, DRRAiNN assigns low importance to an area in the lower right part, despite its inclusion in the delineated catchment area. This discrepancy could be related to known underground flows near Pforzheim, as reported in <xref ref-type="bibr" rid="bib1.bibx65" id="text.63"/>. In the absence of subsurface flows, water would be expected to pass through Pforzheim; however, due to the presence of underground flow paths, it instead moves towards the southeast, entering the Neckar River network via an alternative route. Our results suggests that DRRAiNN may have detected these unobservable underground flows from precipitation and discharge dynamics. However, this hypothesis arguably needs more investigation in the future.</p>
      <p id="d2e2386">Note that these results primarily serve as a proof of principle: We present results from the seed producing the clearest attributions; others yielded qualitatively worse results. However, it is important to keep in mind that DRRAiNN is trained on daily discharge measurements. Learning sharp catchment delineations would require the training data set to contain sequences in which it rained within the area, but not outside of it, over the extent of a <inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:mn mathvariant="normal">24</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> period. As precipitation is very dynamic on this time scale, the chances for this are relatively low. In the future, we expect sharper results if we go from daily to hourly discharge data.</p>
</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>Ablations</title>
      <p id="d2e2409">To assess both the contributions of specific architectural components and the model's use of topographic information, we conducted a series of ablations on DRRAiNN (Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>). First, we examined whether DRRAiNN utilizes the DEM primarily for encoding flow physics or as positional information by training, validating, and testing it on a rotated DEM. This resulted in slightly worse performance and reduced alignment between attribution maps and elevation-delineated catchments (Appendix <xref ref-type="sec" rid="App1.Ch1.S1.SS1"/>). Next, we evaluated the model's inductive bias in distinguishing between spatially extended and local processes (Appendix <xref ref-type="sec" rid="App1.Ch1.S1.SS2"/>). Last, we removed the hypernetworks to examine their impact (Appendix <xref ref-type="sec" rid="App1.Ch1.S1.SS3"/>). Both ablations led to performance degradation across most metrics and lead times. However, the differences were not always significant. Notably, both ablated models show reduced correspondence between their attribution patterns and elevation-delineated catchment boundaries. The attribution maps either exhibit less coherent spatial structures or show systematic deviations from topographically-expected patterns, suggesting that these architectural components contribute to the model's ability to learn topographically-consistent flow representations.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Discussion</title>
      <p id="d2e2429">We introduce DRRAiNN, a fully differentiable, fully distributed neural network architecture for estimating river discharge from past discharge, gridded elevation maps, and gridded precipitation and solar radiation. DRRAiNN demonstrates better performance than EFAS on lead times of up to <inline-formula><mml:math id="M118" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d. This indicates that DRRAiNN can produce valid estimations far into the future despite it being trained on sequences of only <inline-formula><mml:math id="M119" display="inline"><mml:mn mathvariant="normal">20</mml:mn></mml:math></inline-formula> d, including a warm-up period of <inline-formula><mml:math id="M120" display="inline"><mml:mn mathvariant="normal">10</mml:mn></mml:math></inline-formula> d.</p>
      <p id="d2e2453">Our analysis reveals that the difficulty of discharge estimation varies across gauging stations. Interestingly, both DRRAiNN and EFAS consistently struggle with the same stations, suggesting that the difficulty is intrinsic to the stations and their associated data rather than specific to the model architecture. Several factors likely contribute to this variability. For example, stations affected by unobserved variables such as complex subsurface topography, land cover heterogeneity, or anthropogenic factors (e.g., dam operations) may be inherently harder to model. Furthermore, spatial variations in the quality of input data could contribute to discrepancies in performance. Future investigations using attribution techniques could offer deeper insights into these station-specific variations and guide the development of architectural modifications or regularization to address these challenges effectively.</p>
      <p id="d2e2456">Our ablation studies show the benefits of distinguishing between spatially extended and local processes, and of incorporating hypernetworks. The reduced performance and failure of the ablated models to produce realistic catchment areas suggests that these components encode crucial hydrological processes, such as water movement across complex topographies. This suggests that incorporating appropriate inductive biases can both improve model interpretability and reduce the risk of learning spurious correlations.</p>
      <p id="d2e2459">Interestingly, the model instance that produces attribution maps with the strongest correspondence to elevation-delineated catchments is not the one with the best predictive performance. This points to a trade-off between optimizing for predictive accuracy and producing attribution patterns that align with topographically-expected flow boundaries. This suggests that conventional performance metrics, while effective at evaluating predictive accuracy, may not fully capture whether the model's internal representations correspond to our expectations based on surface topography and established hydrological understanding.</p>
      <p id="d2e2463">Increasing the amount of training data generally enhances performance in ML. Currently, DRRAiNN is not designed for scalability, as its application is expected to require retraining in each specific context. A natural step toward improving adaptability would be training DRRAiNN on hourly discharge data. This could improve performance and attribution quality, potentially enabling the model to trace the origins of individual discharge peaks. Since traditional PBMs rely on a wider range of input variables, feeding them as additional inputs could also lead to performance improvements in DRRAiNN. This includes land cover, parent material, soil texture, vegetation, temperature, and potential evapotranspiration among others. Interpretability methods can then be used to perform a sensitivity analysis, revealing which input variables are important when and, due to our model being fully distributed, where. These methods may also provide insights into the model's internal representations, potentially uncovering links to real-world hydrological variables.</p>
      <p id="d2e2466">Several strategies can be employed to investigate DRRAiNN's spatial generalization capabilities. One approach is to leave out individual stations within a river network during training to evaluate generalization within hydrologically connected regions. A more demanding test of generalization would involve training and testing on different river networks. By testing it on catchments that are not part of the training data, we can systematically assess its ability to generalize to unseen regions. Ultimately, we aim to apply DRRAiNN to diverse catchments across Germany, Europe, or globally. Due to DRRAiNN's data-driven nature, discharge measurements will always be needed for training. However, recent advances in remote sensing may enable the application of DRRAiNN to ungauged river networks <xref ref-type="bibr" rid="bib1.bibx15" id="paren.64"/>.</p>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d2e2481">In this paper, we introduced DRRAiNN, a fully distributed neural network architecture that estimates river discharge from precipitation, solar radiation, elevation maps, and past discharge measurements from gauging stations. Despite being trained on sparse target data – namely daily discharge observations from <inline-formula><mml:math id="M121" display="inline"><mml:mn mathvariant="normal">17</mml:mn></mml:math></inline-formula> stations over ten years – DRRAiNN outperforms the operational benchmark model EFAS in terms of KGE and NSE across various lead times. Beyond its predictive accuracy, DRRAiNN provides physically interpretable attributions, enabling the identification of precipitation sources contributing to discharge at specific stations. Our analyses highlight the importance of incorporating hydrologically meaningful constraints, or inductive biases. These biases not only enhance interpretability but also help the model align more closely with physical principles, as evidenced by its ability to delineate realistic catchment areas.</p>
      <p id="d2e2491">With its predictive performance, interpretability, and physical consistency, DRRAiNN represents a promising step forward in the application of neural networks to distributed hydrological modeling.</p>
</sec>

      
      </body>
    <back><app-group>

<app id="App1.Ch1.S1">
  <label>Appendix A</label><title>Ablations</title>
<sec id="App1.Ch1.S1.SS1">
  <label>A1</label><title>Rotated elevation map</title>
      <p id="d2e2512">We aim to assess whether DRRAiNN utilizes the elevation map in a physically plausible way – specifically, to propagate water downhill across the landscape. An alternative would be that DRRAiNN leverages the elevation map primarily as a positional encoding, allowing it to orient itself within the landscape and learning location-specific biases. In practice, both mechanisms are likely at play to some degree.</p>
      <p id="d2e2515">To examine this, we train, validate, and test DRRAiNN using the same elevation map as before, but rotated by <inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:mn mathvariant="normal">180</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula>. This setup preserves the statistics of the elevation map, ensuring a fair comparison.</p>

      <fig id="FA1" specific-use="star"><label>Figure A1</label><caption><p id="d2e2530">Performances of the best three out of five DRRAiNN model instances  and DRRAiNN model instances on a rotated elevation map, compared to EFAS across different metrics and lead times up to <inline-formula><mml:math id="M123" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d. Results are averaged across all stations. Each line style corresponds to a distinct DRRAiNN instance. </p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f08.png"/>

        </fig>

      <p id="d2e2547">For most metrics and lead times, DRRAiNN performs better when trained and tested on the original elevation map compared to the rotated one (Fig. <xref ref-type="fig" rid="FA1"/>). Nonetheless, its continued superior performance relative to EFAS – even with the rotated DEM – supports the hypothesis that DRRAiNN leverages elevation as a positional encoding. Remarkably, this still enables it to reconstruct plausible catchment areas to some extent (Fig. <xref ref-type="fig" rid="FA2"/>). However, our quantitative analysis (Fig. <xref ref-type="fig" rid="FA3"/>) shows that catchment areas are more accurately reconstructed when DRRAiNN is executed on the original DEM. This suggests that our original model's use of the elevation map goes beyond mere positional encoding, incorporating hydrologically meaningful information.</p>
</sec>
<sec id="App1.Ch1.S1.SS2">
  <label>A2</label><title>All LSTM</title>
      <p id="d2e2564">A key inductive bias in DRRAiNN is the explicit separation between spatially extended processes and local processes. Lateral water movement across the landscape is a spatially extended process primarily driven by elevation. Evapotranspiration, on the other hand, is a local process that is largely influenced by solar radiation. We encode this distinction into DRRAiNN by assigning these processes to different components of the ConvNeXt block: the DWConv is parameterized by a CNN that receives elevation as input, while PWConv1 and PWConv2 are parameterized by an MLP that receives solar radiation. In this ablation, we discard this bias by feeding the elevation and solar radiation – together with precipitation – directly into the PWLSTM. Consequently, the relativity bias, realized by subtracting the elevation of the center cell from the elevations of all other cells within each receptive field of the hypernetwork, is also removed.</p>
      <p id="d2e2567">We observe a significant performance drop for all metrics except MAE (Fig. <xref ref-type="fig" rid="FA4"/>). In addition, the inferred catchment areas appear less plausible compared to those produced by DRRAiNN (Fig. <xref ref-type="fig" rid="FA5"/>), a finding that is supported quantitatively (Fig. <xref ref-type="fig" rid="FA6"/>). These results demonstrate that explicitly distinguishing between spatially extended and local processes benefits DRRAiNN in terms of both predictive accuracy and physical plausibility.</p>
</sec>
<sec id="App1.Ch1.S1.SS3">
  <label>A3</label><title>No hypernetworks</title>
      <p id="d2e2584">Here, we train DRRAiNN without hypernetworks to assess their contribution. To stay close to the original architecture, we preserve inductive bias that distinguishes between the spatially extended process of water propagation and the local process of evapotranspiration. Specifically, the elevation map is concatenated with the hidden state, passed through a position-wise linear layer, and then fed into the DWConv. This step is necessary because DWConv requires the input and output channels to be of equal size. As a result, the relativity bias, realized by subtracting the elevation of the center cell from the elevations of all other cells within each receptive field of the hypernetwork, is also removed. For solar radiation, we concatenate it with the hidden state and feed the result directly into PWConv1.</p>
      <p id="d2e2587">Removing the hypernetworks from DRRAiNN results in decreased performance for KGE and NSE (Fig. <xref ref-type="fig" rid="FA7"/>a and b). For PCC and MAE, we do not observe a systematic difference (Fig. <xref ref-type="fig" rid="FA7"/>c and d). The ablated model produces less plausible attributions maps compared to DRRAiNN (Fig. <xref ref-type="fig" rid="FA8"/>), a finding that is supported quantitatively (Fig. <xref ref-type="fig" rid="FA9"/>).</p><fig id="FA2"><label>Figure A2</label><caption><p id="d2e2600">Attribution maps of precipitation for discharge estimation at selected stations and time intervals, averaged over all test set sequences with a rotated elevation map. Brighter colors indicate grid cells where precipitation has a stronger influence on the estimated discharge at the corresponding station. For comparison, traditional catchment areas delineated from elevation data are outlined in red. </p></caption>
          
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f09.png"/>

        </fig>

<fig id="FA3"><label>Figure A3</label><caption><p id="d2e2615">Wasserstein distances between normalized attributions inside and outside the catchment areas delineated from the digital elevation model. A higher distance indicates better agreement between inferred and delineated catchment areas, suggesting more physically realistic model behavior. Standard deviations are computed across the different gauging stations. </p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f10.png"/>

        </fig>

      <fig id="FA4"><label>Figure A4</label><caption><p id="d2e2626">Performances of the best three out of five DRRAiNN model instances and DRRAiNN model instances where all forcings are fed into the PWLSTM, compared to EFAS across different metrics and lead times up to <inline-formula><mml:math id="M124" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d. Results are averaged across all stations. Each line style corresponds to a distinct DRRAiNN instance. </p></caption>
          
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f11.png"/>

        </fig>

<fig id="FA5"><label>Figure A5</label><caption><p id="d2e2647">Attribution maps of precipitation for discharge estimation at selected stations and time intervals, averaged over all test set sequences when all forcings are fed into the PWLSTM. Brighter colors indicate grid cells where precipitation has a stronger influence on the estimated discharge at the corresponding station. For comparison, traditional catchment areas delineated from elevation data are outlined in red. </p></caption>
          
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f12.png"/>

        </fig>

<fig id="FA6"><label>Figure A6</label><caption><p id="d2e2662">Wasserstein distances between normalized attributions inside and outside the catchment areas delineated from the digital elevation model. A higher distance indicates better agreement between inferred and delineated catchment areas, suggesting more physically realistic model behavior. Standard deviations are computed across the different gauging stations. </p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f13.png"/>

        </fig>

      <fig id="FA7"><label>Figure A7</label><caption><p id="d2e2673">Performances of the best three out of five original DRRAiNN model instances and DRRAiNN model instances without hypernetworks, compared to EFAS across different metrics and lead times up to <inline-formula><mml:math id="M125" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d. Results are averaged across all stations. Each line style corresponds to a distinct DRRAiNN instance. </p></caption>
          
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f14.png"/>

        </fig>

<fig id="FA8"><label>Figure A8</label><caption><p id="d2e2694">Attribution maps of precipitation for discharge estimation at selected stations and time intervals, averaged over all test set sequences without hypernetworks. Brighter colors indicate grid cells where precipitation has a stronger influence on the estimated discharge at the corresponding station. For comparison, traditional catchment areas delineated from elevation data are outlined in red. </p></caption>
          
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f15.png"/>

        </fig>

<fig id="FA9"><label>Figure A9</label><caption><p id="d2e2709">Wasserstein distances between normalized attributions inside and outside the catchment areas delineated from the digital elevation model. A higher distance indicates better agreement between inferred and delineated catchment areas, suggesting more physically realistic model behavior. Standard deviations are computed across the different gauging stations. </p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f16.png"/>

        </fig>

</sec>
</app>

<app id="App1.Ch1.S2">
  <label>Appendix B</label><title>Alternative hyperparameters</title>
      <p id="d2e2727">In this appendix, we report the performance of DRRAiNN under alternative hyperparameters settings. In the default configuration, the LSTM in the rainfall-runoff model has a hidden size of <inline-formula><mml:math id="M126" display="inline"><mml:mn mathvariant="normal">4</mml:mn></mml:math></inline-formula>, and the GRU in the discharge model has a hidden size of <inline-formula><mml:math id="M127" display="inline"><mml:mn mathvariant="normal">8</mml:mn></mml:math></inline-formula>. Here, we examine DRRAiNN's performance using both smaller and larger hidden sizes. Additionally, we assess the impact of replacing the GRUs in the discharge model with LSTMs.</p>
<sec id="App1.Ch1.S2.SS1">
  <label>B1</label><title>Rainfall-runoff model with hidden size 2</title>
      <p id="d2e2751">Figure <xref ref-type="fig" rid="FB1"/> shows that reducing the hidden size of the rainfall-runoff model from <inline-formula><mml:math id="M128" display="inline"><mml:mn mathvariant="normal">4</mml:mn></mml:math></inline-formula> to <inline-formula><mml:math id="M129" display="inline"><mml:mn mathvariant="normal">2</mml:mn></mml:math></inline-formula> still yields a competitive model. On average, it performs slightly worse during the initial days. However, due to the variance in performance across different seeds, additional experiments are required to draw a more definitive conclusion.</p>
</sec>
<sec id="App1.Ch1.S2.SS2">
  <label>B2</label><title>Rainfall-runoff model with hidden size 6</title>
      <p id="d2e2778">Figure <xref ref-type="fig" rid="FB2"/> shows that increasing the hidden size of the rainfall-runoff model from <inline-formula><mml:math id="M130" display="inline"><mml:mn mathvariant="normal">4</mml:mn></mml:math></inline-formula> to <inline-formula><mml:math id="M131" display="inline"><mml:mn mathvariant="normal">6</mml:mn></mml:math></inline-formula> slightly decreases performance on the NSE and PCC metrics, while KGE remains largely unaffected. Since no significant improvement is observed, we argue that the smaller model should be preferred, following Occam's razor.</p>
</sec>
<sec id="App1.Ch1.S2.SS3">
  <label>B3</label><title>Discharge model with hidden size 4</title>
      <p id="d2e2805">Figure <xref ref-type="fig" rid="FB3"/> shows that reducing the hidden size of the discharge model from <inline-formula><mml:math id="M132" display="inline"><mml:mn mathvariant="normal">8</mml:mn></mml:math></inline-formula> to <inline-formula><mml:math id="M133" display="inline"><mml:mn mathvariant="normal">4</mml:mn></mml:math></inline-formula> significantly reduces performance across all metrics and lead times.</p>
</sec>
<sec id="App1.Ch1.S2.SS4">
  <label>B4</label><title>Discharge model with hidden size 16</title>
      <p id="d2e2833">Figure <xref ref-type="fig" rid="FB4"/> shows that increasing the hidden size of the discharge model from <inline-formula><mml:math id="M134" display="inline"><mml:mn mathvariant="normal">8</mml:mn></mml:math></inline-formula> to <inline-formula><mml:math id="M135" display="inline"><mml:mn mathvariant="normal">16</mml:mn></mml:math></inline-formula> leads to mixed results. While KGE appears to deteriorate, NSE and PCC show slight improvements, particularly at longer lead times. Since no significant improvement can be observed, we argue that opting for the smaller model align better with Occam's razor.</p>
</sec>
<sec id="App1.Ch1.S2.SS5">
  <label>B5</label><title>Discharge model with LSTM</title>
      <p id="d2e2860">Figure <xref ref-type="fig" rid="FB5"/> shows that replacing the GRUs in the discharge model with LSTMs significantly reduces performance across all metrics and almost all lead times. This suggests that model complexity should reflect the complexity of the underlying dynamics: river flow tends to follow simpler dynamics than surface and subsurface flow, which we model with an LSTM. Moreover, water typically resides in channels for shorter periods compared to its residence time below ground. This may explain the superior performance of GRUs in the discharge model, though further investigation is warranted.</p><fig id="FB1"><label>Figure B1</label><caption><p id="d2e2867">Performances of the best three out of five original DRRAiNN model instances and DRRAiNN model instances with a hidden size of <inline-formula><mml:math id="M136" display="inline"><mml:mn mathvariant="normal">2</mml:mn></mml:math></inline-formula> in the rainfall-runoff model, compared to EFAS across different metrics and lead times up to <inline-formula><mml:math id="M137" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d. Results are averaged across all stations. Each line style corresponds to a distinct DRRAiNN instance. </p></caption>
          
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f17.png"/>

        </fig>

      <fig id="FB2"><label>Figure B2</label><caption><p id="d2e2894">Performances of the best three out of five original DRRAiNN model instances and DRRAiNN model instances with a hidden size of <inline-formula><mml:math id="M138" display="inline"><mml:mn mathvariant="normal">6</mml:mn></mml:math></inline-formula> in the rainfall-runoff model, compared to EFAS across different metrics and lead times up to <inline-formula><mml:math id="M139" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d. Results are averaged across all stations. Each line style corresponds to a distinct DRRAiNN instance. </p></caption>
          
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f18.png"/>

        </fig>

<fig id="FB3"><label>Figure B3</label><caption><p id="d2e2923">Performances of the best three out of five original DRRAiNN model instances and DRRAiNN model instances with a hidden size of <inline-formula><mml:math id="M140" display="inline"><mml:mn mathvariant="normal">4</mml:mn></mml:math></inline-formula> in the discharge model, compared to EFAS across different metrics and lead times up to <inline-formula><mml:math id="M141" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d. Results are averaged across all stations. Each line style corresponds to a distinct DRRAiNN instance. </p></caption>
          
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f19.png"/>

        </fig>

      <fig id="FB4"><label>Figure B4</label><caption><p id="d2e2950">Performances of the best three out of five original DRRAiNN model instances and DRRAiNN model instances with a hidden size of <inline-formula><mml:math id="M142" display="inline"><mml:mn mathvariant="normal">16</mml:mn></mml:math></inline-formula> in the discharge model, compared to EFAS across different metrics and lead times up to <inline-formula><mml:math id="M143" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d. Results are averaged across all stations. Each line style corresponds to a distinct DRRAiNN instance. </p></caption>
          
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f20.png"/>

        </fig>

<fig id="FB5"><label>Figure B5</label><caption><p id="d2e2978">Performances of the best three out of five original DRRAiNN model instances and DRRAiNN model instances with LSTMs instead of GRUs in the discharge model, compared to EFAS across different metrics and lead times up to <inline-formula><mml:math id="M144" display="inline"><mml:mn mathvariant="normal">50</mml:mn></mml:math></inline-formula> d. Results are averaged across all stations. Each line style corresponds to a distinct DRRAiNN instance. </p></caption>
          
          <graphic xlink:href="https://hess.copernicus.org/articles/29/6257/2025/hess-29-6257-2025-f21.png"/>

        </fig>

</sec>
</app>
  </app-group><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e3001">The preprocessed data sets can be found at <ext-link xlink:href="https://doi.org/10.5281/zenodo.13970575" ext-link-type="DOI">10.5281/zenodo.13970575</ext-link> <xref ref-type="bibr" rid="bib1.bibx53" id="paren.65"/>. The code can be found at <ext-link xlink:href="https://doi.org/10.5281/zenodo.13992583" ext-link-type="DOI">10.5281/zenodo.13992583</ext-link> <xref ref-type="bibr" rid="bib1.bibx54" id="paren.66"/>.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e3019">All authors contributed to the conceptualization of the paper. FS, MT, and MB designed the model architecture. FS developed the code and performed the experiments. FS prepared the manuscript with contributions from all co-authors.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e3025">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e3031">Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e3039">We thank the reviewers for their very constructive criticism, feedback, and suggestions. The authors thank the International Max Planck Research School for Intelligent Systems (IMPRS-IS)  for supporting Fedor Scholz and Manuel Traub. ChatGPT was partially used to improve the writing style of the manuscript.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e3044">This work received funding from the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy – EXC number 2064/1 – Project number 390727645 as well as from the Cyber Valley in Tübingen, CyVy-RF-2020-15. This open-access publication was funded by the Open Access Publication Fund of the University of Tübingen.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e3054">This paper was edited by Daniel Klotz and reviewed by Shijie Jiang and Peter Nelemans.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Al Hossain et al.(2015)Al Hossain, Ahmed, Aktar, Fida, Khan, Islam, Yazdan, Noor, and Rahaman</label><mixed-citation> Al Hossain, B. M. T., Ahmed, T., Aktar, M. N., Fida, M., Khan, A., Islam, A., Yazdan, M. M. S., Noor, F., and Rahaman, A. Z.: Climate Change Impacts on Water Availability in the Meghna Basin, in: Proceedings of the 5th International Conference on Water and Flood Management (ICWFM-2015), Dhaka, Bangladesh, 6–8, ISBN 9789843388018, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>AWGN(2023)</label><mixed-citation>AWGN: Amtliches Digitales Wasserwirtschaftliches Gewässernetz (AWGN), <uri>https://www.lubw.baden-wuerttemberg.de/wasser/awgn</uri> (last access: 23 July 2024), 2023.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Bharati et al.(2011)Bharati, Lacombe, Gurung, Jayakody, Hoanh, and Smakhtin</label><mixed-citation>Bharati, L., Lacombe, G., Gurung, P., Jayakody, P., Hoanh, C. T., and Smakhtin,   V.: The Impacts of Water Infrastructure and Climate Change on the Hydrology of the Upper Ganges River Basin, International Water Management Institute (Research Report 142), <ext-link xlink:href="https://doi.org/10.5337/2011.210" ext-link-type="DOI">10.5337/2011.210</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Bindas et al.(2024)Bindas, Tsai, Liu, Rahmani, Feng, Bian, Lawson, and Shen</label><mixed-citation>Bindas, T., Tsai, W.-P., Liu, J., Rahmani, F., Feng, D., Bian, Y., Lawson, K., and Shen, C.: Improving River Routing Using a Differentiable Muskingum-Cunge Model and Physics-Informed Machine Learning, Water Resources Research, 60, e2023WR035337, <ext-link xlink:href="https://doi.org/10.1029/2023WR035337" ext-link-type="DOI">10.1029/2023WR035337</ext-link>,  2024.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Börgel et al.(2025)Börgel, Karsten, Rummel, and Gräwe</label><mixed-citation>Börgel, F., Karsten, S., Rummel, K., and Gräwe, U.: From weather data to river runoff: using spatiotemporal convolutional networks for discharge forecasting, Geosci. Model Dev., 18, 2005–2019, <ext-link xlink:href="https://doi.org/10.5194/gmd-18-2005-2025" ext-link-type="DOI">10.5194/gmd-18-2005-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Brutsaert(2023)</label><mixed-citation> Brutsaert, W.: Hydrology: An Introduction (2nd ed.), Cambridge University Press, Cambridge, UK, ISBN 9781107135277, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Butz et al.(2019)Butz, Bilkey, Humaidan, Knott, and Otte</label><mixed-citation>Butz, M. V., Bilkey, D., Humaidan, D., Knott, A., and Otte, S.: Learning, planning, and control in a monolithic neural event inference architecture, Neural Networks, 117, 135–144, <ext-link xlink:href="https://doi.org/10.1016/j.neunet.2019.05.001" ext-link-type="DOI">10.1016/j.neunet.2019.05.001</ext-link>,  2019.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Butz et al.(2025)Butz, Mittenbühler, Schwöbel, Achimova, Gumbsch, Otte, and Kiebel</label><mixed-citation>Butz, M. V., Mittenbühler, M., Schwöbel, S., Achimova, A., Gumbsch, C., Otte, S., and Kiebel, S.: Contextualizing predictive minds, Neuroscience &amp; Biobehavioral Reviews, 168, 105948, <ext-link xlink:href="https://doi.org/10.1016/j.neubiorev.2024.105948" ext-link-type="DOI">10.1016/j.neubiorev.2024.105948</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Camporese and Girotto(2022)</label><mixed-citation>Camporese, M. and Girotto, M.: Recent advances and opportunities in data assimilation for physics-based hydrological modeling, Frontiers in Water, 4, 948832, <ext-link xlink:href="https://doi.org/10.3389/frwa.2022.948832" ext-link-type="DOI">10.3389/frwa.2022.948832</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Chen et al.(2022)Chen, Zwart, and Jia</label><mixed-citation>Chen, S., Zwart, J. A., and Jia, X.: Physics-Guided Graph Meta Learning for Predicting Water Temperature and Streamflow in Stream Networks, in: KDD 22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD, 22,  2752–2761, ISBN 9781450393850, <ext-link xlink:href="https://doi.org/10.1145/3534678.3539115" ext-link-type="DOI">10.1145/3534678.3539115</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Cho et al.(2014)Cho, van Merrienboer, Bahdanau, and Bengio</label><mixed-citation>Cho, K., van Merrienboer, B., Bahdanau, D., and Bengio, Y.: On the Properties of Neural Machine Translation: Encoder–Decoder Approaches, in: Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, <ext-link xlink:href="https://doi.org/10.3115/v1/w14-4012" ext-link-type="DOI">10.3115/v1/w14-4012</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Copernicus Climate Change Service(2023)</label><mixed-citation>Copernicus Climate Change Service, Climate Data Store: ERA5 hourly data on single levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS), <ext-link xlink:href="https://doi.org/10.24381/cds.adbb2d47" ext-link-type="DOI">10.24381/cds.adbb2d47</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>EU-DEM(2016)</label><mixed-citation>EU-DEM: EU-DEM v1.1, Dataset, <uri>https://land.copernicus.eu/imagery-in-situ/eu-dem/eu-dem-v1.1</uri> (last access: 6 September 2023),   2016.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Gauch et al.(2021)Gauch, Kratzert, Klotz, Nearing, Lin, and Hochreiter</label><mixed-citation>Gauch, M., Kratzert, F., Klotz, D., Nearing, G., Lin, J., and Hochreiter, S.: Rainfall–runoff prediction at multiple timescales with a single Long Short-Term Memory network, Hydrol. Earth Syst. Sci., 25, 2045–2062, <ext-link xlink:href="https://doi.org/10.5194/hess-25-2045-2021" ext-link-type="DOI">10.5194/hess-25-2045-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Gigi et al.(2019)Gigi, Elidan, Hassidim, Matias, Moshe, Nevo, Shalev, and Wiesel</label><mixed-citation>Gigi, Y., Elidan, G., Hassidim, A., Matias, Y., Moshe, Z., Nevo, S., Shalev, G., and Wiesel, A.: Towards global remote discharge estimation: Using the few to estimate the many, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1901.00786" ext-link-type="DOI">10.48550/arXiv.1901.00786</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Gillies et al.(2013)</label><mixed-citation>Gillies, S. and others: Rasterio: geospatial raster I/O for Python programmers, GitHub, <uri>https://github.com/rasterio/rasterio</uri>  (last access: 21 June 2024), 2013.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>GRDC(2024)</label><mixed-citation>GRDC: Global Runoff Data Centre, <uri>https://grdc.bafg.de/</uri> (last access: 22 October 2024), 2024.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Gupta et al.(2009)Gupta, Kling, Yilmaz, and Martinez</label><mixed-citation>Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling, Journal of Hydrology, 377, 80–91, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2009.08.003" ext-link-type="DOI">10.1016/j.jhydrol.2009.08.003</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Hendrycks and Gimpel(2016)</label><mixed-citation>Hendrycks, D. and Gimpel, K.: Gaussian Error Linear Units (GELUs), arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1606.08415" ext-link-type="DOI">10.48550/arXiv.1606.08415</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Hochreiter and Schmidhuber(1997)</label><mixed-citation>Hochreiter, S. and Schmidhuber, J.: Long Short-Term Memory, Neural Computation, 9, 1735–1780, <ext-link xlink:href="https://doi.org/10.1162/neco.1997.9.8.1735" ext-link-type="DOI">10.1162/neco.1997.9.8.1735</ext-link>, 1997.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Hoedt et al.(2021)Hoedt, Kratzert, Klotz, Halmich, Holzleitner, Nearing, Hochreiter, and Klambauer</label><mixed-citation>Hoedt, P.-J., Kratzert, F., Klotz, D., Halmich, C., Holzleitner, M., Nearing, G., Hochreiter, S., and Klambauer, G.: MC-LSTM: Mass-Conserving LSTM, Proceedings of Machine Learning Research, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2101.05186" ext-link-type="DOI">10.48550/arXiv.2101.05186</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Höge et al.(2022)Höge, Scheidegger, Baity-Jesi, Albert, and Fenicia</label><mixed-citation>Höge, M., Scheidegger, A., Baity-Jesi, M., Albert, C., and Fenicia, F.: Improving hydrologic models for predictions and process understanding using neural ODEs, Hydrol. Earth Syst. Sci., 26, 5085–5102, <ext-link xlink:href="https://doi.org/10.5194/hess-26-5085-2022" ext-link-type="DOI">10.5194/hess-26-5085-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Hrachowitz et al.(2013)Hrachowitz, Savenije, Blöschl, McDonnell, Sivapalan, Pomeroy, Arheimer, Blume, Clark, Ehret, Fenicia, Freer, Gelfan, Gupta, Hughes, Hut, Montanari, Pande, Tetzlaff, Troch, Uhlenbrook, Wagener, Winsemius, Woods, Zehe, and Cudennec</label><mixed-citation>Hrachowitz, M., Savenije, H., Blöschl, G., McDonnell, J., Sivapalan, M., Pomeroy, J., Arheimer, B., Blume, T., Clark, M., Ehret, U., Fenicia, F., Freer, J., Gelfan, A., Gupta, H., Hughes, D., Hut, R., Montanari, A., Pande, S., Tetzlaff, D., Troch, P., Uhlenbrook, S., Wagener, T., Winsemius, H., Woods, R., Zehe, E., and Cudennec, C.: A decade of Predictions in Ungauged Basins (PUB) –a review, Hydrological Sciences Journal, 58, 1198–1255, <ext-link xlink:href="https://doi.org/10.1080/02626667.2013.803183" ext-link-type="DOI">10.1080/02626667.2013.803183</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Hunter et al.(2007)Hunter, Bates, Horritt, and Wilson</label><mixed-citation> Hunter, N. M., Bates, P. D., Horritt, M. S., and Wilson, M. D.: Simple spatially-distributed models for predicting flood inundation: A review, Geomorphology, 90, 208–225, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Imhoff et al.(2020)Imhoff, Van Verseveld, Van Osnabrugge, and Weerts</label><mixed-citation>Imhoff, R., Van Verseveld, W., Van Osnabrugge, B., and Weerts, A.: Scaling point-scale (pedo) transfer functions to seamless large-domain parameter estimates for high-resolution distributed hydrologic modeling: An example for the Rhine River, Water Resources Research, 56, e2019WR026807, <ext-link xlink:href="https://doi.org/10.1029/2019WR026807" ext-link-type="DOI">10.1029/2019WR026807</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Imhoff et al.(2022)Imhoff, Brauer, van Heeringen, Uijlenhoet, and Weerts</label><mixed-citation>Imhoff, R. O., Brauer, C. C., van Heeringen, K.-J., Uijlenhoet, R., and Weerts, A. H.: Large-sample evaluation of radar rainfall nowcasting for flood early warning, Water Resources Research, 58, e2021WR031591, <ext-link xlink:href="https://doi.org/10.1029/2021WR031591" ext-link-type="DOI">10.1029/2021WR031591</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Karlbauer et al.(2019)Karlbauer, Otte, Lensch, Scholten, Wulfmeyer, and Butz</label><mixed-citation>Karlbauer, M., Otte, S., Lensch, H., Scholten, T., Wulfmeyer, V., and Butz, M. V.: A distributed neural network architecture for robust non-linear spatio-temporal prediction, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1912.11141" ext-link-type="DOI">10.48550/arXiv.1912.11141</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Klotz et al.(2022)Klotz, Kratzert, Gauch, Keefe Sampson, Brandstetter, Klambauer, Hochreiter, and Nearing</label><mixed-citation>Klotz, D., Kratzert, F., Gauch, M., Keefe Sampson, A., Brandstetter, J., Klambauer, G., Hochreiter, S., and Nearing, G.: Uncertainty estimation with deep learning for rainfall–runoff modeling, Hydrol. Earth Syst. Sci., 26, 1673–1693, <ext-link xlink:href="https://doi.org/10.5194/hess-26-1673-2022" ext-link-type="DOI">10.5194/hess-26-1673-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Kratzert et al.(2018)Kratzert, Klotz, Brenner, Schulz, and Herrnegger</label><mixed-citation>Kratzert, F., Klotz, D., Brenner, C., Schulz, K., and Herrnegger, M.: Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks, Hydrol. Earth Syst. Sci., 22, 6005–6022, <ext-link xlink:href="https://doi.org/10.5194/hess-22-6005-2018" ext-link-type="DOI">10.5194/hess-22-6005-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Kratzert et al.(2019)Kratzert, Klotz, Shalev, Klambauer, Hochreiter, and Nearing</label><mixed-citation>Kratzert, F., Klotz, D., Shalev, G., Klambauer, G., Hochreiter, S., and Nearing, G.: Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets, Hydrol. Earth Syst. Sci., 23, 5089–5110, <ext-link xlink:href="https://doi.org/10.5194/hess-23-5089-2019" ext-link-type="DOI">10.5194/hess-23-5089-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Kratzert et al.(2021)Kratzert, Klotz, Gauch, Klingler, Nearing, and Hochreiter</label><mixed-citation>Kratzert, F., Klotz, D., Gauch, M., Klingler, C., Nearing, G., and Hochreiter, S.: Large-scale river network modeling using Graph Neural Networks, in: EGU General Assembly Conference Abstracts,  EGU21–13375, <ext-link xlink:href="https://doi.org/10.5194/egusphere-egu21-13375" ext-link-type="DOI">10.5194/egusphere-egu21-13375</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Li et al.(2022)Li, Zhang, and Krebs</label><mixed-citation> Li, P., Zhang, J., and Krebs, P.: Prediction of flow based on a CNN-LSTM combined deep learning approach, Water, 14, 993, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Liu and Gupta(2007)</label><mixed-citation>Liu, Y. and Gupta, H. V.: Uncertainty in hydrologic modeling: Toward an integrated data assimilation framework, Water Resources Research, Volume 43, Issue 7, <ext-link xlink:href="https://doi.org/10.1029/2006WR005756" ext-link-type="DOI">10.1029/2006WR005756</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Liu et al.(2012)Liu, Weerts, Clark, Hendricks Franssen, Kumar, Moradkhani, Seo, Schwanenberg, Smith, Van Dijk, Van Velzen, He, Lee, Noh, Rakovec, and Restrepo</label><mixed-citation>Liu, Y., Weerts, A. H., Clark, M., Hendricks Franssen, H.-J., Kumar, S., Moradkhani, H., Seo, D.-J., Schwanenberg, D., Smith, P., van Dijk, A. I. J. M., van Velzen, N., He, M., Lee, H., Noh, S. J., Rakovec, O., and Restrepo, P.: Advancing data assimilation in operational hydrologic forecasting: progresses, challenges, and emerging opportunities, Hydrol. Earth Syst. Sci., 16, 3863–3887, <ext-link xlink:href="https://doi.org/10.5194/hess-16-3863-2012" ext-link-type="DOI">10.5194/hess-16-3863-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Liu et al.(2022)Liu, Mao, Wu, Feichtenhofer, Darrell, and Xie</label><mixed-citation>Liu, Z., Mao, H., Wu, C.-Y., Feichtenhofer, C., Darrell, T., and Xie, S.: A ConvNet for the 2020s, in: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), <ext-link xlink:href="https://doi.org/10.1109/cvpr52688.2022.01167" ext-link-type="DOI">10.1109/cvpr52688.2022.01167</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Longyang et al.(2024)Longyang, Choi, Tennant, Hill, Ashmead, Neilson, Newell, McNamara, and Xu</label><mixed-citation>Longyang, Q., Choi, S., Tennant, H., Hill, D., Ashmead, N., Neilson, B. T., Newell, D. L., McNamara, J. P., and Xu, T.: Explainable Spatially Distributed Hydrologic Modeling of a Snow Dominated Mountainous Karst Watershed Using Attention, Authorea Preprints, <ext-link xlink:href="https://doi.org/10.22541/essoar.171536019.93198716/v1" ext-link-type="DOI">10.22541/essoar.171536019.93198716/v1</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Marçais and de Dreuzy(2017)</label><mixed-citation> Marçais, J. and de Dreuzy, J.-R.: Prospective interest of deep learning for hydrological inference, Groundwater, 55, 688–692, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Montzka et al.(2012)Montzka, Pauwels, Franssen, Han, and Vereecken</label><mixed-citation> Montzka, C., Pauwels, V. R., Franssen, H.-J. H., Han, X., and Vereecken, H.: Multivariate and multiscale data assimilation in terrestrial systems: A review, Sensors, 12, 16291–16333, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Moradkhani et al.(2005)Moradkhani, Hsu, Gupta, and Sorooshian</label><mixed-citation>Moradkhani, H., Hsu, K.-L., Gupta, H., and Sorooshian, S.: Uncertainty assessment of hydrologic model states and parameters: Sequential data assimilation using the particle filter, Water Resources Research, Volume 41, Issue 5, Article number W05012, <ext-link xlink:href="https://doi.org/10.1029/2004WR003604" ext-link-type="DOI">10.1029/2004WR003604</ext-link>,  2005.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Moshe et al.(2020)Moshe, Metzger, Kratzert, Morin, Nevo, Elidan, and Elyaniv</label><mixed-citation>Moshe, Z., Metzger, A., Kratzert, F., Morin, E., Nevo, S., Elidan, G., and Elyaniv, R.: HydroNets: Leveraging River Network Structure and Deep Neural Networks for Hydrologic Modeling , EGU General Assembly 2020, Online, 4–8 May 2020, EGU2020-4135, <ext-link xlink:href="https://doi.org/10.5194/egusphere-egu2020-4135" ext-link-type="DOI">10.5194/egusphere-egu2020-4135</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Muñoz-Carpena et al.(2023)Muñoz-Carpena, Carmona-Cabrero, Yu, Fox, and Batelaan</label><mixed-citation>Muñoz-Carpena, R., Carmona-Cabrero, A., Yu, Z., Fox, G., and Batelaan, O.: Convergence of mechanistic modeling and artificial intelligence in hydrologic science and engineering, PLOS Water, 2, e0000059, <ext-link xlink:href="https://doi.org/10.1371/journal.pwat.0000212" ext-link-type="DOI">10.1371/journal.pwat.0000212</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Nash and Sutcliffe(1970)</label><mixed-citation> Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual models part I–A discussion of principles, Journal of Hydrology, 10, 282–290, 1970.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Nearing et al.(2020)Nearing, Kratzert, Sampson, Pelissier, Klotz, Frame, Prieto, and Gupta</label><mixed-citation>Nearing, G., Kratzert, F., Sampson, A. K., Pelissier, C., Klotz, D., Frame, J., Prieto, C., and Gupta, H.: What Role Does Hydrological Science Play in the Age of Machine Learning?, Water Resources Research, 57, e2020WR028091, <ext-link xlink:href="https://doi.org/10.31223/osf.io/3sx6g" ext-link-type="DOI">10.31223/osf.io/3sx6g</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Oddo et al.(2024)Oddo, Bolten, Kumar, and Cleary</label><mixed-citation>Oddo, P. C., Bolten, J. D., Kumar, S. V., and Cleary, B.: Deep Convolutional LSTM for improved flash flood prediction, Frontiers in Water, 6, 1346104, <ext-link xlink:href="https://doi.org/10.3389/frwa.2024.1346104" ext-link-type="DOI">10.3389/frwa.2024.1346104</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Otte et al.(2020)Otte, Karlbauer, and Butz</label><mixed-citation>Otte, S., Karlbauer, M., and Butz, M. V.: Active Tuning, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2010.03958" ext-link-type="DOI">10.48550/arXiv.2010.03958</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Palmer et al.(2008)Palmer, Reidy Liermann, Nilsson, Flörke, Alcamo, Lake, and Bond</label><mixed-citation> Palmer, M. A., Reidy Liermann, C. A., Nilsson, C., Flörke, M., Alcamo, J., Lake, P. S., and Bond, N.: Climate change and the world's river basins: anticipating management options, Frontiers in Ecology and the Environment, 6, 81–89, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Paszke et al.(2019)Paszke, Gross, Massa, Lerer, Bradbury, Chanan, Killeen, Lin, Gimelshein, Antiga, Desmaison, Kopf, Yang, DeVito, Raison, Tejani, Chilamkurthy, Steiner, Fang, Bai, and Chintala</label><mixed-citation>Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Kopf, A., Yang, E., DeVito, Z., Raison, M., Tejani, A., Chilamkurthy, S., Steiner, B., Fang, L., Bai, J., and Chintala, S.: PyTorch: An Imperative Style, High-Performance Deep Learning Library, Advances in Neural Information Processing Systems, 12, <ext-link xlink:href="https://doi.org/10.48550/arXiv.1912.01703" ext-link-type="DOI">10.48550/arXiv.1912.01703</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Pilon(2002)</label><mixed-citation>Pilon, P. J.: Guidelines for reducing flood losses, Tech. rep., United Nations International Strategy for Disaster Reduction (UNISDR), <uri>https://www.un.org/esa/sustdev/publications/flood_guidelines.pdf</uri> (last access: 11 November 2025), 2002.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Pokharel and Roy(2024)</label><mixed-citation>Pokharel, S. and Roy, T.: A parsimonious setup for streamflow forecasting using CNN-LSTM, Journal of Hydroinformatics,  jh2024114, <ext-link xlink:href="https://doi.org/10.2166/hydro.2024.114" ext-link-type="DOI">10.2166/hydro.2024.114</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>RADOLAN(2016)</label><mixed-citation>RADOLAN: RADOLAN/RADVOR, <uri>https://opendata.dwd.de/climate_environment/CDC/grids_germany/hourly/radolan/</uri>(last access: 6 September 2023), 2016.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Rakovec et al.(2012)Rakovec, Weerts, Hazenberg, Torfs, and Uijlenhoet</label><mixed-citation>Rakovec, O., Weerts, A. H., Hazenberg, P., Torfs, P. J. J. F., and Uijlenhoet, R.: State updating of a distributed hydrological model with Ensemble Kalman Filtering: effects of updating frequency and observation network density on forecast accuracy, Hydrol. Earth Syst. Sci., 16, 3435–3449, <ext-link xlink:href="https://doi.org/10.5194/hess-16-3435-2012" ext-link-type="DOI">10.5194/hess-16-3435-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>Samaniego et al.(2010)Samaniego, Kumar, and Attinger</label><mixed-citation>Samaniego, L., Kumar, R., and Attinger, S.: Multiscale parameter regionalization of a grid-based hydrologic model at the mesoscale, Water Resources Research, 46, W05523, <ext-link xlink:href="https://doi.org/10.1029/2008WR007327" ext-link-type="DOI">10.1029/2008WR007327</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>Scholz et al.(2024a)Scholz, Traub, Zarfl, Scholten, and Butz</label><mixed-citation>Scholz, F., Traub, M., Zarfl, C., Scholten, T., and Butz, M. V.: Fully differentiable, fully distributed River Discharge Prediction: data sets, Zenodo [data set], <ext-link xlink:href="https://doi.org/10.5281/zenodo.13970575" ext-link-type="DOI">10.5281/zenodo.13970575</ext-link>, 2024a.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Scholz et al.(2024b)Scholz, Traub, Zarfl, Scholten, and Butz</label><mixed-citation>Scholz, F., Traub, M., Zarfl, C., Scholten, T., and Butz, M. V.: Fully differentiable, fully distributed River Discharge Prediction: code, Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.13992583" ext-link-type="DOI">10.5281/zenodo.13992583</ext-link>, 2024b.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>Shen(2018)</label><mixed-citation> Shen, C.: A transdisciplinary review of deep learning research and its relevance for water resources scientists, Water Resources Research, 54, 8558–8593, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Shen et al.(2023)Shen, Appling, Gentine, Bandai, Gupta, Tartakovsky, Baity-Jesi, Fenicia, Kifer, Li, Liu, Ren, Zheng, Harman, Clark, Farthing, Feng, Kumar, Aboelyazeed, Rahmani, Song, Beck, Bindas, Dwivedi, Fang, Höge, Rackauckas, Mohanty, Roy, Xu, and Lawson</label><mixed-citation>Shen, C., Appling, A. P., Gentine, P., Bandai, T., Gupta, H., Tartakovsky, A., Baity-Jesi, M., Fenicia, F., Kifer, D., Li, L., Liu, X., Ren, W., Zheng, Y., Harman, C. J., Clark, M., Farthing, M., Feng, D., Kumar, P., Aboelyazeed, D., Rahmani, F., Song, Y., Beck, H. E., Bindas, T., Dwivedi, D., Fang, K., Höge, M., Rackauckas, C., Mohanty, B., Roy, T., Xu, C., and Lawson, K.: Differentiable modelling to unify machine learning and physical models for geosciences, Nature Reviews Earth &amp; Environment, 4, 552–567, <ext-link xlink:href="https://doi.org/10.1038/s43017-023-00450-9" ext-link-type="DOI">10.1038/s43017-023-00450-9</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx57"><label>Shi et al.(2015)Shi, Chen, Wang, Yeung, Wong, and Woo</label><mixed-citation>Shi, X., Chen, Z., Wang, H., Yeung, D.-Y., Wong, W.-k., and Woo, W.-c.: Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting, Advances in neural information processing systems, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1506.04214" ext-link-type="DOI">10.48550/arXiv.1506.04214</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx58"><label>Simonyan et al.(2013)Simonyan, Vedaldi, and Zisserman</label><mixed-citation>Simonyan, K., Vedaldi, A., and Zisserman, A.: Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1312.6034" ext-link-type="DOI">10.48550/arXiv.1312.6034</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx59"><label>Sit et al.(2020)Sit, Demiray, Xiang, Ewing, Sermet, and Demir</label><mixed-citation>Sit, M., Demiray, B., Xiang, Z., Ewing, G., Sermet, Y., and Demir, I.: A Comprehensive Review of Deep Learning Applications in Hydrology and Water Resources, <ext-link xlink:href="https://doi.org/10.31223/osf.io/xs36g" ext-link-type="DOI">10.31223/osf.io/xs36g</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx60"><label>Sit et al.(2021)Sit, Demiray, and Demir</label><mixed-citation>Sit, M., Demiray, B., and Demir, I.: Short-term Hourly Streamflow Prediction with Graph Convolutional GRU Networks, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2107.07039" ext-link-type="DOI">10.48550/arXiv.2107.07039</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx61"><label>Sun et al.(2022)Sun, Jiang, Yang, Xie, and Chen</label><mixed-citation>Sun, A. Y., Jiang, P., Yang, Z.-L., Xie, Y., and Chen, X.: A graph neural network (GNN) approach to basin-scale river network learning: the role of physics-based connectivity and data fusion, Hydrol. Earth Syst. Sci., 26, 5163–5184, <ext-link xlink:href="https://doi.org/10.5194/hess-26-5163-2022" ext-link-type="DOI">10.5194/hess-26-5163-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx62"><label>Traub et al.(2024)Traub, Becker, Sauter, Otte, and Butz</label><mixed-citation>Traub, M., Becker, F., Sauter, A., Otte, S., and Butz, M. V.: Loci-segmented: improving scene segmentation learning, in: International Conference on Artificial Neural Networks,  45–61, Springer, <ext-link xlink:href="https://doi.org/10.1007/978-3-031-72338-4_4" ext-link-type="DOI">10.1007/978-3-031-72338-4_4</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx63"><label>Tyson et al.(2023)Tyson, Longyang, Neilson, Zeng, and Xu</label><mixed-citation>Tyson, C., Longyang, Q., Neilson, B. T., Zeng, R., and Xu, T.: Effects of meteorological forcing uncertainty on high-resolution snow modeling and streamflow prediction in a mountainous karst watershed, Journal of Hydrology, 619, 129304, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2023.129304" ext-link-type="DOI">10.1016/j.jhydrol.2023.129304</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx64"><label>Ueda et al.(2024)Ueda, Tanouchi, Egusa, and Yoshihiro</label><mixed-citation>Ueda, F., Tanouchi, H., Egusa, N., and Yoshihiro, T.: A Transfer Learning Approach Based on Radar Rainfall for River Water-Level Prediction, Water, 16, 607, <ext-link xlink:href="https://doi.org/10.3390/w16040607" ext-link-type="DOI">10.3390/w16040607</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx65"><label>Ufrecht(2002)</label><mixed-citation>Ufrecht, W.: Hydrogeologische   Modelle – ein Leitfaden mit Fallbeispielen, Schriftenreihe der Deutschen Geologischen Gesellschaft, 24, Schweizerbart Science Publishers, Stuttgart, Germany, <uri>http://www.schweizerbart.de//publications/detail/artno/171902400/Schriftenreihe_der_Dt_Ges_f_Geowissen</uri> (last access: 11 November 2025), 2002. </mixed-citation></ref>
      <ref id="bib1.bibx66"><label>Valeriano et al.(2010)Valeriano, Koike, Yang, and Yang</label><mixed-citation> Valeriano, O. C. S., Koike, T., Yang, K., and Yang, D.: Optimal dam operation during flood season using a distributed hydrological model and a heuristic algorithm, Journal of Hydrologic Engineering, 15, 580–586, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx67"><label>Van Vliet et al.(2013)Van Vliet, Franssen, Yearsley, Ludwig, Haddeland, Lettenmaier, and Kabat</label><mixed-citation> Van Vliet, M. T., Franssen, W. H., Yearsley, J. R., Ludwig, F., Haddeland, I., Lettenmaier, D. P., and Kabat, P.: Global river discharge and water temperature under climate change, Global Environmental Change, 23, 450–464, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx68"><label>Wang et al.(2024)Wang, Jiang, Zheng, Han, Kumar, Rakovec, and Li</label><mixed-citation>Wang, C., Jiang, S., Zheng, Y., Han, F., Kumar, R., Rakovec, O., and Li, S.: Distributed Hydrological Modeling With Physics-Encoded Deep Learning: A General Framework and Its Application in the Amazon, Water Resources Research, 60, e2023WR036170, <ext-link xlink:href="https://doi.org/10.1029/2023WR036170" ext-link-type="DOI">10.1029/2023WR036170</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx69"><label>Wright(2019)</label><mixed-citation>Wright, L.: Ranger – a synergistic optimizer, GitHub, <uri>https://github.com/lessw2020/Ranger-Deep-Learning-Optimizer</uri> (last access: 22 August 2024),   2019.</mixed-citation></ref>
      <ref id="bib1.bibx70"><label>Xiang and Demir(2020)</label><mixed-citation>Xiang, Z. and Demir, I.: Distributed long-term hourly streamflow predictions using deep learning – A case study for State of Iowa, Environmental Modelling &amp; Software, 131, 104761, <ext-link xlink:href="https://doi.org/10.1016/j.envsoft.2020.104761" ext-link-type="DOI">10.1016/j.envsoft.2020.104761</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx71"><label>Xiang and Demir(2022)</label><mixed-citation>Xiang, Z. and Demir, I.: Fully distributed rainfall-runoff modeling using spatial-temporal graph neural network, EarthArxiv, <ext-link xlink:href="https://doi.org/https://doi.org/10.31223/X57P74" ext-link-type="DOI">https://doi.org/10.31223/X57P74</ext-link>,  2022.</mixed-citation></ref>
      <ref id="bib1.bibx72"><label>Xu et al.(2022)Xu, Longyang, Tyson, Zeng, and Neilson</label><mixed-citation>Xu, T., Longyang, Q., Tyson, C., Zeng, R., and Neilson, B. T.: Hybrid physically based and deep learning modeling of a snow dominated, mountainous, karst watershed, Water Resources Research, 58, e2021WR030993,  <ext-link xlink:href="https://doi.org/10.1029/2021WR030993" ext-link-type="DOI">10.1029/2021WR030993</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx73"><label>Yadan(2019)</label><mixed-citation>Yadan, O.: Hydra – A framework for elegantly configuring complex applications, Github, <uri>https://github.com/facebookresearch/hydra</uri> (last access: 28 October 2024), 2019.</mixed-citation></ref>
      <ref id="bib1.bibx74"><label>Zhong et al.(2024)Zhong, Lei, Li, and Jiang</label><mixed-citation>Zhong, L., Lei, H., Li, Z., and Jiang, S.: Advancing streamflow prediction in data-scarce regions through vegetation-constrained distributed hybrid ecohydrological models, Journal of Hydrology, 645, 132165, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2024.132165" ext-link-type="DOI">10.1016/j.jhydrol.2024.132165</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx75"><label>Zhu et al.(2023)Zhu, Wei, Zhang, Xu, and Qin</label><mixed-citation>Zhu, S., Wei, J., Zhang, H., Xu, Y., and Qin, H.: Spatiotemporal deep learning rainfall-runoff forecasting combined with remote sensing precipitation products in large scale basins, Journal of Hydrology, 616, 128727, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2022.128727" ext-link-type="DOI">10.1016/j.jhydrol.2022.128727</ext-link>,  2023.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Fully differentiable, fully distributed rainfall-runoff modeling</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Al Hossain et al.(2015)Al Hossain, Ahmed, Aktar, Fida, Khan, Islam,
Yazdan, Noor, and Rahaman</label><mixed-citation>
      
Al Hossain, B. M. T., Ahmed, T., Aktar, M. N., Fida, M., Khan, A., Islam, A.,
Yazdan, M. M. S., Noor, F., and Rahaman, A. Z.: Climate Change Impacts on
Water Availability in the Meghna Basin, in: Proceedings of the 5th
International Conference on Water and Flood Management (ICWFM-2015), Dhaka,
Bangladesh, 6–8, ISBN 9789843388018, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>AWGN(2023)</label><mixed-citation>
      
AWGN: Amtliches Digitales Wasserwirtschaftliches Gewässernetz (AWGN),
<a href="https://www.lubw.baden-wuerttemberg.de/wasser/awgn" target="_blank"/> (last access: 23 July 2024), 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Bharati et al.(2011)Bharati, Lacombe, Gurung, Jayakody, Hoanh, and
Smakhtin</label><mixed-citation>
      
Bharati, L., Lacombe, G., Gurung, P., Jayakody, P., Hoanh, C. T., and Smakhtin,   V.: The Impacts of Water Infrastructure and Climate Change on the Hydrology of the Upper Ganges River Basin, International Water Management Institute (Research Report 142), <a href="https://doi.org/10.5337/2011.210" target="_blank">https://doi.org/10.5337/2011.210</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Bindas et al.(2024)Bindas, Tsai, Liu, Rahmani, Feng, Bian, Lawson,
and Shen</label><mixed-citation>
      
Bindas, T., Tsai, W.-P., Liu, J., Rahmani, F., Feng, D., Bian, Y., Lawson, K.,
and Shen, C.: Improving River Routing Using a Differentiable Muskingum-Cunge
Model and Physics-Informed Machine Learning, Water Resources Research, 60,
e2023WR035337, <a href="https://doi.org/10.1029/2023WR035337" target="_blank">https://doi.org/10.1029/2023WR035337</a>,  2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Börgel et al.(2025)Börgel, Karsten, Rummel, and
Gräwe</label><mixed-citation>
      
Börgel, F., Karsten, S., Rummel, K., and Gräwe, U.: From weather data to river runoff: using spatiotemporal convolutional networks for discharge forecasting, Geosci. Model Dev., 18, 2005–2019, <a href="https://doi.org/10.5194/gmd-18-2005-2025" target="_blank">https://doi.org/10.5194/gmd-18-2005-2025</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Brutsaert(2023)</label><mixed-citation>
      
Brutsaert, W.: Hydrology: An Introduction (2nd ed.), Cambridge University Press, Cambridge, UK, ISBN 9781107135277, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Butz et al.(2019)Butz, Bilkey, Humaidan, Knott, and Otte</label><mixed-citation>
      
Butz, M. V., Bilkey, D., Humaidan, D., Knott, A., and Otte, S.: Learning,
planning, and control in a monolithic neural event inference architecture,
Neural Networks, 117, 135–144, <a href="https://doi.org/10.1016/j.neunet.2019.05.001" target="_blank">https://doi.org/10.1016/j.neunet.2019.05.001</a>,  2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Butz et al.(2025)Butz, Mittenbühler, Schwöbel, Achimova,
Gumbsch, Otte, and Kiebel</label><mixed-citation>
      
Butz, M. V., Mittenbühler, M., Schwöbel, S., Achimova, A., Gumbsch, C.,
Otte, S., and Kiebel, S.: Contextualizing predictive minds, Neuroscience
&amp; Biobehavioral Reviews, 168, 105948,
<a href="https://doi.org/10.1016/j.neubiorev.2024.105948" target="_blank">https://doi.org/10.1016/j.neubiorev.2024.105948</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Camporese and Girotto(2022)</label><mixed-citation>
      
Camporese, M. and Girotto, M.: Recent advances and opportunities in data
assimilation for physics-based hydrological modeling, Frontiers in Water, 4,
948832, <a href="https://doi.org/10.3389/frwa.2022.948832" target="_blank">https://doi.org/10.3389/frwa.2022.948832</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Chen et al.(2022)Chen, Zwart, and Jia</label><mixed-citation>
      
Chen, S., Zwart, J. A., and Jia, X.: Physics-Guided Graph Meta Learning for
Predicting Water Temperature and Streamflow in Stream Networks, in: KDD
22: The 28th ACM SIGKDD Conference on Knowledge Discovery
and Data Mining, KDD, 22,  2752–2761, ISBN 9781450393850,
<a href="https://doi.org/10.1145/3534678.3539115" target="_blank">https://doi.org/10.1145/3534678.3539115</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Cho et al.(2014)Cho, van Merrienboer, Bahdanau, and Bengio</label><mixed-citation>
      
Cho, K., van Merrienboer, B., Bahdanau, D., and Bengio, Y.: On the Properties
of Neural Machine Translation: Encoder–Decoder Approaches, in:
Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in
Statistical Translation, <a href="https://doi.org/10.3115/v1/w14-4012" target="_blank">https://doi.org/10.3115/v1/w14-4012</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Copernicus Climate Change Service(2023)</label><mixed-citation>
      
Copernicus Climate Change Service, Climate Data Store: ERA5 hourly data on single levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS), <a href="https://doi.org/10.24381/cds.adbb2d47" target="_blank">https://doi.org/10.24381/cds.adbb2d47</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>EU-DEM(2016)</label><mixed-citation>
      
EU-DEM: EU-DEM v1.1, Dataset,
<a href="https://land.copernicus.eu/imagery-in-situ/eu-dem/eu-dem-v1.1" target="_blank"/> (last access: 6 September 2023),   2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Gauch et al.(2021)Gauch, Kratzert, Klotz, Nearing, Lin, and
Hochreiter</label><mixed-citation>
      
Gauch, M., Kratzert, F., Klotz, D., Nearing, G., Lin, J., and Hochreiter, S.: Rainfall–runoff prediction at multiple timescales with a single Long Short-Term Memory network, Hydrol. Earth Syst. Sci., 25, 2045–2062, <a href="https://doi.org/10.5194/hess-25-2045-2021" target="_blank">https://doi.org/10.5194/hess-25-2045-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Gigi et al.(2019)Gigi, Elidan, Hassidim, Matias, Moshe, Nevo, Shalev,
and Wiesel</label><mixed-citation>
      
Gigi, Y., Elidan, G., Hassidim, A., Matias, Y., Moshe, Z., Nevo, S., Shalev,
G., and Wiesel, A.: Towards global remote discharge estimation: Using the few
to estimate the many, arXiv [preprint],
<a href="https://doi.org/10.48550/arXiv.1901.00786" target="_blank">https://doi.org/10.48550/arXiv.1901.00786</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Gillies et al.(2013)</label><mixed-citation>
      
Gillies, S. and others: Rasterio: geospatial raster I/O for Python
programmers, GitHub, <a href="https://github.com/rasterio/rasterio" target="_blank"/>  (last access: 21 June 2024), 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>GRDC(2024)</label><mixed-citation>
      
GRDC: Global Runoff Data Centre, <a href="https://grdc.bafg.de/" target="_blank"/> (last access: 22 October 2024),
2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Gupta et al.(2009)Gupta, Kling, Yilmaz, and Martinez</label><mixed-citation>
      
Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of
the mean squared error and NSE performance criteria: Implications for
improving hydrological modelling, Journal of Hydrology, 377, 80–91,
<a href="https://doi.org/10.1016/j.jhydrol.2009.08.003" target="_blank">https://doi.org/10.1016/j.jhydrol.2009.08.003</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Hendrycks and Gimpel(2016)</label><mixed-citation>
      
Hendrycks, D. and Gimpel, K.: Gaussian Error Linear Units (GELUs), arXiv
[preprint], <a href="https://doi.org/10.48550/arXiv.1606.08415" target="_blank">https://doi.org/10.48550/arXiv.1606.08415</a>,
2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Hochreiter and Schmidhuber(1997)</label><mixed-citation>
      
Hochreiter, S. and Schmidhuber, J.: Long Short-Term Memory, Neural Computation,
9, 1735–1780, <a href="https://doi.org/10.1162/neco.1997.9.8.1735" target="_blank">https://doi.org/10.1162/neco.1997.9.8.1735</a>, 1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Hoedt et al.(2021)Hoedt, Kratzert, Klotz, Halmich, Holzleitner,
Nearing, Hochreiter, and Klambauer</label><mixed-citation>
      
Hoedt, P.-J., Kratzert, F., Klotz, D., Halmich, C., Holzleitner, M., Nearing,
G., Hochreiter, S., and Klambauer, G.: MC-LSTM: Mass-Conserving LSTM,
Proceedings of Machine Learning Research, arXiv
[preprint],
<a href="https://doi.org/10.48550/arXiv.2101.05186" target="_blank">https://doi.org/10.48550/arXiv.2101.05186</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Höge et al.(2022)Höge, Scheidegger, Baity-Jesi, Albert, and
Fenicia</label><mixed-citation>
      
Höge, M., Scheidegger, A., Baity-Jesi, M., Albert, C., and Fenicia, F.: Improving hydrologic models for predictions and process understanding using neural ODEs, Hydrol. Earth Syst. Sci., 26, 5085–5102, <a href="https://doi.org/10.5194/hess-26-5085-2022" target="_blank">https://doi.org/10.5194/hess-26-5085-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Hrachowitz et al.(2013)Hrachowitz, Savenije, Blöschl, McDonnell,
Sivapalan, Pomeroy, Arheimer, Blume, Clark, Ehret, Fenicia, Freer, Gelfan,
Gupta, Hughes, Hut, Montanari, Pande, Tetzlaff, Troch, Uhlenbrook, Wagener,
Winsemius, Woods, Zehe, and Cudennec</label><mixed-citation>
      
Hrachowitz, M., Savenije, H., Blöschl, G., McDonnell, J., Sivapalan, M.,
Pomeroy, J., Arheimer, B., Blume, T., Clark, M., Ehret, U., Fenicia, F.,
Freer, J., Gelfan, A., Gupta, H., Hughes, D., Hut, R., Montanari, A., Pande,
S., Tetzlaff, D., Troch, P., Uhlenbrook, S., Wagener, T., Winsemius, H.,
Woods, R., Zehe, E., and Cudennec, C.: A decade of Predictions in Ungauged
Basins (PUB) –a review, Hydrological Sciences Journal, 58,
1198–1255, <a href="https://doi.org/10.1080/02626667.2013.803183" target="_blank">https://doi.org/10.1080/02626667.2013.803183</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Hunter et al.(2007)Hunter, Bates, Horritt, and
Wilson</label><mixed-citation>
      
Hunter, N. M., Bates, P. D., Horritt, M. S., and Wilson, M. D.: Simple
spatially-distributed models for predicting flood inundation: A review,
Geomorphology, 90, 208–225, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Imhoff et al.(2020)Imhoff, Van Verseveld, Van Osnabrugge, and
Weerts</label><mixed-citation>
      
Imhoff, R., Van Verseveld, W., Van Osnabrugge, B., and Weerts, A.: Scaling
point-scale (pedo) transfer functions to seamless large-domain parameter
estimates for high-resolution distributed hydrologic modeling: An example for
the Rhine River, Water Resources Research, 56, e2019WR026807, <a href="https://doi.org/10.1029/2019WR026807" target="_blank">https://doi.org/10.1029/2019WR026807</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Imhoff et al.(2022)Imhoff, Brauer, van Heeringen, Uijlenhoet, and
Weerts</label><mixed-citation>
      
Imhoff, R. O., Brauer, C. C., van Heeringen, K.-J., Uijlenhoet, R., and Weerts,
A. H.: Large-sample evaluation of radar rainfall nowcasting for flood early
warning, Water Resources Research, 58, e2021WR031591, <a href="https://doi.org/10.1029/2021WR031591" target="_blank">https://doi.org/10.1029/2021WR031591</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Karlbauer et al.(2019)Karlbauer, Otte, Lensch, Scholten, Wulfmeyer,
and Butz</label><mixed-citation>
      
Karlbauer, M., Otte, S., Lensch, H., Scholten, T., Wulfmeyer, V., and Butz,
M. V.: A distributed neural network architecture for robust non-linear
spatio-temporal prediction, arXiv [preprint],
<a href="https://doi.org/10.48550/arXiv.1912.11141" target="_blank">https://doi.org/10.48550/arXiv.1912.11141</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Klotz et al.(2022)Klotz, Kratzert, Gauch, Keefe Sampson,
Brandstetter, Klambauer, Hochreiter, and Nearing</label><mixed-citation>
      
Klotz, D., Kratzert, F., Gauch, M., Keefe Sampson, A., Brandstetter, J., Klambauer, G., Hochreiter, S., and Nearing, G.: Uncertainty estimation with deep learning for rainfall–runoff modeling, Hydrol. Earth Syst. Sci., 26, 1673–1693, <a href="https://doi.org/10.5194/hess-26-1673-2022" target="_blank">https://doi.org/10.5194/hess-26-1673-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Kratzert et al.(2018)Kratzert, Klotz, Brenner, Schulz, and
Herrnegger</label><mixed-citation>
      
Kratzert, F., Klotz, D., Brenner, C., Schulz, K., and Herrnegger, M.: Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks, Hydrol. Earth Syst. Sci., 22, 6005–6022, <a href="https://doi.org/10.5194/hess-22-6005-2018" target="_blank">https://doi.org/10.5194/hess-22-6005-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Kratzert et al.(2019)Kratzert, Klotz, Shalev, Klambauer, Hochreiter,
and Nearing</label><mixed-citation>
      
Kratzert, F., Klotz, D., Shalev, G., Klambauer, G., Hochreiter, S., and Nearing, G.: Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets, Hydrol. Earth Syst. Sci., 23, 5089–5110, <a href="https://doi.org/10.5194/hess-23-5089-2019" target="_blank">https://doi.org/10.5194/hess-23-5089-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Kratzert et al.(2021)Kratzert, Klotz, Gauch, Klingler, Nearing, and
Hochreiter</label><mixed-citation>
      
Kratzert, F., Klotz, D., Gauch, M., Klingler, C., Nearing, G., and Hochreiter,
S.: Large-scale river network modeling using Graph Neural Networks, in: EGU
General Assembly Conference Abstracts,  EGU21–13375,
<a href="https://doi.org/10.5194/egusphere-egu21-13375" target="_blank">https://doi.org/10.5194/egusphere-egu21-13375</a>,
2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Li et al.(2022)Li, Zhang, and Krebs</label><mixed-citation>
      
Li, P., Zhang, J., and Krebs, P.: Prediction of flow based on a CNN-LSTM
combined deep learning approach, Water, 14, 993, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Liu and Gupta(2007)</label><mixed-citation>
      
Liu, Y. and Gupta, H. V.: Uncertainty in hydrologic modeling: Toward an
integrated data assimilation framework, Water Resources Research, Volume 43, Issue 7, <a href="https://doi.org/10.1029/2006WR005756" target="_blank">https://doi.org/10.1029/2006WR005756</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Liu et al.(2012)Liu, Weerts, Clark, Hendricks Franssen, Kumar,
Moradkhani, Seo, Schwanenberg, Smith, Van Dijk, Van Velzen, He, Lee, Noh,
Rakovec, and Restrepo</label><mixed-citation>
      
Liu, Y., Weerts, A. H., Clark, M., Hendricks Franssen, H.-J., Kumar, S., Moradkhani, H., Seo, D.-J., Schwanenberg, D., Smith, P., van Dijk, A. I. J. M., van Velzen, N., He, M., Lee, H., Noh, S. J., Rakovec, O., and Restrepo, P.: Advancing data assimilation in operational hydrologic forecasting: progresses, challenges, and emerging opportunities, Hydrol. Earth Syst. Sci., 16, 3863–3887, <a href="https://doi.org/10.5194/hess-16-3863-2012" target="_blank">https://doi.org/10.5194/hess-16-3863-2012</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Liu et al.(2022)Liu, Mao, Wu, Feichtenhofer, Darrell, and
Xie</label><mixed-citation>
      
Liu, Z., Mao, H., Wu, C.-Y., Feichtenhofer, C., Darrell, T., and Xie, S.: A
ConvNet for the 2020s, in: 2022 IEEE/CVF Conference on Computer Vision and
Pattern Recognition (CVPR), <a href="https://doi.org/10.1109/cvpr52688.2022.01167" target="_blank">https://doi.org/10.1109/cvpr52688.2022.01167</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Longyang et al.(2024)Longyang, Choi, Tennant, Hill, Ashmead, Neilson,
Newell, McNamara, and Xu</label><mixed-citation>
      
Longyang, Q., Choi, S., Tennant, H., Hill, D., Ashmead, N., Neilson, B. T.,
Newell, D. L., McNamara, J. P., and Xu, T.: Explainable Spatially Distributed
Hydrologic Modeling of a Snow Dominated Mountainous Karst Watershed Using
Attention, Authorea Preprints, <a href="https://doi.org/10.22541/essoar.171536019.93198716/v1" target="_blank">https://doi.org/10.22541/essoar.171536019.93198716/v1</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Marçais and de Dreuzy(2017)</label><mixed-citation>
      
Marçais, J. and de Dreuzy, J.-R.: Prospective interest of deep learning
for hydrological inference, Groundwater, 55, 688–692, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Montzka et al.(2012)Montzka, Pauwels, Franssen, Han, and
Vereecken</label><mixed-citation>
      
Montzka, C., Pauwels, V. R., Franssen, H.-J. H., Han, X., and Vereecken, H.:
Multivariate and multiscale data assimilation in terrestrial systems: A
review, Sensors, 12, 16291–16333, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Moradkhani et al.(2005)Moradkhani, Hsu, Gupta, and
Sorooshian</label><mixed-citation>
      
Moradkhani, H., Hsu, K.-L., Gupta, H., and Sorooshian, S.: Uncertainty
assessment of hydrologic model states and parameters: Sequential data
assimilation using the particle filter, Water Resources Research, Volume 41, Issue 5, Article number W05012, <a href="https://doi.org/10.1029/2004WR003604" target="_blank">https://doi.org/10.1029/2004WR003604</a>,  2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Moshe et al.(2020)Moshe, Metzger, Kratzert, Morin, Nevo, Elidan, and
Elyaniv</label><mixed-citation>
      
Moshe, Z., Metzger, A., Kratzert, F., Morin, E., Nevo, S., Elidan, G., and Elyaniv, R.: HydroNets: Leveraging River Network Structure and Deep Neural Networks for Hydrologic Modeling , EGU General Assembly 2020, Online, 4–8 May 2020, EGU2020-4135, <a href="https://doi.org/10.5194/egusphere-egu2020-4135" target="_blank">https://doi.org/10.5194/egusphere-egu2020-4135</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Muñoz-Carpena et al.(2023)Muñoz-Carpena, Carmona-Cabrero, Yu,
Fox, and Batelaan</label><mixed-citation>
      
Muñoz-Carpena, R., Carmona-Cabrero, A., Yu, Z., Fox, G., and Batelaan, O.:
Convergence of mechanistic modeling and artificial intelligence in hydrologic
science and engineering, PLOS Water, 2, e0000059, <a href="https://doi.org/10.1371/journal.pwat.0000212" target="_blank">https://doi.org/10.1371/journal.pwat.0000212</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Nash and Sutcliffe(1970)</label><mixed-citation>
      
Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual
models part I–A discussion of principles, Journal of Hydrology,
10, 282–290, 1970.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Nearing et al.(2020)Nearing, Kratzert, Sampson, Pelissier, Klotz,
Frame, Prieto, and Gupta</label><mixed-citation>
      
Nearing, G., Kratzert, F., Sampson, A. K., Pelissier, C., Klotz, D., Frame, J.,
Prieto, C., and Gupta, H.: What Role Does Hydrological Science Play in the
Age of Machine Learning?, Water Resources Research, 57, e2020WR028091,
<a href="https://doi.org/10.31223/osf.io/3sx6g" target="_blank">https://doi.org/10.31223/osf.io/3sx6g</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Oddo et al.(2024)Oddo, Bolten, Kumar, and Cleary</label><mixed-citation>
      
Oddo, P. C., Bolten, J. D., Kumar, S. V., and Cleary, B.: Deep Convolutional
LSTM for improved flash flood prediction, Frontiers in Water, 6, 1346104, <a href="https://doi.org/10.3389/frwa.2024.1346104" target="_blank">https://doi.org/10.3389/frwa.2024.1346104</a>,
2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Otte et al.(2020)Otte, Karlbauer, and Butz</label><mixed-citation>
      
Otte, S., Karlbauer, M., and Butz, M. V.: Active Tuning, arXiv [preprint],
<a href="https://doi.org/10.48550/arXiv.2010.03958" target="_blank">https://doi.org/10.48550/arXiv.2010.03958</a>,
2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Palmer et al.(2008)Palmer, Reidy Liermann, Nilsson, Flörke,
Alcamo, Lake, and Bond</label><mixed-citation>
      
Palmer, M. A., Reidy Liermann, C. A., Nilsson, C., Flörke, M., Alcamo, J.,
Lake, P. S., and Bond, N.: Climate change and the world's
river basins: anticipating management options, Frontiers in Ecology and the
Environment, 6, 81–89, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Paszke et al.(2019)Paszke, Gross, Massa, Lerer, Bradbury, Chanan,
Killeen, Lin, Gimelshein, Antiga, Desmaison, Kopf, Yang, DeVito, Raison,
Tejani, Chilamkurthy, Steiner, Fang, Bai, and Chintala</label><mixed-citation>
      
Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen,
T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Kopf, A., Yang, E.,
DeVito, Z., Raison, M., Tejani, A., Chilamkurthy, S., Steiner, B., Fang, L.,
Bai, J., and Chintala, S.: PyTorch: An Imperative Style, High-Performance
Deep Learning Library, Advances in Neural Information Processing Systems,
12, <a href="https://doi.org/10.48550/arXiv.1912.01703" target="_blank">https://doi.org/10.48550/arXiv.1912.01703</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Pilon(2002)</label><mixed-citation>
      
Pilon, P. J.: Guidelines for reducing flood losses, Tech. rep., United Nations
International Strategy for Disaster Reduction (UNISDR), <a href="https://www.un.org/esa/sustdev/publications/flood_guidelines.pdf" target="_blank"/> (last access: 11 November 2025), 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Pokharel and Roy(2024)</label><mixed-citation>
      
Pokharel, S. and Roy, T.: A parsimonious setup for streamflow forecasting using
CNN-LSTM, Journal of Hydroinformatics,  jh2024114, <a href="https://doi.org/10.2166/hydro.2024.114" target="_blank">https://doi.org/10.2166/hydro.2024.114</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>RADOLAN(2016)</label><mixed-citation>
      
RADOLAN: RADOLAN/RADVOR,
<a href="https://opendata.dwd.de/climate_environment/CDC/grids_germany/hourly/radolan/" target="_blank"/>(last access: 6 September 2023),
2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Rakovec et al.(2012)Rakovec, Weerts, Hazenberg, Torfs, and
Uijlenhoet</label><mixed-citation>
      
Rakovec, O., Weerts, A. H., Hazenberg, P., Torfs, P. J. J. F., and Uijlenhoet, R.: State updating of a distributed hydrological model with Ensemble Kalman Filtering: effects of updating frequency and observation network density on forecast accuracy, Hydrol. Earth Syst. Sci., 16, 3435–3449, <a href="https://doi.org/10.5194/hess-16-3435-2012" target="_blank">https://doi.org/10.5194/hess-16-3435-2012</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Samaniego et al.(2010)Samaniego, Kumar, and
Attinger</label><mixed-citation>
      
Samaniego, L., Kumar, R., and Attinger, S.: Multiscale parameter
regionalization of a grid-based hydrologic model at the mesoscale, Water
Resources Research, 46, W05523, <a href="https://doi.org/10.1029/2008WR007327" target="_blank">https://doi.org/10.1029/2008WR007327</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Scholz et al.(2024a)Scholz, Traub, Zarfl, Scholten, and
Butz</label><mixed-citation>
      
Scholz, F., Traub, M., Zarfl, C., Scholten, T., and Butz, M. V.: Fully
differentiable, fully distributed River Discharge Prediction: data sets, Zenodo [data set],
<a href="https://doi.org/10.5281/zenodo.13970575" target="_blank">https://doi.org/10.5281/zenodo.13970575</a>, 2024a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Scholz et al.(2024b)Scholz, Traub, Zarfl, Scholten, and
Butz</label><mixed-citation>
      
Scholz, F., Traub, M., Zarfl, C., Scholten, T., and Butz, M. V.: Fully
differentiable, fully distributed River Discharge Prediction: code, Zenodo [code],
<a href="https://doi.org/10.5281/zenodo.13992583" target="_blank">https://doi.org/10.5281/zenodo.13992583</a>, 2024b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Shen(2018)</label><mixed-citation>
      
Shen, C.: A transdisciplinary review of deep learning research and its
relevance for water resources scientists, Water Resources Research, 54,
8558–8593, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Shen et al.(2023)Shen, Appling, Gentine, Bandai, Gupta, Tartakovsky,
Baity-Jesi, Fenicia, Kifer, Li, Liu, Ren, Zheng, Harman, Clark, Farthing,
Feng, Kumar, Aboelyazeed, Rahmani, Song, Beck, Bindas, Dwivedi, Fang,
Höge, Rackauckas, Mohanty, Roy, Xu, and Lawson</label><mixed-citation>
      
Shen, C., Appling, A. P., Gentine, P., Bandai, T., Gupta, H., Tartakovsky, A.,
Baity-Jesi, M., Fenicia, F., Kifer, D., Li, L., Liu, X., Ren, W., Zheng, Y.,
Harman, C. J., Clark, M., Farthing, M., Feng, D., Kumar, P., Aboelyazeed, D.,
Rahmani, F., Song, Y., Beck, H. E., Bindas, T., Dwivedi, D., Fang, K.,
Höge, M., Rackauckas, C., Mohanty, B., Roy, T., Xu, C., and Lawson, K.:
Differentiable modelling to unify machine learning and physical models for
geosciences, Nature Reviews Earth &amp; Environment, 4, 552–567,
<a href="https://doi.org/10.1038/s43017-023-00450-9" target="_blank">https://doi.org/10.1038/s43017-023-00450-9</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Shi et al.(2015)Shi, Chen, Wang, Yeung, Wong, and Woo</label><mixed-citation>
      
Shi, X., Chen, Z., Wang, H., Yeung, D.-Y., Wong, W.-k., and Woo, W.-c.:
Convolutional LSTM Network: A Machine Learning Approach for Precipitation
Nowcasting, Advances in neural information processing systems,
arXiv [preprint],
<a href="https://doi.org/10.48550/arXiv.1506.04214" target="_blank">https://doi.org/10.48550/arXiv.1506.04214</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Simonyan et al.(2013)Simonyan, Vedaldi, and
Zisserman</label><mixed-citation>
      
Simonyan, K., Vedaldi, A., and Zisserman, A.: Deep Inside Convolutional
Networks: Visualising Image Classification Models and Saliency Maps, arXiv [preprint],
<a href="https://doi.org/10.48550/arXiv.1312.6034" target="_blank">https://doi.org/10.48550/arXiv.1312.6034</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Sit et al.(2020)Sit, Demiray, Xiang, Ewing, Sermet, and
Demir</label><mixed-citation>
      
Sit, M., Demiray, B., Xiang, Z., Ewing, G., Sermet, Y., and Demir, I.: A
Comprehensive Review of Deep Learning Applications in Hydrology and Water
Resources, <a href="https://doi.org/10.31223/osf.io/xs36g" target="_blank">https://doi.org/10.31223/osf.io/xs36g</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Sit et al.(2021)Sit, Demiray, and Demir</label><mixed-citation>
      
Sit, M., Demiray, B., and Demir, I.: Short-term Hourly Streamflow Prediction
with Graph Convolutional GRU Networks, arXiv [preprint],
<a href="https://doi.org/10.48550/arXiv.2107.07039" target="_blank">https://doi.org/10.48550/arXiv.2107.07039</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Sun et al.(2022)Sun, Jiang, Yang, Xie, and Chen</label><mixed-citation>
      
Sun, A. Y., Jiang, P., Yang, Z.-L., Xie, Y., and Chen, X.: A graph neural network (GNN) approach to basin-scale river network learning: the role of physics-based connectivity and data fusion, Hydrol. Earth Syst. Sci., 26, 5163–5184, <a href="https://doi.org/10.5194/hess-26-5163-2022" target="_blank">https://doi.org/10.5194/hess-26-5163-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Traub et al.(2024)Traub, Becker, Sauter, Otte, and
Butz</label><mixed-citation>
      
Traub, M., Becker, F., Sauter, A., Otte, S., and Butz, M. V.: Loci-segmented:
improving scene segmentation learning, in: International Conference on
Artificial Neural Networks,  45–61, Springer, <a href="https://doi.org/10.1007/978-3-031-72338-4_4" target="_blank">https://doi.org/10.1007/978-3-031-72338-4_4</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>Tyson et al.(2023)Tyson, Longyang, Neilson, Zeng, and
Xu</label><mixed-citation>
      
Tyson, C., Longyang, Q., Neilson, B. T., Zeng, R., and Xu, T.: Effects of
meteorological forcing uncertainty on high-resolution snow modeling and
streamflow prediction in a mountainous karst watershed, Journal of Hydrology,
619, 129304, <a href="https://doi.org/10.1016/j.jhydrol.2023.129304" target="_blank">https://doi.org/10.1016/j.jhydrol.2023.129304</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Ueda et al.(2024)Ueda, Tanouchi, Egusa, and Yoshihiro</label><mixed-citation>
      
Ueda, F., Tanouchi, H., Egusa, N., and Yoshihiro, T.: A Transfer Learning
Approach Based on Radar Rainfall for River Water-Level Prediction, Water, 16,
607, <a href="https://doi.org/10.3390/w16040607" target="_blank">https://doi.org/10.3390/w16040607</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>Ufrecht(2002)</label><mixed-citation>
      
Ufrecht, W.: Hydrogeologische   Modelle – ein Leitfaden mit Fallbeispielen, Schriftenreihe der Deutschen Geologischen Gesellschaft, 24, Schweizerbart Science Publishers, Stuttgart, Germany, <a href="http://www.schweizerbart.de//publications/detail/artno/171902400/Schriftenreihe_der_Dt_Ges_f_Geowissen" target="_blank"/> (last access: 11 November 2025), 2002.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Valeriano et al.(2010)Valeriano, Koike, Yang, and
Yang</label><mixed-citation>
      
Valeriano, O. C. S., Koike, T., Yang, K., and Yang, D.: Optimal dam operation
during flood season using a distributed hydrological model and a heuristic
algorithm, Journal of Hydrologic Engineering, 15, 580–586, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>Van Vliet et al.(2013)Van Vliet, Franssen, Yearsley, Ludwig,
Haddeland, Lettenmaier, and Kabat</label><mixed-citation>
      
Van Vliet, M. T., Franssen, W. H., Yearsley, J. R., Ludwig, F., Haddeland, I.,
Lettenmaier, D. P., and Kabat, P.: Global river discharge and water
temperature under climate change, Global Environmental Change, 23, 450–464,
2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>Wang et al.(2024)Wang, Jiang, Zheng, Han, Kumar, Rakovec, and
Li</label><mixed-citation>
      
Wang, C., Jiang, S., Zheng, Y., Han, F., Kumar, R., Rakovec, O., and Li, S.:
Distributed Hydrological Modeling With Physics-Encoded Deep Learning: A
General Framework and Its Application in the Amazon, Water Resources
Research, 60, e2023WR036170, <a href="https://doi.org/10.1029/2023WR036170" target="_blank">https://doi.org/10.1029/2023WR036170</a>,
2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>Wright(2019)</label><mixed-citation>
      
Wright, L.: Ranger – a synergistic optimizer, GitHub,
<a href="https://github.com/lessw2020/Ranger-Deep-Learning-Optimizer" target="_blank"/> (last access: 22 August 2024),   2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>Xiang and Demir(2020)</label><mixed-citation>
      
Xiang, Z. and Demir, I.: Distributed long-term hourly streamflow predictions
using deep learning – A case study for State of Iowa, Environmental
Modelling &amp; Software, 131, 104761,
<a href="https://doi.org/10.1016/j.envsoft.2020.104761" target="_blank">https://doi.org/10.1016/j.envsoft.2020.104761</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>Xiang and Demir(2022)</label><mixed-citation>
      
Xiang, Z. and Demir, I.: Fully distributed rainfall-runoff modeling using
spatial-temporal graph neural network, EarthArxiv, <a href="https://doi.org/https://doi.org/10.31223/X57P74" target="_blank">https://doi.org/https://doi.org/10.31223/X57P74</a>,  2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>Xu et al.(2022)Xu, Longyang, Tyson, Zeng, and Neilson</label><mixed-citation>
      
Xu, T., Longyang, Q., Tyson, C., Zeng, R., and Neilson, B. T.: Hybrid
physically based and deep learning modeling of a snow dominated, mountainous,
karst watershed, Water Resources Research, 58, e2021WR030993,  <a href="https://doi.org/10.1029/2021WR030993" target="_blank">https://doi.org/10.1029/2021WR030993</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>Yadan(2019)</label><mixed-citation>
      
Yadan, O.: Hydra – A framework for elegantly configuring complex applications,
Github, <a href="https://github.com/facebookresearch/hydra" target="_blank"/> (last access: 28 October 2024), 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>Zhong et al.(2024)Zhong, Lei, Li, and Jiang</label><mixed-citation>
      
Zhong, L., Lei, H., Li, Z., and Jiang, S.: Advancing streamflow prediction in
data-scarce regions through vegetation-constrained distributed hybrid
ecohydrological models, Journal of Hydrology, 645, 132165,
<a href="https://doi.org/10.1016/j.jhydrol.2024.132165" target="_blank">https://doi.org/10.1016/j.jhydrol.2024.132165</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>Zhu et al.(2023)Zhu, Wei, Zhang, Xu, and Qin</label><mixed-citation>
      
Zhu, S., Wei, J., Zhang, H., Xu, Y., and Qin, H.: Spatiotemporal deep learning
rainfall-runoff forecasting combined with remote sensing precipitation
products in large scale basins, Journal of Hydrology, 616, 128727, <a href="https://doi.org/10.1016/j.jhydrol.2022.128727" target="_blank">https://doi.org/10.1016/j.jhydrol.2022.128727</a>,  2023.

    </mixed-citation></ref-html>--></article>
