<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">HESS</journal-id><journal-title-group>
    <journal-title>Hydrology and Earth System Sciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">HESS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Hydrol. Earth Syst. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1607-7938</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/hess-30-371-2026</article-id><title-group><article-title>Probabilistic hierarchical interpolation and interpretable neural network configurations for flood prediction</article-title><alt-title>Probabilistic neural network configurations for flood prediction</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Saberian</surname><given-names>Mostafa</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff2 aff3">
          <name><surname>Samadi</surname><given-names>Vidya</given-names></name>
          <email>samadi@clemson.edu</email>
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff4 aff5">
          <name><surname>Popescu</surname><given-names>Ioana</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>Glenn Department of Civil Engineering, Clemson University, Clemson, SC, USA</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Department of Agricultural Sciences, Clemson University, Clemson, SC, USA</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Artificial Intelligence Research Institute for Science and Engineering (AIRISE), Clemson University, Clemson, SC, USA</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>Department of Hydroinformatics and Socio-Technical Innovation, IHE Delft Institute for Water Education, Delft, the Netherlands</institution>
        </aff>
        <aff id="aff5"><label>5</label><institution>Faculty of Civil Engineering and Geosciences, Delft University of Technology, Delft, the Netherlands</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Vidya Samadi (samadi@clemson.edu)</corresp></author-notes><pub-date><day>26</day><month>January</month><year>2026</year></pub-date>
      
      <volume>30</volume>
      <issue>2</issue>
      <fpage>371</fpage><lpage>399</lpage>
      <history>
        <date date-type="received"><day>21</day><month>August</month><year>2024</year></date>
           <date date-type="rev-request"><day>7</day><month>October</month><year>2024</year></date>
           <date date-type="rev-recd"><day>28</day><month>December</month><year>2025</year></date>
           <date date-type="accepted"><day>7</day><month>January</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Mostafa Saberian et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026.html">This article is available from https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026.html</self-uri><self-uri xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026.pdf">The full text article is available as a PDF file from https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e129">The past few years have witnessed the rise of neural networks (NNs) applications for hydrological time series modeling. By virtue of their capabilities, NN models can achieve unprecedented levels of performance when learning how to solve increasingly complex rainfall-runoff processes via data, making them pivotal for the development of computational hydrologic tasks such as flood predictions. The NN models should, to be considered practical, provide a probabilistic understanding of the model mechanisms and predictions and hints on what could perturb the model. In this paper, we developed two NN models, i.e., Neural Hierarchical Interpolation for Time Series Forecasting (N-HiTS) and Network-Based Expansion Analysis for Interpretable Time Series Forecasting (N-BEATS) with a probabilistic multi-quantile objective and benchmarked them with long short-term memory (LSTM) for flood prediction across two headwater streams in Georgia and North Carolina, USA. To generate a probabilistic prediction, a Multi-Quantile Loss was used to assess the 95th percentile prediction uncertainty (95 PPU) of multiple flooding events. Extensive experiments demonstrated the advantages of hierarchical interpolation and interpretable architecture, where both N-HiTS and N-BEATS provided an average accuracy improvement of <inline-formula><mml:math id="M1" display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> 5 % over the LSTM benchmarking model. On a variety of flooding events, both N-HiTS and N-BEATS demonstrated significant performance improvements over the LSTM benchmark and showcased their probabilistic predictions by specifying a likelihood objective.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>Division of Civil, Mechanical and Manufacturing Innovation</funding-source>
<award-id>CMMI 2125283</award-id>
<award-id>CBET 2429082</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

      
      </body>
    <back><notes notes-type="specialsection"><title>Key points</title>
    

      <p id="d2e146"><list list-type="bullet">
        
        <list-item>

      <p id="d2e153">N-HiTS and N-BEATS predictions reflect interpretability and hierarchical representations of data to reduce neural network complexities.</p>
        </list-item>
        <list-item>

      <p id="d2e159">Both N-HiTS and N-BEATS models outperformed the LSTM in mathematically defining uncertainty bands.</p>
        </list-item>
        <list-item>

      <p id="d2e165">Predicting the magnitude of the recession curve of flood hydrographs was particularly challenging for all models.</p>
        </list-item>
      </list></p>
  </notes>
<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e178">The past few years have witnessed a rapid surge in the neural networks (NN) applications in hydrology. As these opaque, data-driven models are increasingly employed for critical hydrological predictions, the hydrology community has placed growing emphasis on developing trustworthy and interpretable NN models. However, maintaining coherence while producing accurate predictions can be a challenging problem (Olivares et al., 2024). There is a general agreement on the importance of providing probabilistic NN prediction (Sadeghi Tabas and Samadi, 2022), especially in the case of flood prediction (Martinaitis et al., 2023).</p>
      <p id="d2e181">Flood occurrences have witnessed an alarming surge in frequency and severity globally. Jonkman (2005) studied a natural disaster database (Guha-Sapir and Below, 2002) and reported that over 27 years, more than 175 000 people died, and close to 2.2 billion were affected directly by floods worldwide. These numbers are likely an underestimation due to unreported events (Nevo et al., 2022). In addition, the United Nations Office for Disaster Risk Reduction reported that flooding has been the most frequent, widespread weather-related natural disaster since 1995, claiming over 600 000 lives, affecting around 4 billion people globally, and causing annual economic damage of more than 100 billion USD (UNISDR, 2015). This escalating trend has necessitated the need for better flood prediction and management strategies. Scholars have successfully implemented different flood models such as deterministic (e.g., Roelvink et al., 2009, Thompson and Frazier, 2014; Barnard et al., 2014; Erikson et al., 2018) and physically based flood models (e.g., Basso et al., 2016; Chen et al., 2016; Pourreza-Bilondi et al., 2017; Saksena et al., 2020; Refsgaard et al., 2022) in various environmental systems over the past several decades. These studies have heightened the need for precise flood prediction (Samadi et al., 2025), they have also unveiled limitations inherent in existing deterministic and physics-based models.</p>
      <p id="d2e184">While evidence suggests that both deterministic and physics-based approaches are meaningful and useful (Sukovich et al., 2014; Zafarmomen et al., 2024), their forecasts rest heavily on imprecise and subjective expert opinion; there is a challenge for setting robust evidence-based thresholds to issue flood warnings and alerts (Palmer, 2012). Moreover, many of these traditional flood models, particularly physically explicit models, rely too strongly on a particular choice of numerical approximation and describe multiple process parameterizations only within a fixed spatial architecture (e.g., Clark et al., 2015). Recent NN models have shown promising results across a large variety of flood modeling applications (e.g., Nevo et al., 2022; Pally and Samadi, 2022; Dasgupta et al., 2023; Zhang et al., 2023b; Zafarmomen and Samadi, 2025; Saberian et al., 2026) and encourage the use of such methodologies as core drivers for neural flood prediction (Windheuser et al., 2023).</p>
      <p id="d2e187">Earlier adaptations of these intelligent techniques showed promising for flood prediction (e.g., Hsu et al., 1995; Tiwari and Chatterjee, 2010). However, recent efforts have taken NN application to the next level, providing uncertainty assessment (Sadeghi Tabas and Samadi, 2022) and improvements over various spatio-temporal scales, regions, and processes (e.g., Kratzert et al., 2018; Park and Lee, 2024; Zhang et al., 2023a). Nevo et al. (2022) were the first scholars who employed long short-term memory (LSTM) for flood stage prediction and inundation mapping, achieving notable success during the 2021 monsoon season. Soon after, Russo et al. (2023) evaluated various NN models for predicting depth flood in urban systems, highlighting the potential of data-driven models for urban flood prediction. Similarly, Defontaine et al. (2023) emphasized the role of NN algorithms in enhancing the reliability of flood predictions, particularly in the context of limited data availability. Windheuser et al. (2023) studied flood gauge height forecasting using images and time series data for two gauging stations in Georgia, USA. They used multiple NN models such as Convolutional Neural Network (ConvNet/CNN) and LSTM to forecast floods in near real-time (up to 72 h).</p>
      <p id="d2e191">In a sequence, Wee et al. (2023) used Impact-Based Forecasting (IBF) to propose a Flood Impact-Based Forecasting system (FIBF) using flexible fuzzy inference techniques, aiding decision-makers in a timely response. Zou et al. (2023) proposed a Residual LSTM (ResLSTM) model to enhance and address flood prediction gradient issues. They integrated Deep Autoregressive Recurrent (DeepAR) with four recurrent neural networks (RNNs), including ResLSTM, LSTM, Gated Recurrent Unit (GRU), and Time Feedforward Connections Single Gate Recurrent Unit (TFC-SGRU). They showed that ResLSTM achieved superior accuracy. While these studies reported the superiority of NN models for flood modeling, they highlighted a number of challenges, notably (i) the limited capability of proposed NN models to capture the spatial variability and magnitudes of extreme data over time, (ii) the lack of a sophisticated mechanism to capture different flood magnitudes and synthesize the prediction, and (iii) inability of the NN models to process data in parallel and capture the relationships between all elements in a sequential manner.</p>
      <p id="d2e194">Recent advances in neural time series forecasting showed promising results that can be used to address the above challenges for flood prediction. Recent techniques include the adoption of the attention mechanism and Transformer-inspired approaches (Fan et al., 2019; Alaa and van der Schaar, 2019; Lim et al., 2021) along with attention-free architectures composed of deep stacks of fully connected layers (Oreshkin et al., 2020).</p>
      <p id="d2e197">All these approaches are relatively easy to scale up in terms of flood magnitudes (small to major flood predictions), compared to LSTM and have proven to be capable of capturing spatiotemporal dependencies (Challu et al., 2022). In addition, these architectures can capture input-output relationships implicitly while they tend to be more computationally efficient. Many state-of-the-art NN approaches for flood forecasting have been established based on LSTM. There are cell states in the LSTM networks that can be interpreted as storage capacity often used in flood generation schemes. In LSTM, the updating of internal cell states (or storages) is regulated through several gates: the first gate regulates the storage depletion, the second one regulates storage fluctuations, and the third gate regulates the storages outflow (Tabas and Samadi, 2022). The elaborate gated design of the LSTM partly solves the long-term dependency problem in flood time series prediction (Fang et al., 2020), although, the structure of LSTMs is designed in a sequential manner that cannot directly connect two nonadjacent portions (positions) of a time series.</p>
      <p id="d2e200">In this paper, we developed attention-free architecture, i.e. Neural Hierarchical Interpolation for Time Series Forecasting (N-HiTS; Challu et al., 2022) and Network-Based Expansion Analysis for Interpretable Time Series Forecasting (N-BEATS; Oreshkin et al., 2020) and benchmarked these models with LSTM for flood prediction. We developed fully connected N-BEATS and N-HiTS architectures using multi-rate data sampling, synthesizing the flood prediction outputs via multi-scale interpolation.</p>
      <p id="d2e203">We implemented all algorithms for flood prediction on two headwater streams i.e., the Lower Dog River, Georgia, and the Upper Dutchmans Creek, North Carolina, USA to ensure that the results are reliable and comparable. The results of N-BEATS and N-HiTS techniques were compared with the benchmarking LSTM to understand how these techniques can improve the representations of rainfall and runoff dispensing over a recurrence process. Notably, this study represents a pioneering effort, as to the best of our knowledge, this is the first instance in which the application of N-BEATS and N-HiTS algorithms in the field of flood prediction has been explored. The scope of this research will focus on: <list list-type="custom"><list-item><label>i.</label>
      <p id="d2e208">Flood prediction in a hierarchical fashion with interpretable outputs: We built N-BEATS and N-HiTS for flood prediction with a very deep stack of fully connected layers to implicitly capture input-output relationships with hierarchical interpolation capabilities. The predictions also involve programming the algorithms with decreasing complexity and aligning their time scale with the final output through multi-scale hierarchical interpolation and interpretable architecture. Predictions were aggregated in a hierarchical fashion that enabled the building of a very deep neural network with interpretable configurations.</p></list-item><list-item><label>ii.</label>
      <p id="d2e212">Uncertainty quantification of the models by employing probabilistic approaches: a Multi-Quantile Loss (MQL) was used to assess the 95th percentile prediction uncertainty (95 PPU) of multiple flooding events. MQL was integrated as the loss function to account for probabilistic prediction. MQL trains the model to produce probabilistic forecasts by predicting multiple quantiles of the distribution of future values.</p></list-item><list-item><label>iii.</label>
      <p id="d2e216">Exploring headwater stream response to flooding: Understanding the dynamic response of headwater streams to flooding is essential for managing downstream flood risks. Headwater streams constitute the uppermost sections of stream networks, usually comprising 60 % to 80 % of a catchment area. Given this substantial coverage and the tendency for precipitation to increase with elevation, headwater streams are responsible for generating and controlling the majority of runoff in downstream portions (MacDonald and Coe, 2007).</p></list-item></list> The remainder of this paper is structured as follows. Section 2 presents the case study and data, NN models, performance metrics, and sensitivity and uncertainty approaches. Section 3 focuses on the results of flood predictions including sensitivity and uncertainty assessment and computation efficiency. Finally, Sect. 4 concludes the paper.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methodology</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Case Study and Data</title>
      <p id="d2e235">This research used two headwater gauging stations located at the Lower Dog River watershed, Georgia (GA; USGS02337410, Dog River gauging station), and the Upper Dutchmans Creek watershed, North Carolina (NC; USGS0214269560, Killian Creek gauging station). As depicted in Fig. 1, the Lower Dog River and the Upper Dutchmans Creek watersheds are in the west and north parts of two metropolitan cities, Atlanta and Charlotte. The Lower Dog River stream gauge is established southeast of Villa Rica in Carroll County, where the USGS has regularly monitored discharge data since 2007 in 15 min increments. The Lower Dog River is a stream with a length of 15.7 miles (25.3 km; obtained from the U.S. Geological Survey [USGS] National Hydrography Dataset high-resolution flowline data), an average elevation of 851.94 m, and the watershed area above this gauging station is 66.5 square miles (172 km<sup>2</sup>; obtained from the Georgia Department of Natural Resources). This watershed is covered by 15.2 % residential area, 14.6 % agricultural land, and <inline-formula><mml:math id="M3" display="inline"><mml:mo>∼</mml:mo></mml:math></inline-formula> 70 % forest (Munn et al., 2020).</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e256">The Lower Dog River and The Upper Dutchmans Creek watersheds are in GA and NC. The proximity of the watersheds to Atlanta and Charlotte (urban area) are also displayed on the map.</p></caption>
        <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f01.jpg"/>

      </fig>

      <p id="d2e265">Killian Creek gauging station at the Upper Dutchmans Creek watershed is established in Montgomery County, NC, where the USGS has regularly monitored discharge data since 1995 in 15 min increments. The Upper Dutchmans Creek is a stream with a length of 4.9 miles (7.9 km), an average elevation of 642.2 m (see Table 1), and the watershed area above this gauging station is 4 square miles (10.3 km<sup>2</sup>) with less than 3 % residential area and about 93 % forested land use (US EPA, 2024).</p>

<table-wrap id="T1" specific-use="star"><label>Table 1</label><caption><p id="d2e281">The Lower Dog River and Upper Dutchmans Creek's physical characteristics.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Watershed</oasis:entry>
         <oasis:entry colname="col2">USGS Station</oasis:entry>
         <oasis:entry colname="col3">Average Elevation</oasis:entry>
         <oasis:entry colname="col4">Stream Length</oasis:entry>
         <oasis:entry colname="col5">Watershed area</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">ID Number</oasis:entry>
         <oasis:entry colname="col3">(m)</oasis:entry>
         <oasis:entry colname="col4">(km)</oasis:entry>
         <oasis:entry colname="col5">(km<sup>2</sup>)</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Lower Dog River watershed, GA</oasis:entry>
         <oasis:entry colname="col2">USGS02337410</oasis:entry>
         <oasis:entry colname="col3">851.9</oasis:entry>
         <oasis:entry colname="col4">25.3</oasis:entry>
         <oasis:entry colname="col5">172</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Upper Dutchmans Creek watershed, NC</oasis:entry>
         <oasis:entry colname="col2">USGS0214269560</oasis:entry>
         <oasis:entry colname="col3">642.2</oasis:entry>
         <oasis:entry colname="col4">7.9</oasis:entry>
         <oasis:entry colname="col5">10.3</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e386">The Lower Dog River has experienced significant flooding in the last decades. For example, in September 2009, the creek, along with most of northern GA, experienced heavy rainfall (5 inches, equal to 94 mm). The Lower Dog River, overwhelmed by large amounts of overland flow from saturated ground in the watershed, experienced massive flooding in September 2009 (Gotvald, 2010). The river crested at 33.8 feet (10.3 m) with a peak discharge of 59 900 cfs (1700 m<sup>3</sup> s<sup>−1</sup>), nearly six times the 100-year flood level (McCallum and Gotvald, 2010). In addition, Dutchmans Creek experienced significant flooding in February 2020. According to local news (WCCB Charlotte's CW, 2020), the flood in Gaston County caused significant infrastructure damage and community disruption. Key impacts included the threatened collapse of the Dutchman's Creek bridge in Mt. Holly and the closure of Highway 7 in McAdenville, GA.</p>
      <p id="d2e410">To provide the meteorological forcing data, i.e., precipitation, temperature, and humidity, were extracted from the National Oceanic and Atmospheric Administration's (NOAA) Local Climatological Data (LCD). We used the NOAA precipitation, temperature, and humidity data of Atlanta Hartsfield Jackson International Airport and Charlotte Douglas Airport stations as an input for neural network algorithms. The data has been monitored since 1 January 1948, and 22 July 1941, with an hourly interval which was used as an input variable for constructing neural networks.</p>
      <p id="d2e413">To fill in the missing values in the data, we used the spline interpolation method. We applied this method to fill the gaps in time series data, although the missing values were insignificant (less than 1 %). In addition, we employed the Minimum Inter-Event Time (MIT) approach to precisely identify and separate individual storm events. The MIT-based event delineation is pivotal for accurately defining storm events. This method allowed us to isolate discrete rainfall episodes, aiding a comprehensive analysis of storm events. Moreover, it provided a basis for event-specific examination of flood responses, such as initial condition and cessation (loss), runoff generation, and runoff dynamics.</p>
      <p id="d2e416">The hourly rainfall dataset consists of distinct rainfall occurrences, some consecutive and others clustered with brief intervals of zero rainfall. As these zero intervals extend, we aim to categorize them into distinct events. It's worth noting that even within a single storm event, we often encounter short periods of no rainfall, known as intra-storm zero values. In the MIT method, we defined a storm event as a discrete rainfall episode surrounded by dry periods both preceding and following it, determined by an MIT (Asquith et al., 2005; Safaei-Moghadam et al., 2023).</p>
      <p id="d2e419">There are many ways to determine MIT value. One practical approximation is using serial autocorrelation between rainfall occurrences. MIT approach uses autocorrelation that measures the statistical dependency of rainfall data at one point in time with data at earlier, or lagged times within the time series. The lag time represents the gap between data points being correlated. When the lag time is zero, the autocorrelation coefficient is unity, indicating a one-to-one correlation. As the lag time increases, the statistical correlation diminishes, converging to a minimum value. This signifies the fact that rainfall events become progressively less statistically dependent or, in other words, temporally unrelated. To pinpoint the optimal MIT, we analyzed the autocorrelation coefficients for various lag times, observing the point at which the coefficient approaches zero. This lag time signifies the minimum interval of no rainfall, effectively delineating distinct rainfall events.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>NN Algorithms</title>
      <p id="d2e430">In this study, three distinct neural network (NN) architectures were developed to perform multi-horizon flood forecasting. Each NN was coupled with a MQL objective to generate probabilistic predictions and quantify predictive uncertainty. Throughout the manuscript, the term parameters are used exclusively to refer to the network's weights and biases for clarity and consistency.</p>
<sec id="Ch1.S2.SS2.SSS1">
  <label>2.2.1</label><title>LSTM</title>
      <p id="d2e440">LSTM is an RNN architecture widely used as a benchmark model for flood neural time series modeling. LSTM networks are capable of selectively learning order dependence in sequence prediction problems (Sadeghi Tabas and Samadi, 2022). These networks are powerful because they can capture the temporal features, especially the long-term dependencies (Hochreiter et al., 2001) and are independent of the length of the data sequences input, meaning that each sample is independent from another one.</p>
      <p id="d2e443">The memory cell state within LSTM plays a crucial role in capturing extended patterns in data, making it well-suited for dynamic time series modeling such as flood prediction. An LSTM cell uses the following functions to compute flood prediction.

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M8" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E1"><mml:mtd><mml:mtext>1</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="italic">σ</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>+</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>B</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>c</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E2"><mml:mtd><mml:mtext>2</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>f</mml:mi></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>+</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>B</mml:mi><mml:mi>f</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>c</mml:mi><mml:mi>f</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E3"><mml:mtd><mml:mtext>3</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi>o</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>+</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>B</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>c</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E4"><mml:mtd><mml:mtext>4</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>m</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>⊙</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>+</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>⊙</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">tanh</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>g</mml:mi></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>+</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>B</mml:mi><mml:mi>g</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>c</mml:mi><mml:mi>g</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E5"><mml:mtd><mml:mtext>5</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>h</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>o</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>⊙</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">tanh</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>m</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          Where <inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> represent the input and the hidden state at time step <inline-formula><mml:math id="M11" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, respectively. <inline-formula><mml:math id="M12" display="inline"><mml:mo>⊙</mml:mo></mml:math></inline-formula> denotes element-wise multiplication, tanh stands for the hyperbolic tangent activation function, and <inline-formula><mml:math id="M13" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> represents the sigmoid activation function. <inline-formula><mml:math id="M14" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M15" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M16" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> are trainable weights and biases that undergo optimization during the training process. <inline-formula><mml:math id="M17" display="inline"><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are cell states at time step <inline-formula><mml:math id="M19" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> that are employed in the input processing for the next time step. <inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> represents the memory state responsible for preserving long-term information, while <inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> represents the memory state preserving short-term information. The LSTM cell consists of a forget gate <inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, an input gate <inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and an output gate <inline-formula><mml:math id="M24" display="inline"><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and has a cell state <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. At every time step <inline-formula><mml:math id="M26" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, the cell gets the data point <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> with the output of the previous cell <inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> (Windheuser et al., 2023). The forget gate then defines if the information is removed from the cell state, while the input gate evaluates if the information should be added to the cell state and the output gate specifies which information from the cell state can be used for the next cells.</p>
      <p id="d2e950">We used two LSTM layers with 128 cells in the first two hidden layers as encoder layers, which were then connected to two multilayer perceptron (MLP) layers with 128 neurons as decoder layers. The LSTM simulation was performed with these input layers along with the <italic>Adam</italic> optimizer (Kingma and Ba, 2017), tanh activation function, and a single lagged dependent-variable value to train with a learning rate of 0.001. The architecture of the proposed LSTM model is illustrated in Fig. 2.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e959">The structure of LSTM programmed in this research. We used tanh and sigmoid as activation functions along with 2 layers of LSTM, 2 layers of MLP, and 128 cells in each layer.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f02.png"/>

        </fig>

</sec>
<sec id="Ch1.S2.SS2.SSS2">
  <label>2.2.2</label><title>N-BEATS</title>
      <p id="d2e976">N-BEATS is a deep learning architecture based on backward and forward residual links and the very deep stack of fully connected layers specifically designed for sequential data forecasting tasks (Oreshkin et al., 2020). This architecture has several desirable properties including interpretability. The N-BEATS architecture distinguishes itself from existing architecture in several ways. First, the algorithm approaches forecasting as a non-linear multivariate regression problem instead of a sequence-to-sequence challenge. Indeed, the core component of this architecture (as depicted in Fig. 3) is a fully connected non-linear regressor, which takes the historical data from a time series as input and generates multiple data points for the forecasting horizon. Second, most existing time series architectures are quite limited in depth, typically consisting of one to five LSTM layers. N-BEATS employs the residual principle to stack a substantial number of layers together, as illustrated in Fig. 3. In this configuration, the basic block not only predicts the next output but also assesses its contribution to decomposing the input, a concept that is referred to as “backcast” (see Oreshkin et al., 2020).</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e981">The N-BEATS modeling structure, used in this research.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f03.png"/>

        </fig>

      <p id="d2e990">The basic building block in the architecture features a fork-like structure, as illustrated in Fig. 3 (bottom). The <inline-formula><mml:math id="M29" display="inline"><mml:mi>l</mml:mi></mml:math></inline-formula>th block (for the sake of brevity, the block index <inline-formula><mml:math id="M30" display="inline"><mml:mi>l</mml:mi></mml:math></inline-formula> is omitted from Fig. 3) takes its respective input, <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, and produces two output vectors: <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. In the initial block of the model, <inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> corresponds to the overall model input, which is a historical lookback window of a specific length, culminating with the most recent observed data point. For the subsequent blocks, <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is derived from the residual outputs of the preceding blocks. Each block generates two distinct outputs: (1) <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>: This represents the forward forecast of the block, spanning a duration of <inline-formula><mml:math id="M37" display="inline"><mml:mi>H</mml:mi></mml:math></inline-formula> time units. (2) <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>: This signifies the block's optimal estimation of <inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, which is referred to “backcast.” This estimation is made within the constraints of the functional space available to the block for approximating signals (Oreshkin et al., 2020).</p>
      <p id="d2e1116">Internally, the fundamental building block is composed of two elements. The initial element involves a fully connected network, which generates forward expansion coefficient predictors, <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, and a backward expansion coefficient predictor, <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>. The second element encompasses both backward basis layers, <inline-formula><mml:math id="M42" display="inline"><mml:mrow><mml:msubsup><mml:mi>g</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, and forward basis layers, <inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:msubsup><mml:mi>g</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>. These layers take the corresponding forward <inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>and backward <inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>expansion coefficients as input, conduct internal transformations using a set of basis functions, and ultimately yield the backcast, <inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, and the forecast outputs, <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, as previously described by Oreshkin et al. (2020). The following equations describe the first element:

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M48" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E6"><mml:mtd><mml:mtext>6</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="normal">FC</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="normal">FC</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="normal">FC</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="normal">FC</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msub><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E7"><mml:mtd><mml:mtext>7</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="normal">LINEAR</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="normal">LINEAR</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          The LINEAR layer, in essence, functions as a straightforward linear projection, meaning <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msubsup><mml:mi>W</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>. As for the fully connected (FC) layer, it takes on the role of a conventional FC layer, incorporating RELU non-linearity as an activation function.</p>
      <p id="d2e1490">The second element performs the mapping of expansion coefficients <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>and <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> to produce outputs using basis layers, resulting in <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mi>g</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mi>g</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. This process is defined by the following equation:

            <disp-formula id="Ch1.E8" content-type="numbered"><label>8</label><mml:math id="M54" display="block"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="normal">dim</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:munderover><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:msubsup><mml:mi mathvariant="bold-italic">v</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mi mathvariant="normal">dim</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:munderover><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup><mml:msubsup><mml:mi mathvariant="bold-italic">v</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup></mml:mrow></mml:math></disp-formula>

          Within this context, <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">v</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">v</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> represent the basis vectors for forecasting and backcasting, respectively, while <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> corresponds to the <inline-formula><mml:math id="M58" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th element of <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e1770">The N-BEATS uses a novel hierarchical doubly residual architecture which is illustrated in Fig. 3 (top and middle). This framework incorporates two residual branches, one traversing the backcast predictions of each layer, while the other traverses the forecast branch of each layer. The following equation describes this process:

            <disp-formula id="Ch1.E9" content-type="numbered"><label>9</label><mml:math id="M60" display="block"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>l</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>l</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mi>l</mml:mi></mml:munder><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></disp-formula>

          As mentioned earlier, in the specific scenario of the initial block, its input corresponds to the model-level input <inline-formula><mml:math id="M61" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>. In contrast, for all subsequent blocks, the backcast residual branch <inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> can be conceptualized as conducting a sequential analysis of the input signal. The preceding block eliminates the portion of the signal <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>l</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> that it can effectively approximate, thereby simplifying the prediction task for downstream blocks. Significantly, each block produces a partial forecast <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> , which is initially aggregated at the stack level and subsequently at the overall network level, establishing a hierarchical decomposition. The ultimate forecast <inline-formula><mml:math id="M65" display="inline"><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> is the summation of all partial forecasts (Oreshkin et al., 2020).</p>
      <p id="d2e1897">The N-BEATS model has two primary configurations: generic and interpretable. These configurations determine how the model structures its blocks and how it processes time series data. In the generic configuration, the model uses a stack of generic blocks that are designed to be flexible and adaptable to various patterns in the time series data. Each generic block consists of fully connected layers with ReLU activation functions. The key characteristic of generic configuration is its flexibility. Since the blocks are not specialized for any specific pattern (like trend or seasonality), they can learn a wide range of patterns directly from the data (Oreshkin et al., 2020). In the interpretable configuration, the model architecture integrates distinct trend and seasonality components. This involves structuring the basis layers at the stack level specifically to model these elements, allowing the stack outputs to be more easily understood.</p>
      <p id="d2e1901">Trend Model: In this stack <inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:msubsup><mml:mi>g</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:msubsup><mml:mi>g</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> are polynomials of a small degree <inline-formula><mml:math id="M68" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>, functions that vary slowly across the forecast window, to replicate monotonic or slowly varying nature of trends:

            <disp-formula id="Ch1.E10" content-type="numbered"><label>10</label><mml:math id="M69" display="block"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow><mml:mi>p</mml:mi></mml:munderover><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>i</mml:mi></mml:mrow><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:msup><mml:mi>t</mml:mi><mml:mi>i</mml:mi></mml:msup></mml:mrow></mml:math></disp-formula>

          The time vector <inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">t</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>H</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>H</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:msup><mml:mo>]</mml:mo><mml:mi>T</mml:mi></mml:msup><mml:mo>/</mml:mo><mml:mi>H</mml:mi></mml:mrow></mml:math></inline-formula> is specified on a discrete grid ranging from 0 to <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>H</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mi>H</mml:mi></mml:mrow></mml:math></inline-formula>, projecting <inline-formula><mml:math id="M72" display="inline"><mml:mi>H</mml:mi></mml:math></inline-formula> steps into the future. Consequently, the trend forecast represented in matrix form is:

            <disp-formula id="Ch1.E11" content-type="numbered"><label>11</label><mml:math id="M73" display="block"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">tr</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mi mathvariant="bold">T</mml:mi><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></disp-formula>

          Where the polynomial coefficients, <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, predicted by an FC network at layer <inline-formula><mml:math id="M75" display="inline"><mml:mi>l</mml:mi></mml:math></inline-formula> of stack <inline-formula><mml:math id="M76" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula>, are described by Eqs. (6) and (7). The matrix <inline-formula><mml:math id="M77" display="inline"><mml:mi mathvariant="bold">T</mml:mi></mml:math></inline-formula>, consisting of powers of <inline-formula><mml:math id="M78" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, is represented as <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi>t</mml:mi><mml:mi>p</mml:mi></mml:msup><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>. When <inline-formula><mml:math id="M80" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> is small, such as 2 or 3, it compels <inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">tr</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> to emulate a trend (Oreshkin et al., 2020).</p>
      <p id="d2e2240">Seasonality model: In this stack <inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:msubsup><mml:mi>g</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:msubsup><mml:mi>g</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> are periodic functions, to capture the cyclical and recurring characteristics of seasonality, such that <inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="normal">Δ</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M85" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula> is the seasonality period. The Fourier series serves as a natural foundation for modeling periodic functions:

            <disp-formula id="Ch1.E12" content-type="numbered"><label>12</label><mml:math id="M86" display="block"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow><mml:mrow><mml:mstyle scriptlevel="+1"><mml:mfrac><mml:mi>H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:munderover><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>i</mml:mi></mml:mrow><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:mi>cos⁡</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>i</mml:mi><mml:mo>+</mml:mo><mml:mo>[</mml:mo><mml:mi>H</mml:mi><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>]</mml:mo></mml:mrow><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup><mml:mi>sin⁡</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>

          
          Consequently, the seasonality forecast is represented in the following matrix form:

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M87" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E13"><mml:mtd><mml:mtext>13</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msubsup><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">seas</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mi>S</mml:mi><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E14"><mml:mtd><mml:mtext>14</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="bold">S</mml:mi></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>cos⁡</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>cos⁡</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi>H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfenced><mml:mi>t</mml:mi></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi>sin⁡</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>sin⁡</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">π</mml:mi><mml:mfenced close="]" open="["><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi>H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfenced><mml:mi>t</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          Where the Fourier coefficients <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, that predicted by an FC network at layer <inline-formula><mml:math id="M89" display="inline"><mml:mi>l</mml:mi></mml:math></inline-formula> of stack <inline-formula><mml:math id="M90" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula>, are described by Eqs. (6) and (7). The matrix <inline-formula><mml:math id="M91" display="inline"><mml:mi mathvariant="bold">S</mml:mi></mml:math></inline-formula> represents sinusoidal waveforms. As a result, the forecast <inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>l</mml:mi></mml:mrow><mml:mi mathvariant="normal">seas</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> becomes a periodic function that imitates typical seasonal patterns (Oreshkin et al., 2020).</p>
</sec>
<sec id="Ch1.S2.SS2.SSS3">
  <label>2.2.3</label><title>N-HiTS</title>
      <p id="d2e2668">N-HiTS builds upon the N-BEATS architecture but with improved accuracy and computational efficiency for long-horizon forecasting. N-HiTS utilizes multi-rate sampling and multi-scale synthesis of forecasts, leading to a hierarchical forecast structure that lowers computational demands and improves prediction accuracy (Challu et al., 2022).</p>
      <p id="d2e2671">Like N-BEATS, N-HiTS employs local nonlinear mappings onto foundational functions within numerous blocks (illustrated in Fig. 4). Each block includes an MLP that generates backcast and forecast output coefficients. The backcast output refines the input data for the following blocks, and the forecast outputs are combined to generate the final prediction. Blocks are organized into stacks, with each stack dedicated to grasping specific data attributes using its own distinct set of functions. The network's input is a sequence of <inline-formula><mml:math id="M93" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula> lags (look-back period), with <inline-formula><mml:math id="M94" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> stacks, each containing <inline-formula><mml:math id="M95" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> blocks (Challu et al., 2022).</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e2697">The structure of N-HiTS model programmed in this study. The architecture includes several Stacks, each Stack includes several Block, where each block consists of a MaxPool layer and a multi-layer which learns to produce coefficients for the backcast and forecast outputs of its basis.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f04.png"/>

        </fig>

      <p id="d2e2707">In each block, a MaxPool layer with varying kernel sizes (<inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:msub><mml:mi>k</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) is employed at the input, enabling the block to focus on specific input components of different scales. Larger kernel sizes emphasize the analysis of larger-scale, low-frequency data, aiding in improving long-term forecasting accuracy. This approach, known as multi-rate signal sampling, alters the effective input signal sampling rate for each block's MLP (Challu et al., 2022).</p>
      <p id="d2e2721">Additionally, multi-rate processing has several advantages. It reduces memory usage, computational demands, and the number of learnable parameters, and helps prevent overfitting, while preserving the original receptive field. The following operation is applicable to the input <inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>L</mml:mi><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> of each block, with the first block (<inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mi>l</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>) using the network-wide input, where <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>L</mml:mi><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>≡</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>L</mml:mi><mml:mo>:</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>.

            <disp-formula id="Ch1.E15" content-type="numbered"><label>15</label><mml:math id="M100" display="block"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>L</mml:mi><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="normal">MaxPool</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>L</mml:mi><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>k</mml:mi><mml:mi>l</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>

          In many multi-horizon forecasting models, the number of neural network predictions matches the horizon's dimensionality, denoted as <inline-formula><mml:math id="M101" display="inline"><mml:mi>H</mml:mi></mml:math></inline-formula>. For instance, in N-BEATS, the number of predictions <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:mfenced close="|" open="|"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>H</mml:mi></mml:mrow></mml:math></inline-formula>. This results in a significant increase in computational demands and an unnecessary surge in model complexity as the horizon <inline-formula><mml:math id="M103" display="inline"><mml:mi>H</mml:mi></mml:math></inline-formula> becomes larger (Challu et al., 2022).</p>
      <p id="d2e2899">To address these challenges, N-HiTS proposes the use of temporal interpolation. This model manages the parameter counts per unit of output time (<inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:mfenced open="|" close="|"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mfenced close="⌉" open="⌈"><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mi>l</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>H</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>) by defining the dimensionality of the interpolation coefficients with respect to the expressiveness ratio <inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. To revert to the original sampling rate and predict all horizon points, this model employs temporal interpolation through the function <inline-formula><mml:math id="M106" display="inline"><mml:mi>g</mml:mi></mml:math></inline-formula>:

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M107" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E16"><mml:mtd><mml:mtext>16</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>g</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">f</mml:mi></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace width="1em" linebreak="nobreak"/><mml:mo>∀</mml:mo><mml:mi mathvariant="italic">τ</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>∈</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo mathvariant="italic">{</mml:mo><mml:mi>t</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>+</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>t</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>+</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>H</mml:mi><mml:mo mathvariant="italic">}</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E17"><mml:mtd><mml:mtext>17</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">̃</mml:mo></mml:mover><mml:mrow><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>,</mml:mo><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>=</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>g</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>l</mml:mi><mml:mi mathvariant="normal">b</mml:mi></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="1em"/><mml:mo>∀</mml:mo><mml:mi mathvariant="italic">τ</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>∈</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo mathvariant="italic">{</mml:mo><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>L</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>t</mml:mi><mml:mo mathvariant="italic">}</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E18"><mml:mtd><mml:mtext>18</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi>g</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="italic">θ</mml:mi></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="italic">θ</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mfenced close=")" open="("><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi mathvariant="italic">θ</mml:mi><mml:mfenced close="]" open="["><mml:mrow><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mfenced><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="italic">θ</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mfenced><mml:mo>(</mml:mo><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E19"><mml:mtd><mml:mtext>19</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>=</mml:mo><mml:mi>arg⁡</mml:mi><mml:munder><mml:mo movablelimits="false">min⁡</mml:mo><mml:mrow><mml:mi>t</mml:mi><mml:mo>∈</mml:mo><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>≤</mml:mo><mml:mi mathvariant="italic">τ</mml:mi></mml:mrow></mml:munder><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>-</mml:mo><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="1em"/><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>t</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:msub><mml:mi>r</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          The hierarchical interpolation approach involves distributing expressiveness ratios over blocks, integrated with multi-rate sampling. Blocks closer to the input employ more aggressive interpolation, generating lower granularity signals. These blocks specialize in analyzing more aggressively subsampled signals. The final hierarchical prediction, <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>H</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, is constructed by combining outputs from all blocks, creating interpolations at various time-scale hierarchy levels. This approach maintains a structured hierarchy of interpolation granularity, with each block focusing on its own input and output scales (Challu et al., 2022).</p>
      <p id="d2e3295">To manage a diverse set of frequency bands while maintaining control over the number of parameters, exponentially increasing expressiveness ratios are recommended. As an alternative, each stack can be dedicated to modeling various recognizable cycles within the time series (e.g., weekly, or daily) employing matching <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:msub><mml:mi>r</mml:mi><mml:mi>l</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. Ultimately, the residual obtained from backcasting in the preceding hierarchy level is subtracted from the input of the subsequent level, intensifying the next-level block's attention on signals outside the previously addressed band (Challu et al., 2022).

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M110" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E20"><mml:mtd><mml:mtext>20</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>H</mml:mi></mml:mrow></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>L</mml:mi></mml:munderover><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>H</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E21"><mml:mtd><mml:mtext>21</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>L</mml:mi><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>l</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>=</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>L</mml:mi><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mi>L</mml:mi><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula></p>
</sec>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Performance Metrics</title>
      <p id="d2e3473">To comprehensively evaluate the accuracy of flood predictions, we utilized a suite of metrics, including Nash-Sutcliffe Efficiency (NSE; Nash and Sutcliffe, 1970), persistent Nash-Sutcliffe Efficiency (persistent-NSE), Kling–Gupta efficiency (KGE; Gupta et al., 2009), Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Peak Flow Error (PFE), and Time to Peak Error (TPE; Evin et al., 2024; Lobligeois et al., 2014). These metrics collectively facilitate a rigorous assessment of the model's performance in reproducing the magnitude of observed peak flows and the shape of the hydrograph.</p>
      <p id="d2e3476">NSE measures the model's ability to explain the variance in observed data and assesses the goodness-of-fit by comparing the observed and simulated hydrographs. In hydrological studies, the NSE index is a widely accepted measure for evaluating the fitting quality of models (McCuen et al., 2006). It is calculated as:

          <disp-formula id="Ch1.E22" content-type="numbered"><label>22</label><mml:math id="M111" display="block"><mml:mrow><mml:mi mathvariant="normal">NSE</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula>

        Where <inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> represents observed value at time <inline-formula><mml:math id="M113" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> represents simulated value at time <inline-formula><mml:math id="M115" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M116" display="inline"><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:math></inline-formula> is the mean observed values and <inline-formula><mml:math id="M117" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> is the number of data points. An NSE value of 1 indicates a perfect match between the observed and modeled data, while lower values represent the degree of departure from a perfect fit.</p>
      <p id="d2e3638">As the models are designed to predict one hour ahead in one of the prediction horizons, the persistent-NSE is essential for evaluating their performance. The standard NSE measures the model's sum of squared errors relative to the sum of squared errors when the mean observation is used as the forecast value. In contrast, persistent-NSE uses the most recent observed data as the forecast value for comparison (Nevo et al., 2022). The persistent-NSE is calculated as:

          <disp-formula id="Ch1.E23" content-type="numbered"><label>23</label><mml:math id="M118" display="block"><mml:mrow><mml:mtext>persistent-NSE</mml:mtext><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula>

        Where <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> represents the observed value at time <inline-formula><mml:math id="M120" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> represents the simulated value at time <inline-formula><mml:math id="M122" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>is the observed value at the last time step (<inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>) and <inline-formula><mml:math id="M125" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> is the number of data points.</p>
      <p id="d2e3823">The KGE is a widely used performance metric in hydrological modeling and combines multiple aspects of model performance, including correlation, variability bias, and mean bias. The KGE metric is calculated using the following equation:

          <disp-formula id="Ch1.E24" content-type="numbered"><label>24</label><mml:math id="M126" display="block"><mml:mrow><mml:mi mathvariant="normal">KGE</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msqrt><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>+</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>+</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="italic">β</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:mrow></mml:math></disp-formula>

        Where <inline-formula><mml:math id="M127" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> represents Pearson correlation coefficient between observed <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and simulated <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values. <inline-formula><mml:math id="M130" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> represents bias ratio, calculated as <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula> where <inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are the means of simulated and observed data, respectively. <inline-formula><mml:math id="M134" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula> represents variability ratio, calculated as <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub><mml:mo>/</mml:mo><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub><mml:mo>/</mml:mo><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula> where <inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">o</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are the standard deviations of simulated and observed data, respectively.</p>
      <p id="d2e4039">RMSE quantifies the average magnitude of errors between observed and modeled values, offering insights into the absolute goodness-of-fit, while MAE is a measure of the average absolute difference between the modeled values and the observed values and provides a measure of the average magnitude of errors. RMSE is calculated as:

          <disp-formula id="Ch1.E25" content-type="numbered"><label>25</label><mml:math id="M138" display="block"><mml:mrow><mml:mi mathvariant="normal">RMSE</mml:mi><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msqrt><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:mspace linebreak="nobreak" width="0.125em"/><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mo>(</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:mrow></mml:math></disp-formula>

        and MAE is calculated as:

          <disp-formula id="Ch1.E26" content-type="numbered"><label>26</label><mml:math id="M139" display="block"><mml:mrow><mml:mi mathvariant="normal">MAE</mml:mi><mml:mo>=</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:mspace linebreak="nobreak" width="0.125em"/><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mfenced close="|" open="|"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></disp-formula>

        Where <inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> represents observed value at time <inline-formula><mml:math id="M141" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> represents simulated value at time <inline-formula><mml:math id="M143" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M144" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> is the number of data points. RMSE and MAE provide information about the magnitude of modeling errors, with smaller values indicating a better model fit.</p>
      <p id="d2e4205">PFE quantifies the magnitude disparity between observed and modeled peak flow values. The PFE metric is defined as:

          <disp-formula id="Ch1.E27" content-type="numbered"><label>27</label><mml:math id="M145" display="block"><mml:mrow><mml:mi mathvariant="normal">PFE</mml:mi><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mfenced close="|" open="|"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula>

        Where <inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> represents the observed peak flow value, and <inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> signifies the simulated peak flow value. The PFE metric, expressed as a dimensionless value, provides a quantitative measure of the relative error in predicting peak flow magnitudes concerning the observed values. A smaller PFE denotes more accurate modeling of peak flow magnitudes, with a value of zero indicating a perfect match.</p>
      <p id="d2e4285">TPE assesses the temporal alignment of peak flows in the observed and modeled hydrographs. The TPE metric is computed as:

          <disp-formula id="Ch1.E28" content-type="numbered"><label>28</label><mml:math id="M148" display="block"><mml:mrow><mml:mi mathvariant="normal">TPE</mml:mi><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mfenced open="|" close="|"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></disp-formula>

        Where <inline-formula><mml:math id="M149" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> signifies the time at which the peak flow occurs in the observed hydrograph, and <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> represents the time at which the peak flow occurs in the simulated hydrograph. TPE that is measured in units of time (hours), provides insight into the precision of peak flow timing. Smaller TPE values indicate a superior alignment between the observed and modeled peak flow timing, while larger TPE values indicate discrepancies in the temporal occurrence of peak flows.</p>
      <p id="d2e4352">The utilization of these five metrics, PFE, persistent-NSE, TPE, NSE, and RMSE, collectively provides a robust and multifaceted assessment of flood prediction performance. This approach ensures that both the magnitude and timing of peak flows, as well as the overall hydrograph shape, are accurately calibrated and validated.</p>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Sensitivity and Uncertainty Analysis</title>
      <p id="d2e4363">When implementing NN models, it's crucial to understand how each input feature affects the model's performance or outputs. To achieve this, we systematically excluded each input feature from the model one by one (the Leave-One-Out method). For each exclusion, we retrained the model without that specific input feature and then tested its performance against a test dataset. This method helps in understanding which input features are most critical to the model's performance and which ones have a lesser impact. It also allows us to identify any input features that may be redundant or have little effect on the overall outcome, thus potentially simplifying the model without sacrificing accuracy.</p>
      <p id="d2e4366">In this study, we utilized probabilistic approaches to quantify the uncertainty in flood prediction. This method is rooted in statistical techniques employed for the estimation of unknown probability distributions, with a foundation in observed data. More specifically, we leveraged the Maximum Likelihood Estimation (MLE) approach, which entails the determination of MQL objective values that optimize the likelihood function. The likelihood function quantifies the probability of MQL objective taking values, given the observed realizations.</p>
      <p id="d2e4369">We incorporated the MQL as a probabilistic error metric into algorithmic architecture. MQL performs an evaluation by computing the average loss for a predefined set of quantiles. This computation is grounded in the absolute disparities between predicted quantiles and their corresponding observed values. By considering multiple quantile levels, MQL provides a comprehensive assessment of the model's ability to capture the distribution of the target variable, rather than focusing solely on point estimates.</p>
      <p id="d2e4372">The MQL metric also aligns closely with the Continuous Ranked Probability Score (CRPS), a standard tool for evaluating predictive distributions. CRPS measures the difference between the predicted cumulative distribution function and the observed values by integrating over all possible quantiles. The computation of CRPS involves a numerical integration technique that discretizes quantiles and applies a left Riemann approximation for CRPS integral computation. This process culminates in the averaging of these computations over uniformly spaced quantiles, providing a robust evaluation of the predictive distribution <inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>F</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.

              <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M152" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E29"><mml:mtd><mml:mtext>29</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi mathvariant="normal">MQL</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mfenced close="]" open="["><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:msubsup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:mspace width="0.125em" linebreak="nobreak"/><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:munder><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">QL</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>(</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E30"><mml:mtd><mml:mtext>30</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi mathvariant="normal">CRPS</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mover accent="true"><mml:mi>F</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:munderover><mml:mo movablelimits="false">∫</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">1</mml:mn></mml:munderover><mml:mi mathvariant="normal">QL</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mi mathvariant="normal">d</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E31"><mml:mtd><mml:mtext>31</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="normal">QL</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi><mml:mi>q</mml:mi></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>H</mml:mi></mml:mfrac></mml:mstyle><mml:mspace linebreak="nobreak" width="0.125em"/><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>=</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>H</mml:mi></mml:mrow></mml:munderover><mml:mo>(</mml:mo><mml:mfenced close=")" open="("><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>q</mml:mi></mml:mrow></mml:mfenced><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi><mml:mi>q</mml:mi></mml:msubsup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>-</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi><mml:mi>q</mml:mi></mml:msubsup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

        Where <inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> represents observed value at time <inline-formula><mml:math id="M154" display="inline"><mml:mi mathvariant="italic">τ</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi><mml:mi>q</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> represents simulated value at time <inline-formula><mml:math id="M156" display="inline"><mml:mi mathvariant="italic">τ</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M157" display="inline"><mml:mi>q</mml:mi></mml:math></inline-formula> is the slope of the quantile loss, and <inline-formula><mml:math id="M158" display="inline"><mml:mi>H</mml:mi></mml:math></inline-formula> is the horizon of forecasting (Fig. 5).</p>

      <fig id="F5"><label>Figure 5</label><caption><p id="d2e4771">The MQL function which shows loss values for different values of <inline-formula><mml:math id="M159" display="inline"><mml:mi>q</mml:mi></mml:math></inline-formula> when the true value is <inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.</p></caption>
        <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f05.png"/>

      </fig>

      <p id="d2e4798">Implementation-wise, let <inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:mi>D</mml:mi><mml:mo>=</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:msubsup><mml:mo mathvariant="italic">}</mml:mo><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> denote training pairs, where <inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the past 24 h discharge context and <inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> the discharge <inline-formula><mml:math id="M164" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula> hours ahead. For a fixed horizon <inline-formula><mml:math id="M165" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula> and quantile levels <inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">τ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:msubsup><mml:mo mathvariant="italic">}</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, each model <inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi mathvariant="italic">θ</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> outputs the vector of conditional quantiles:

          <disp-formula id="Ch1.E32" content-type="numbered"><label>32</label><mml:math id="M168" display="block"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mi mathvariant="italic">θ</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mi mathvariant="italic">τ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi mathvariant="italic">τ</mml:mi><mml:mi>K</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>∈</mml:mo><mml:msup><mml:mi>R</mml:mi><mml:mi>K</mml:mi></mml:msup></mml:mrow></mml:math></disp-formula>

        Parameters <inline-formula><mml:math id="M169" display="inline"><mml:mi mathvariant="italic">θ</mml:mi></mml:math></inline-formula> are learned by minimizing the multi-quantile (pinball) loss:

          <disp-formula id="Ch1.E33" content-type="numbered"><label>33</label><mml:math id="M170" display="block"><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="script">L</mml:mi><mml:mfenced open="(" close=")"><mml:mi mathvariant="italic">θ</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi mathvariant="normal">NK</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover><mml:mspace width="0.125em" linebreak="nobreak"/><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mrow><mml:msub><mml:mi mathvariant="italic">τ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:msub><mml:mi mathvariant="italic">τ</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>u</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mo movablelimits="false">max⁡</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="italic">τ</mml:mi><mml:mi>u</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mi>u</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>-</mml:mo><mml:msub><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mi>u</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:mi>u</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

        Because <inline-formula><mml:math id="M171" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is convex and piecewise linear, its (sub)gradient with respect to <inline-formula><mml:math id="M172" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi></mml:mrow><mml:mi mathvariant="italic">τ</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> is:

          <disp-formula id="Ch1.E34" content-type="numbered"><label>34</label><mml:math id="M173" display="block"><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi></mml:msup></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>=</mml:mo><mml:mfenced open="{" close=""><mml:mtable class="array" columnalign="left left"><mml:mtr><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mfenced close=")" open="("><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="italic">τ</mml:mi></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi>y</mml:mi><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi></mml:msup><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi>y</mml:mi><mml:mo>-</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi mathvariant="italic">τ</mml:mi></mml:msup><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></disp-formula>

        enabling backpropagation (Adam) without any sampling. Thus, each quantile <inline-formula><mml:math id="M174" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi></mml:mrow><mml:mi mathvariant="italic">τ</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> is a direct network output learned to satisfy the quantile condition under <inline-formula><mml:math id="M175" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="italic">τ</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. Uncertainty intervals are formed from these quantile predictions; for a 95 % band we use <inline-formula><mml:math id="M176" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi></mml:mrow><mml:mn mathvariant="normal">0.025</mml:mn></mml:msubsup><mml:msubsup><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>h</mml:mi></mml:mrow><mml:mn mathvariant="normal">0.975</mml:mn></mml:msubsup><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>. The resulting bands quantify the uncertainty conditional on <inline-formula><mml:math id="M177" display="inline"><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e5408">Incorporating MQL as a central metric in our study underscores its suitability for probabilistic forecasting, particularly in the context of uncertainty quantification. Unlike traditional error metrics that focus on point predictions, MQL captures both central tendencies and variability by penalizing errors symmetrically across quantiles. This property ensures balanced and reliable assessments of the predictive distribution, ultimately enhancing the robustness and interpretability of flood prediction models.</p>
      <p id="d2e5411">Furthermore, we employed two key indices, the <inline-formula><mml:math id="M178" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor and the <inline-formula><mml:math id="M179" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor, to rigorously assess the quality of uncertainty performance in our hydrological modeling. These metrics are instrumental in quantifying the extent to which the model's predictions encompass the observed data, thereby providing valuable insights into the model's predictive accuracy and reliability.</p>
      <p id="d2e5428">The <inline-formula><mml:math id="M180" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor, or percentage of data within 95 PPU, is the first index used in this assessment. The <inline-formula><mml:math id="M181" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor quantifies the percentage of observed data that falls within the 95 PPU, providing a measure of the model's predictive accuracy. The <inline-formula><mml:math id="M182" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor can theoretically vary from 0 % to a maximum of 100 %. A <inline-formula><mml:math id="M183" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor of 100 % signifies a perfect alignment between the model's predictions and the observed data within the uncertainty band. In contrast, a lower <inline-formula><mml:math id="M184" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor indicates a reduced ability of the model to predict data within the specified uncertainty range.

          <disp-formula id="Ch1.E35" content-type="numbered"><label>35</label><mml:math id="M185" display="block"><mml:mrow><mml:mi>P</mml:mi><mml:mtext>-Factor</mml:mtext><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mtext>Observations braketed by 95 PPU</mml:mtext><mml:mtext>Number of observations</mml:mtext></mml:mfrac></mml:mstyle><mml:mo>×</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></disp-formula>

        The <inline-formula><mml:math id="M186" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor can be computed by dividing the average width of the uncertainty band by the standard deviation of the measured variable. The <inline-formula><mml:math id="M187" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor, with a minimum possible value of zero, provides a measure of the spread of uncertainty relative to the variability of the observed data. Theoretically, the <inline-formula><mml:math id="M188" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor spans from 0 to infinity, and a value of zero implies that the model's predictions precisely match the measured data, with the uncertainty band being very narrow in relation to the variability of the observed data.

          <disp-formula id="Ch1.E36" content-type="numbered"><label>36</label><mml:math id="M189" display="block"><mml:mrow><mml:mi>R</mml:mi><mml:mtext>-Factor</mml:mtext><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mtext>Average width of 95 PPU band</mml:mtext><mml:mtext>Standard deviation of measured variables</mml:mtext></mml:mfrac></mml:mstyle><mml:mo>×</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></disp-formula>

        In practice, the quality of the model is assessed by considering the 95 % prediction band with the highest <inline-formula><mml:math id="M190" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor and the lowest <inline-formula><mml:math id="M191" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor. This specific band encompasses most observed records, signifying the model's ability to provide accurate and reliable predictions while effectively quantifying uncertainty. A simulation with a <inline-formula><mml:math id="M192" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor of 1 and an <inline-formula><mml:math id="M193" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor of 0 signifies an ideal scenario where the model precisely matches the measured data within the uncertainty band (Abbaspour et al., 2007).</p>
      <p id="d2e5565">Figure 6 shows the workflow of programming N-BEATS, N-HiTS, and LSTM for flood prediction. As illustrated, the initial step involved cleaning and preparing the input data, which was then used to feed the models. The workflow for each model and their output generation processes are depicted in Fig. 6. We segmented the storm events using the MIT approach, as previously described. Following this, we conducted a sensitivity analysis using the Leave-One-Out method and performed uncertainty analysis using the MLE approach to construct the 95 PPU band. This rigorous methodology ensures a robust evaluation of model performance under varying conditions and highlights the models' predictive reliability and resilience. We employed the “NeuralForecast” Python package to develop the N-BEATS, N-HiTS, and LSTM models. This package provides a diverse array of NN models with an emphasis on usability and robustness.</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e5571">The workflow of N-BEATS, N-HiTS, and LSTM implementation. The upper section of the figure illustrates multiple steps from data preprocessing to model evaluation. The lower section provides a detailed view of the workflow and implementation for each model, highlighting the specific processes and methodologies employed in generating the outputs. Backpropagation Through Time (BPTT) trains LSTM by unrolling the model through time, computing gradients for each time step, and updating weights based on temporal dependencies.</p></caption>
        <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f06.png"/>

      </fig>

</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results and Discussion</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Independent Storms Delineation</title>
      <p id="d2e5596">MIT's contextual delineation of storm events laid the groundwork for in-depth evaluation of rainfall events, enabling isolation and separation of rainfall events that led to significant flooding events. The nuanced outcomes of the MIT assessment contributed significantly to the understanding of rainfall variability and distribution as the dominant contributor to flood generation.</p>
      <p id="d2e5599">During modeling implementation, the initial imperative was the precise distinction of storm events within the precipitation time series data of each case study. Our findings demonstrate that on average a dry period of 7 h serves as the optimal MIT time for both of our case studies. This outcome signifies that when a dry interval of more than 7 h transpires between two successive rainfall events, these subsequent rainfalls should be considered two distinct storm events. This determination underlines the temporal threshold necessary for distinguishing between individual meteorological phenomena in two case studies.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Hyperparameter Optimization</title>
      <p id="d2e5611">In the context of hyperparameter optimization, we systematically considered and tuned various hyperparameters for the N-HiTS, N-BEATS, and LSTM. We searched for learning rates on a log-uniform grid between <inline-formula><mml:math id="M194" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>  and <inline-formula><mml:math id="M195" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, batch sizes <inline-formula><mml:math id="M196" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">16</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">32</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">64</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, input size <inline-formula><mml:math id="M197" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">6</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">12</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">24</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> h. For the LSTM, recurrent layers <inline-formula><mml:math id="M198" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, hidden units per layer <inline-formula><mml:math id="M199" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">64</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">128</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">256</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, activation <inline-formula><mml:math id="M200" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="normal">tanh</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">ReLU</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, decoder MLP depth <inline-formula><mml:math id="M201" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, and decoder MLP width <inline-formula><mml:math id="M202" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">64</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">128</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">256</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> were varied during the simulation run. For N-HiTS, stacks <inline-formula><mml:math id="M203" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, blocks per stack <inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">5</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, block MLP width <inline-formula><mml:math id="M205" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">64</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">128</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">256</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, and block MLP depth <inline-formula><mml:math id="M206" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> were explored. For N-BEATS, we searched stacks <inline-formula><mml:math id="M207" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, blocks per stack <inline-formula><mml:math id="M208" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">5</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, block MLP width <inline-formula><mml:math id="M209" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">64</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">128</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">256</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, and block MLP depth <inline-formula><mml:math id="M210" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>; the interpretable (trend/seasonality) basis was kept fixed. Following extensive exploration and fine-tuning of these hyperparameters, the optimal configurations were identified (see Table 2). For the N-HiTS model, the most favorable outcomes were achieved with the following hyperparameter settings: 2000 epochs, “identity” for scaler type, a learning rate of 0.001, a batch size of 32, input size of 24 h, “identity” for stack type, 512 units for hidden layers of each stack, step size of 1, MQLoss as loss function, and “ReLU” for the activation function. As shown in Table 2, the N-HiTS model demonstrated superior performance with 4 stacks, containing 2 blocks each, and corresponding coefficients of 48, 24, 12, and 1, showcasing the significance of these settings for flood prediction.</p>

<table-wrap id="T2" specific-use="star"><label>Table 2</label><caption><p id="d2e5963">Optimized values for the hyperparameters.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="justify" colwidth="3cm"/>
     <oasis:colspec colnum="3" colname="col3" align="justify" colwidth="3cm"/>
     <oasis:colspec colnum="4" colname="col4" align="justify" colwidth="3cm"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Hyperparameter</oasis:entry>
         <oasis:entry colname="col2">N-HiTS</oasis:entry>
         <oasis:entry colname="col3">N-BEATS</oasis:entry>
         <oasis:entry colname="col4">LSTM</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Epoch</oasis:entry>
         <oasis:entry colname="col2">2000</oasis:entry>
         <oasis:entry colname="col3">2000</oasis:entry>
         <oasis:entry colname="col4">5000</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Scaler type</oasis:entry>
         <oasis:entry colname="col2">identity</oasis:entry>
         <oasis:entry colname="col3">identity</oasis:entry>
         <oasis:entry colname="col4">standard</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Learning rate</oasis:entry>
         <oasis:entry colname="col2">0.001</oasis:entry>
         <oasis:entry colname="col3">0.001</oasis:entry>
         <oasis:entry colname="col4">0.001</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Batch size</oasis:entry>
         <oasis:entry colname="col2">32</oasis:entry>
         <oasis:entry colname="col3">32</oasis:entry>
         <oasis:entry colname="col4">32</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Input size</oasis:entry>
         <oasis:entry colname="col2">24 h</oasis:entry>
         <oasis:entry colname="col3">24 h</oasis:entry>
         <oasis:entry colname="col4">24 h</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Stack type</oasis:entry>
         <oasis:entry colname="col2">Seasonality, trend,identity, identity</oasis:entry>
         <oasis:entry colname="col3">Seasonality, trend,identity</oasis:entry>
         <oasis:entry colname="col4"><sup>*</sup></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Number of units in each hidden layer</oasis:entry>
         <oasis:entry colname="col2">512</oasis:entry>
         <oasis:entry colname="col3">512</oasis:entry>
         <oasis:entry colname="col4">128</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Loss function</oasis:entry>
         <oasis:entry colname="col2">MQLoss</oasis:entry>
         <oasis:entry colname="col3">MQLoss</oasis:entry>
         <oasis:entry colname="col4">MQLoss</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Activation function</oasis:entry>
         <oasis:entry colname="col2">ReLU</oasis:entry>
         <oasis:entry colname="col3">ReLU</oasis:entry>
         <oasis:entry colname="col4">tanh</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Number of stacks</oasis:entry>
         <oasis:entry colname="col2">4</oasis:entry>
         <oasis:entry colname="col3">3</oasis:entry>
         <oasis:entry colname="col4"><sup>*</sup></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Number of blocks in each stack</oasis:entry>
         <oasis:entry colname="col2">2</oasis:entry>
         <oasis:entry colname="col3">2</oasis:entry>
         <oasis:entry colname="col4"><sup>*</sup></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Stacks' coefficients</oasis:entry>
         <oasis:entry colname="col2">48,24,12,1</oasis:entry>
         <oasis:entry colname="col3"><sup>*</sup></oasis:entry>
         <oasis:entry colname="col4"><sup>*</sup></oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d2e5966"><sup>*</sup> Not applicable.</p></table-wrap-foot></table-wrap>

      <p id="d2e6233">This hyperparameter optimization was also conducted for the N-BEATS model. In this model, we considered 2000 epochs, 3 stacks with 2 blocks, “identity” for scaler type, a learning rate of 0.001, a batch size of 32, input size of 24 h, “identity” for stack type, 512 units for hidden layers of each stack, step size of 1, MQLoss as loss function, and “ReLU” for the activation function.</p>
      <p id="d2e6237">Moreover, the LSTM as a benchmark model yielded its best results with 5000 epochs, an input size of 24 h, “identity” as the scaler type, a learning rate of 0.001, a batch size of 32, and “tanh” as the activation function. Furthermore, LSTM's hidden state was most effective with two layers containing 128 units, and the MLP decoder thrived with two layers encompassing 128 units. These meticulously optimized hyperparameter settings represent the culmination of efforts to ensure that each model operates at its peak potential, facilitating accurate flood prediction.</p>
      <p id="d2e6240">In Table 2, “epoch” refers to the number of training steps, and “scaler type” indicates the type of scaler used for normalizing temporal inputs. The “learning rate” specifies the step size at each iteration while optimizing the model, and the “batch size” represents the number of samples processed in one forward and backward pass. The “loss function” quantifies the difference between the predicted outputs and the actual target values, while the “activation function” determines whether a neuron should be activated. The “stacks' coefficients” in the N-HITS model control the frequency specialization for each stack, enabling effective handling of different frequency components in the time series data.</p>
      <p id="d2e6243">Another hyperparameter for all three models is input size, which is a variable that determines the maximum sequence length for truncated backpropagation during training and the number of autoregressive inputs (lags) that the models considered for prediction. Essentially, input size represents the length of the historical series data used as input to the model. This variable offers flexibility in the models, allowing them to learn from a defined window of past observations, which can range from the entire historical dataset to a subset, tailored to the specific requirements of the prediction task. In the context of flood prediction, determining the appropriate input size is crucial to adequately capture the meteorological data preceding the flood event. To address this, we calculated the time of concentration (TC) of the watershed system and set the input size to exceed this duration. According to the Natural Resources Conservation Service (NRCS, 2010), for typical natural watershed conditions, the TC can be calculated from lag time, the time between peak rainfall and peak discharge, using the formula: <inline-formula><mml:math id="M217" display="inline"><mml:mrow><mml:mtext>Lag time</mml:mtext><mml:mo>=</mml:mo><mml:mi mathvariant="normal">TC</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>×</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">0.6</mml:mn></mml:mrow></mml:math></inline-formula> (NRCS, 2010). Specifically, the average TC in the Lower Dog River watershed and Upper Dutchmans Creek watershed was found to be 19 and 22 h, respectively. As these represent the average TC for our case studies, we selected the 24 h for input data, slightly longer than the average TC, ensuring sufficient coverage of relevant meteorological data preceding all flood events.</p>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Flood Prediction and Performance Assessment</title>
      <p id="d2e6272">In this study, we conducted a comprehensive performance evaluation of N-HiTS, N-BEATS, and benchmarked these models with LSTM, utilizing two case studies: the Lower Dog River and the Upper Dutchmans Creek watersheds. Within these case studies, we trained and validated the models separately for each watershed across a diverse set of storm events from 1 October 2007 to 1 October 2022 (15 years) in the Lower Dog River and from 21 December 1994 to 1 October 2022 (27 years) in the Upper Dutchmans Creek. The decision to train separate models for each catchment was made to account for the unique hydrological characteristics and local features specific to each watershed. By training models individually, we aimed to optimize performance by tailoring each model to the distinct rainfall-runoff relationship inherent in each catchment. All algorithms were tested using unseen flooding events that occurred between 14 December 2022 and 28 March 2023. Our targets were event-focused, where operational value focuses on performance during rising limbs, peaks, and recessions. Evaluating over the entire continuous hydrograph (testing period) can dilute or even mask differences. For this reason, we prioritized an event-centric assessment as the primary evaluation approach rather than full-period metrics. In the Dog River gauging station, two winter storms, i.e., 3 to 5 January 2023 (Event 1) and 17 to 18 February 2023 (Event 2), as well as a spring flood event that occurred during 26 to 28 March 2023 (Event 3) were selected for testing. Additionally, three winter flooding events, i.e., 14 to 16 December 2022 (Event 4), 25 and 26 January 2023 (Event 5), and 11 to 13 February 2023 (Event 6), were chosen to test the algorithms across the Killian Creek gauging station in the Upper Dutchmans Creek. The rainfall events corresponding to these flooding events were delineated using the MIT technique discussed in Sect. 3.1.</p>
      <p id="d2e6275">Our results for the Lower Dog River case study explicitly demonstrated the accuracy of both N-HiTS and N-BEATS in generating the winter and spring flood hydrographs compared to the LSTM model across all selected storm events. Although, N-HiTS prediction slightly outperformed N-BEATS during winter prediction (3 to 5 January 2023). In this event, N-HiTS outperformed N-BEATS with a difference of 11.6 % in MAE and 20 % in RMSE. The N-HiTS slight outperformance (see Tables 3 and 4) is attributed to its unique structure that allows the model to discern and capture intricate patterns within the data. Specifically, N-HiTS predicted flooding events hierarchically using blocks specialized in different rainfall frequencies based on controlled signal projections, through expressiveness ratios, and interpolation of each block. The coefficients are then used to synthesize backcast through <inline-formula><mml:math id="M218" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mi>t</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mi>L</mml:mi><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula> and forecast (<inline-formula><mml:math id="M219" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>:</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi>H</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:math></inline-formula>) outputs of the block as a flood value. The coefficients were locally determined along the horizon, allowing N-HiTS to reconstruct nonstationary signals over time.</p>

<table-wrap id="T3" specific-use="star"><label>Table 3</label><caption><p id="d2e6334">The performance metrics for the Lower Dog River flood predictions with 1 h prediction horizon.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Model</oasis:entry>
         <oasis:entry colname="col2">Performance Metric</oasis:entry>
         <oasis:entry colname="col3">Event 1</oasis:entry>
         <oasis:entry colname="col4">Event 2</oasis:entry>
         <oasis:entry colname="col5">Event 3</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">N-HiTS</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3">0.995</oasis:entry>
         <oasis:entry colname="col4">0.991</oasis:entry>
         <oasis:entry colname="col5">0.992</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Persistent-NSE</oasis:entry>
         <oasis:entry colname="col3">0.947</oasis:entry>
         <oasis:entry colname="col4">0.931</oasis:entry>
         <oasis:entry colname="col5">0.948</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.977</oasis:entry>
         <oasis:entry colname="col4">0.989</oasis:entry>
         <oasis:entry colname="col5">0.976</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">123.2</oasis:entry>
         <oasis:entry colname="col4">27.6</oasis:entry>
         <oasis:entry colname="col5">68.5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">64.1</oasis:entry>
         <oasis:entry colname="col4">12.0</oasis:entry>
         <oasis:entry colname="col5">37.8</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.018</oasis:entry>
         <oasis:entry colname="col4">0.051</oasis:entry>
         <oasis:entry colname="col5">0.015</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">0</oasis:entry>
         <oasis:entry colname="col4">1</oasis:entry>
         <oasis:entry colname="col5">0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M220" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">96.9 %</oasis:entry>
         <oasis:entry colname="col4">100 %</oasis:entry>
         <oasis:entry colname="col5">93.5 %</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M221" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">0.27</oasis:entry>
         <oasis:entry colname="col4">0.40</oasis:entry>
         <oasis:entry colname="col5">0.33</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">N-BEATS</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3">0.991</oasis:entry>
         <oasis:entry colname="col4">0.989</oasis:entry>
         <oasis:entry colname="col5">0.993</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Persistent-NSE</oasis:entry>
         <oasis:entry colname="col3">0.917</oasis:entry>
         <oasis:entry colname="col4">0.916</oasis:entry>
         <oasis:entry colname="col5">0.956</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.984</oasis:entry>
         <oasis:entry colname="col4">0.984</oasis:entry>
         <oasis:entry colname="col5">0.98</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">154.1</oasis:entry>
         <oasis:entry colname="col4">30.5</oasis:entry>
         <oasis:entry colname="col5">62.5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">72.6</oasis:entry>
         <oasis:entry colname="col4">13.6</oasis:entry>
         <oasis:entry colname="col5">35.9</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.0005</oasis:entry>
         <oasis:entry colname="col4">0.031</oasis:entry>
         <oasis:entry colname="col5">0.0002</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">0</oasis:entry>
         <oasis:entry colname="col4">1</oasis:entry>
         <oasis:entry colname="col5">0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M222" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">87.8 %</oasis:entry>
         <oasis:entry colname="col4">100 %</oasis:entry>
         <oasis:entry colname="col5">90.3 %</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M223" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">0.17</oasis:entry>
         <oasis:entry colname="col4">0.23</oasis:entry>
         <oasis:entry colname="col5">0.24</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LSTM</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3">0.756</oasis:entry>
         <oasis:entry colname="col4">0.983</oasis:entry>
         <oasis:entry colname="col5">0.988</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Persistent-NSE</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M224" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>1.44</oasis:entry>
         <oasis:entry colname="col4">0.871</oasis:entry>
         <oasis:entry colname="col5">0.929</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.765</oasis:entry>
         <oasis:entry colname="col4">0.978</oasis:entry>
         <oasis:entry colname="col5">0.971</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">841.1</oasis:entry>
         <oasis:entry colname="col4">37.9</oasis:entry>
         <oasis:entry colname="col5">79.5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">369.4</oasis:entry>
         <oasis:entry colname="col4">18.6</oasis:entry>
         <oasis:entry colname="col5">42</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.258</oasis:entry>
         <oasis:entry colname="col4">0.036</oasis:entry>
         <oasis:entry colname="col5">0.016</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">1</oasis:entry>
         <oasis:entry colname="col4">0</oasis:entry>
         <oasis:entry colname="col5">0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M225" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">81.8 %</oasis:entry>
         <oasis:entry colname="col4">93.1 %</oasis:entry>
         <oasis:entry colname="col5">96.7 %</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M226" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">0.37</oasis:entry>
         <oasis:entry colname="col4">0.51</oasis:entry>
         <oasis:entry colname="col5">0.6</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<table-wrap id="T4" specific-use="star"><label>Table 4</label><caption><p id="d2e6889">The performance metrics for the Killian Creek flood predictions with 1 h prediction horizon.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Model</oasis:entry>
         <oasis:entry colname="col2">Performance Metric</oasis:entry>
         <oasis:entry colname="col3">Event 4</oasis:entry>
         <oasis:entry colname="col4">Event 5</oasis:entry>
         <oasis:entry colname="col5">Event 6</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">N-HiTS</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3">0.991</oasis:entry>
         <oasis:entry colname="col4">0.971</oasis:entry>
         <oasis:entry colname="col5">0.991</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Persistent-NSE</oasis:entry>
         <oasis:entry colname="col3">0.885</oasis:entry>
         <oasis:entry colname="col4">0.806</oasis:entry>
         <oasis:entry colname="col5">0.844</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.982</oasis:entry>
         <oasis:entry colname="col4">0.967</oasis:entry>
         <oasis:entry colname="col5">0.991</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">28.8</oasis:entry>
         <oasis:entry colname="col4">46.0</oasis:entry>
         <oasis:entry colname="col5">19.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">17.9</oasis:entry>
         <oasis:entry colname="col4">23.8</oasis:entry>
         <oasis:entry colname="col5">11.5</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.017</oasis:entry>
         <oasis:entry colname="col4">0.008</oasis:entry>
         <oasis:entry colname="col5">0.020</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">0</oasis:entry>
         <oasis:entry colname="col4">0</oasis:entry>
         <oasis:entry colname="col5">0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M227" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">92.6 %</oasis:entry>
         <oasis:entry colname="col4">90.9 %</oasis:entry>
         <oasis:entry colname="col5">100 %</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M228" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">0.39</oasis:entry>
         <oasis:entry colname="col4">0.48</oasis:entry>
         <oasis:entry colname="col5">0.45</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">N-BEATS</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3">0.992</oasis:entry>
         <oasis:entry colname="col4">0.973</oasis:entry>
         <oasis:entry colname="col5">0.989</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Persistent-NSE</oasis:entry>
         <oasis:entry colname="col3">0.908</oasis:entry>
         <oasis:entry colname="col4">0.821</oasis:entry>
         <oasis:entry colname="col5">0.823</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.972</oasis:entry>
         <oasis:entry colname="col4">0.951</oasis:entry>
         <oasis:entry colname="col5">0.973</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">25.7</oasis:entry>
         <oasis:entry colname="col4">44.2</oasis:entry>
         <oasis:entry colname="col5">20.2</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">18.3</oasis:entry>
         <oasis:entry colname="col4">25.9</oasis:entry>
         <oasis:entry colname="col5">14.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.006</oasis:entry>
         <oasis:entry colname="col4">0.008</oasis:entry>
         <oasis:entry colname="col5">0.019</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">0</oasis:entry>
         <oasis:entry colname="col4">0</oasis:entry>
         <oasis:entry colname="col5">0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M229" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">96.3 %</oasis:entry>
         <oasis:entry colname="col4">86.3 %</oasis:entry>
         <oasis:entry colname="col5">96.9 %</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M230" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">0.43</oasis:entry>
         <oasis:entry colname="col4">0.53</oasis:entry>
         <oasis:entry colname="col5">0.43</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LSTM</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3">0.952</oasis:entry>
         <oasis:entry colname="col4">0.892</oasis:entry>
         <oasis:entry colname="col5">0.935</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Persistent-NSE</oasis:entry>
         <oasis:entry colname="col3">0.4</oasis:entry>
         <oasis:entry colname="col4">0.27</oasis:entry>
         <oasis:entry colname="col5">0.087</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.92</oasis:entry>
         <oasis:entry colname="col4">0.899</oasis:entry>
         <oasis:entry colname="col5">0.901</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">65.7</oasis:entry>
         <oasis:entry colname="col4">89.2</oasis:entry>
         <oasis:entry colname="col5">50.3</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">41.1</oasis:entry>
         <oasis:entry colname="col4">45</oasis:entry>
         <oasis:entry colname="col5">35.9</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.031</oasis:entry>
         <oasis:entry colname="col4">0.058</oasis:entry>
         <oasis:entry colname="col5">0.098</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">1</oasis:entry>
         <oasis:entry colname="col4">0</oasis:entry>
         <oasis:entry colname="col5">0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M231" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">70.4 %</oasis:entry>
         <oasis:entry colname="col4">72.73 %</oasis:entry>
         <oasis:entry colname="col5">81.82 %</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M232" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">0.66</oasis:entry>
         <oasis:entry colname="col4">0.7</oasis:entry>
         <oasis:entry colname="col5">0.65</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e7434">While the N-HiTS emerged as the most accurate in predicting flood hydrograph among the three models, its performance was somehow comparable with N-BEATS. The N-BEATS model exhibited good performance in two case studies. It consistently provided competitive results, demonstrating its capacity to effectively handle diverse storm events and deliver reliable predictions. N-BEATS has a generic and interpretable architecture depending on the blocks it uses. Interpretable configuration sequentially projects the signal into polynomials and harmonic basis to learn trend and seasonality components while generic configuration substitutes the polynomial and harmonic basis for identity basis and larger network's depth. In this study, we used interpretable architecture, as it regularizes its predictions through projections into harmonic and trend basis that is well-suited for flood prediction tasks. Using interpretable architecture, flood prediction was aggregated in a hierarchical fashion. This enabled the building of a very deep neural network with interpretable flood prediction outputs.</p>
      <p id="d2e7437">It is essential to underscore that, despite its strong performance, the N-BEATS model did not surpass the N-HiTS model in terms of NSE, Persistent-NSE, MAE, and RMSE for the Lower Dog River case study. Although both models showed almost the same KGE values. Notably, the N-BEATS model showcased superior results based on the PFE metric, signifying its exceptional capability in accurately predicting flood peaks. However, both N-HiTS and N-BEATS models overestimated the flood peak rate of Event 2 for the Lower Dog River watershed. This event, which occurred from 17 to 18 February 2023, was flashy, short, and intense proceeded by a prior small rainfall event (from 12 until 13 February) that minimized the rate of infiltration. This flash flood event caused by excessive rainfall in a short period of time (<inline-formula><mml:math id="M233" display="inline"><mml:mrow><mml:mi mathvariant="italic">&lt;</mml:mi><mml:mn mathvariant="normal">8</mml:mn></mml:mrow></mml:math></inline-formula> h) was challenging to predict for N-BEATS and N-HiTS models. In addition, predicting the magnitude of changes in the recession curve of the third event seems to be a challenge for both models. The specific part of the flood hydrograph after the precipitation event, where flood diminishes during a rainless is dominated by the release of runoff from shallow aquifer systems or natural storages. It seems both models showed a slight deficiency in capturing this portion of the hydrograph when the rainfall amount decreases over time in the Dog River gauging station.</p>
      <p id="d2e7450">Conversely, in the Killian Creek gauging station, the N-BEATS model almost emerged as the top performer in predicting the flood hydrograph based on NSE, Persistent-NSE, RMSE, and PFE performance metrics (see Tables 3 and 4). KGE values remained almost the same for both models. In addition, both N-BEATS and N-HiTS slightly overpredicted time to peak values for Event 5. This reflects the fact that when rainfall varies randomly around zero, it provides less to no information for the algorithms to learn the fluctuations and patterns in time series data. Both N-HiTS and N-BEATS provided comparable results for all events predicted in this study. N-HiTS builds upon N-BEATS by adding a MaxPool layer at each block. Each block consists of an MLP layer that learns how to produce coefficients for the backcast and forecast outputs. This subsamples the time series and allows each stack to focus on either short-term or long-term effects, depending on the pooling kernel size. Then, the partial predictions of each stack are combined using hierarchical interpolation. This ability enhances N-HiTS capabilities to produce drastically improved, interpretable, and computationally efficient long-horizon flood predictions.</p>
      <p id="d2e7453">In contrast, the performance of LSTM as a benchmark model lagged behind both N-HiTS and N-BEATS models for all events across two case studies. Despite its extensive applications in various hydrology domains, the LSTM model exhibited comparatively lower accuracy when tasked with predicting flood responses during different storm events. Focusing on NSE, Persistent-NSE. KGE, MAE, RMSE, and PFE metrics, it is noteworthy that all three models, across both case studies, consistently succeeded in capturing peak flow rates at the appropriate timing. All models demonstrated commendable results with respect to the TPE metric. In most scenarios, TPE revealed a value of 0, signifying that the models accurately pinpointed the peak flow rate precisely at the expected time. In some instances, TPE reached a value of 1, showing a deviation of one hour in predicting the peak flow time. This deviation is deemed acceptable, particularly considering the utilization of short, intense rainfall for our analysis.</p>
      <p id="d2e7456">Our investigation into the performance of the three distinct forecasting models yielded compelling results pertaining to their ability to generate 95 PPU, as quantified by the <inline-formula><mml:math id="M234" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor and <inline-formula><mml:math id="M235" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor. These factors serve as critical indicators for assessing the reliability and precision of the uncertainty bands produced by the MLE. Our findings demonstrated that the N-HiTS and N-BEATS models outperformed the LSTM model in mathematically defining uncertainty bands, in terms of <inline-formula><mml:math id="M236" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor metric. The <inline-formula><mml:math id="M237" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor, a crucial metric for evaluating the average width of the uncertainty band, consistently favored the N-HiTS and N-BEATS models over their counterparts. This finding was consistent across a diverse range of storm events. In addition, coupling MLE with the N-HiTS and N-BEATS models demonstrated superior performance in generating 95 PPU when assessed through the <inline-formula><mml:math id="M238" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor metric. The <inline-formula><mml:math id="M239" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor represents another vital aspect of uncertainty quantification, focusing on the precision of the uncertainty bands.</p>
      <p id="d2e7503">Figures 7 and 8 present graphical depictions of the predicted flood with 1 h prediction horizon and uncertainty assessment for each model as well as Flow Duration Curve (FDC) across two gauging stations.  As illustrated, the uncertainty bands skillfully bracketed most of the observational data, reflecting the fact that MLE was successful in reducing errors in flood prediction. FDC analysis also revealed that N-HiTS and N-BEATS models skillfully predicted the flood hydrograph, however, both models were particularly successful in predicting moderate to high flood events (1800–6000 and <inline-formula><mml:math id="M240" display="inline"><mml:mrow><mml:mi mathvariant="italic">&gt;</mml:mi><mml:mn mathvariant="normal">6000</mml:mn></mml:mrow></mml:math></inline-formula> cfs). In the FDC plots, the <inline-formula><mml:math id="M241" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>-axis denotes the exceedance probability, expressed as a percentage, while the <inline-formula><mml:math id="M242" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>-axis signifies flood in cubic feet per second. Notably, these plots reveal distinctive patterns in the performance of the N-HiTS, N-BEATS, and LSTM models.</p>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e7532">95 PPU band and FDC plots of N-HiTS, N-BEATS, and LSTM models with 1 h prediction horizon for the three selected flooding events in the Lower Dog River gauging station.</p></caption>
        <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f07.png"/>

      </fig>

      <fig id="F8" specific-use="star"><label>Figure 8</label><caption><p id="d2e7543">95 PPU band and FDC plots of N-HiTS, N-BEATS, and LSTM models with 1 h prediction horizon for the three selected flooding events in the Killian Creek gauging station.</p></caption>
        <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f08.png"/>

      </fig>

      <p id="d2e7552">Within the lower exceedance probability range, particularly around the peak flow, the N-HiTS and N-BEATS models demonstrated a clear superiority over the LSTM model, closely aligning with the observed data. This observed trend is consistent when examining the corresponding hydrographs. Across all events, the flood hydrographs generated by N-HiTS and N-BEATS exhibited a closer resemblance to the observed data, particularly in the vicinity of the peak timing and rate, compared to the hydrographs produced by the LSTM model. These findings underscore the enhanced predictive accuracy and reliability of the N-HiTS and N-BEATS models, particularly in predicting moderate to high flood events as well as critical hydrograph features such as peak flow rate and timing. The alignment of model-generated FDCs and hydrographs with observed data in the proximity of peak flow further establishes the efficiency of N-HiTS and N-BEATS in accurately reproducing the dynamics of flood generation mechanisms across two headwater streams.</p>
      <p id="d2e7555">To evaluate robustness across lead times, we extended the analysis to 3 and 6 h prediction horizons. The results are presented in Figs. 9–12, and Tables 5 and 6. As expected, NSE and KGE decreased while the absolute errors increased with horizon for all models; however, N-HiTS and N-BEATS continued to outperform LSTM across both stations and events. At Killian Creek station, both N-HiTS and N-BEATS preserved their lead, yielding higher NSE and lower MAE/RMSE than LSTM, while at the Lower Dog River, N-BEATS remained slightly superior on the same metrics. KGE values stayed comparable between the two feed-forward models, and peak-focused metrics (PFE and TPE) indicated that both still captured peak magnitude and timing reliably, compared to LSTM. Uncertainty bands widened with horizon as expected, but the likelihood-based 95 PPU for N-HiTS and N-BEATS maintained tighter <inline-formula><mml:math id="M243" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factors and competitive <inline-formula><mml:math id="M244" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factors relative to LSTM, especially around moderate-to-high flows. Flow-duration diagnostics at multi-hour leads reinforced these findings, showing closer alignment of N-HiTS and N-BEATS to observations in the upper tail. Overall, the multi-horizon results corroborate the 1 h horizon results: N-HiTS and N-BEATS deliver more accurate and reliable flood forecasts than LSTM, and their relative strengths persist at 3 and 6 h ahead. For completeness, we also evaluated 12 and 24 h lead times. During these horizons, all models' performances declined sharply (NSE <inline-formula><mml:math id="M245" display="inline"><mml:mrow><mml:mi mathvariant="italic">&lt;</mml:mi><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula> across sites and events), so we restrict detailed reporting to 1–6 h where performance remains operationally meaningful.</p>

      <fig id="F9" specific-use="star"><label>Figure 9</label><caption><p id="d2e7585">95 PPU band and FDC plots of N-HiTS, N-BEATS, and LSTM models with 3 h prediction horizon for the three selected flooding events in the Lower Dog River gauging station.</p></caption>
        <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f09.png"/>

      </fig>

      <fig id="F10" specific-use="star"><label>Figure 10</label><caption><p id="d2e7596">95 PPU band and FDC plots of N-HiTS, N-BEATS, and LSTM models with 6 h prediction horizon for the three selected flooding events in the Lower Dog River gauging station.</p></caption>
        <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f10.png"/>

      </fig>

      <fig id="F11" specific-use="star"><label>Figure 11</label><caption><p id="d2e7607">95 PPU band and FDC plots of N-HiTS, N-BEATS, and LSTM models with 3 h prediction horizon for the three selected flooding events in the Killian Creek gauging station.</p></caption>
        <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f11.png"/>

      </fig>

      <fig id="F12" specific-use="star"><label>Figure 12</label><caption><p id="d2e7618">95 PPU band and FDC plots of N-HiTS, N-BEATS, and LSTM models with 6 h prediction horizon for the three selected flooding events in the Killian Creek gauging station.</p></caption>
        <graphic xlink:href="https://hess.copernicus.org/articles/30/371/2026/hess-30-371-2026-f12.png"/>

      </fig>

<table-wrap id="T5" specific-use="star"><label>Table 5</label><caption><p id="d2e7631">The performance metrics of the models with 3 h prediction horizon.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="8">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Model</oasis:entry>
         <oasis:entry colname="col2">Performance Metric</oasis:entry>
         <oasis:entry colname="col3">Event 1</oasis:entry>
         <oasis:entry colname="col4">Event 2</oasis:entry>
         <oasis:entry colname="col5">Event 3</oasis:entry>
         <oasis:entry colname="col6">Event 4</oasis:entry>
         <oasis:entry colname="col7">Event 5</oasis:entry>
         <oasis:entry colname="col8">Event 6</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">N-HiTS</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3">0.91</oasis:entry>
         <oasis:entry colname="col4">0.86</oasis:entry>
         <oasis:entry colname="col5">0.58</oasis:entry>
         <oasis:entry colname="col6">0.83</oasis:entry>
         <oasis:entry colname="col7">0.81</oasis:entry>
         <oasis:entry colname="col8">0.89</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.92</oasis:entry>
         <oasis:entry colname="col4">0.92</oasis:entry>
         <oasis:entry colname="col5">0.74</oasis:entry>
         <oasis:entry colname="col6">0.85</oasis:entry>
         <oasis:entry colname="col7">0.85</oasis:entry>
         <oasis:entry colname="col8">0.88</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">506</oasis:entry>
         <oasis:entry colname="col4">107</oasis:entry>
         <oasis:entry colname="col5">485</oasis:entry>
         <oasis:entry colname="col6">122</oasis:entry>
         <oasis:entry colname="col7">119</oasis:entry>
         <oasis:entry colname="col8">65</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">293</oasis:entry>
         <oasis:entry colname="col4">58</oasis:entry>
         <oasis:entry colname="col5">209</oasis:entry>
         <oasis:entry colname="col6">71</oasis:entry>
         <oasis:entry colname="col7">65</oasis:entry>
         <oasis:entry colname="col8">42</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.03</oasis:entry>
         <oasis:entry colname="col4">0.02</oasis:entry>
         <oasis:entry colname="col5">0.08</oasis:entry>
         <oasis:entry colname="col6">0.1</oasis:entry>
         <oasis:entry colname="col7">0.07</oasis:entry>
         <oasis:entry colname="col8">0.05</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">0</oasis:entry>
         <oasis:entry colname="col4">0</oasis:entry>
         <oasis:entry colname="col5">0</oasis:entry>
         <oasis:entry colname="col6">0</oasis:entry>
         <oasis:entry colname="col7">0</oasis:entry>
         <oasis:entry colname="col8">0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M246" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">97 %</oasis:entry>
         <oasis:entry colname="col4">100 %</oasis:entry>
         <oasis:entry colname="col5">93.5 %</oasis:entry>
         <oasis:entry colname="col6">85 %</oasis:entry>
         <oasis:entry colname="col7">72 %</oasis:entry>
         <oasis:entry colname="col8">88 %</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M247" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">0.8</oasis:entry>
         <oasis:entry colname="col4">1.3</oasis:entry>
         <oasis:entry colname="col5">0.75</oasis:entry>
         <oasis:entry colname="col6">0.99</oasis:entry>
         <oasis:entry colname="col7">0.92</oasis:entry>
         <oasis:entry colname="col8">1.14</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">N-BEATS</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3">0.92</oasis:entry>
         <oasis:entry colname="col4">0.88</oasis:entry>
         <oasis:entry colname="col5">0.56</oasis:entry>
         <oasis:entry colname="col6">0.82</oasis:entry>
         <oasis:entry colname="col7">0.82</oasis:entry>
         <oasis:entry colname="col8">0.89</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.91</oasis:entry>
         <oasis:entry colname="col4">0.91</oasis:entry>
         <oasis:entry colname="col5">0.72</oasis:entry>
         <oasis:entry colname="col6">0.83</oasis:entry>
         <oasis:entry colname="col7">0.84</oasis:entry>
         <oasis:entry colname="col8">0.87</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">481</oasis:entry>
         <oasis:entry colname="col4">101</oasis:entry>
         <oasis:entry colname="col5">498</oasis:entry>
         <oasis:entry colname="col6">124</oasis:entry>
         <oasis:entry colname="col7">115</oasis:entry>
         <oasis:entry colname="col8">63</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">241</oasis:entry>
         <oasis:entry colname="col4">48</oasis:entry>
         <oasis:entry colname="col5">207</oasis:entry>
         <oasis:entry colname="col6">67</oasis:entry>
         <oasis:entry colname="col7">58</oasis:entry>
         <oasis:entry colname="col8">33</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.04</oasis:entry>
         <oasis:entry colname="col4">0.02</oasis:entry>
         <oasis:entry colname="col5">0.12</oasis:entry>
         <oasis:entry colname="col6">0.006</oasis:entry>
         <oasis:entry colname="col7">0.02</oasis:entry>
         <oasis:entry colname="col8">0.002</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">1</oasis:entry>
         <oasis:entry colname="col4">0</oasis:entry>
         <oasis:entry colname="col5">2</oasis:entry>
         <oasis:entry colname="col6">0</oasis:entry>
         <oasis:entry colname="col7">0</oasis:entry>
         <oasis:entry colname="col8">0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M248" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">90.9 %</oasis:entry>
         <oasis:entry colname="col4">93 %</oasis:entry>
         <oasis:entry colname="col5">90.3 %</oasis:entry>
         <oasis:entry colname="col6">92 %</oasis:entry>
         <oasis:entry colname="col7">68 %</oasis:entry>
         <oasis:entry colname="col8">94 %</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M249" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">0.7</oasis:entry>
         <oasis:entry colname="col4">1.2</oasis:entry>
         <oasis:entry colname="col5">0.74</oasis:entry>
         <oasis:entry colname="col6">0.78</oasis:entry>
         <oasis:entry colname="col7">1.1</oasis:entry>
         <oasis:entry colname="col8">0.87</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LSTM</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3">0.7</oasis:entry>
         <oasis:entry colname="col4">0.77</oasis:entry>
         <oasis:entry colname="col5">0.42</oasis:entry>
         <oasis:entry colname="col6">0.82</oasis:entry>
         <oasis:entry colname="col7">0.51</oasis:entry>
         <oasis:entry colname="col8">0.55</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.765</oasis:entry>
         <oasis:entry colname="col4">0.87</oasis:entry>
         <oasis:entry colname="col5">0.65</oasis:entry>
         <oasis:entry colname="col6">0.79</oasis:entry>
         <oasis:entry colname="col7">0.64</oasis:entry>
         <oasis:entry colname="col8">0.69</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">928</oasis:entry>
         <oasis:entry colname="col4">139</oasis:entry>
         <oasis:entry colname="col5">575</oasis:entry>
         <oasis:entry colname="col6">125</oasis:entry>
         <oasis:entry colname="col7">190</oasis:entry>
         <oasis:entry colname="col8">133</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">487</oasis:entry>
         <oasis:entry colname="col4">80</oasis:entry>
         <oasis:entry colname="col5">296</oasis:entry>
         <oasis:entry colname="col6">85</oasis:entry>
         <oasis:entry colname="col7">118</oasis:entry>
         <oasis:entry colname="col8">87</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.12</oasis:entry>
         <oasis:entry colname="col4">0.03</oasis:entry>
         <oasis:entry colname="col5">0.16</oasis:entry>
         <oasis:entry colname="col6">0.16</oasis:entry>
         <oasis:entry colname="col7">0.44</oasis:entry>
         <oasis:entry colname="col8">0.08</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">2</oasis:entry>
         <oasis:entry colname="col4">1</oasis:entry>
         <oasis:entry colname="col5">2</oasis:entry>
         <oasis:entry colname="col6">2</oasis:entry>
         <oasis:entry colname="col7">1</oasis:entry>
         <oasis:entry colname="col8">2</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M250" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">75.8 %</oasis:entry>
         <oasis:entry colname="col4">96 %</oasis:entry>
         <oasis:entry colname="col5">83.9 %</oasis:entry>
         <oasis:entry colname="col6">100 %</oasis:entry>
         <oasis:entry colname="col7">90 %</oasis:entry>
         <oasis:entry colname="col8">94 %</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M251" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">1.15</oasis:entry>
         <oasis:entry colname="col4">1.88</oasis:entry>
         <oasis:entry colname="col5">1.66</oasis:entry>
         <oasis:entry colname="col6">2.8</oasis:entry>
         <oasis:entry colname="col7">3.7</oasis:entry>
         <oasis:entry colname="col8">2.4</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<table-wrap id="T6" specific-use="star"><label>Table 6</label><caption><p id="d2e8384">The performance metrics of the models with 6 h prediction horizon.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="8">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Model</oasis:entry>
         <oasis:entry colname="col2">Performance Metric</oasis:entry>
         <oasis:entry colname="col3">Event 1</oasis:entry>
         <oasis:entry colname="col4">Event 2</oasis:entry>
         <oasis:entry colname="col5">Event 3</oasis:entry>
         <oasis:entry colname="col6">Event 4</oasis:entry>
         <oasis:entry colname="col7">Event 5</oasis:entry>
         <oasis:entry colname="col8">Event 6</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">N-HiTS</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3">0.82</oasis:entry>
         <oasis:entry colname="col4">0.58</oasis:entry>
         <oasis:entry colname="col5">0.51</oasis:entry>
         <oasis:entry colname="col6">0.6</oasis:entry>
         <oasis:entry colname="col7">0.7</oasis:entry>
         <oasis:entry colname="col8">0.52</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.76</oasis:entry>
         <oasis:entry colname="col4">0.68</oasis:entry>
         <oasis:entry colname="col5">0.67</oasis:entry>
         <oasis:entry colname="col6">0.74</oasis:entry>
         <oasis:entry colname="col7">0.78</oasis:entry>
         <oasis:entry colname="col8">0.67</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">708</oasis:entry>
         <oasis:entry colname="col4">189</oasis:entry>
         <oasis:entry colname="col5">525</oasis:entry>
         <oasis:entry colname="col6">188</oasis:entry>
         <oasis:entry colname="col7">147</oasis:entry>
         <oasis:entry colname="col8">137</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">423</oasis:entry>
         <oasis:entry colname="col4">90</oasis:entry>
         <oasis:entry colname="col5">257</oasis:entry>
         <oasis:entry colname="col6">110</oasis:entry>
         <oasis:entry colname="col7">90</oasis:entry>
         <oasis:entry colname="col8">77</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.35</oasis:entry>
         <oasis:entry colname="col4">0.29</oasis:entry>
         <oasis:entry colname="col5">0.12</oasis:entry>
         <oasis:entry colname="col6">0.03</oasis:entry>
         <oasis:entry colname="col7">0.2</oasis:entry>
         <oasis:entry colname="col8">0.1</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">2</oasis:entry>
         <oasis:entry colname="col4">3</oasis:entry>
         <oasis:entry colname="col5">0</oasis:entry>
         <oasis:entry colname="col6">0</oasis:entry>
         <oasis:entry colname="col7">3</oasis:entry>
         <oasis:entry colname="col8">3</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M252" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">70 %</oasis:entry>
         <oasis:entry colname="col4">96 %</oasis:entry>
         <oasis:entry colname="col5">87 %</oasis:entry>
         <oasis:entry colname="col6">92 %</oasis:entry>
         <oasis:entry colname="col7">82 %</oasis:entry>
         <oasis:entry colname="col8">87 %</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M253" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">0.71</oasis:entry>
         <oasis:entry colname="col4">1.1</oasis:entry>
         <oasis:entry colname="col5">1.1</oasis:entry>
         <oasis:entry colname="col6">1.8</oasis:entry>
         <oasis:entry colname="col7">1.15</oasis:entry>
         <oasis:entry colname="col8">1.2</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">N-BEATS</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3">0.94</oasis:entry>
         <oasis:entry colname="col4">0.85</oasis:entry>
         <oasis:entry colname="col5">0.59</oasis:entry>
         <oasis:entry colname="col6">0.33</oasis:entry>
         <oasis:entry colname="col7">0.82</oasis:entry>
         <oasis:entry colname="col8">0.59</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.83</oasis:entry>
         <oasis:entry colname="col4">0.82</oasis:entry>
         <oasis:entry colname="col5">0.73</oasis:entry>
         <oasis:entry colname="col6">0.55</oasis:entry>
         <oasis:entry colname="col7">0.79</oasis:entry>
         <oasis:entry colname="col8">0.67</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">386</oasis:entry>
         <oasis:entry colname="col4">112</oasis:entry>
         <oasis:entry colname="col5">481</oasis:entry>
         <oasis:entry colname="col6">244</oasis:entry>
         <oasis:entry colname="col7">115</oasis:entry>
         <oasis:entry colname="col8">126</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">259</oasis:entry>
         <oasis:entry colname="col4">58</oasis:entry>
         <oasis:entry colname="col5">181</oasis:entry>
         <oasis:entry colname="col6">131</oasis:entry>
         <oasis:entry colname="col7">56</oasis:entry>
         <oasis:entry colname="col8">74</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.16</oasis:entry>
         <oasis:entry colname="col4">0.23</oasis:entry>
         <oasis:entry colname="col5">0.02</oasis:entry>
         <oasis:entry colname="col6">0.03</oasis:entry>
         <oasis:entry colname="col7">0.03</oasis:entry>
         <oasis:entry colname="col8">0.12</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">0</oasis:entry>
         <oasis:entry colname="col4">3</oasis:entry>
         <oasis:entry colname="col5">0</oasis:entry>
         <oasis:entry colname="col6">0</oasis:entry>
         <oasis:entry colname="col7">0</oasis:entry>
         <oasis:entry colname="col8">3</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M254" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">100 %</oasis:entry>
         <oasis:entry colname="col4">86 %</oasis:entry>
         <oasis:entry colname="col5">90.3 %</oasis:entry>
         <oasis:entry colname="col6">85 %</oasis:entry>
         <oasis:entry colname="col7">77 %</oasis:entry>
         <oasis:entry colname="col8">78 %</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M255" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">1.8</oasis:entry>
         <oasis:entry colname="col4">2.3</oasis:entry>
         <oasis:entry colname="col5">1.1</oasis:entry>
         <oasis:entry colname="col6">1.13</oasis:entry>
         <oasis:entry colname="col7">3.3</oasis:entry>
         <oasis:entry colname="col8">1.2</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LSTM</oasis:entry>
         <oasis:entry colname="col2">NSE</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M256" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.35</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M257" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.39</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M258" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.22</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M259" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.17</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M260" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.2</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M261" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.2</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">KGE</oasis:entry>
         <oasis:entry colname="col3">0.3</oasis:entry>
         <oasis:entry colname="col4">0.05</oasis:entry>
         <oasis:entry colname="col5">0.18</oasis:entry>
         <oasis:entry colname="col6">0.34</oasis:entry>
         <oasis:entry colname="col7">0.33</oasis:entry>
         <oasis:entry colname="col8">0.4</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3">1984</oasis:entry>
         <oasis:entry colname="col4">348</oasis:entry>
         <oasis:entry colname="col5">834</oasis:entry>
         <oasis:entry colname="col6">324</oasis:entry>
         <oasis:entry colname="col7">300</oasis:entry>
         <oasis:entry colname="col8">220</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">MAE</oasis:entry>
         <oasis:entry colname="col3">1304</oasis:entry>
         <oasis:entry colname="col4">192</oasis:entry>
         <oasis:entry colname="col5">468</oasis:entry>
         <oasis:entry colname="col6">234</oasis:entry>
         <oasis:entry colname="col7">201</oasis:entry>
         <oasis:entry colname="col8">174</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">PFE</oasis:entry>
         <oasis:entry colname="col3">0.24</oasis:entry>
         <oasis:entry colname="col4">0.36</oasis:entry>
         <oasis:entry colname="col5">0.42</oasis:entry>
         <oasis:entry colname="col6">0. 6</oasis:entry>
         <oasis:entry colname="col7">0.44</oasis:entry>
         <oasis:entry colname="col8">0.42</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">TPE (hours)</oasis:entry>
         <oasis:entry colname="col3">3</oasis:entry>
         <oasis:entry colname="col4">4</oasis:entry>
         <oasis:entry colname="col5">3</oasis:entry>
         <oasis:entry colname="col6">0</oasis:entry>
         <oasis:entry colname="col7">2</oasis:entry>
         <oasis:entry colname="col8">2</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M262" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">36 %</oasis:entry>
         <oasis:entry colname="col4">79 %</oasis:entry>
         <oasis:entry colname="col5">90.3 %</oasis:entry>
         <oasis:entry colname="col6">85 %</oasis:entry>
         <oasis:entry colname="col7">86 %</oasis:entry>
         <oasis:entry colname="col8">63 %</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M263" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor</oasis:entry>
         <oasis:entry colname="col3">1.8</oasis:entry>
         <oasis:entry colname="col4">1.9</oasis:entry>
         <oasis:entry colname="col5">2.16</oasis:entry>
         <oasis:entry colname="col6">1.6</oasis:entry>
         <oasis:entry colname="col7">3.7</oasis:entry>
         <oasis:entry colname="col8">1.6</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e9170">To probe cross-catchment generalizability, we trained a single “regional” model by pooling Lower Dog River and Killian Creek, preserving per-site temporal splits and fitting a global scaler only on the pooled training portion to avoid leakage; evaluation remained strictly per site. Relative to per-site training, pooled fitting produced a small accuracy drop for N-HiTS and N-BEATS (<inline-formula><mml:math id="M264" display="inline"><mml:mo lspace="0mm">∼</mml:mo></mml:math></inline-formula> 2 % to 3 %). LSTM showed mixed performance to pooling, it improved in some storm events but degraded in others, so that, when averaged across both stations and storm events, LSTM's regional performance was effectively unchanged relative to the per-site training. Despite that, the regional N-HiTS/N-BEATS matched the accuracy of the best per-site models within the variability observed across storm events and, importantly, consistently surpassed LSTM at both basins. Mechanistically, N-HiTS's multi-rate pooling and hierarchical interpolation, and N-BEATS's trend/seasonality basis projection, act as catchment-invariant feature extractors that support parameter sharing across stations.</p>
      <p id="d2e9180">In our investigation, we conducted an analysis to assess the impact of varying input sizes on the performance of the N-HiTS, as the best model. We implemented four different durations as input sizes to observe the corresponding differences in modeling performance. Notably, one of the key metrics affected by changes in input size was 95 PPU, which exhibited a general decrease with increasing input size. As detailed in Table 7, we observed a discernible trend in the <inline-formula><mml:math id="M265" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor of the N-HiTS model as the input size was increased. Specifically, there was a decline in the <inline-formula><mml:math id="M266" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor as the input size expanded. This trend underscores the influence of input size on model performance, particularly in terms of 95 PPU band and accuracy.</p>

<table-wrap id="T7"><label>Table 7</label><caption><p id="d2e9200">N-HiTS's <inline-formula><mml:math id="M267" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor results for three storm events in each case study, using 1, 6, 12, and 24 h input size in training.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Input Size</oasis:entry>
         <oasis:entry colname="col2">1 h</oasis:entry>
         <oasis:entry colname="col3">6 h</oasis:entry>
         <oasis:entry colname="col4">12 h</oasis:entry>
         <oasis:entry colname="col5">24 h</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Dog River, GA – Event 1</oasis:entry>
         <oasis:entry colname="col2">0.314</oasis:entry>
         <oasis:entry colname="col3">0.337</oasis:entry>
         <oasis:entry colname="col4">0.29</oasis:entry>
         <oasis:entry colname="col5">0.272</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Dog River, GA – Event 2</oasis:entry>
         <oasis:entry colname="col2">0.35</oasis:entry>
         <oasis:entry colname="col3">0.413</oasis:entry>
         <oasis:entry colname="col4">0.403</oasis:entry>
         <oasis:entry colname="col5">0.402</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Dog River, GA – Event 3</oasis:entry>
         <oasis:entry colname="col2">0.358</oasis:entry>
         <oasis:entry colname="col3">0.459</oasis:entry>
         <oasis:entry colname="col4">0.374</oasis:entry>
         <oasis:entry colname="col5">0.336</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Killian Creek, NC – Event 4</oasis:entry>
         <oasis:entry colname="col2">0.491</oasis:entry>
         <oasis:entry colname="col3">0.422</oasis:entry>
         <oasis:entry colname="col4">0.426</oasis:entry>
         <oasis:entry colname="col5">0.388</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Killian Creek, NC – Event 5</oasis:entry>
         <oasis:entry colname="col2">0.584</oasis:entry>
         <oasis:entry colname="col3">0.503</oasis:entry>
         <oasis:entry colname="col4">0.557</oasis:entry>
         <oasis:entry colname="col5">0.483</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Killian Creek, NC – Event 6</oasis:entry>
         <oasis:entry colname="col2">0.482</oasis:entry>
         <oasis:entry colname="col3">0.42</oasis:entry>
         <oasis:entry colname="col4">0.446</oasis:entry>
         <oasis:entry colname="col5">0.454</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e9360">Overall, uncertainty analysis revealed that coupling MLE with N-HiTS and N-BEATS models demonstrated superior performance in generating 95 PPU, effectively reducing errors in flood prediction. The MLE approach was more successful in reducing 95 PPU bands of N-HiTS and N-BEATS models compared to the LSTM, as indicated by the <inline-formula><mml:math id="M268" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor and <inline-formula><mml:math id="M269" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>-Factor. The N-BEATS model demonstrated a narrower uncertainty band (lower <inline-formula><mml:math id="M270" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor value), while the N-HiTS model provided higher precision. Furthermore, incorporating data with various sizes into the N-HiTS model led to a narrower 95 PPU and an improvement in the <inline-formula><mml:math id="M271" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>-Factor, highlighting the significance of input size in enhancing model accuracy and reducing uncertainty.</p>
</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>Sensitivity Analysis</title>
      <p id="d2e9399">In this study, we conducted a comprehensive sensitivity analysis of the N-HiTS, N-BEATS, and LSTM models to evaluate their responsiveness to meteorological variables, specifically precipitation, humidity, and temperature. The goal was to assess how the omission of input features impacts the overall modeling performance compared to their full-variable counterparts.</p>
      <p id="d2e9402">To execute this analysis, we systematically trained each model by excluding meteorological variables one or more at a time, subsequently evaluating their predictive performance using the entire testing dataset. The results of our analysis indicated that N-HiTS and N-BEATS models exhibited minimal sensitivity to meteorological variables, as evidenced by the negligible impact on their performance metric (i.e., NSE, Persistent-NSE, KGE, RMSE, and MAE) upon input feature exclusion.</p>
      <p id="d2e9405">Notably, as shown in Table 8, the performance of the N-HiTS model displayed a marginal deviation under variable omission, while the N-BEATS model exhibited consistent performance irrespective of the inclusion or exclusion of meteorological variables. The structure of this algorithm is based on backward and forward residual links for univariate time series point forecasting which does not take into account other input features in the prediction task.  These findings suggest that the predictive capabilities of N-HiTS and N-BEATS models predominantly rely on historical flood data. Both models demonstrated strong performance even without incorporating precipitation, temperature, or humidity data, underscoring their ability in flood prediction in the absence of specific meteorological inputs. This capability underscores the robustness of the N-HiTS and N-BEATS models, positioning them as viable tools and perhaps appropriate for real-time flood forecasting tasks where direct meteorological data may be limited or unavailable.</p>

<table-wrap id="T8" specific-use="star"><label>Table 8</label><caption><p id="d2e9412">Performance metrics' values for N-HiTS, N-BEATS, and LSTM models by excluding meteorological variables one or more at a time.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Model</oasis:entry>
         <oasis:entry colname="col2">Excluded Variables</oasis:entry>
         <oasis:entry colname="col3">NSE</oasis:entry>
         <oasis:entry colname="col4">Persistent-NSE</oasis:entry>
         <oasis:entry colname="col5">KGE</oasis:entry>
         <oasis:entry colname="col6">RMSE</oasis:entry>
         <oasis:entry colname="col7">MAE</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">N-HiTS</oasis:entry>
         <oasis:entry colname="col2">Using all variables</oasis:entry>
         <oasis:entry colname="col3">0.996</oasis:entry>
         <oasis:entry colname="col4">0.92</oasis:entry>
         <oasis:entry colname="col5">0.988</oasis:entry>
         <oasis:entry colname="col6">22.66</oasis:entry>
         <oasis:entry colname="col7">4.19</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Without Precipitation</oasis:entry>
         <oasis:entry colname="col3">0.993</oasis:entry>
         <oasis:entry colname="col4">0.91</oasis:entry>
         <oasis:entry colname="col5">0.97</oasis:entry>
         <oasis:entry colname="col6">23.28</oasis:entry>
         <oasis:entry colname="col7">4.31</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Without Humidity</oasis:entry>
         <oasis:entry colname="col3">0.995</oasis:entry>
         <oasis:entry colname="col4">0.914</oasis:entry>
         <oasis:entry colname="col5">0.976</oasis:entry>
         <oasis:entry colname="col6">22.87</oasis:entry>
         <oasis:entry colname="col7">4.22</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Without Temperature</oasis:entry>
         <oasis:entry colname="col3">0.995</oasis:entry>
         <oasis:entry colname="col4">0.921</oasis:entry>
         <oasis:entry colname="col5">0.985</oasis:entry>
         <oasis:entry colname="col6">22.43</oasis:entry>
         <oasis:entry colname="col7">4.14</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Discharge only prediction</oasis:entry>
         <oasis:entry colname="col3">0.993</oasis:entry>
         <oasis:entry colname="col4">0.911</oasis:entry>
         <oasis:entry colname="col5">0.972</oasis:entry>
         <oasis:entry colname="col6">23.21</oasis:entry>
         <oasis:entry colname="col7">4.29</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">N-BEATS</oasis:entry>
         <oasis:entry colname="col2">Using all variables</oasis:entry>
         <oasis:entry colname="col3">0.994</oasis:entry>
         <oasis:entry colname="col4">0.978</oasis:entry>
         <oasis:entry colname="col5">0.992</oasis:entry>
         <oasis:entry colname="col6">11.80</oasis:entry>
         <oasis:entry colname="col7">2.13</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Without Precipitation</oasis:entry>
         <oasis:entry colname="col3">0.994</oasis:entry>
         <oasis:entry colname="col4">0.978</oasis:entry>
         <oasis:entry colname="col5">0.991</oasis:entry>
         <oasis:entry colname="col6">11.86</oasis:entry>
         <oasis:entry colname="col7">2.17</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Without Humidity</oasis:entry>
         <oasis:entry colname="col3">0.994</oasis:entry>
         <oasis:entry colname="col4">0.978</oasis:entry>
         <oasis:entry colname="col5">0.991</oasis:entry>
         <oasis:entry colname="col6">11.81</oasis:entry>
         <oasis:entry colname="col7">2.16</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Without Temperature</oasis:entry>
         <oasis:entry colname="col3">0.994</oasis:entry>
         <oasis:entry colname="col4">0.978</oasis:entry>
         <oasis:entry colname="col5">0.991</oasis:entry>
         <oasis:entry colname="col6">11.82</oasis:entry>
         <oasis:entry colname="col7">2.16</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Discharge only prediction</oasis:entry>
         <oasis:entry colname="col3">0.994</oasis:entry>
         <oasis:entry colname="col4">0.978</oasis:entry>
         <oasis:entry colname="col5">0.991</oasis:entry>
         <oasis:entry colname="col6">11.96</oasis:entry>
         <oasis:entry colname="col7">2.17</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LSTM</oasis:entry>
         <oasis:entry colname="col2">Using all variables</oasis:entry>
         <oasis:entry colname="col3">0.992</oasis:entry>
         <oasis:entry colname="col4">0.865</oasis:entry>
         <oasis:entry colname="col5">0.926</oasis:entry>
         <oasis:entry colname="col6">29.52</oasis:entry>
         <oasis:entry colname="col7">8.15</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Without Precipitation</oasis:entry>
         <oasis:entry colname="col3">0.979</oasis:entry>
         <oasis:entry colname="col4">0.665</oasis:entry>
         <oasis:entry colname="col5">0.892</oasis:entry>
         <oasis:entry colname="col6">39.46</oasis:entry>
         <oasis:entry colname="col7">19.83</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Without Humidity</oasis:entry>
         <oasis:entry colname="col3">0.991</oasis:entry>
         <oasis:entry colname="col4">0.843</oasis:entry>
         <oasis:entry colname="col5">0.925</oasis:entry>
         <oasis:entry colname="col6">31.73</oasis:entry>
         <oasis:entry colname="col7">9.15</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Without Temperature</oasis:entry>
         <oasis:entry colname="col3">0.983</oasis:entry>
         <oasis:entry colname="col4">0.628</oasis:entry>
         <oasis:entry colname="col5">0.872</oasis:entry>
         <oasis:entry colname="col6">48.95</oasis:entry>
         <oasis:entry colname="col7">11.49</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Discharge only prediction</oasis:entry>
         <oasis:entry colname="col3">0.976</oasis:entry>
         <oasis:entry colname="col4">0.576</oasis:entry>
         <oasis:entry colname="col5">0.692</oasis:entry>
         <oasis:entry colname="col6">52.28</oasis:entry>
         <oasis:entry colname="col7">33.5</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S3.SS5">
  <label>3.5</label><title>Computational Efficiency</title>
      <p id="d2e9840">The computational efficiency of the N-HiTS, N-BEATS, and LSTM models, as well as a comparative analysis, is presented in Table 9. The study encompassed the entire process of training and predicting over the testing period, employing the optimized hyperparameters as previously described. Regarding the training time, it is noteworthy that the LSTM model exhibited the quickest performance. Specifically, LSTM demonstrated a training time that was 71 % faster than N-HiTS and 93 % faster than N-BEATS in the Lower Dog River watershed, while it was respectively,126 % and 118 % faster than N-HiTS and N-BEATS in the Upper Dutchmans Creek, over training dataset. This is because LSTM has simple architecture compared to the N-BEATS and N-HiTS and does not require multivariate features, hierarchical interpolation, and multi-rate data sampling. Perhaps, this outcome underscores the computational advantage of LSTM over other algorithms.</p>

<table-wrap id="T9" specific-use="star"><label>Table 9</label><caption><p id="d2e9846">Computational costs of N-HiTS, N-BEATS, and LSTM models in the Dog River and Killian Creek gauging stations.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right" colsep="1"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" namest="col2" nameend="col3" align="center" colsep="1">Training Time over Train Datasets (seconds)  </oasis:entry>
         <oasis:entry rowsep="1" namest="col4" nameend="col5" align="center">Predicting Time over Test Datasets (seconds) </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Model</oasis:entry>
         <oasis:entry colname="col2">Lower Dog River</oasis:entry>
         <oasis:entry colname="col3">Upper Dutchmans Creek</oasis:entry>
         <oasis:entry colname="col4">Lower Dog River</oasis:entry>
         <oasis:entry colname="col5">Upper Dutchmans Creek</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">N-HiTS</oasis:entry>
         <oasis:entry colname="col2">256.032</oasis:entry>
         <oasis:entry colname="col3">374.569</oasis:entry>
         <oasis:entry colname="col4">1533.029</oasis:entry>
         <oasis:entry colname="col5">1205.526</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">N-BEATS</oasis:entry>
         <oasis:entry colname="col2">288.511</oasis:entry>
         <oasis:entry colname="col3">361.599</oasis:entry>
         <oasis:entry colname="col4">2028.068</oasis:entry>
         <oasis:entry colname="col5">1482.305</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LSTM</oasis:entry>
         <oasis:entry colname="col2">149.173</oasis:entry>
         <oasis:entry colname="col3">165.827</oasis:entry>
         <oasis:entry colname="col4">2046.140</oasis:entry>
         <oasis:entry colname="col5">1792.444</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e9954">Conversely, during the testing period, the N-HiTS model emerged as the fastest and delivered the most efficient results in comparison to the other models. Notably, N-HiTS displayed a predicted time that was 33 % faster than LSTM and 32 % faster than N-BEATS. This finding highlights the computational efficiency of the N-HiTS model in the context of predicting processes. Our experiments unveiled an interesting contrast in the computational performance of these models. While LSTM excelled in terms of training time, it lagged behind when it came to the testing period.</p>
      <p id="d2e9959">In the grand scheme of computational efficiency, model accuracy, and uncertainty analysis results, it becomes evident that the superiority of the N-HiTS and N-BEATS models in terms of accuracy and uncertainty analysis holds paramount importance. This significance is accentuated by the critical nature of flood prediction, where precision and certainty are pivotal. Therefore, computational efficiency must be viewed in the context of the broader objectives, with the accuracy and reliability of flood predictions taking precedence in ensuring the safety and preparedness of the affected regions.</p>
</sec>
</sec>
<sec id="Ch1.S4" sec-type="conclusions">
  <label>4</label><title>Conclusions</title>
      <p id="d2e9972">This study examined multiple NN algorithms for flood prediction. We selected two headwater streams with minimal human impacts to understand how NN approaches can capture flood magnitude and timing for these natural systems. In conclusion, our study represents a pioneering effort in exploring and advancing the application of NN algorithms, specifically the N-HiTS and N-BEATS models, in the field of flood prediction. In our case studies, both N-HiTS and N-BEATS models achieved state-of-the-art results, outperforming LSTM as a benchmark model, particularly in one-hour prediction. While a one-hour lead time may seem brief, it is highly significant for accurate flash flood prediction particularly in an area with a proximity to metropolitan cities, where rapid response is critical. These benchmarking results are arguably a pivotal part of this research. However, the N-BEATS model slightly emerged as a powerful and interpretable tool for flood prediction in most selected events.</p>
      <p id="d2e9975">This study focused on short-lead, operational forecasting at gauged sites, using historical discharge to deliver robust, low-latency updates. While the evaluation is limited to two Southeastern U.S. basins, the architecture (e.g., N-HiTS) is flexible and can incorporate additional covariates and catchment attributes. Extending the approach to ungauged or other basins is feasible through multi-basin training and transfer learning or few-shot adaptation when even brief warm-up records are available. These extensions represent promising directions for future work to assess geographic transferability under the same operational assumptions.</p>
      <p id="d2e9978">In addition, the results of the experiments described above demonstrated that N-HiTS multi-rate input sampling and hierarchical interpolation along with N-BEATS interpretable configuration are effective in learning location-specific runoff generation behaviors. Both algorithms with an MLP-based deep neural architecture with backward and forward residual links can sequentially project the data signal into polynomials and harmonic basis needed to predict intense storm behaviors with varied magnitudes. The innovation in this study, besides benchmarking the LSTM model for headwater streams, was to tackle volatility and memory complexity challenges, by locally specializing flood sequential predictions into the data signal's frequencies with interpretability, and hierarchical interpolation and pooling. Both N-HiTS and N-BEATS models offered similar performance as compared with the LSTM but also offered a level of interpretability about how the model learns to differentiate aspects of complex watershed-specific behaviors via data. The interpretability of N-HiTS and N-BEATS arises directly from their model architecture.</p>
      <p id="d2e9981">In the interpretable N-BEATS framework, forecasts are decomposed into trend and seasonality stacks, each represented by explicit basis coefficients that reveal how different temporal patterns contribute to the prediction. Similarly, N-HiTS achieves interpretability by aggregating contributions across multiple distinct time scales, allowing insight into the temporal dynamics driving each forecast. N-HiTS aims to enhance the accuracy of long-term time-series forecasts through hierarchical interpolation and multi-scale data sampling, allowing it to focus on different data patterns, which prioritizes features essential to understand flood magnitudes. N-BEATS leverages interpretable configurations with trend and seasonality projections, enabling it to decompose time series data into intuitive components. N-BEATS interpretable architecture is recommended for scarce data settings (such as flooding event), as it regularizes its predictions through projections onto harmonic and trend basis.</p>
      <p id="d2e9985">These approaches improve model transparency by allowing understanding of how each part of the model contributes to the final prediction, particularly when applied to complex flood patterns. Both models also support multivariate series (and covariates) by flattening the model inputs to a 1-D series and reshaping the outputs to a tensor of appropriate dimensions. This approach provides flexibility to handle arbitrary numbers of features. Like LSTM, both N-HiTS and N-BEATS models support producing probabilistic predictions by specifying a likelihood objective. In terms of sensitivity analysis, both N-HiTS and N-BEATS maintain consistent performance even when trained without specific meteorological input.</p>
      <p id="d2e9988">Although, during some flashy floods, the models encountered challenges in capturing the peak flows and the dynamics of the recession curve, which is directly related to groundwater contribution to flood hydrograph, both models were technically insensitive to rainfall data as an input variable. This suggests the fact that both algorithms can learn patterns in discharge data without requiring meteorological input. This ability underscores these models' robustness in generating accurate predictions using historical flood data alone, making them valuable tools for flood prediction, especially in data-poor watersheds or even for real-time flood prediction when near real-time meteorological inputs are limited or unavailable. In terms of computational efficiency, both N-HiTS and N-BEATS are trained almost at the same pace; however, N-HiTS predicted the test data much quicker than N-BEATS. Unlike N-HiTS and N-BEATS, LSTM excelled in reducing training time due to its simplicity and limited number of parameters.</p>
      <p id="d2e9991">Moving forward, it is worth mentioning that predicting the magnitude of the recession curve of flood hydrographs was particularly challenging for all models. We argue that this is because the relation between base flow and time is particularly hard to calibrate due to ground-water effluent that is controlled by geological and physical conditions (vegetation, wetlands, and wet meadows) in headwater streams. In addition, the situations of runoff occurrence are diverse and have a high measurement variance with high frequency that can make it difficult for the algorithms to fully capture discrete representation learning on time series.</p>
      <p id="d2e9994">In future studies, it will be important to develop strategies to derive analogs to the interpretable configuration as well as multi-rate input sampling, hierarchical interpolation, and backcast residual connections that allow for the dynamic representation of flood times series data with different frequencies and nonlinearity. A dynamic representation of flood time series is, at least in principle, possible by generating additive predictions in different bands of the time-series signals, reducing memory footprint and compute time, and improving architecture parsimony and accuracy. This would allow the model to “learn” interpretability and hierarchical representations from raw data to reduce complexity as the information flows through the network.</p>
      <p id="d2e9997">While a single station provides valuable localized information, particularly for small, headwater streams where runoff closely follows immediate meteorological conditions, it may not capture the spatial heterogeneity of larger watersheds. In our study, the applied methods successfully captured runoff magnitude and dynamics in small basins for an operational setting. However, broader spatial coverage and distributed data would likely enhance model accuracy for larger regions. Consequently, our conclusions are specifically scoped to the selected basins and forecast horizons, and broader generalizations would require multi-region investigations in future work.</p>
      <p id="d2e10000">Finally, the performance of N-HiTS, N-BEATS, or other neural network architectures could be further enhanced with robust uncertainty quantification. Approaches such as Bayesian Model Averaging (BMA) with fixed or flexible priors (Samadi et al., 2020) or Markov Chain Monte Carlo (MCMC) optimization methods (Duane et al., 1987) could capture both aleatoric and epistemic uncertainties. We leave these strategies for future exploration in the context of neural flood time-series prediction.</p>
</sec><notes notes-type="dataavailability"><title>Data availability</title>

      <p id="d2e10006">The historical discharge data used in this study are from the USGS (<uri>https://waterdata.usgs.gov/nwis/uv/?referred_module=sw</uri>, last access: 15 Januray 2026), meteorological data from USDA (<uri>https://www.ncdc.noaa.gov/cdo-web/datatools/lcd</uri>, last access: 1 March 2024). We have uploaded the datasets and codes used in this research to Zenodo, accessible via <ext-link xlink:href="https://doi.org/10.5281/zenodo.13343364" ext-link-type="DOI">10.5281/zenodo.13343364</ext-link> (Saberian and Samadi, 2024). For modeling, we used the NeuralForecast package (Olivares et al., 2022), available at: <uri>https://github.com/Nixtla/neuralforecast</uri> (Olivares et al., 2022).</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e10024">MS: conceptualization, methodology, visualization, writing (original draft); VS: conceptualization, funding acquisition, methodology, supervision, visualization, writing (review and editing); IP: conceptualization, methodology, visualization, writing (review and editing).</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e10030">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e10036">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e10042">This research is supported by the US National Science Foundation Directorate of Engineering (Grant # CMMI 2125283; CBET 2429082). All opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF. The authors acknowledge and appreciate Thorsten Wagener (University of Potsdam, Germany) discussion and feedback on this manuscript. Clemson University (USA) is acknowledged for generous allotment of computing time on the Palmetto cluster.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e10047">This research is supported by the US National Science Foundation Directorate of Engineering (grant nos. CMMI 2125283; CBET 2429082).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e10053">This paper was edited by Yue-Ping Xu and reviewed by Peishi Jiang and two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><mixed-citation>Abbaspour, K. C., Yang, J., Maximov, I., Siber, R., Bogner, K., Mieleitner, J., Zobrist, J., and Srinivasan, R.: Modelling hydrology and water quality in the pre-alpine/alpine Thur watershed using SWAT, Journal of Hydrology, 333, 413–430, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2006.09.014" ext-link-type="DOI">10.1016/j.jhydrol.2006.09.014</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><mixed-citation> Alaa, A. M. and van der Schaar, M.: Attentive State-Space Modeling of Disease Progression, in: Advances in Neural Information Processing Systems, ISBN 9781713807933, 2019.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><mixed-citation>Asquith, W. H., Roussel, M. C., Thompson, D. B., Cleveland, T. G., and Fang, X.: Summary of dimensionless Texas hyetographs and distribution of storm depth developed for Texas Department of Transportation research project 0–4194, Texas Department of Transportation, <uri>https://library.ctr.utexas.edu/digitized/texasarchive/phase1/4194-4-txdot.pdf</uri> (last access: 15 January 2026), 2005.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><mixed-citation>Barnard, P. L., van Ormondt, M., Erikson, L. H., Eshleman, J., Hapke, C., Ruggiero, P., Adams, P. N., and Foxgrover, A. C.: Development of the Coastal Storm Modeling System (CoSMoS) for predicting the impact of storms on high-energy, active-margin coasts, Nat. Hazards, 74, 1095–1125, <ext-link xlink:href="https://doi.org/10.1007/s11069-014-1236-y" ext-link-type="DOI">10.1007/s11069-014-1236-y</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><mixed-citation>Basso, S., Schirmer, M., and Botter, G.: A physically based analytical model of flood frequency curves, Geophysical Research Letters, 43, 9070–9076, <ext-link xlink:href="https://doi.org/10.1002/2016GL069915" ext-link-type="DOI">10.1002/2016GL069915</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><mixed-citation>Challu, C., Olivares, K. G., Oreshkin, B. N., Garza, F., Mergenthaler-Canseco, M., and Dubrawski, A.: N-HiTS: Neural Hierarchical Interpolation for Time Series Forecasting, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2201.12886" ext-link-type="DOI">10.48550/arXiv.2201.12886</ext-link>, 29 November 2022.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><mixed-citation>Chen, Y., Li, J., and Xu, H.: Improving flood forecasting capability of physically based distributed hydrological models by parameter optimization, Hydrol. Earth Syst. Sci., 20, 375–392, <ext-link xlink:href="https://doi.org/10.5194/hess-20-375-2016" ext-link-type="DOI">10.5194/hess-20-375-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><mixed-citation>Clark, M. P., Nijssen, B., Lundquist, J. D., Kavetski, D., Rupp, D. E., Woods, R. A., Freer, J. E., Gutmann, E. D., Wood, A. W., Brekke, L. D., Arnold, J. R., Gochis, D. J., and Rasmussen, R. M.: A unified approach for process-based hydrologic modeling: 1. Modeling concept, Water Resources Research, 51, 2498–2514, <ext-link xlink:href="https://doi.org/10.1002/2015WR017198" ext-link-type="DOI">10.1002/2015WR017198</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><mixed-citation>Dasgupta, A., Arnal, L., Emerton, R., Harrigan, S., Matthews, G., Muhammad, A., O'Regan, K., Pérez-Ciria, T., Valdez, E., van Osnabrugge, B., Werner, M., Buontempo, C., Cloke, H., Pappenberger, F., Pechlivanidis, I. G., Prudhomme, C., Ramos, M.-H., and Salamon, P.: Connecting hydrological modelling and forecasting from global to local scales: Perspectives from an international joint virtual workshop, Journal of Flood Risk Management, 18, e12880, <ext-link xlink:href="https://doi.org/10.1111/jfr3.12880" ext-link-type="DOI">10.1111/jfr3.12880</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><mixed-citation>Defontaine, T., Ricci, S., Lapeyre, C., Marchandise, A., and Pape, E. L.: Flood forecasting with Machine Learning in a scarce data layout, IOP Conf. Ser.: Earth Environ. Sci., 1136, 012020, <ext-link xlink:href="https://doi.org/10.1088/1755-1315/1136/1/012020" ext-link-type="DOI">10.1088/1755-1315/1136/1/012020</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><mixed-citation>Duane, S., Kennedy, A. D., Pendleton, B. J., and Roweth, D.: Hybrid Monte Carlo, Physics Letters B, 195, 216–222, <ext-link xlink:href="https://doi.org/10.1016/0370-2693(87)91197-X" ext-link-type="DOI">10.1016/0370-2693(87)91197-X</ext-link>, 1987.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><mixed-citation>Erikson, L. H., Espejo, A., Barnard, P. L., Serafin, K. A., Hegermiller, C. A., O'Neill, A., Ruggiero, P., Limber, P. W., and Mendez, F. J.: Identification of storm events and contiguous coastal sections for deterministic modeling of extreme coastal flood events in response to climate change, Coastal Engineering, 140, 316–330, <ext-link xlink:href="https://doi.org/10.1016/j.coastaleng.2018.08.003" ext-link-type="DOI">10.1016/j.coastaleng.2018.08.003</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><mixed-citation>Evin, G., Le Lay, M., Fouchier, C., Penot, D., Colleoni, F., Mas, A., Garambois, P.-A., and Laurantin, O.: Evaluation of hydrological models on small mountainous catchments: impact of the meteorological forcings, Hydrol. Earth Syst. Sci., 28, 261–281, <ext-link xlink:href="https://doi.org/10.5194/hess-28-261-2024" ext-link-type="DOI">10.5194/hess-28-261-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><mixed-citation>Fan, C., Zhang, Y., Pan, Y., Li, X., Zhang, C., Yuan, R., Wu, D., Wang, W., Pei, J., and Huang, H.: Multi-Horizon Time Series Forecasting with Temporal Attention Learning, in: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery &amp; Data Mining, New York, NY, USA, 2527–2535, <ext-link xlink:href="https://doi.org/10.1145/3292500.3330662" ext-link-type="DOI">10.1145/3292500.3330662</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><mixed-citation>Fang, K., Kifer, D., Lawson, K., and Shen, C.: Evaluating the Potential and Challenges of an Uncertainty Quantification Method for Long Short-Term Memory Models for Soil Moisture Predictions, Water Resources Research, 56, e2020WR028095, <ext-link xlink:href="https://doi.org/10.1029/2020WR028095" ext-link-type="DOI">10.1029/2020WR028095</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><mixed-citation>Gotvald, A. J.: Historic flooding in Georgia, 2009, U.S. Geological Survey Open-File Report 2010–1230, 19 pp., <uri>https://pubs.usgs.gov/of/2010/1230/</uri> (last access: 5 June 2024), 2010.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><mixed-citation>Guha-Sapir, D. and Below, R.: Quality and accuracy of disaster data: a comparative analysis of 3 global data sets, Working paper prepared for the Disaster Management Facility, World Bank, CRED, Brussels, <uri>https://api.semanticscholar.org/CorpusID:132874120</uri> (last access: 5 June 2024), 2002.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><mixed-citation>Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling, Journal of Hydrology, 377, 80–91, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2009.08.003" ext-link-type="DOI">10.1016/j.jhydrol.2009.08.003</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><mixed-citation>Hochreiter, S., Younger, A. S., and Conwell, P. R.: Learning to Learn Using Gradient Descent, in: Artificial Neural Networks – ICANN 2001, Berlin, Heidelberg, 87–94, <ext-link xlink:href="https://doi.org/10.1007/3-540-44668-0_13" ext-link-type="DOI">10.1007/3-540-44668-0_13</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><mixed-citation>Hsu, K., Gupta, H. V., and Sorooshian, S.: Artificial Neural Network Modeling of the Rainfall-Runoff Process, Water Resources Research, 31, 2517–2530, <ext-link xlink:href="https://doi.org/10.1029/95WR01955" ext-link-type="DOI">10.1029/95WR01955</ext-link>, 1995.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><mixed-citation>Jonkman, S. N.: Global Perspectives on Loss of Human Life Caused by Floods, Nat. Hazards, 34, 151–175, <ext-link xlink:href="https://doi.org/10.1007/s11069-004-8891-3" ext-link-type="DOI">10.1007/s11069-004-8891-3</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><mixed-citation>Kingma, D. P. and Ba, J.: Adam: A Method for Stochastic Optimization, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1412.6980" ext-link-type="DOI">10.48550/arXiv.1412.6980</ext-link>, 29 January 2017.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><mixed-citation>Kratzert, F., Klotz, D., Brenner, C., Schulz, K., and Herrnegger, M.: Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks, Hydrol. Earth Syst. Sci., 22, 6005–6022, <ext-link xlink:href="https://doi.org/10.5194/hess-22-6005-2018" ext-link-type="DOI">10.5194/hess-22-6005-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><mixed-citation>Lim, B., Arık, S. Ö., Loeff, N., and Pfister, T.: Temporal Fusion Transformers for interpretable multi-horizon time series forecasting, International Journal of Forecasting, 37, 1748–1764, <ext-link xlink:href="https://doi.org/10.1016/j.ijforecast.2021.03.012" ext-link-type="DOI">10.1016/j.ijforecast.2021.03.012</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><mixed-citation>Lobligeois, F., Andréassian, V., Perrin, C., Tabary, P., and Loumagne, C.: When does higher spatial resolution rainfall information improve streamflow simulation? An evaluation using 3620 flood events, Hydrol. Earth Syst. Sci., 18, 575–594, <ext-link xlink:href="https://doi.org/10.5194/hess-18-575-2014" ext-link-type="DOI">10.5194/hess-18-575-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><mixed-citation>MacDonald, L. H. and Coe, D.: Influence of Headwater Streams on Downstream Reaches in Forested Areas, Forest Science, 53, 148–168, <ext-link xlink:href="https://doi.org/10.1093/forestscience/53.2.148" ext-link-type="DOI">10.1093/forestscience/53.2.148</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><mixed-citation>Martinaitis, S. M., Wilson, K. A., Yussouf, N., Gourley, J. J., Vergara, H., Meyer, T. C., Heinselman, P. L., Gerard, A., Berry, K. L., Vergara, A., and Monroe, J.: A Path Toward Short-Term Probabilistic Flash Flood Prediction, <ext-link xlink:href="https://doi.org/10.1175/BAMS-D-22-0026.1" ext-link-type="DOI">10.1175/BAMS-D-22-0026.1</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><mixed-citation>McCallum, B. E. and Gotvald, A. J.: Historic flooding in northern Georgia, September 16–22, 2009, U.S. Geological Survey Fact Sheet 2010–3061, 4 pp., <uri>https://pubs.usgs.gov/fs/2010/3061/</uri> (last access: 5 June 2024), 2010.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><mixed-citation>McCuen, R. H., Knight, Z., and Cutter, A. G.: Evaluation of the Nash–Sutcliffe Efficiency Index, Journal of Hydrologic Engineering, 11, 597–602, <ext-link xlink:href="https://doi.org/10.1061/(ASCE)1084-0699(2006)11:6(597)" ext-link-type="DOI">10.1061/(ASCE)1084-0699(2006)11:6(597)</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><mixed-citation>Munn, M., Sheibley, R., Waite, I., and Meador, M.: Understanding the relationship between stream metabolism and biological assemblages, Freshwater Science, 39, 680–692, <ext-link xlink:href="https://doi.org/10.1086/711690" ext-link-type="DOI">10.1086/711690</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><mixed-citation>Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual models part I – A discussion of principles, Journal of Hydrology, 10, 282–290, <ext-link xlink:href="https://doi.org/10.1016/0022-1694(70)90255-6" ext-link-type="DOI">10.1016/0022-1694(70)90255-6</ext-link>, 1970.</mixed-citation></ref>
      <ref id="bib1.bib32"><label>32</label><mixed-citation>Natural Resources Conservation Service (NRCS): National Engineering Handbook Part 630 Hydrology – Chapter 15: Time of Concentration, <uri>https://www.hydrocad.net/neh/630ch15.pdf</uri> (last access: 23 December 2025), 2010.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><mixed-citation>Nevo, S., Morin, E., Gerzi Rosenthal, A., Metzger, A., Barshai, C., Weitzner, D., Voloshin, D., Kratzert, F., Elidan, G., Dror, G., Begelman, G., Nearing, G., Shalev, G., Noga, H., Shavitt, I., Yuklea, L., Royz, M., Giladi, N., Peled Levi, N., Reich, O., Gilon, O., Maor, R., Timnat, S., Shechter, T., Anisimov, V., Gigi, Y., Levin, Y., Moshe, Z., Ben-Haim, Z., Hassidim, A., and Matias, Y.: Flood forecasting with machine learning models in an operational framework, Hydrol. Earth Syst. Sci., 26, 4013–4032, <ext-link xlink:href="https://doi.org/10.5194/hess-26-4013-2022" ext-link-type="DOI">10.5194/hess-26-4013-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib34"><label>34</label><mixed-citation>Olivares, K. G., Challú, C., Garza, A., Mergenthaler Canseco, M., and Dubrawski, A.: NeuralForecast: User friendly state-of-the-art neural forecasting models, PyCon Salt Lake City, Utah, US, GitHub [code], <uri>https://github.com/Nixtla/neuralforecast</uri> (last access: 23 December 2025), 2022.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><mixed-citation>Olivares, K. G., Meetei, O. N., Ma, R., Reddy, R., Cao, M., and Dicker, L.: Probabilistic hierarchical forecasting with deep Poisson mixtures, International Journal of Forecasting, 40, 470–489, <ext-link xlink:href="https://doi.org/10.1016/j.ijforecast.2023.04.007" ext-link-type="DOI">10.1016/j.ijforecast.2023.04.007</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib36"><label>36</label><mixed-citation>Oreshkin, B. N., Carpov, D., Chapados, N., and Bengio, Y.: N-BEATS: Neural basis expansion analysis for interpretable time series forecasting, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1905.10437" ext-link-type="DOI">10.48550/arXiv.1905.10437</ext-link>, 20 February 2020.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><mixed-citation>Pally, R. J. and Samadi, V.: Application of image processing and convolutional neural networks for flood image classification and semantic segmentation, Environmental Modelling &amp; Software, 148, 105285, <ext-link xlink:href="https://doi.org/10.1016/j.envsoft.2021.105285" ext-link-type="DOI">10.1016/j.envsoft.2021.105285</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib38"><label>38</label><mixed-citation>Palmer, T. N.: Towards the probabilistic Earth-system simulator: a vision for the future of climate and weather prediction, Quarterly Journal of the Royal Meteorological Society, 138, 841–861, <ext-link xlink:href="https://doi.org/10.1002/qj.1923" ext-link-type="DOI">10.1002/qj.1923</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib39"><label>39</label><mixed-citation>Park, K. and Lee, E. H.: Urban flood vulnerability analysis and prediction based on the land use using Deep Neural Network, International Journal of Disaster Risk Reduction, 101, 104231, <ext-link xlink:href="https://doi.org/10.1016/j.ijdrr.2023.104231" ext-link-type="DOI">10.1016/j.ijdrr.2023.104231</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib40"><label>40</label><mixed-citation>Pourreza-Bilondi, M., Samadi, S. Z., Akhoond-Ali, A.-M., and Ghahraman, B.: Reliability of Semiarid Flash Flood Modeling Using Bayesian Framework, Journal of Hydrologic Engineering, 22, 05016039, <ext-link xlink:href="https://doi.org/10.1061/(ASCE)HE.1943-5584.0001482" ext-link-type="DOI">10.1061/(ASCE)HE.1943-5584.0001482</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib41"><label>41</label><mixed-citation>Refsgaard, J. C., Stisen, S., and Koch, J.: Hydrological process knowledge in catchment modelling – Lessons and perspectives from 60 years development, Hydrological Processes, 36, e14463, <ext-link xlink:href="https://doi.org/10.1002/hyp.14463" ext-link-type="DOI">10.1002/hyp.14463</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib42"><label>42</label><mixed-citation>Roelvink, D., Reniers, A., van Dongeren, A., van Thiel de Vries, J., McCall, R., and Lescinski, J.: Modelling storm impacts on beaches, dunes and barrier islands, Coastal Engineering, 56, 1133–1152, <ext-link xlink:href="https://doi.org/10.1016/j.coastaleng.2009.08.006" ext-link-type="DOI">10.1016/j.coastaleng.2009.08.006</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib43"><label>43</label><mixed-citation>Russo, S., Perraudin, N., Stalder, S., Perez-Cruz, F., Leitao, J. P., Obozinski, G., and Wegner, J. D.: An evaluation of deep learning models for predicting water depth evolution in urban floods, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2302.10062" ext-link-type="DOI">10.48550/arXiv.2302.10062</ext-link>, 20 February 2023.</mixed-citation></ref>
      <ref id="bib1.bib44"><label>44</label><mixed-citation>Saberian, M., Zafarmomen, N., Neupane, A., Panthi, K., and Samadi, V.: HydroQuantum: A new quantum-driven Python package for hydrological simulation, Environmental Modelling &amp; Software, 195, 106736, <ext-link xlink:href="https://doi.org/10.1016/j.envsoft.2025.106736" ext-link-type="DOI">10.1016/j.envsoft.2025.106736</ext-link>, 2026.</mixed-citation></ref>
      <ref id="bib1.bib45"><label>45</label><mixed-citation>Safaei-Moghadam, A., Tarboton, D., and Minsker, B.: Estimating the likelihood of roadway pluvial flood based on crowdsourced traffic data and depression-based DEM analysis, Nat. Hazards Earth Syst. Sci., 23, 1–19, <ext-link xlink:href="https://doi.org/10.5194/nhess-23-1-2023" ext-link-type="DOI">10.5194/nhess-23-1-2023</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib46"><label>46</label><mixed-citation>Saksena, S., Dey, S., Merwade, V., and Singhofen, P. J.: A Computationally Efficient and Physically Based Approach for Urban Flood Modeling Using a Flexible Spatiotemporal Structure, Water Resources Research, 56, e2019WR025769, <ext-link xlink:href="https://doi.org/10.1029/2019WR025769" ext-link-type="DOI">10.1029/2019WR025769</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib47"><label>47</label><mixed-citation>Samadi, S., Pourreza-Bilondi, M., Wilson, C. A. M. E., and Hitchcock, D. B.: Bayesian Model Averaging With Fixed and Flexible Priors: Theory, Concepts, and Calibration Experiments for Rainfall-Runoff Modeling, Journal of Advances in Modeling Earth Systems, 12, e2019MS001924, <ext-link xlink:href="https://doi.org/10.1029/2019MS001924" ext-link-type="DOI">10.1029/2019MS001924</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib48"><label>48</label><mixed-citation>Samadi, V., Fowler, H. J., Lamond, J., Wagener, T., Brunner, M., Gourley, J., Moradkhani, H., Popescu, I., Wasko, C., Wright, D., Wu, H., Zhang, K., Arias, P. A., Duan, Q., Nazemi, A., van Oevelen, P. J., Prein, A. F., Roundy, J. K., Saberian, M., and Umutoni, L.: The Needs, Challenges, and Priorities for Advancing Global Flood Research, WIREs Water, 12, e70026, <ext-link xlink:href="https://doi.org/10.1002/wat2.70026" ext-link-type="DOI">10.1002/wat2.70026</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib49"><label>49</label><mixed-citation>WCCB Charlotte's CW: Widespread Flooding After Severe Storms, <uri>https://www.wccbcharlotte.com/2020/02/08/widespread-flooding-after-severe-storms/</uri> (last access: 5 June 2024), 2020.</mixed-citation></ref>
      <ref id="bib1.bib50"><label>50</label><mixed-citation>Saberian, M. and Samadi, V.: Probabilistic Hierarchical Interpolation and Interpretable Configuration for Flood Prediction, Zenodo [data set], <ext-link xlink:href="https://doi.org/10.5281/zenodo.13343364" ext-link-type="DOI">10.5281/zenodo.13343364</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib51"><label>51</label><mixed-citation>Sukovich, E. M., Ralph, F. M., Barthold, F. E., Reynolds, D. W., and Novak, D. R.: Extreme Quantitative Precipitation Forecast Performance at the Weather Prediction Center from 2001 to 2011, Weather and Forecasting, 29, 894–911, <ext-link xlink:href="https://doi.org/10.1175/WAF-D-13-00061.1" ext-link-type="DOI">10.1175/WAF-D-13-00061.1</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib52"><label>52</label><mixed-citation>Tabas, S. S. and Samadi, S.: Variational Bayesian dropout with a Gaussian prior for recurrent neural networks application in rainfall–runoff modeling, Environ. Res. Lett., 17, 065012, <ext-link xlink:href="https://doi.org/10.1088/1748-9326/ac7247" ext-link-type="DOI">10.1088/1748-9326/ac7247</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib53"><label>53</label><mixed-citation>Thompson, C. M. and Frazier, T. G.: Deterministic and probabilistic flood modeling for contemporary and future coastal and inland precipitation inundation, Applied Geography, 50, 1–14, <ext-link xlink:href="https://doi.org/10.1016/j.apgeog.2014.01.013" ext-link-type="DOI">10.1016/j.apgeog.2014.01.013</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib54"><label>54</label><mixed-citation>Tiwari, M. K. and Chatterjee, C.: Development of an accurate and reliable hourly flood forecasting model using wavelet-bootstrap-ANN (WBANN) hybrid approach, Journal of Hydrology, 394, 458–470, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2010.10.001" ext-link-type="DOI">10.1016/j.jhydrol.2010.10.001</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib55"><label>55</label><mixed-citation> UNISDR: Making development sustainable: The future of disaster risk management, Global Assessment Report on Disaster Risk Reduction, Geneva, Switzerland, United Nations Office for Disaster Risk Reduction (UNISDR), ISBN 9789211320428, 2015.</mixed-citation></ref>
      <ref id="bib1.bib56"><label>56</label><mixed-citation>US EPA: Watershed Report, Office of Water, <uri>https://watersgeo.epa.gov/watershedreport/?comid=9224629</uri>, last access: 5 June 2024.</mixed-citation></ref>
      <ref id="bib1.bib57"><label>57</label><mixed-citation>Wee, G., Chang, L.-C., Chang, F.-J., and Mat Amin, M. Z.: A flood Impact-Based forecasting system by fuzzy inference techniques, Journal of Hydrology, 625, 130117, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2023.130117" ext-link-type="DOI">10.1016/j.jhydrol.2023.130117</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib58"><label>58</label><mixed-citation>Windheuser, L., Karanjit, R., Pally, R., Samadi, S., and Hubig, N. C.: An End-To-End Flood Stage Prediction System Using Deep Neural Networks, Earth and Space Science, 10, e2022EA002385, <ext-link xlink:href="https://doi.org/10.1029/2022EA002385" ext-link-type="DOI">10.1029/2022EA002385</ext-link>, 2023. </mixed-citation></ref>
      <ref id="bib1.bib59"><label>59</label><mixed-citation>Zafarmomen, N. and Samadi, V.: Can large language models effectively reason about adverse weather conditions?, Environmental Modelling &amp; Software, 188, 106421, <ext-link xlink:href="https://doi.org/10.1016/j.envsoft.2025.106421" ext-link-type="DOI">10.1016/j.envsoft.2025.106421</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib60"><label>60</label><mixed-citation>Zafarmomen, N., Alizadeh, H., Bayat, M., Ehtiat, M., and Moradkhani, H.: Assimilation of Sentinel-Based Leaf Area Index for Modeling Surface-Ground Water Interactions in Irrigation Districts, Water Resources Research, 60, e2023WR036080, <ext-link xlink:href="https://doi.org/10.1029/2023WR036080" ext-link-type="DOI">10.1029/2023WR036080</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib61"><label>61</label><mixed-citation>Zhang, L., Qin, H., Mao, J., Cao, X., and Fu, G.: High temporal resolution urban flood prediction using attention-based LSTM models, Journal of Hydrology, 620, 129499, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2023.129499" ext-link-type="DOI">10.1016/j.jhydrol.2023.129499</ext-link>, 2023a.</mixed-citation></ref>
      <ref id="bib1.bib62"><label>62</label><mixed-citation>Zhang, Y., Pan, D., Griensven, J. V., Yang, S. X., and Gharabaghi, B.: Intelligent flood forecasting and warning: a survey, Intelligence &amp; Robotics, 3, 190–212, <ext-link xlink:href="https://doi.org/10.20517/ir.2023.12" ext-link-type="DOI">10.20517/ir.2023.12</ext-link>, 2023b.</mixed-citation></ref>
      <ref id="bib1.bib63"><label>63</label><mixed-citation>Zou, Y., Wang, J., Lei, P., and Li, Y.: A novel multi-step ahead forecasting model for flood based on time residual LSTM, Journal of Hydrology, 620, 129521, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2023.129521" ext-link-type="DOI">10.1016/j.jhydrol.2023.129521</ext-link>, 2023.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Probabilistic hierarchical interpolation and interpretable neural network configurations for flood prediction</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
      
Abbaspour, K. C., Yang, J., Maximov, I., Siber, R., Bogner, K., Mieleitner,
J., Zobrist, J., and Srinivasan, R.: Modelling hydrology and water quality
in the pre-alpine/alpine Thur watershed using SWAT, Journal of Hydrology,
333, 413–430, <a href="https://doi.org/10.1016/j.jhydrol.2006.09.014" target="_blank">https://doi.org/10.1016/j.jhydrol.2006.09.014</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
      
Alaa, A. M. and van der Schaar, M.: Attentive State-Space Modeling of
Disease Progression, in: Advances in Neural Information Processing Systems, ISBN 9781713807933,
2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
      
Asquith, W. H., Roussel, M. C., Thompson, D. B., Cleveland, T. G., and Fang,
X.: Summary of dimensionless Texas hyetographs and distribution of storm
depth developed for Texas Department of Transportation research project
0–4194, Texas Department of Transportation, <a href="https://library.ctr.utexas.edu/digitized/texasarchive/phase1/4194-4-txdot.pdf" target="_blank"/> (last access: 15 January 2026), 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
      
Barnard, P. L., van Ormondt, M., Erikson, L. H., Eshleman, J., Hapke, C.,
Ruggiero, P., Adams, P. N., and Foxgrover, A. C.: Development of the Coastal
Storm Modeling System (CoSMoS) for predicting the impact of storms on
high-energy, active-margin coasts, Nat. Hazards, 74, 1095–1125,
<a href="https://doi.org/10.1007/s11069-014-1236-y" target="_blank">https://doi.org/10.1007/s11069-014-1236-y</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
      
Basso, S., Schirmer, M., and Botter, G.: A physically based analytical model
of flood frequency curves, Geophysical Research Letters, 43, 9070–9076,
<a href="https://doi.org/10.1002/2016GL069915" target="_blank">https://doi.org/10.1002/2016GL069915</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
      
Challu, C., Olivares, K. G., Oreshkin, B. N., Garza, F.,
Mergenthaler-Canseco, M., and Dubrawski, A.: N-HiTS: Neural Hierarchical
Interpolation for Time Series Forecasting, arXiv [preprint],
<a href="https://doi.org/10.48550/arXiv.2201.12886" target="_blank">https://doi.org/10.48550/arXiv.2201.12886</a>, 29 November 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
      
Chen, Y., Li, J., and Xu, H.: Improving flood forecasting capability of physically based distributed hydrological models by parameter optimization, Hydrol. Earth Syst. Sci., 20, 375–392, <a href="https://doi.org/10.5194/hess-20-375-2016" target="_blank">https://doi.org/10.5194/hess-20-375-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
      
Clark, M. P., Nijssen, B., Lundquist, J. D., Kavetski, D., Rupp, D. E.,
Woods, R. A., Freer, J. E., Gutmann, E. D., Wood, A. W., Brekke, L. D.,
Arnold, J. R., Gochis, D. J., and Rasmussen, R. M.: A unified approach for
process-based hydrologic modeling: 1. Modeling concept, Water Resources
Research, 51, 2498–2514, <a href="https://doi.org/10.1002/2015WR017198" target="_blank">https://doi.org/10.1002/2015WR017198</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
      
Dasgupta, A., Arnal, L., Emerton, R., Harrigan, S., Matthews, G., Muhammad,
A., O'Regan, K., Pérez-Ciria, T., Valdez, E., van Osnabrugge, B.,
Werner, M., Buontempo, C., Cloke, H., Pappenberger, F., Pechlivanidis, I.
G., Prudhomme, C., Ramos, M.-H., and Salamon, P.: Connecting hydrological
modelling and forecasting from global to local scales: Perspectives from an
international joint virtual workshop, Journal of Flood Risk Management, 18,
e12880, <a href="https://doi.org/10.1111/jfr3.12880" target="_blank">https://doi.org/10.1111/jfr3.12880</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
      
Defontaine, T., Ricci, S., Lapeyre, C., Marchandise, A., and Pape, E. L.:
Flood forecasting with Machine Learning in a scarce data layout, IOP Conf.
Ser.: Earth Environ. Sci., 1136, 012020,
<a href="https://doi.org/10.1088/1755-1315/1136/1/012020" target="_blank">https://doi.org/10.1088/1755-1315/1136/1/012020</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
      
Duane, S., Kennedy, A. D., Pendleton, B. J., and Roweth, D.: Hybrid Monte
Carlo, Physics Letters B, 195, 216–222,
<a href="https://doi.org/10.1016/0370-2693(87)91197-X" target="_blank">https://doi.org/10.1016/0370-2693(87)91197-X</a>, 1987.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
      
Erikson, L. H., Espejo, A., Barnard, P. L., Serafin, K. A., Hegermiller, C.
A., O'Neill, A., Ruggiero, P., Limber, P. W., and Mendez, F. J.:
Identification of storm events and contiguous coastal sections for
deterministic modeling of extreme coastal flood events in response to
climate change, Coastal Engineering, 140, 316–330,
<a href="https://doi.org/10.1016/j.coastaleng.2018.08.003" target="_blank">https://doi.org/10.1016/j.coastaleng.2018.08.003</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
      
Evin, G., Le Lay, M., Fouchier, C., Penot, D., Colleoni, F., Mas, A., Garambois, P.-A., and Laurantin, O.: Evaluation of hydrological models on small mountainous catchments: impact of the meteorological forcings, Hydrol. Earth Syst. Sci., 28, 261–281, <a href="https://doi.org/10.5194/hess-28-261-2024" target="_blank">https://doi.org/10.5194/hess-28-261-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
      
Fan, C., Zhang, Y., Pan, Y., Li, X., Zhang, C., Yuan, R., Wu, D., Wang, W.,
Pei, J., and Huang, H.: Multi-Horizon Time Series Forecasting with Temporal
Attention Learning, in: Proceedings of the 25th ACM SIGKDD International
Conference on Knowledge Discovery &amp; Data Mining, New York, NY, USA,
2527–2535, <a href="https://doi.org/10.1145/3292500.3330662" target="_blank">https://doi.org/10.1145/3292500.3330662</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
      
Fang, K., Kifer, D., Lawson, K., and Shen, C.: Evaluating the Potential and
Challenges of an Uncertainty Quantification Method for Long Short-Term
Memory Models for Soil Moisture Predictions, Water Resources Research, 56,
e2020WR028095, <a href="https://doi.org/10.1029/2020WR028095" target="_blank">https://doi.org/10.1029/2020WR028095</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
      
Gotvald, A. J.: Historic flooding in Georgia, 2009, U.S. Geological Survey
Open-File Report 2010–1230, 19 pp., <a href="https://pubs.usgs.gov/of/2010/1230/" target="_blank"/> (last access: 5 June 2024), 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
      
Guha-Sapir, D. and Below, R.: Quality and accuracy of disaster data:
a comparative analysis of 3 global data sets, Working paper prepared for
the Disaster Management Facility, World Bank, CRED, Brussels, <a href="https://api.semanticscholar.org/CorpusID:132874120" target="_blank"/> (last access: 5 June 2024), 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
      
Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition
of the mean squared error and NSE performance criteria: Implications for
improving hydrological modelling, Journal of Hydrology, 377, 80–91,
<a href="https://doi.org/10.1016/j.jhydrol.2009.08.003" target="_blank">https://doi.org/10.1016/j.jhydrol.2009.08.003</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
      
Hochreiter, S., Younger, A. S., and Conwell, P. R.: Learning to Learn Using
Gradient Descent, in: Artificial Neural Networks – ICANN 2001, Berlin,
Heidelberg, 87–94, <a href="https://doi.org/10.1007/3-540-44668-0_13" target="_blank">https://doi.org/10.1007/3-540-44668-0_13</a>,
2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
      
Hsu, K., Gupta, H. V., and Sorooshian, S.: Artificial Neural Network
Modeling of the Rainfall-Runoff Process, Water Resources Research, 31,
2517–2530, <a href="https://doi.org/10.1029/95WR01955" target="_blank">https://doi.org/10.1029/95WR01955</a>, 1995.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
      
Jonkman, S. N.: Global Perspectives on Loss of Human Life Caused by Floods,
Nat. Hazards, 34, 151–175, <a href="https://doi.org/10.1007/s11069-004-8891-3" target="_blank">https://doi.org/10.1007/s11069-004-8891-3</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
      
Kingma, D. P. and Ba, J.: Adam: A Method for Stochastic Optimization, arXiv [preprint],
<a href="https://doi.org/10.48550/arXiv.1412.6980" target="_blank">https://doi.org/10.48550/arXiv.1412.6980</a>, 29 January 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
      
Kratzert, F., Klotz, D., Brenner, C., Schulz, K., and Herrnegger, M.: Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks, Hydrol. Earth Syst. Sci., 22, 6005–6022, <a href="https://doi.org/10.5194/hess-22-6005-2018" target="_blank">https://doi.org/10.5194/hess-22-6005-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
      
Lim, B., Arık, S. Ö., Loeff, N., and Pfister, T.: Temporal Fusion
Transformers for interpretable multi-horizon time series forecasting,
International Journal of Forecasting, 37, 1748–1764,
<a href="https://doi.org/10.1016/j.ijforecast.2021.03.012" target="_blank">https://doi.org/10.1016/j.ijforecast.2021.03.012</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
      
Lobligeois, F., Andréassian, V., Perrin, C., Tabary, P., and Loumagne, C.: When does higher spatial resolution rainfall information improve streamflow simulation? An evaluation using 3620 flood events, Hydrol. Earth Syst. Sci., 18, 575–594, <a href="https://doi.org/10.5194/hess-18-575-2014" target="_blank">https://doi.org/10.5194/hess-18-575-2014</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
      
MacDonald, L. H. and Coe, D.: Influence of Headwater Streams on Downstream
Reaches in Forested Areas, Forest Science, 53, 148–168,
<a href="https://doi.org/10.1093/forestscience/53.2.148" target="_blank">https://doi.org/10.1093/forestscience/53.2.148</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
      
Martinaitis, S. M., Wilson, K. A., Yussouf, N., Gourley, J. J., Vergara, H.,
Meyer, T. C., Heinselman, P. L., Gerard, A., Berry, K. L., Vergara, A., and
Monroe, J.: A Path Toward Short-Term Probabilistic Flash Flood Prediction,
<a href="https://doi.org/10.1175/BAMS-D-22-0026.1" target="_blank">https://doi.org/10.1175/BAMS-D-22-0026.1</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
      
McCallum, B. E. and Gotvald, A. J.: Historic flooding in northern Georgia,
September 16–22, 2009, U.S. Geological Survey Fact Sheet 2010–3061, 4 pp., <a href="https://pubs.usgs.gov/fs/2010/3061/" target="_blank"/> (last access: 5 June 2024),
2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
      
McCuen, R. H., Knight, Z., and Cutter, A. G.: Evaluation of the
Nash–Sutcliffe Efficiency Index, Journal of Hydrologic Engineering, 11,
597–602, <a href="https://doi.org/10.1061/(ASCE)1084-0699(2006)11:6(597)" target="_blank">https://doi.org/10.1061/(ASCE)1084-0699(2006)11:6(597)</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
      
Munn, M., Sheibley, R., Waite, I., and Meador, M.: Understanding the
relationship between stream metabolism and biological assemblages,
Freshwater Science, 39, 680–692, <a href="https://doi.org/10.1086/711690" target="_blank">https://doi.org/10.1086/711690</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
      
Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual
models part I – A discussion of principles, Journal of Hydrology, 10,
282–290, <a href="https://doi.org/10.1016/0022-1694(70)90255-6" target="_blank">https://doi.org/10.1016/0022-1694(70)90255-6</a>, 1970.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
      
Natural Resources Conservation Service (NRCS): National Engineering Handbook Part
630 Hydrology – Chapter 15: Time of Concentration,
<a href="https://www.hydrocad.net/neh/630ch15.pdf" target="_blank"/> (last access: 23 December 2025), 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
      
Nevo, S., Morin, E., Gerzi Rosenthal, A., Metzger, A., Barshai, C., Weitzner, D., Voloshin, D., Kratzert, F., Elidan, G., Dror, G., Begelman, G., Nearing, G., Shalev, G., Noga, H., Shavitt, I., Yuklea, L., Royz, M., Giladi, N., Peled Levi, N., Reich, O., Gilon, O., Maor, R., Timnat, S., Shechter, T., Anisimov, V., Gigi, Y., Levin, Y., Moshe, Z., Ben-Haim, Z., Hassidim, A., and Matias, Y.: Flood forecasting with machine learning models in an operational framework, Hydrol. Earth Syst. Sci., 26, 4013–4032, <a href="https://doi.org/10.5194/hess-26-4013-2022" target="_blank">https://doi.org/10.5194/hess-26-4013-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
      
Olivares, K. G., Challú, C., Garza, A., Mergenthaler Canseco, M., and
Dubrawski, A.: NeuralForecast: User friendly state-of-the-art neural
forecasting models, PyCon Salt Lake City, Utah, US, GitHub [code],
<a href="https://github.com/Nixtla/neuralforecast" target="_blank"/> (last access: 23 December 2025), 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
      
Olivares, K. G., Meetei, O. N., Ma, R., Reddy, R., Cao, M., and Dicker, L.:
Probabilistic hierarchical forecasting with deep Poisson mixtures,
International Journal of Forecasting, 40, 470–489,
<a href="https://doi.org/10.1016/j.ijforecast.2023.04.007" target="_blank">https://doi.org/10.1016/j.ijforecast.2023.04.007</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
      
Oreshkin, B. N., Carpov, D., Chapados, N., and Bengio, Y.: N-BEATS: Neural
basis expansion analysis for interpretable time series forecasting, arXiv [preprint],
<a href="https://doi.org/10.48550/arXiv.1905.10437" target="_blank">https://doi.org/10.48550/arXiv.1905.10437</a>, 20 February 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
      
Pally, R. J. and Samadi, V.: Application of image processing and
convolutional neural networks for flood image classification and semantic
segmentation, Environmental Modelling &amp; Software, 148, 105285,
<a href="https://doi.org/10.1016/j.envsoft.2021.105285" target="_blank">https://doi.org/10.1016/j.envsoft.2021.105285</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
      
Palmer, T. N.: Towards the probabilistic Earth-system simulator: a vision
for the future of climate and weather prediction, Quarterly Journal of the
Royal Meteorological Society, 138, 841–861,
<a href="https://doi.org/10.1002/qj.1923" target="_blank">https://doi.org/10.1002/qj.1923</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>39</label><mixed-citation>
      
Park, K. and Lee, E. H.: Urban flood vulnerability analysis and prediction
based on the land use using Deep Neural Network, International Journal of
Disaster Risk Reduction, 101, 104231,
<a href="https://doi.org/10.1016/j.ijdrr.2023.104231" target="_blank">https://doi.org/10.1016/j.ijdrr.2023.104231</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>40</label><mixed-citation>
      
Pourreza-Bilondi, M., Samadi, S. Z., Akhoond-Ali, A.-M., and Ghahraman, B.:
Reliability of Semiarid Flash Flood Modeling Using Bayesian Framework,
Journal of Hydrologic Engineering, 22, 05016039,
<a href="https://doi.org/10.1061/(ASCE)HE.1943-5584.0001482" target="_blank">https://doi.org/10.1061/(ASCE)HE.1943-5584.0001482</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>41</label><mixed-citation>
      
Refsgaard, J. C., Stisen, S., and Koch, J.: Hydrological process knowledge
in catchment modelling – Lessons and perspectives from 60 years
development, Hydrological Processes, 36, e14463,
<a href="https://doi.org/10.1002/hyp.14463" target="_blank">https://doi.org/10.1002/hyp.14463</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>42</label><mixed-citation>
      
Roelvink, D., Reniers, A., van Dongeren, A., van Thiel de Vries, J., McCall,
R., and Lescinski, J.: Modelling storm impacts on beaches, dunes and barrier
islands, Coastal Engineering, 56, 1133–1152,
<a href="https://doi.org/10.1016/j.coastaleng.2009.08.006" target="_blank">https://doi.org/10.1016/j.coastaleng.2009.08.006</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>43</label><mixed-citation>
      
Russo, S., Perraudin, N., Stalder, S., Perez-Cruz, F., Leitao, J. P.,
Obozinski, G., and Wegner, J. D.: An evaluation of deep learning models for
predicting water depth evolution in urban floods, arXiv [preprint],
<a href="https://doi.org/10.48550/arXiv.2302.10062" target="_blank">https://doi.org/10.48550/arXiv.2302.10062</a>, 20 February 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>44</label><mixed-citation>
      
Saberian, M., Zafarmomen, N., Neupane, A., Panthi, K., and Samadi, V.:
HydroQuantum: A new quantum-driven Python package for hydrological
simulation, Environmental Modelling &amp; Software, 195, 106736,
<a href="https://doi.org/10.1016/j.envsoft.2025.106736" target="_blank">https://doi.org/10.1016/j.envsoft.2025.106736</a>, 2026.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>45</label><mixed-citation>
      
Safaei-Moghadam, A., Tarboton, D., and Minsker, B.: Estimating the likelihood of roadway pluvial flood based on crowdsourced traffic data and depression-based DEM analysis, Nat. Hazards Earth Syst. Sci., 23, 1–19, <a href="https://doi.org/10.5194/nhess-23-1-2023" target="_blank">https://doi.org/10.5194/nhess-23-1-2023</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>46</label><mixed-citation>
      
Saksena, S., Dey, S., Merwade, V., and Singhofen, P. J.: A Computationally
Efficient and Physically Based Approach for Urban Flood Modeling Using a
Flexible Spatiotemporal Structure, Water Resources Research, 56,
e2019WR025769, <a href="https://doi.org/10.1029/2019WR025769" target="_blank">https://doi.org/10.1029/2019WR025769</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>47</label><mixed-citation>
      
Samadi, S., Pourreza-Bilondi, M., Wilson, C. A. M. E., and Hitchcock, D. B.:
Bayesian Model Averaging With Fixed and Flexible Priors: Theory, Concepts,
and Calibration Experiments for Rainfall-Runoff Modeling, Journal of
Advances in Modeling Earth Systems, 12, e2019MS001924,
<a href="https://doi.org/10.1029/2019MS001924" target="_blank">https://doi.org/10.1029/2019MS001924</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>48</label><mixed-citation>
      
Samadi, V., Fowler, H. J., Lamond, J., Wagener, T., Brunner, M., Gourley,
J., Moradkhani, H., Popescu, I., Wasko, C., Wright, D., Wu, H., Zhang, K.,
Arias, P. A., Duan, Q., Nazemi, A., van Oevelen, P. J., Prein, A. F.,
Roundy, J. K., Saberian, M., and Umutoni, L.: The Needs, Challenges, and
Priorities for Advancing Global Flood Research, WIREs Water, 12, e70026,
<a href="https://doi.org/10.1002/wat2.70026" target="_blank">https://doi.org/10.1002/wat2.70026</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>49</label><mixed-citation>
      
WCCB Charlotte's CW: Widespread Flooding After Severe Storms,
<a href="https://www.wccbcharlotte.com/2020/02/08/widespread-flooding-after-severe-storms/" target="_blank"/> (last access: 5 June 2024),
2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>50</label><mixed-citation>
      
Saberian, M. and Samadi, V.: Probabilistic Hierarchical Interpolation and Interpretable Configuration for Flood Prediction, Zenodo [data set], <a href="https://doi.org/10.5281/zenodo.13343364" target="_blank">https://doi.org/10.5281/zenodo.13343364</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>51</label><mixed-citation>
      
Sukovich, E. M., Ralph, F. M., Barthold, F. E., Reynolds, D. W., and Novak,
D. R.: Extreme Quantitative Precipitation Forecast Performance at the
Weather Prediction Center from 2001 to 2011, Weather and Forecasting, 29,
894–911, <a href="https://doi.org/10.1175/WAF-D-13-00061.1" target="_blank">https://doi.org/10.1175/WAF-D-13-00061.1</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>52</label><mixed-citation>
      
Tabas, S. S. and Samadi, S.: Variational Bayesian dropout with a Gaussian
prior for recurrent neural networks application in rainfall–runoff
modeling, Environ. Res. Lett., 17, 065012,
<a href="https://doi.org/10.1088/1748-9326/ac7247" target="_blank">https://doi.org/10.1088/1748-9326/ac7247</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>53</label><mixed-citation>
      
Thompson, C. M. and Frazier, T. G.: Deterministic and probabilistic flood
modeling for contemporary and future coastal and inland precipitation
inundation, Applied Geography, 50, 1–14,
<a href="https://doi.org/10.1016/j.apgeog.2014.01.013" target="_blank">https://doi.org/10.1016/j.apgeog.2014.01.013</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>54</label><mixed-citation>
      
Tiwari, M. K. and Chatterjee, C.: Development of an accurate and reliable
hourly flood forecasting model using wavelet-bootstrap-ANN (WBANN) hybrid
approach, Journal of Hydrology, 394, 458–470,
<a href="https://doi.org/10.1016/j.jhydrol.2010.10.001" target="_blank">https://doi.org/10.1016/j.jhydrol.2010.10.001</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>55</label><mixed-citation>
      
UNISDR: Making development sustainable: The future of disaster risk
management, Global Assessment Report on Disaster Risk Reduction, Geneva,
Switzerland, United Nations Office for Disaster Risk Reduction (UNISDR), ISBN 9789211320428,
2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>56</label><mixed-citation>
      
US EPA: Watershed Report, Office of Water, <a href="https://watersgeo.epa.gov/watershedreport/?comid=9224629" target="_blank"/>, last access: 5 June 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>57</label><mixed-citation>
      
Wee, G., Chang, L.-C., Chang, F.-J., and Mat Amin, M. Z.: A flood
Impact-Based forecasting system by fuzzy inference techniques, Journal of
Hydrology, 625, 130117, <a href="https://doi.org/10.1016/j.jhydrol.2023.130117" target="_blank">https://doi.org/10.1016/j.jhydrol.2023.130117</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>58</label><mixed-citation>
      
Windheuser, L., Karanjit, R., Pally, R., Samadi, S., and Hubig, N. C.: An
End-To-End Flood Stage Prediction System Using Deep Neural Networks, Earth
and Space Science, 10, e2022EA002385, <a href="https://doi.org/10.1029/2022EA002385" target="_blank">https://doi.org/10.1029/2022EA002385</a>,
2023.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>59</label><mixed-citation>
      
Zafarmomen, N. and Samadi, V.: Can large language models effectively reason
about adverse weather conditions?, Environmental Modelling &amp; Software,
188, 106421, <a href="https://doi.org/10.1016/j.envsoft.2025.106421" target="_blank">https://doi.org/10.1016/j.envsoft.2025.106421</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>60</label><mixed-citation>
      
Zafarmomen, N., Alizadeh, H., Bayat, M., Ehtiat, M., and Moradkhani, H.:
Assimilation of Sentinel-Based Leaf Area Index for Modeling Surface-Ground
Water Interactions in Irrigation Districts, Water Resources Research, 60,
e2023WR036080, <a href="https://doi.org/10.1029/2023WR036080" target="_blank">https://doi.org/10.1029/2023WR036080</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>61</label><mixed-citation>
      
Zhang, L., Qin, H., Mao, J., Cao, X., and Fu, G.: High temporal resolution
urban flood prediction using attention-based LSTM models, Journal of
Hydrology, 620, 129499, <a href="https://doi.org/10.1016/j.jhydrol.2023.129499" target="_blank">https://doi.org/10.1016/j.jhydrol.2023.129499</a>,
2023a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>62</label><mixed-citation>
      
Zhang, Y., Pan, D., Griensven, J. V., Yang, S. X., and Gharabaghi, B.:
Intelligent flood forecasting and warning: a survey, Intelligence &amp; Robotics, 3, 190–212,
<a href="https://doi.org/10.20517/ir.2023.12" target="_blank">https://doi.org/10.20517/ir.2023.12</a>, 2023b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>63</label><mixed-citation>
      
Zou, Y., Wang, J., Lei, P., and Li, Y.: A novel multi-step ahead forecasting
model for flood based on time residual LSTM, Journal of Hydrology, 620,
129521, <a href="https://doi.org/10.1016/j.jhydrol.2023.129521" target="_blank">https://doi.org/10.1016/j.jhydrol.2023.129521</a>, 2023.

    </mixed-citation></ref-html>--></article>
