<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">HESS</journal-id><journal-title-group>
    <journal-title>Hydrology and Earth System Sciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">HESS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Hydrol. Earth Syst. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1607-7938</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/hess-30-3145-2026</article-id><title-group><article-title>A multi-chain surrogate-assisted hybrid optimization framework for joint identification of groundwater contaminant sources and hydrogeological parameters</article-title><alt-title>A multi-chain surrogate-assisted hybrid optimization framework</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2 aff3">
          <name><surname>Wu</surname><given-names>Mengtian</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff4 aff5">
          <name><surname>Huang</surname><given-names>Xuan</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff6">
          <name><surname>Xu</surname><given-names>Pengcheng</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Chen</surname><given-names>Han</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Yang</surname><given-names>Xu</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Xu</surname><given-names>Jin</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff1 aff2 aff3">
          <name><surname>Duan</surname><given-names>Qingyun</given-names></name>
          <email>qyduan@hhu.edu.cn</email>
        <ext-link>https://orcid.org/0000-0001-9955-1512</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>National Key Laboratory of Water Disaster Prevention, Hohai University, Nanjing, China</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>College of Hydrology and Water Resources, Hohai University, Nanjing, China</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>China Meteorological Administration Hydro-Meteorology Key Laboratory, Hohai University, Nanjing, China</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>Nanjing Hydraulic Research Institute, Nanjing, China</institution>
        </aff>
        <aff id="aff5"><label>5</label><institution>Technology Innovation Center for Green Ecological Conservation and Restoration of Yangtze River Delta Rivers and Lakes, Ministry of Water Resources, Shanghai, China</institution>
        </aff>
        <aff id="aff6"><label>6</label><institution>Macau Environmental Research Institute, Faculty of Innovation Engineering, Macau University of Science and Technology, Macau, China</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Qingyun Duan (qyduan@hhu.edu.cn)</corresp></author-notes><pub-date><day>21</day><month>May</month><year>2026</year></pub-date>
      
      <volume>30</volume>
      <issue>10</issue>
      <fpage>3145</fpage><lpage>3163</lpage>
      <history>
        <date date-type="received"><day>9</day><month>December</month><year>2025</year></date>
           <date date-type="rev-request"><day>18</day><month>December</month><year>2025</year></date>
           <date date-type="rev-recd"><day>14</day><month>April</month><year>2026</year></date>
           <date date-type="accepted"><day>29</day><month>April</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Mengtian Wu et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026.html">This article is available from https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026.html</self-uri><self-uri xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026.pdf">The full text article is available as a PDF file from https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e169">Rapid and accurate identification of groundwater contaminant information and hydrogeological parameters is crucial for effective groundwater remediation and risk management. Within a simulation–optimization framework, this task is inherently posed as a mixed-variable optimization problem involving discrete parameters (e.g., source locations) and continuous ones (e.g., hydraulic heads, conductivities, and release fluxes). However, several challenges arise in this context. First, conventional optimization algorithms often exhibit slow convergence and unstable performance. Second, they typically require thousands of simulations to adequately explore the complex parameter space, resulting in prohibitive computational costs. To address these issues, this study develops a surrogate-assisted hybrid algorithm that integrates the Cooperative Search Algorithm (CSA) and Tabu Search (TS) within a synergistic multi-chain optimization framework, termed SA-CSA-TS. In each iteration, individual chains first perform independent CSA-based optimization to promote broad global exploration, after which they collaboratively refine source locations through a neighbourhood search guided by a shared tabu list. In addition, surrogate models equipped with a reconstruction strategy partially replace groundwater simulations, thereby substantially reducing the computational burden. Case studies reveal that the Radial Basis Function (RBF) outperforms other mainstream surrogate models in both accuracy and stability. Furthermore, comparative experiments confirm that the proposed SA-CSA-TS framework not only achieves higher solution accuracy but also significantly reduces computational demand, demonstrating strong potential for efficient groundwater contamination diagnosis.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>National Natural Science Foundation of China</funding-source>
<award-id>42101046</award-id>
<award-id>W2431029</award-id>
</award-group>
<award-group id="gs2">
<funding-source>National Key Research and Development Program of China</funding-source>
<award-id>2021YFC3201102</award-id>
</award-group>
<award-group id="gs3">
<funding-source>Ministry of Water Resources</funding-source>
<award-id>SKS-2022001</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e181">Groundwater contamination has become an increasingly critical issue, posing significant risks to environmental safety and public health (Gorelick and Zheng, 2015; Li et al., 2021a; Agbotui et al., 2025). Effective groundwater remediation requires rapid and accurate identification of contaminant source parameters (Bai and Tahmasebi, 2022; Mahar and Datta, 2001; Zhao et al., 2016). However, due to the invisibility of groundwater systems and sparse monitoring (Mirghani et al., 2009), source information cannot always be obtained directly. Instead, it must be inferred from observations, typically within a simulation–optimization (S–O) framework (Singh, 2015).</p>
      <p id="d2e184">Within the S–O framework, simulation models such as MODFLOW, MT3DMS, and FEFLOW are employed to describe the spatial and temporal evolution of contaminant plumes (Delshad et al., 1996; Harbaugh, 2005; Zheng and Wang, 1999). The quality of candidate parameter sets is evaluated through performance metrics (e.g., NSE and RMSE) that measure the discrepancy between simulated and observed data. Optimization algorithms then iteratively adjust these parameters to minimize the selected metrics, thereby identifying the most probable parameter values. Common algorithms, including Genetic Algorithm (GA) (Ayvaz and Elci, 2018; Singh and Datta, 2006), Particle Swarm Optimization (PSO) (Meenal and Eldho, 2012; Pan et al., 2023), and Simulated Annealing (Jha and Datta, 2013), have demonstrated considerable success in groundwater contamination source identification (GCSI) (Swetha et al., 2025). Consequently, the S–O framework incorporating these groundwater models and algorithms has been widely adopted in groundwater contamination studies (Guneshwor et al., 2018).</p>
      <p id="d2e188">Despite these advantages, the S–O framework still faces several challenges that hinder its accuracy and computational efficiency (Wu et al., 2022b).  For instance, real-world GCSI often requires identifying source locations, which inherently transforms the task into a mixed-variable problem (Li et al., 2023). Such problems involve the simultaneous estimation of both discrete parameters (e.g., source locations) and continuous parameters (e.g., time-dependent contaminant release rates) (Wang et al., 2024).  However, many existing optimization algorithms handle discrete variables through simple conversion techniques, such as binary encoding, grid-based discretization, or rounding schemes. These treatments can introduce approximation errors or impose artificial constraints, ultimately reducing solution quality. In addition, the mixed-variable structure produces highly complex, discontinuous, and multimodal objective landscapes. As a result, algorithms are more likely to converge prematurely to local optima (Chang et al., 2021).</p>
      <p id="d2e191">For these reasons, some studies have introduced new or hybrid algorithms.  For instance, Flying Foxes Optimization (FFO) has demonstrated superior search efficiency and accuracy in groundwater problems (Li et al., 2023).  Similarly, the hybrid GA-PSO algorithm (Wang et al., 2015) improves performance by combining the global exploration capabilities of GA with the fast convergence of PSO, while Li et al. (2021b) also propose a Hybrid Homotopy-Genetic Algorithm. However, most of these approaches adopt a simultaneous optimization strategy that treats source locations and release rates as equivalent variables. In practice, this assumption oversimplifies the physical reality of groundwater transport. Source locations typically exert a dominant influence because they determine the transport pathways and the geometry of the plume. In contrast, release rates and hydrogeological parameters mainly scale the concentration magnitudes. This sensitivity disparity creates a multimodal response surface, where multiple location combinations can reproduce sparse field observations with similar accuracy.  This characteristic significantly increases the risk of premature convergence and may lead to the misidentification of critical source information.</p>
      <p id="d2e195">The computational burden associated with GCSI cannot be ignored, as optimization algorithms often require thousands of simulations to adequately explore the parameter space (Razavi et al., 2012; Asher et al., 2015; Ouyang et al., 2017). This intensive demand severely limits practical applications, particularly for complex or large-scale groundwater simulation models (Song et al., 2019). In this context, surrogate modelling, as a data-driven technique, has become a widely adopted choice (Song et al., 2018). By approximating the behaviour of high-fidelity groundwater models, surrogate models can enable more efficient and feasible source identification. Common surrogate models include Kriging, Gaussian Process (Rasmussen and Williams, 2006), Support Vector Regression (Chang and Lin, 2011), Radial Basis Function (Broomhead and Lowe, 1988), and ensembles of these models (Xing et al., 2019; Yin and Tsai, 2020; Zhu et al., 2024). However, most existing studies still select surrogate models based primarily on empirical preference, and few have systematically evaluated or compared their performance and suitability for groundwater systems (Hou and Lu, 2018; Wu et al., 2022a; Luo et al., 2025). To address this gap, the present study conducts a comprehensive comparison of mainstream surrogate models and identifies the most effective one for GCSI.</p>
      <p id="d2e198">Overall, this study proposes a multi-chain surrogate-assisted hybrid optimization framework, termed SA-CSA-TS. The framework adopts a multi-chain structure operating across two distinct optimization stages. In the first stage, individual chains execute CSA-based optimization to enhance global exploration, with well-trained surrogate models replacing time-consuming groundwater simulations. In the second stage, chains collaboratively refine source locations through a neighbourhood search guided by a shared tabu list. This cooperative strategy enables efficient identification of source positions that control the contaminant plume distribution. To support the framework, several surrogate models are evaluated, and the RBF model is found to provide the most accurate approximation for groundwater applications. Case studies show that SA-CSA-TS can reduce computational cost by up to 85 %–88 % while achieving higher identification accuracy than conventional algorithms. These results demonstrate the efficiency and reliability of the proposed framework and offer valuable insights for groundwater contamination remediation.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methodology</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Groundwater simulation</title>
      <p id="d2e216">For groundwater contamination source identification (GCSI), this study adopts a simulation–optimization framework (Mahar and Datta, 2001). There are various effective simulation techniques available for groundwater modelling. In this study, MODFLOW 6, including the Groundwater Flow and Groundwater Transport models (Hughes et al., 2017; Langevin et al., 2022), is adopted to simulate groundwater flow and pollutant transport, facilitated by the Python package FloPy (Bakker et al., 2016), which provides a convenient and flexible interface for model construction and execution. The governing partial differential equation for transient flow in a two-dimensional aquifer system can be given as follows:

            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M1" display="block"><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mo>∂</mml:mo><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mi>W</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> denotes the hydraulic conductivity, <inline-formula><mml:math id="M3" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>; <inline-formula><mml:math id="M4" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula> denotes the hydraulic head, <inline-formula><mml:math id="M5" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula>; <inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are the coordinates along the axis, <inline-formula><mml:math id="M8" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula>; <inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the specific storage of the porous material; and <inline-formula><mml:math id="M10" display="inline"><mml:mi>W</mml:mi></mml:math></inline-formula> is the volumetric flux per unit area.</p>
      <p id="d2e384">Solute transport can be described by the following advection-dispersion-reaction equation under known hydrogeological conditions:

            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M11" display="block"><mml:mtable class="aligned" rowspacing="0.2ex" columnspacing="1em" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mo>∂</mml:mo><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mfenced open="(" close=")"><mml:mrow><mml:mi mathvariant="italic">θ</mml:mi><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:msup><mml:mi>C</mml:mi><mml:mi>k</mml:mi></mml:msup></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mo>∂</mml:mo><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="italic">θ</mml:mi><mml:msub><mml:mi>v</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msup><mml:mi>C</mml:mi><mml:mi>k</mml:mi></mml:msup></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mi>s</mml:mi></mml:msub><mml:msubsup><mml:mi>C</mml:mi><mml:mi>k</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msubsup><mml:mo>+</mml:mo><mml:mo movablelimits="false">∑</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mspace width="1em" linebreak="nobreak"/><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="italic">θ</mml:mi><mml:msup><mml:mi>C</mml:mi><mml:mi>k</mml:mi></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M12" display="inline"><mml:mi mathvariant="italic">θ</mml:mi></mml:math></inline-formula> is effective porosity; <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:msup><mml:mi>C</mml:mi><mml:mi>k</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> is the dissolved concentration of the species <inline-formula><mml:math id="M14" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M15" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mg</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">L</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>; <inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is the dispersion coefficient tensor, <inline-formula><mml:math id="M17" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>; <inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:msub><mml:mi>v</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the linear pore water velocity, <inline-formula><mml:math id="M19" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>; <inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the volumetric flow rate per unit volume, representing sources or sinks; <inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:msubsup><mml:mi>C</mml:mi><mml:mi>k</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> is the source or sink concentration of species <inline-formula><mml:math id="M22" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M23" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">L</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>; <inline-formula><mml:math id="M24" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the chemical reaction term, <inline-formula><mml:math id="M25" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">L</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>UQPyL</title>
      <p id="d2e722">UQPyL (<uri>http://www.uq-pyl.com</uri>, last access: 15 May 2026) is a Python package developed by our team to support uncertainty quantification and optimization in computational modelling. The package integrates a comprehensive set of tools, including sampling techniques, surrogate modelling, parameter analysis methods, and global as well as hybrid optimization algorithms. Its modular and extensible design enables users to flexibly combine different components, facilitating rapid prototyping and testing of new algorithms. Moreover, UQPyL includes a default interface to couple external numerical simulators, making it suitable for computationally intensive applications such as groundwater modelling. In this study, UQPyL provides the foundation for implementing the proposed SA-CSA-TS algorithm, conducting surrogate-model comparison experiments, and ensuring a consistent environment for benchmarking different optimization methods.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Overview of the proposed algorithm</title>
      <p id="d2e736">This study develops a surrogate-assisted hybrid optimization algorithm, SA-CSA-TS, built upon a multi-chain framework in which each chain iteratively performs a two-stage search. Global exploration is conducted using the Cooperative Search Algorithm (CSA), followed by local refinement using Tabu Search (TS). To reduce dependence on computationally expensive groundwater simulations, surrogate models with dynamic reconstruction are embedded into both stages. In addition, designed inter-chain communication enables the exchange of evaluated samples, enhancing data diversity and improving surrogate accuracy.</p>
      <p id="d2e739">Figure 1 illustrates the overall workflow. The process begins with initial sampling, and the groundwater model is used to evaluate these samples to initialize the chain archive <inline-formula><mml:math id="M26" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula>. After that, the algorithm enters the multi-chain optimization phase.</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e751">Overall framework of SA-CSA-TS.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f01.png"/>

        </fig>

      <p id="d2e761">During each iteration, surrogate models are at first constructed. The key feature is synergistic learning, where each chain builds its surrogates not only from its own history but also from the evaluated solutions shared by other chains (see the red arrows in Fig. 1). In the first stage, each chain independently performs CSA under the guidance of surrogates to explore the global search space. The best individual from each chain is then evaluated using the groundwater simulator and used to update <inline-formula><mml:math id="M27" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula>. Local refinement is performed in the second stage. Before activating TS, the surrogate models are reconstructed using all newly obtained evaluations. TS subsequently explores neighbourhood solutions through multiple-move operators, and cooperation among chains is realized via a shared tabu list, which prevents redundant searches and promotes effective diversification. Surrogates continue to pre-screen candidate solutions, and only the most promising candidate from each chain is evaluated with the groundwater model. This iterative process continues until the predefined maximum evaluations of groundwater model <inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:msub><mml:mtext>FE</mml:mtext><mml:mtext>max</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is reached.</p>
      <p id="d2e782"><inline-graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-g01.png"/></p>
      <p id="d2e790">In summary, SA-CSA-TS enhances GCSI efficiency through three integrated mechanisms. First, the multi-chain framework enables synergistic learning by sharing evaluated information across chains. Second, the sequential deployment of CSA and TS provides a strong balance between global exploration and local intensification. Finally, surrogate models with dynamic reconstruction reduce computational burden while preserving high-fidelity prediction accuracy to guide the search effectively.</p>
      <p id="d2e793">For clarity, the pseudocode of SA-CSA-TS is also provided in Algorithm 1.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e798">Workflow of solution evaluation with simulation or surrogate models.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f02.png"/>

        </fig>

</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Surrogate modelling in SA-CSA-TS</title>
      <p id="d2e815">To alleviate the computational burden of repeated groundwater simulations, surrogate modelling is embedded into the proposed SA-CSA-TS framework. In GCSI, candidate parameters should be evaluated by the groundwater simulator to quantify the mismatch between simulated and observed concentrations at the monitoring wells (see the dashed line of Fig. 2). However, the entire optimization process typically requires thousands of forward simulations. To alleviate the computational demand, the SA-CSA-TS incorporates a surrogate modelling technique (see the solid line of Fig. 2). In this study, four commonly used surrogate models, namely Kriging (KRG), Gaussian Process (GP), Support Vector Regression (SVR), and Radial Basis Function (RBF), are considered as candidate approximators; detailed theoretical backgrounds of these models are available in Lophaven et al. (2002), Rasmussen and Williams (2006), Smola and Schölkopf (2004), and Buhmann (2003), respectively.  Although KRG and GP are theoretically closely related, both are considered in this study because differences in practical implementation and hyperparameter estimation may still lead to different predictive performance. All four models are implemented in UQPyL, and their predictive performance is compared in Sect. 4. Rather than treating surrogate modelling as an independent component, the present study embeds it directly into the optimization workflow so that surrogate predictions can guide both the global exploration and the local refinement stage of SA-CSA-TS.</p>
      <p id="d2e818">As illustrated in Fig. 2, a set of surrogate models is constructed to estimate the discrepancy (e.g., RMSE or <inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) between simulated and observed concentrations at each monitoring well. Therefore, the number of surrogate models equals the number of observation wells. During optimization, these surrogates substitute for repeated groundwater simulations and provide rapid approximations of the error. The predicted discrepancies across all wells are then aggregated, and their sum is adopted as the overall objective function, guiding the evaluation of candidate parameters and the subsequent optimization.</p>
</sec>
<sec id="Ch1.S2.SS5">
  <label>2.5</label><title>Global exploration via surrogate-assisted CSA</title>
      <p id="d2e841">In SA-CSA-TS, the first stage focuses on global exploration, where each chain independently executes the Cooperative Search Algorithm (CSA) with surrogate-based fitness evaluation. The CSA, proposed by Feng et al. (2021), is a population-based metaheuristic inspired by cooperative behaviours in social systems. Previous studies (Feng et al., 2022, 2024) have already shown its feasibility in related water-resources and hydrological applications, including cascade reservoir operation, discharge simulation, streamflow and flood forecasting. Here, CSA is adopted for GCSI because it emphasizes team communication, reflective learning and internal competition among individuals. These mechanisms are well suited to the high-dimensional, nonlinear, and potentially multimodal nature of the inverse problem, and are expected to identify promising regions for subsequent local refinement.</p>
      <p id="d2e844">In CSA, a population of candidate solutions <inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msubsup><mml:mo mathvariant="italic">}</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> is initially generated. During the optimization process, individuals improve their positions by learning from others within the population. For example, at iteration <inline-formula><mml:math id="M31" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, the update of the <inline-formula><mml:math id="M32" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th individual typically follows a team communication rule: 

            <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M33" display="block"><mml:mtable rowspacing="0.2ex" columnspacing="1em" class="aligned" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msubsup><mml:mi>u</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mi>A</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mi>B</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mi>C</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msubsup><mml:mi>A</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi>g</mml:mi><mml:mtext>ind</mml:mtext><mml:mi>t</mml:mi></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msubsup><mml:mi>B</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>⋅</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:mo>(</mml:mo><mml:msup><mml:mtext>gm</mml:mtext><mml:mi>t</mml:mi></mml:msup><mml:mo>-</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msubsup><mml:mi>C</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup><mml:mo>=</mml:mo><mml:mi mathvariant="italic">β</mml:mi><mml:mo>⋅</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:mfenced close=")" open="("><mml:mrow><mml:msup><mml:mtext>pm</mml:mtext><mml:mi>t</mml:mi></mml:msup><mml:mo>-</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:msubsup><mml:mi>A</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:msubsup><mml:mi>B</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:msubsup><mml:mi>C</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> denote the knowledge components from the chairman, board of directors, and board of supervisors, respectively. <inline-formula><mml:math id="M37" display="inline"><mml:mrow><mml:msubsup><mml:mi>g</mml:mi><mml:mtext>ind</mml:mtext><mml:mi>t</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> is the indth global best individual at iteration <inline-formula><mml:math id="M38" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>. The <inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:msup><mml:mtext>gm</mml:mtext><mml:mi>t</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> represents the mean position of the top <inline-formula><mml:math id="M40" display="inline"><mml:mi>M</mml:mi></mml:math></inline-formula> global best individuals. The <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:msup><mml:mtext>pm</mml:mtext><mml:mi>t</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> is the mean position of the <inline-formula><mml:math id="M42" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th personal best individual. In addition to this team-communication update, CSA also employs reflective learning and internal competition to maintain population diversity and retain superior individuals; other detailed algorithmic formulations can be found in Feng et al. (2021).</p>
      <p id="d2e1205">In the proposed algorithm, CSA is embedded in a surrogate-assisted manner.  As illustrated in Fig. 3, the objective values of candidate solutions are predicted by the trained surrogate models instead of being repeatedly evaluated by the computationally expensive groundwater simulator. This substitution substantially improves the efficiency of the global exploration stage. The superior solutions generated by CSA are then used to update the current position of each chain (Line 08 in Algorithm 1). For comparison purposes, this surrogate-assisted CSA module is also implemented as a standalone benchmark algorithm, denoted SA-CSA, so that the specific contributions of the multi-chain architecture and the subsequent Tabu Search can be explicitly assessed against the complete SA-CSA-TS framework.</p>

      <fig id="F3"><label>Figure 3</label><caption><p id="d2e1211">Workflow of surrogate-assisted CSA.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f03.png"/>

        </fig>

</sec>
<sec id="Ch1.S2.SS6">
  <label>2.6</label><title>Local refinement via surrogate-assisted TS</title>
      <p id="d2e1228">Following the global exploration stage, SA-CSA-TS performs local refinement using Tabu Search (TS). TS is a local-search metaheuristic characterized by adaptive memory and strategic neighbourhood exploration. Its key feature is the tabu list, which records recently visited solutions or attributes and prevents their immediate reconsideration, thereby reducing cycling and encouraging exploration of new regions. To avoid excessive restriction, TS also incorporates an aspiration mechanism, under which a tabu status can be relaxed if the corresponding move leads to an improved solution. These characteristics make TS well suited for refining promising regions identified in the preceding global exploration stage. In the context of GCSI, TS is particularly useful for structured exploration of discrete source-location configurations, thereby helping the algorithm escape local traps and identify more competitive solutions.</p>
      <p id="d2e1231">Unlike the previous stage, where CSA operates independently in each chain, the Tabu Search (TS) stage is executed under a coordinated multi-chain framework. In this design, all chains share a common tabu list, which serves as a collective memory to prevent any chain from revisiting previously explored regions. Since the discrete variable corresponds to the index of a potential contamination-source area, the tabu list is defined over this finite set, and its maximum size is equal to the total number of candidate source areas. The corresponding search mechanism is illustrated in Fig. 4. Guided by the retrained surrogate model, each chain explores its neighbourhood to identify promising candidates. As shown, the search trajectories are strictly constrained by the shared history, enabling the algorithm to better navigate multi-modal landscapes. For example, moves that enter tabu-listed areas (highlighted by red arrows) are prohibited. After selecting the most promising solutions, the algorithm performs simulation-based evaluations and subsequently updates the shared tabu list, thereby allowing dynamic information exchange among all chains.</p>

      <fig id="F4"><label>Figure 4</label><caption><p id="d2e1236">Diagram of multi-chain Tabu Search.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f04.png"/>

        </fig>

      <p id="d2e1246">We describe the rule for updating the tabu list. Let <inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> denote the current solution and its objective value of the <inline-formula><mml:math id="M45" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th chain, respectively, and let <inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mtext>best</mml:mtext><mml:mi>i</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> represent the historical best objective value recorded by that chain. The update mechanism consists of the following three cases:</p>
      <p id="d2e1291">If <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&gt;</mml:mo><mml:msubsup><mml:mi>f</mml:mi><mml:mtext>best</mml:mtext><mml:mi>i</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, the discrete component of <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, denoted <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">d</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, is added to the tabu list <inline-formula><mml:math id="M50" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>, preventing the algorithm from revisiting this configuration in subsequent iterations.</p>
      <p id="d2e1345">If <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&lt;</mml:mo><mml:msubsup><mml:mi>f</mml:mi><mml:mtext>best</mml:mtext><mml:mi>i</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">d</mml:mi></mml:msubsup><mml:mo>∈</mml:mo><mml:mi>T</mml:mi></mml:mrow></mml:math></inline-formula>, the tabu status of <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">d</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> is removed, allowing the algorithm to reconsider this configuration since a better solution has been found.</p>
      <p id="d2e1398">If <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&lt;</mml:mo><mml:msubsup><mml:mi>f</mml:mi><mml:mtext>best</mml:mtext><mml:mi>i</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="normal">d</mml:mi></mml:msubsup><mml:mo>∉</mml:mo><mml:mi>T</mml:mi></mml:mrow></mml:math></inline-formula>, both the best solution <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mtext>best</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> and the best objective <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mtext>best</mml:mtext><mml:mi>i</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> are updated accordingly.</p>

      <fig id="F5"><label>Figure 5</label><caption><p id="d2e1464">Schematic diagram in Case 1.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f05.png"/>

        </fig>

</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Case studies</title>
      <p id="d2e1482">To comprehensively evaluate the performance of the proposed SA-CSA-TS algorithm, three case studies are conducted. Cases 1 and 2 are hypothetical scenarios designed to compare the effectiveness of different surrogate models and to enable an in-depth examination of the internal behaviour of SA-CSA-TS. Case 3 involves a field-informed practical scenario, suitable for examining the applicability and robustness of SA-CSA-TS under realistic conditions.</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Case 1</title>
      <p id="d2e1492">The study area is a two-dimensional, homogeneous confined aquifer (<inline-formula><mml:math id="M58" display="inline"><mml:mrow><mml:mn mathvariant="normal">800</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1200</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>), as illustrated in Fig. 5. The left and right boundaries are assigned constant hydraulic heads, and the remaining boundaries are treated as no-flow. For simulation, the domain is discretized into a grid of <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:mn mathvariant="normal">16</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">24</mml:mn></mml:mrow></mml:math></inline-formula> cells, with a uniform cell size of 50 <inline-formula><mml:math id="M60" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula>. The basic hydrogeological parameters used in this case are summarized in Table 1.</p>

<table-wrap id="T1"><label>Table 1</label><caption><p id="d2e1538">Basic values and ranges of hydrogeological parameters in Case 1.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="2">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Name</oasis:entry>
         <oasis:entry colname="col2">Value or range</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Hydraulic conductivity, <inline-formula><mml:math id="M61" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M62" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">15.0–35.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Porosity, <inline-formula><mml:math id="M63" display="inline"><mml:mi mathvariant="italic">θ</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.25</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Longitudinal dispersity, <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi mathvariant="normal">L</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M65" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">40.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Transverse dispersity, <inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi mathvariant="normal">T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M67" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">15.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Saturated thickness, <inline-formula><mml:math id="M68" display="inline"><mml:mi>b</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M69" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">20.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Hydraulic head of the left boundary, <inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M71" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">40.0–50.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Hydraulic head of the right boundary, <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M73" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">30.0–40.0</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e1744">The potential contamination source zone, also shown in Fig. 5, represents an industrial area with intensive activities, where contaminants may be intermittently released into the aquifer. Within this zone, one or more contamination sources may exist. To capture solute transport behaviour and provide data for the inverse analysis, seven monitoring wells are distributed across the study area (the triangle in Fig. 5).</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e1750">Distribution of contaminant plume in the 5th and 10th SPs of Case 1.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f06.png"/>

        </fig>

      <p id="d2e1759">In Case 1, a single contaminant source is considered. The total simulation time is 40 months, divided into 20 stress periods (SPs), with the source releasing contaminants only during the first five SPs. The true source location and its release fluxes for these five SPs are listed in Table S1 in the Supplement.  The contaminant plume distributions at the 5th and 10th SPs are shown in Fig. 6.</p>
      <p id="d2e1762">For this case, the parameters to be identified include: <list list-type="custom"><list-item><label>(a)</label>
      <p id="d2e1767">Hydrogeological parameters: The hydraulic conductivity (<inline-formula><mml:math id="M74" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula>) and the boundary head (<inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>). Their ranges are listed in Table 1;</p></list-item><list-item><label>(b)</label>
      <p id="d2e1800">Source-related parameters: The source locations (SI and SJ, where SI denotes the grid index in the <inline-formula><mml:math id="M77" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>-direction and SJ denotes the grid index in the <inline-formula><mml:math id="M78" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>-direction, respectively) and their time-varying release fluxes (<inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M80" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> denotes the index of the source, <inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>; and <inline-formula><mml:math id="M82" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> denotes the index of the stress period, <inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> to 5), with the value of each flux bounded between 0 and 100 <inline-formula><mml:math id="M84" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">kg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>.</p></list-item></list></p>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e1891">Distribution of contaminant plume in the 5th and 10th SPs of Case 2.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f07.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Case 2</title>
      <p id="d2e1908">Case 2 adopts the same hydrogeological setting and numerical configuration as Case 1, but involves a more complex contamination scenario. In this case, three independent contaminant sources are introduced within the potential source zone. Their true locations and time-varying release fluxes are summarized in Table S2. The contaminant plume distribution at the 5th and 10th SPs is illustrated in Fig. 7.</p>
      <p id="d2e1911">Compared with Case 1, Case 2 presents a significantly higher level of complexity for surrogate modelling and optimization. The number of discrete variables associated with source locations increases from 2 to 6, and the total number of unknown parameters rises from 10 to 24 due to the introduction of additional sources and their time-varying release fluxes.</p>

      <fig id="F8" specific-use="star"><label>Figure 8</label><caption><p id="d2e1916">Overview of the research region in Case 3.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f08.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Case 3</title>
      <p id="d2e1933">This case study is designed as a realistic numerical experiment based on the hydrogeological conditions of a mining area in Henan Province, China. The study area covers approximately <inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:mn mathvariant="normal">2.67</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>. According to exploration-stage geological archives and field investigations, the aquifer is conceptualized as a single-layer unconfined system composed mainly of weathered and fractured granite, with an average saturated thickness of about 30 m. The underlying fresh granite is considered impermeable and therefore forms the basal boundary of the model. The groundwater flow system is represented by a two-dimensional single-layer numerical model. In plan view, the model domain is discretized using a structured grid with a uniform cell size of <inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:mn mathvariant="normal">30</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">30</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, and the irregular outer boundary is represented by active and inactive cells, as shown in Fig. 8. The rivers along the western and eastern margins are treated as constant-head boundaries, whereas the northern and southern margins are specified as no-flow boundaries because they are bounded by relatively intact, low-permeability fresh granite. Groundwater recharge occurs primarily through vertical infiltration of precipitation and is represented using an average annual precipitation of 650 mm and a recharge coefficient of 0.12.  To capture spatial heterogeneity, the aquifer is divided into four hydraulic-conductivity zones based on the exploration-stage geological archives: Zone I corresponds to alluvial sand and gravel near the riverbanks, Zones II and III represent highly weathered and moderately weathered granite, respectively, and Zone IV represents a localized tectonic fracture zone. The main hydrogeological parameters adopted in the model are summarized in Table 2.</p>

      <fig id="F9"><label>Figure 9</label><caption><p id="d2e1978">Concentration dataset at monitoring wells in Case 3.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f09.png"/>

        </fig>

<table-wrap id="T2"><label>Table 2</label><caption><p id="d2e1990">Basic settings of Case 3.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="2">
     <oasis:colspec colnum="1" colname="col1" align="justify" colwidth="58mm"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1" align="left">Name</oasis:entry>
         <oasis:entry colname="col2">Value or range</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1" align="left">Hydraulic conductivity of Zone I, <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mi>I</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M88" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">15.0–35.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1" align="left">Hydraulic conductivity of Zone II, <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mtext>II</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M90" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">10.0–25.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1" align="left">Hydraulic conductivity of Zone III, <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mtext>III</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M92" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">5.0–15.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1" align="left">Hydraulic conductivity of Zone IV, <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mtext>IV</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M94" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">20.0–45.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1" align="left">Porosity, <inline-formula><mml:math id="M95" display="inline"><mml:mi mathvariant="italic">θ</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.3</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1" align="left">Longitudinal dispersity, <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi mathvariant="normal">L</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M97" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mn mathvariant="normal">40.0</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1" align="left">Transverse dispersity, <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi mathvariant="normal">T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M100" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">11.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1" align="left">Saturated thickness, <inline-formula><mml:math id="M101" display="inline"><mml:mi>b</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M102" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">30.0</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1" align="left">Effective recharge rate, <inline-formula><mml:math id="M103" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M104" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:mn mathvariant="normal">2.14</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1" align="left">Hydraulic head of the left boundary, <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M107" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">97.4</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1" align="left">Hydraulic head of the right boundary, <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M109" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">83.1</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e2364">A potential contaminant source region is delineated, as highlighted in pink in Fig. 8. Field investigations identify three waste-ore deposits (<inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) within this region. These sources continuously release contaminants into the groundwater during the first five stress periods (out of a total of ten). Nine observation wells are distributed across the study area to monitor contaminant migration, and Fig. 9 illustrates the temporal concentration dataset used for the inverse analysis over the stress periods.</p>
      <p id="d2e2392">In summary, the parameters to be identified include: (a) Hydrogeological parameters: The hydraulic conductivity (<inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>); (b) Source locations (<inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:msub><mml:mtext>SI</mml:mtext><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:msub><mml:mtext>SJ</mml:mtext><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula>) and their release fluxes (<inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M120" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> to 5), with the value of each flux bounded between 0 and 100 <inline-formula><mml:math id="M121" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">kg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> Their reference values are listed in Table S3.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Comparison of surrogate models</title>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Experiment setup</title>
      <p id="d2e2563">This study employs four commonly used surrogate models to investigate their performance in predicting the discrepancy between observed and simulated data for a given set of solutions: (a) Kriging (KRG); (b) Gaussian Process (GP); (c) Support Vector Regression (SVR); (d) Radial Basis Function (RBF).</p>
      <p id="d2e2566">To ensure a fair comparison, all surrogate models are constructed using UQPyL on a computer equipped with 12th Gen Intel(R) Core (TM) i5-12490F CPU, and 32.0 GB of RAM. Motivated by the cost–benefit perspective of surrogate tuning discussed by Ahrari and Verstraete (2023), only selected influential hyperparameters are tuned in this study using grid-search, whereas the remaining hyperparameters are retained at their default values in UQPyL. The tuned hyperparameters and their search ranges are summarized in Table S4.</p>
      <p id="d2e2569">For sample generation, Latin Hypercube Sampling (LHS) is used in Cases 1–3 to produce a set of parameter samples, which are subsequently input into the groundwater models to obtain contaminant concentrations. For each sample, the RMSE between the simulated and observed concentrations at all monitoring wells is calculated. RMSE is selected here because it provides a steeper and more informative gradient, which is advantageous for optimization. The generated parameter sets and their corresponding RMSE values constitute the full input–output datasets.</p>
      <p id="d2e2572">To evaluate model performance, four training datasets, denoted as DS1–DS4 with sample sizes of 100, 200, 300, and 500, respectively, are constructed.  An independent set of 50 samples is generated for testing.</p>

<table-wrap id="T3" specific-use="star"><label>Table 3</label><caption><p id="d2e2579">Ensemble prediction performance of four surrogate models.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="6">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Case</oasis:entry>
         <oasis:entry colname="col2">Surrogate</oasis:entry>
         <oasis:entry rowsep="1" namest="col3" nameend="col6" align="center">Dataset (<inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>/</mml:mo><mml:mtext>RMSE</mml:mtext></mml:mrow></mml:math></inline-formula>) </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">DS1</oasis:entry>
         <oasis:entry colname="col4">DS2</oasis:entry>
         <oasis:entry colname="col5">DS3</oasis:entry>
         <oasis:entry colname="col6">DS4</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Case 1</oasis:entry>
         <oasis:entry colname="col2">KRG</oasis:entry>
         <oasis:entry colname="col3">0.73/18.77</oasis:entry>
         <oasis:entry colname="col4">0.80/16.16</oasis:entry>
         <oasis:entry colname="col5">0.87/13.03</oasis:entry>
         <oasis:entry colname="col6">0.89/11.98</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">GP</oasis:entry>
         <oasis:entry colname="col3">0.68/20.44</oasis:entry>
         <oasis:entry colname="col4">0.78/16.95</oasis:entry>
         <oasis:entry colname="col5">0.90/11.43</oasis:entry>
         <oasis:entry colname="col6">0.91/10.84</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">SVR</oasis:entry>
         <oasis:entry colname="col3">0.46/26.55</oasis:entry>
         <oasis:entry colname="col4">0.54/24.50</oasis:entry>
         <oasis:entry colname="col5">0.72/19.12</oasis:entry>
         <oasis:entry colname="col6">0.75/18.07</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RBF</oasis:entry>
         <oasis:entry colname="col3">0.81/15.75</oasis:entry>
         <oasis:entry colname="col4">0.88/12.52</oasis:entry>
         <oasis:entry colname="col5">0.95/8.14</oasis:entry>
         <oasis:entry colname="col6">0.95/7.97</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Case 2</oasis:entry>
         <oasis:entry colname="col2">KRG</oasis:entry>
         <oasis:entry colname="col3">0.60/22.91</oasis:entry>
         <oasis:entry colname="col4">0.71/19.38</oasis:entry>
         <oasis:entry colname="col5">0.83/15.03</oasis:entry>
         <oasis:entry colname="col6">0.83/14.76</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">GP</oasis:entry>
         <oasis:entry colname="col3">0.55/24.16</oasis:entry>
         <oasis:entry colname="col4">0.74/18.50</oasis:entry>
         <oasis:entry colname="col5">0.82/14.52</oasis:entry>
         <oasis:entry colname="col6">0.85/13.87</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">SVR</oasis:entry>
         <oasis:entry colname="col3">0.35/29.22</oasis:entry>
         <oasis:entry colname="col4">0.47/26.18</oasis:entry>
         <oasis:entry colname="col5">0.62/22.35</oasis:entry>
         <oasis:entry colname="col6">0.64/21.57</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RBF</oasis:entry>
         <oasis:entry colname="col3">0.71/19.39</oasis:entry>
         <oasis:entry colname="col4">0.85/14.07</oasis:entry>
         <oasis:entry colname="col5">0.91/10.93</oasis:entry>
         <oasis:entry colname="col6">0.91/10.71</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Case 3</oasis:entry>
         <oasis:entry colname="col2">KRG</oasis:entry>
         <oasis:entry colname="col3">0.53/24.84</oasis:entry>
         <oasis:entry colname="col4">0.68/20.37</oasis:entry>
         <oasis:entry colname="col5">0.77/17.42</oasis:entry>
         <oasis:entry colname="col6">0.79/16.49</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">GP</oasis:entry>
         <oasis:entry colname="col3">0.45/26.74</oasis:entry>
         <oasis:entry colname="col4">0.65/21.48</oasis:entry>
         <oasis:entry colname="col5">0.80/16.24</oasis:entry>
         <oasis:entry colname="col6">0.81/15.68</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">SVR</oasis:entry>
         <oasis:entry colname="col3">0.30/30.33</oasis:entry>
         <oasis:entry colname="col4">0.37/28.57</oasis:entry>
         <oasis:entry colname="col5">0.46/26.64</oasis:entry>
         <oasis:entry colname="col6">0.48/25.98</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RBF</oasis:entry>
         <oasis:entry colname="col3">0.68/20.38</oasis:entry>
         <oasis:entry colname="col4">0.83/15.03</oasis:entry>
         <oasis:entry colname="col5">0.88/12.58</oasis:entry>
         <oasis:entry colname="col6">0.90/11.35</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Evaluation of surrogate models</title>
      <p id="d2e2918">As described earlier, SA-CSA-TS constructs individual surrogate models for each monitoring well, and the corresponding outputs are summed to derive an ensemble objective value for optimization. To evaluate the effectiveness of this approach, we first examine the ensemble prediction performance of four surrogate models across Cases 1–3, based on the coefficient of determination (<inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) and Root Mean Square Error (RMSE). The results are summarized in Table 3.</p>
      <p id="d2e2932">Across all datasets (DS1–DS4) and all three cases, RBF clearly delivers the most stable and accurate ensemble predictions. KRG and GP achieve acceptable accuracy, whereas SVR consistently performs the weakest. All models benefit from increasing training data. In comparison, RBF demonstrates a superior sensitivity to data enrichment, aligning well with the iterative reconstruction strategy of SA-CSA-TS. In Case 3, the prediction task becomes significantly more challenging due to more complex hydrogeological conditions, leading to lower <inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values for all models.  However, RBF still maintains robust predictive capability.</p>

      <fig id="F10" specific-use="star"><label>Figure 10</label><caption><p id="d2e2948">Prediction performance of surrogate models under dataset DS3 for two test cases: <bold>(a)</bold> Case 2 and <bold>(b)</bold> Case 3.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f10.png"/>

        </fig>

      <p id="d2e2964">Based on the ensemble results, Cases 2 and 3 under dataset DS3 are selected for detailed surrogate evaluation at the individual monitoring wells. These two cases represent more challenging prediction scenarios. In addition, DS3 provides a sufficiently informative training set, yielding a clear performance improvement over DS2, while the additional gain from DS3 to DS4 is marginal. Figure 10a illustrates the prediction performance for Case 2 using DS3. Accuracy varies substantially across monitoring wells, primarily due to the spatial distribution of the contaminant plume. Wells 1, 2, and 5 are located within the main plume body, where steep and highly nonlinear concentration gradients dominate. Consequently, all surrogate models except RBF show marked reductions in <inline-formula><mml:math id="M125" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> at these locations. In contrast, Wells 6 and 7 lie far from the plume centre, where concentration gradients are smooth, enabling all models to reach their highest performance. A similar trend is observed in Case 3 (see Fig. 10b). Wells situated in high-gradient zones (e.g., Wells 1, 2, 3, and 5) pose greater challenges, leading to noticeable performance declines for SVR, GP, and KRG.  In contrast, RBF consistently maintains strong performance across all monitoring wells.</p>

      <fig id="F11" specific-use="star"><label>Figure 11</label><caption><p id="d2e2980">Sample-wise predicted versus true values of surrogate models for two representative cases: <bold>(a)</bold> Case 2, Well-1, and <bold>(b)</bold> Case 3, Well-5. In each subfigure, the four models are KRG, GP, RBF, and SVR.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f11.png"/>

        </fig>

      <p id="d2e2995">Figure 11a and b presents the sample-wise predicted values at representative locations: Well 1 for Case 2 and Well 5 for Case 3. In both scenarios, RBF achieves the highest <inline-formula><mml:math id="M126" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and lowest RMSE, followed by KRG, GP, and SVR. In Case 3, SVR fails to capture the nonlinearity of contaminant concentrations, with its predictions collapsing into a narrow range. For optimization applications, high fidelity in the low-value region of the response is particularly important, as deviations in this domain can significantly affect the quality of the optimal solution.  RBF provides more stable and accurate predictions in these low-value zones, further reinforcing its reliability as a surrogate model for optimization.</p>
      <p id="d2e3009">In addition to prediction accuracy, the computational cost of training is a critical consideration for SA-CSA-TS, which involves iterative surrogate reconstruction. Theoretically, GP and KRG are computationally intensive with a complexity of <inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:mi>O</mml:mi><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>⋅</mml:mo><mml:msup><mml:mi>N</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M128" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> denotes the number of iterations required by the construction algorithm. In contrast, RBF and SVR offer higher computational efficiency, with complexities of <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:mi>O</mml:mi><mml:mo>(</mml:mo><mml:msup><mml:mi>N</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:mi>O</mml:mi><mml:mo>(</mml:mo><mml:msup><mml:mi>N</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>)</mml:mo><mml:mo>∼</mml:mo><mml:mi>O</mml:mi><mml:mo>(</mml:mo><mml:msup><mml:mi>N</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, respectively. This theoretical advantage is further supported by empirical results obtained using UQPyL on dataset DS4. In terms of actual training time, GP and KRG require approximately 1 and 4 s, whereas RBF and SVR significantly reduce the cost to 0.22 and 0.01 s.</p>
      <p id="d2e3087">In summary, RBF overcomes the precision limitations of SVR while avoiding the computational inefficiencies associated with KRG and GP. It thus provides the best balance between accuracy and efficiency, making it the most suitable surrogate model for the proposed SA-CSA-TS framework.</p>
</sec>
</sec>
<sec id="Ch1.S5">
  <label>5</label><title>Optimization</title>
<sec id="Ch1.S5.SS1">
  <label>5.1</label><title>Experiment setup</title>
      <p id="d2e3107">This section aims to investigate the performance of SA-CSA-TS in GCSI. For comparison, three additional optimization algorithms are considered: Genetic Algorithm (GA), Cooperative Search Algorithm (CSA) and SA-CSA. GA is widely used as a benchmark, whereas CSA represents a state-of-the-art method in recent years. SA-CSA is included to isolate and assess the contributions of the multi-chain framework and the Tabu Search. All algorithms are implemented within UQPyL to ensure a consistent and fair computational environment.</p>
      <p id="d2e3110">For the standard evolutionary algorithms (GA and CSA), the maximum number of simulations (<inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:msub><mml:mtext>FE</mml:mtext><mml:mtext>max</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>) and the population size <inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are set to 20 000 and 100. For GA, the user-defined parameters <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:msub><mml:mi>p</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M134" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">η</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:msub><mml:mi>p</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">η</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are set to 1, 20, <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi>D</mml:mi></mml:mrow></mml:math></inline-formula>, 20, respectively, where <inline-formula><mml:math id="M138" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> denotes the dimensionality of the problem. For CSA, the parameters are set as <inline-formula><mml:math id="M139" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.10</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.15</mml:mn></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:mi>M</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e3235">For surrogate-assisted algorithms (SA-CSA-TS and SA-CSA), the RBF model is employed. The <inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:msub><mml:mtext>FE</mml:mtext><mml:mtext>max</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is reduced to 2000, as surrogate models enable efficient optimization with substantially fewer exact evaluations. Based on the results summarized in Table 3, the number of initial samples <inline-formula><mml:math id="M143" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for surrogate construction is set to 300. For SA-CSA-TS, the number of chains is set to <inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e3272">For Cases 1–3, the optimization problem is formulated as:

            <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M145" display="block"><mml:mtable class="aligned" columnspacing="1em" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtext>minimize:</mml:mtext><mml:mspace linebreak="nobreak" width="1em"/><mml:mi>f</mml:mi><mml:mo>=</mml:mo><mml:msubsup><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>M</mml:mi></mml:msubsup><mml:mfenced close=")" open="("><mml:msqrt><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>T</mml:mi></mml:msubsup><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi>S</mml:mi><mml:mi>m</mml:mi><mml:mi>t</mml:mi></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mi>O</mml:mi><mml:mi>m</mml:mi><mml:mi>t</mml:mi></mml:msubsup></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mi>T</mml:mi></mml:mfrac></mml:mstyle></mml:msqrt></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtext>subject to:</mml:mtext><mml:mspace width="1em" linebreak="nobreak"/><mml:mtext>LB</mml:mtext><mml:mo>≤</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mi>H</mml:mi><mml:mo>,</mml:mo><mml:mi>K</mml:mi><mml:mo>,</mml:mo><mml:mtext>SI</mml:mtext><mml:mo>,</mml:mo><mml:mtext>SJ</mml:mtext><mml:mo>,</mml:mo><mml:mtext>SP</mml:mtext><mml:mo mathvariant="italic">}</mml:mo><mml:mo>≤</mml:mo><mml:mtext>UB</mml:mtext></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:msubsup><mml:mi>S</mml:mi><mml:mi>m</mml:mi><mml:mi>t</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:msubsup><mml:mi>O</mml:mi><mml:mi>m</mml:mi><mml:mi>t</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> represent the simulated and observed concentrations at the <inline-formula><mml:math id="M148" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula>th monitoring well in stress period <inline-formula><mml:math id="M149" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, respectively. LB and UB are the lower and upper bounds of parameters to be estimated.</p>

      <fig id="F12" specific-use="star"><label>Figure 12</label><caption><p id="d2e3430">Convergence curves of four algorithms for three cases: <bold>(a)</bold> Case 1, <bold>(b)</bold> Case 2, and <bold>(c)</bold> Case 3.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f12.png"/>

        </fig>

</sec>
<sec id="Ch1.S5.SS2">
  <label>5.2</label><title>Optimization Results</title>
<sec id="Ch1.S5.SS2.SSS1">
  <label>5.2.1</label><title>Case 1</title>
      <p id="d2e3463">Figure 12a presents the convergence curves of the four algorithms in Case 1.  SA-CSA-TS achieves the best objective value (0.35) within only 2000 simulation runs, outperforming CSA, GA, and SA-CSA. As listed in Table 4, while all algorithms achieve satisfactory calibration for hydrogeological parameters, SA-CSA-TS is the only algorithm that consistently identifies the true source location <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">5</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">9</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and release fluxes. This discrepancy highlights that the primary bottleneck lies in the discrete source search, where the proposed two-stage framework with Tabu Search effectively prevents the search chains from becoming trapped in local basins. Moreover, relative to conventional optimization approaches, the surrogate-assisted framework significantly reduces computational cost while maintaining high-quality solutions.</p>

<table-wrap id="T4" specific-use="star"><label>Table 4</label><caption><p id="d2e3485">Optimization results of all algorithms in Case 1.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="11">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right" colsep="1"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:colspec colnum="9" colname="col9" align="right"/>
     <oasis:colspec colnum="10" colname="col10" align="right"/>
     <oasis:colspec colnum="11" colname="col11" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Algorithms</oasis:entry>
         <oasis:entry colname="col2">Location</oasis:entry>
         <oasis:entry rowsep="1" namest="col3" nameend="col5" align="center" colsep="1">Hydrogeological parameters </oasis:entry>
         <oasis:entry rowsep="1" namest="col6" nameend="col10" align="center">Release fluxes (<inline-formula><mml:math id="M151" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">kg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) </oasis:entry>
         <oasis:entry colname="col11">Objective</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(SI, SJ)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M154" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9"><inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col10"><inline-formula><mml:math id="M159" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col11">value</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">SA-CSA-TS</oasis:entry>
         <oasis:entry colname="col2">(5, 9)</oasis:entry>
         <oasis:entry colname="col3">42.3</oasis:entry>
         <oasis:entry colname="col4">35.1</oasis:entry>
         <oasis:entry colname="col5">18.3</oasis:entry>
         <oasis:entry colname="col6">20.7</oasis:entry>
         <oasis:entry colname="col7">51.7</oasis:entry>
         <oasis:entry colname="col8">13.1</oasis:entry>
         <oasis:entry colname="col9">41.6</oasis:entry>
         <oasis:entry colname="col10">23.8</oasis:entry>
         <oasis:entry colname="col11">0.35</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(0.9 %)</oasis:entry>
         <oasis:entry colname="col4">(0.5 %)</oasis:entry>
         <oasis:entry colname="col5">(1.1 %)</oasis:entry>
         <oasis:entry colname="col6">(3.3 %)</oasis:entry>
         <oasis:entry colname="col7">(1.0 %)</oasis:entry>
         <oasis:entry colname="col8">(3.0 %)</oasis:entry>
         <oasis:entry colname="col9">(2.2 %)</oasis:entry>
         <oasis:entry colname="col10">(3.9 %)</oasis:entry>
         <oasis:entry colname="col11"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SA-CSA</oasis:entry>
         <oasis:entry colname="col2">(4, 10)</oasis:entry>
         <oasis:entry colname="col3">43.2</oasis:entry>
         <oasis:entry colname="col4">35.7</oasis:entry>
         <oasis:entry colname="col5">17.8</oasis:entry>
         <oasis:entry colname="col6">19.1</oasis:entry>
         <oasis:entry colname="col7">49.6</oasis:entry>
         <oasis:entry colname="col8">12.2</oasis:entry>
         <oasis:entry colname="col9">43.5</oasis:entry>
         <oasis:entry colname="col10">21.6</oasis:entry>
         <oasis:entry colname="col11">11.38</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(1.2 %)</oasis:entry>
         <oasis:entry colname="col4">(1.1 %)</oasis:entry>
         <oasis:entry colname="col5">(1.7 %)</oasis:entry>
         <oasis:entry colname="col6">(8.9 %)</oasis:entry>
         <oasis:entry colname="col7">(4.7 %)</oasis:entry>
         <oasis:entry colname="col8">(6.4 %)</oasis:entry>
         <oasis:entry colname="col9">(8.8 %)</oasis:entry>
         <oasis:entry colname="col10">(2.1 %)</oasis:entry>
         <oasis:entry colname="col11"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">GA</oasis:entry>
         <oasis:entry colname="col2">(6, 8)</oasis:entry>
         <oasis:entry colname="col3">42.4</oasis:entry>
         <oasis:entry colname="col4">34.7</oasis:entry>
         <oasis:entry colname="col5">18.5</oasis:entry>
         <oasis:entry colname="col6">19.6</oasis:entry>
         <oasis:entry colname="col7">50.4</oasis:entry>
         <oasis:entry colname="col8">11.7</oasis:entry>
         <oasis:entry colname="col9">36.7</oasis:entry>
         <oasis:entry colname="col10">19.0</oasis:entry>
         <oasis:entry colname="col11">10.37</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(0.7 %)</oasis:entry>
         <oasis:entry colname="col4">(1.7 %)</oasis:entry>
         <oasis:entry colname="col5">(1.7 %)</oasis:entry>
         <oasis:entry colname="col6">(6.8 %)</oasis:entry>
         <oasis:entry colname="col7">(3.0 %)</oasis:entry>
         <oasis:entry colname="col8">(9.8 %)</oasis:entry>
         <oasis:entry colname="col9">(8.2 %)</oasis:entry>
         <oasis:entry colname="col10">(13.6 %)</oasis:entry>
         <oasis:entry colname="col11"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">CSA</oasis:entry>
         <oasis:entry colname="col2">(3, 7)</oasis:entry>
         <oasis:entry colname="col3">43.7</oasis:entry>
         <oasis:entry colname="col4">35.9</oasis:entry>
         <oasis:entry colname="col5">18.2</oasis:entry>
         <oasis:entry colname="col6">19.9</oasis:entry>
         <oasis:entry colname="col7">52.3</oasis:entry>
         <oasis:entry colname="col8">12.5</oasis:entry>
         <oasis:entry colname="col9">42.3</oasis:entry>
         <oasis:entry colname="col10">20.9</oasis:entry>
         <oasis:entry colname="col11">9.19</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(2.3 %)</oasis:entry>
         <oasis:entry colname="col4">(1.7 %)</oasis:entry>
         <oasis:entry colname="col5">(0.6 %)</oasis:entry>
         <oasis:entry colname="col6">(5.3 %)</oasis:entry>
         <oasis:entry colname="col7">(0.7 %)</oasis:entry>
         <oasis:entry colname="col8">(3.5 %)</oasis:entry>
         <oasis:entry colname="col9">(5.8 %)</oasis:entry>
         <oasis:entry colname="col10">(5.1 %)</oasis:entry>
         <oasis:entry colname="col11"/>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S5.SS2.SSS2">
  <label>5.2.2</label><title>Case 2</title>
      <p id="d2e3980">Compared to Case 1, Case 2 involves three contaminant sources and therefore requires more parameters to be identified. Figure 12b presents the convergence behaviour of all algorithms. SA-CSA-TS achieves the best objective value (1.29), followed by CSA (18.23), SA-CSA (21.38) and GA (22.85). SA-CSA-TS also converges much more rapidly, stabilizing within the first 1500 simulation runs.</p>

      <fig id="F13" specific-use="star"><label>Figure 13</label><caption><p id="d2e3985">Radar chart comparing the optimal solutions obtained by four algorithms: <bold>(a)</bold> Case2 and <bold>(b)</bold> Case3.</p></caption>
            <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f13.png"/>

          </fig>

      <p id="d2e4000">Figure 13a compares the optimal solutions obtained by all algorithms. Higher radial values indicate more accurate estimates, with 100 % denoting a perfect match to the true values. SA-CSA-TS encloses the largest area in the radar chart, indicating the highest overall estimation accuracy. While all algorithms provide satisfactory estimates of hydrogeological parameters, only SA-CSA-TS correctly identifies the three contaminant source locations (highlighted in red in Fig. 13a). Other algorithms exhibit noticeable deviations. Moreover, these incorrect source locations are accompanied by inaccurate release rates, suggesting that location errors are compensated by adjustments to other parameters, leading the search into local optima.  Overall, with the assistance of surrogate models and Tabu Search, SA-CSA-TS demonstrates a strong ability to avoid such local traps and to accurately resolve the multi-source identification problem under this more complex scenario.</p>
</sec>
<sec id="Ch1.S5.SS2.SSS3">
  <label>5.2.3</label><title>Case 3</title>
      <p id="d2e4011">Case 3 presents the most challenging optimization landscape due to the increased number of parameters and scenario complexity. As illustrated in Fig. 12c, the surrogate-assisted algorithms maintain a distinct efficiency advantage. In particular, SA-CSA-TS rapidly converges to the best solution within 2000 simulations, whereas GA and CSA stagnate at significantly higher objective values.</p>
      <p id="d2e4014">Figure 13b details the identification accuracy for specific parameters.  Consistent with previous cases, all algorithms estimate the hydrogeological parameters (<inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>K</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) with acceptable accuracy. However, a sharp performance divergence is observed in the source-related parameters: only SA-CSA-TS maintains high accuracy for the location variables (SI, SJ), while other algorithms exhibit substantial deviations. This failure to pinpoint source locations explains the stagnation observed in the other methods. Overall, Case 3 confirms that surrogate models effectively reduce computational cost, and that the multi-chain framework is indispensable for ensuring robustness and avoiding local optima in practical problems.</p>

      <fig id="F14" specific-use="star"><label>Figure 14</label><caption><p id="d2e4037">Runtime breakdown of all algorithms across three cases.</p></caption>
            <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f14.png"/>

          </fig>

</sec>
</sec>
</sec>
<sec id="Ch1.S6">
  <label>6</label><title>Discussion</title>
<sec id="Ch1.S6.SS1">
  <label>6.1</label><title>Effects of surrogate models</title>
      <p id="d2e4063">Surrogate models are incorporated into SA-CSA-TS to alleviate the computational burden of high-fidelity simulations. Figure 14 breaks down the runtime of all algorithms across the three cases into simulation time (blue) and algorithm time (red). It is evident that the simulation cost overwhelmingly dominates the total runtime. Although surrogate-assisted methods introduce a slight overhead for model construction and updating, this cost is negligible compared to the time savings achieved by reducing high-fidelity evaluations. Specifically, in three case studies, SA-CSA-TS reduces the total runtime by approximately 85 %–88 %, compared to the GA and CSA. This result confirms that the efficiency advantage of the surrogate-assisted framework becomes increasingly pronounced as the problem complexity grows.</p>

      <fig id="F15" specific-use="star"><label>Figure 15</label><caption><p id="d2e4068">Evolution of the prediction accuracy of the RBF model on the validation set during the optimization process.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f15.png"/>

        </fig>

      <p id="d2e4077">Given the negligible overhead of surrogate modelling, the effect of the iterative reconstruction strategy is further examined. Figure 15 tracks the evolution of prediction accuracy (<inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) of surrogate models on a validation set during optimization. In Case 1, the accuracy remains high and stable. In contrast, Cases 2 and 3 exhibit noticeable fluctuations. These oscillations are not indicators of failure but rather reflect the algorithm's active exploration of underrepresented regions. Driven by the Tabu Search mechanism, the optimizer periodically escapes local basins and enters unexplored areas where the surrogate model initially has lower accuracy. However, the subsequent recovery of <inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values confirms that the surrogate model successfully adapts to these new regions. Crucially, this dynamic updating process prevents the convergence stagnation observed in the other algorithms, ensuring that the search remains robust even in complex landscapes.</p>
</sec>
<sec id="Ch1.S6.SS2">
  <label>6.2</label><title>Effects of the multi-chain framework</title>
      <p id="d2e4110">Groundwater contaminant source identification is an inherently multi-modal optimization problem, where inaccurate location estimates may easily trap algorithms in inferior local solutions. As observed in Fig. 12a–c, GA, CSA and SA-CSA frequently exhibited instability and stagnation. This failure is largely attributed to their reliance on a single search population or trajectory, which lacks the mechanism to escape local basins. In contrast, SA-CSA-TS successfully identified the source information in all three cases.  To understand this mechanism, we examine the behaviour of the proposed multi-chain framework.</p>

      <fig id="F16"><label>Figure 16</label><caption><p id="d2e4115">Search-frequency maps of candidate source locations obtained by the multi-chain framework in <bold>(a)</bold> Case 1 and <bold>(b)</bold> Case 2. The red bars indicate the true source locations.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f16.png"/>

        </fig>

      <p id="d2e4130">Figure 16 depicts the search-frequency maps of candidate source locations by SA-CSA-TS for Case 1 and Case 2. In both scenarios, the true source locations (marked by red bars) correspond to the highest visit frequencies (red circles), indicating that the majority of chains consistently converge toward the correct region. Notably, the surrounding cells also exhibit high visit frequencies. This phenomenon confirms the parameter-compensation effect, where spatial inaccuracies are temporarily balanced by adjustments in release fluxes or hydraulic conductivity. This “equifinality” trap explains why conventional algorithms often stagnate near, but not exactly at, the true source. Furthermore, Case 2 displays more dispersed secondary hotspots than Case 1, reflecting a more rugged landscape with stronger compensability. Despite this complexity, the proposed framework successfully concentrates the search effort on the true location, demonstrating robust global convergence. Further improvement may be achieved in future work by incorporating optimal monitoring well placement to provide stronger spatial constraints and further reduce the parameter-compensation effect.</p>

      <fig id="F17" specific-use="star"><label>Figure 17</label><caption><p id="d2e4136">Distribution of search trajectories across ten chains for source coordinates in Case 3. The fitted curves highlight the multi-modal nature of the search landscape.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f17.png"/>

        </fig>

      <p id="d2e4145">Figure 17 provides a deeper insight by analysing the distribution of search trajectories across ten independent chains in Case 3. The histograms for the source coordinates (SI and SJ) reveal a distinct multi-modal distribution, confirming the existence of multiple local optima. While the majority of chains converge to the primary peak (the true source), a few chains (e.g., Chains 1, 2, and 10) are entrapped in secondary peaks.  Therefore, if a single-chain method (like standard GA or CSA) is used, and it happens to follow the trajectory of Chain 1, the identification would fail entirely. However, the multi-chain framework mitigates this risk by exploring multiple basins simultaneously. This collective intelligence allows the algorithm to filter out local optima and stabilize estimates around the true global solution, effectively overcoming the equifinality and multi-modality challenges that hinder conventional single-population methods.</p>
</sec>
<sec id="Ch1.S6.SS3">
  <label>6.3</label><title>Robustness analysis</title>
      <p id="d2e4156">To evaluate the robustness of SA-CSA-TS under data uncertainty, additional experiments are conducted based on three case studies. Random Gaussian noise with varying levels (0.5 %, 1 %, and 2 %) is superimposed on the noise-free observation data, following the equation:

            <disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M163" display="block"><mml:mrow><mml:msubsup><mml:mi>C</mml:mi><mml:mtext>obs</mml:mtext><mml:mo>∗</mml:mo></mml:msubsup><mml:mo>=</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mtext>true</mml:mtext></mml:msub><mml:mo>⋅</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi mathvariant="italic">δ</mml:mi><mml:mo>⋅</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M164" display="inline"><mml:mrow><mml:msubsup><mml:mi>C</mml:mi><mml:mtext>obs</mml:mtext><mml:mo>∗</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mtext>true</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> denote the noisy and noise-free observations, respectively; <inline-formula><mml:math id="M166" display="inline"><mml:mi mathvariant="italic">δ</mml:mi></mml:math></inline-formula> denotes the noise level; and <inline-formula><mml:math id="M167" display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula> is a random number following the standard normal distribution <inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>

      <fig id="F18"><label>Figure 18</label><caption><p id="d2e4254">Comparison of average relative errors for three cases under different noise levels.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/3145/2026/hess-30-3145-2026-f18.png"/>

        </fig>

      <p id="d2e4263">Figure 18 illustrates the Average Relative Errors (ARE) for the three cases under these noise levels. A clear trend is observed where the identification error increases marginally with the noise intensity. Specifically, for Case 1, the ARE rises from 1.59 % (noise-free) to 3.09 % (2 % noise). For the more complex scenarios in Cases 2 and 3, the errors start at approximately 3.7 %–3.8 % and increase to roughly 4.5 % under the maximum noise level. Despite these increases, the average errors for all cases consistently remain below 5 %, indicating that the proposed method maintains high performance without significant degradation when observation data is subject to measurement noise.</p>
      <p id="d2e4267">Tables S5–S7  provide the specific identification results in three cases.  Notably, the discrete source locations match the true values exactly across all noise levels. As for continuous variables, the hydrogeological parameters show only slight fluctuations. In comparison, the source release parameters exhibit relatively larger variations. This phenomenon is largely attributed to the complementary effects between different stress periods or among multiple sources, where slight deviations in one parameter may compensate for another. Despite this, the overall errors remain within an acceptable range, confirming the robustness of SA-CSA-TS against data uncertainty.</p>
</sec>
<sec id="Ch1.S6.SS4">
  <label>6.4</label><title>Limitations</title>
      <p id="d2e4278">Despite the promising performance and robustness of SA-CSA-TS, some limitations should be further discussed. First, this study evaluates the proposed algorithm only using two-dimensional groundwater systems. The search strategy of SA-CSA-TS, however, is guided by the difference between simulated and observed responses at monitoring locations, rather than by the assumption tied to a specific groundwater model dimensionality. This gives the SA-CSA-TS potential for extension to three-dimensional groundwater models. Nevertheless, its applicability to three-dimensional flow and hydrodynamic dispersion systems has not yet been demonstrated in this study.  Furthermore, an increase in vertical resolution (depth layers) not only raises the computational cost of groundwater simulation but also poses a challenge to the predictive fidelity of the surrogate model in complex cases, which should be confirmed through further investigation. Second, while the robustness analysis demonstrated resilience against Gaussian noise, real-world field conditions often involve more complex uncertainties.  These include sparse monitoring networks, systematic measurement biases, and structural model errors. Therefore, future work should focus on testing SA-CSA-TS in three-dimensional systems and under combined uncertainty sources, in order to establish its robustness and reliability for complex groundwater inverse problems.</p>
</sec>
</sec>
<sec id="Ch1.S7" sec-type="conclusions">
  <label>7</label><title>Conclusions</title>
      <p id="d2e4290">This study proposes a multi-chain surrogate-assisted hybrid optimization algorithm, SA-CSA-TS, to address the challenges of prohibitive computational costs and multi-modal complexity in GCSI. The algorithm incorporates three key innovations. First, surrogate models are embedded to alleviate the computational burden, while continuous iterative updates ensure reliable optimization guidance. Second, a multi-chain synergistic learning framework enables the exchange of evaluated samples among chains, enhancing data diversity and preventing premature convergence caused by limited local information. Third, a two-stage sequential strategy is employed where CSA conducts global exploration and TS performs neighbourhood refinement guided by a shared tabu list, effectively balancing exploration and exploitation.</p>
      <p id="d2e4293">Through three illustrative case studies, the applicability of different surrogate models and the overall performance of the proposed algorithm were systematically investigated. Results indicate that the Radial Basis Function (RBF) offers the best balance of stability and accuracy, particularly excelling in fitting low-value regions, making it the optimal surrogate for this framework. Comparative experiments with four algorithms (SA-CSA-TS, GA, CSA, and SA-CSA) highlight the superior robustness and accuracy of the proposed framework. While the benchmark algorithms frequently stagnate in local optima due to the parameter-compensation effect, SA-CSA-TS successfully identifies the true contaminant source parameters by leveraging multi-chain cooperation to escape local entrapment. Furthermore, the algorithm achieves a computational cost reduction of approximately 85 %–88 % across the three cases, proving it to be both a precise and efficient tool for GCSI. Future work will focus on extending the framework to three-dimensional and more heterogeneous aquifer systems, with particular emphasis on assessing surrogate predictive fidelity and computational scalability in complex cases to support practical applications. In addition, the integration of surface–groundwater interactions and multi-source data (e.g., satellite-derived observations) will be explored to provide additional constraints for parameter identification.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e4300">The codes and case studies used in this work are available at <ext-link xlink:href="https://doi.org/10.5281/zenodo.17862863" ext-link-type="DOI">10.5281/zenodo.17862863</ext-link> (Wu, 2025) and maintained at the GitHub repository (<uri>https://github.com/smasky/SA-CSA-TS</uri>, last access: 15 May 2026). All numerical experiments are carried out using the UQPyL platform, which is available at <uri>http://www.uq-pyl.com</uri> (last access: 15 May 2026) (or <uri>https://github.com/smasky/UQPyL</uri>, last access: 15 May 2026).</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d2e4315">The supplement related to this article is available online at <inline-supplementary-material xlink:href="https://doi.org/10.5194/hess-30-3145-2026-supplement" xlink:title="pdf">https://doi.org/10.5194/hess-30-3145-2026-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e4324">MW: Methodology, Software, Writing – original draft, Writing – review and editing, Funding acquisition; XH: Methodology, Software; PX: Methodology, Software; XY: Software; HC: Methodology, Software; JX: Methodology; QD: Conceptualization, Methodology, Funding acquisition, Project administration.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e4330">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e4336">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e4343">This study was supported by the Jiangsu Provincial Science and Technology Basic Research Program Youth Fund Project (grant no. BK20241516), the National Natural Science Foundation of China (grant nos. 42101046 and W2431029), the National Key R&amp;D Program of China (grant no. 2021YFC3201102), the Key Scientific and Technological Project of the Ministry of Water Resources of the P.R.C. (grant no. SKS-2022001), and the Jiangsu Province Youth Science and Technology Talent Support Program (grant no. JSTJ-2025-046).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e4349">This paper was edited by Yonggen Zhang and reviewed by three anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><mixed-citation>Agbotui, P. Y., Firouzbehi, F., and Medici, G.: Review of effective porosity in sandstone aquifers: insights for representation of contaminant transport, Sustainability, 17, 6469, <ext-link xlink:href="https://doi.org/10.3390/su17146469" ext-link-type="DOI">10.3390/su17146469</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><mixed-citation>Ahrari, A. and Verstraete, D.: Online model tuning in surrogate-assisted optimization – an effective approach considering the cost-benefit tradeoff, Swarm Evol. Comput., 82, 101357, <ext-link xlink:href="https://doi.org/10.1016/j.swevo.2023.101357" ext-link-type="DOI">10.1016/j.swevo.2023.101357</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><mixed-citation>Asher, M. J., Croke, B. F. W., Jakeman, A. J., and Peeters, L. J. M.: A review of surrogate models and their application to groundwater modeling: Surrogates of Groundwater Models, Water Resour. Res., 51, 5957–5973, <ext-link xlink:href="https://doi.org/10.1002/2015wr016967" ext-link-type="DOI">10.1002/2015wr016967</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><mixed-citation>Ayvaz, M. T. and Elci, A.: Identification of the optimum groundwater quality monitoring network using a genetic algorithm based optimization approach, J. Hydrol., 563, 1078–1091, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2018.06.006" ext-link-type="DOI">10.1016/j.jhydrol.2018.06.006</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><mixed-citation>Bai, T. and Tahmasebi, P.: Characterization of groundwater contamination: a transformer-based deep learning model, Adv. Water Resour., 164, 104217, <ext-link xlink:href="https://doi.org/10.1016/j.advwatres.2022.104217" ext-link-type="DOI">10.1016/j.advwatres.2022.104217</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><mixed-citation>Bakker, M., Post, V., Langevin, C. D., Hughes, J. D., White, J. T., Starn, J. J., and Fienen, M. N.: Scripting MODFLOW Model Development Using Python and FloPy, Groundwater, 54, 733–739, <ext-link xlink:href="https://doi.org/10.1111/gwat.12413" ext-link-type="DOI">10.1111/gwat.12413</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><mixed-citation> Broomhead, D. S. and Lowe, D.: Multivariable functional interpolation and adaptive networks, Complex Systems, 2, 321–355, 1988.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><mixed-citation>Buhmann, M. D.: Radial Basis Functions: Theory and Implementations, Cambridge University Press, <ext-link xlink:href="https://doi.org/10.1017/CBO9780511543241" ext-link-type="DOI">10.1017/CBO9780511543241</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><mixed-citation>Chang, C.-C. and Lin, C.-J.: LIBSVM: a library for support vector machines, ACM T. Intel. Syst. Tec., 2, 27:1–27:27, <ext-link xlink:href="https://doi.org/10.1145/1961189.1961199" ext-link-type="DOI">10.1145/1961189.1961199</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><mixed-citation>Chang, Z., Lu, W., and Wang, Z.: A differential evolutionary markov chain algorithm with ensemble smoother initial point selection for the identification of groundwater contaminant sources, J. Hydrol., 603, 126918, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2021.126918" ext-link-type="DOI">10.1016/j.jhydrol.2021.126918</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><mixed-citation>Delshad, M., Pope, G. A., and Sepehrnoori, K.: A compositional simulator for modeling surfactant enhanced aquifer remediation, 1 formulation, J. Contam. Hydrol., 23, 303–327, <ext-link xlink:href="https://doi.org/10.1016/0169-7722(95)00106-9" ext-link-type="DOI">10.1016/0169-7722(95)00106-9</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><mixed-citation>Feng, Z., Niu, W., and Liu, S.: Cooperation search algorithm: A novel metaheuristic evolutionary intelligence algorithm for numerical optimization and engineering optimization problems, Appl. Soft Comput., 98, 106734, <ext-link xlink:href="https://doi.org/10.1016/j.asoc.2020.106734" ext-link-type="DOI">10.1016/j.asoc.2020.106734</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><mixed-citation>Feng, Z., Shi, P., Yang, T., Niu, W., Zhou, J., and Cheng, C.: Parallel cooperation search algorithm and artificial intelligence method for streamflow time series forecasting, J. Hydrol., 606, 127434, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2022.127434" ext-link-type="DOI">10.1016/j.jhydrol.2022.127434</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><mixed-citation>Feng, Z., Zhang, L., Mo, L., Wang, Y., and Niu, W.: A multi-objective cooperation search algorithm for cascade reservoirs operation optimization considering power generation and ecological flows, Appl. Soft Comput., 150, 111085, <ext-link xlink:href="https://doi.org/10.1016/j.asoc.2023.111085" ext-link-type="DOI">10.1016/j.asoc.2023.111085</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><mixed-citation>Gorelick, S. M. and Zheng, C.: Global change and the groundwater management challenge, Water Resour. Res., 51, 3031–3051, <ext-link xlink:href="https://doi.org/10.1002/2014WR016825" ext-link-type="DOI">10.1002/2014WR016825</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><mixed-citation>Guneshwor, L., Eldho, T. I., and Kumar, A. V.: Identification of groundwater contamination sources using meshfree RPCM simulation and particle swarm optimization, Water Resour. Manag., 32, 1517–1538, <ext-link xlink:href="https://doi.org/10.1007/s11269-017-1885-1" ext-link-type="DOI">10.1007/s11269-017-1885-1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><mixed-citation>Harbaugh, A. W.: MODFLOW-2005, the U.S. Geological Survey modular ground-water model – the Ground-Water Flow Process, U.S. Geological Survey Techniques and Methods, 6-A16, variously p., <ext-link xlink:href="https://doi.org/10.3133/tm6A16" ext-link-type="DOI">10.3133/tm6A16</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><mixed-citation>Hou, Z. and Lu, W.: Comparative study of surrogate models for groundwater contamination source identification at DNAPL-contaminated sites, Hydrogeol. J., 26, 923–932, <ext-link xlink:href="https://doi.org/10.1007/s10040-017-1690-1" ext-link-type="DOI">10.1007/s10040-017-1690-1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><mixed-citation>Hughes, J. D., Langevin, C. D., and Banta, E. R.: Documentation for the MODFLOW 6 framework, Techniques and Methods 6-A57, US Geological Survey, Reston, VA, <ext-link xlink:href="https://doi.org/10.3133/tm6A57" ext-link-type="DOI">10.3133/tm6A57</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><mixed-citation>Jha, M. and Datta, B.: Three-dimensional groundwater contamination source identification using adaptive simulated annealing, J. Hydrol. Eng., 18, 307–317, <ext-link xlink:href="https://doi.org/10.1061/(ASCE)HE.1943-5584.0000624" ext-link-type="DOI">10.1061/(ASCE)HE.1943-5584.0000624</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><mixed-citation>Langevin, C. D., Provost, A. M., Panday, S., and Hughes, J. D.: Documentation for the MODFLOW 6 Groundwater Transport Model, Book 6, Chap. A61, US Geological Survey Techniques and Methods, 56 pp., <ext-link xlink:href="https://doi.org/10.3133/tm6A61" ext-link-type="DOI">10.3133/tm6A61</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><mixed-citation>Li, P., Karunanidhi, D., Subramani, T., and Srinivasamoorthy, K.: Sources and consequences of groundwater contamination, Arch. Environ. Con. Tox., 80, 1–10, <ext-link xlink:href="https://doi.org/10.1007/s00244-020-00805-z" ext-link-type="DOI">10.1007/s00244-020-00805-z</ext-link>, 2021a.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><mixed-citation>Li, J., Lu, W., and Fan, Y.: Groundwater pollution sources identification based on hybrid homotopy-genetic algorithm and simulation optimization, Environ. Eng. Sci., 38, 777–788, <ext-link xlink:href="https://doi.org/10.1089/ees.2020.0117" ext-link-type="DOI">10.1089/ees.2020.0117</ext-link>, 2021b.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><mixed-citation>Li, Y., Lu, W., Pan, Z., Wang, Z., and Dong, G.: Simultaneous identification of groundwater contaminant source and hydraulic parameters based on multilayer perceptron and flying foxes optimization, Environ. Sci. Pollut. R., 30, 78933–78947, <ext-link xlink:href="https://doi.org/10.1007/s11356-023-27574-1" ext-link-type="DOI">10.1007/s11356-023-27574-1</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><mixed-citation>Lophaven, S. N., Nielsen, H. B., and Søndergaard, J.: DACE – A Matlab Kriging Toolbox, Version 2.0, Technical Report IMM-TR-2002-12, Informatics and Mathematical Modelling, Technical University of Denmark, DTU, 34 pp., <uri>https://www2.imm.dtu.dk/pubdb/pubs/3213-full.html</uri> (last access: 15 May 2026), 2002.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><mixed-citation>Luo, C., Wang, X., Xu, Y. J., Jia, S., Liu, Z., Mao, B., Lv, Q., Ji, X., Rong, Y., and Dai, Y.: Synergistic identification of hydrogeological parameters and pollution source information for groundwater point and areal source contamination based on machine learning surrogate–artificial hummingbird algorithm, Hydrol. Earth Syst. Sci., 29, 5719–5736, <ext-link xlink:href="https://doi.org/10.5194/hess-29-5719-2025" ext-link-type="DOI">10.5194/hess-29-5719-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><mixed-citation>Mahar, P. S. and Datta, B.: Optimal identification of ground-water pollution sources and parameter estimation, J. Water Res. Plan. Man., 127, 20–29, <ext-link xlink:href="https://doi.org/10.1061/(ASCE)0733-9496(2001)127:1(20)" ext-link-type="DOI">10.1061/(ASCE)0733-9496(2001)127:1(20)</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><mixed-citation>Meenal, M. and Eldho, T. I.: Simulation–optimization model for groundwater contamination remediation using meshfree point collocation method and particle swarm optimization, Sadhana-Acad. P. Eng. S., 37, 351–369, <ext-link xlink:href="https://doi.org/10.1007/s12046-012-0086-0" ext-link-type="DOI">10.1007/s12046-012-0086-0</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><mixed-citation>Mirghani, B. Y., Mahinthakumar, K. G., Tryby, M. E., Ranjithan, R. S., and Zechman, E. M.: A parallel evolutionary strategy based simulation–optimization approach for solving groundwater source identification problems, Adv. Water Resour., 32, 1373–1385, <ext-link xlink:href="https://doi.org/10.1016/j.advwatres.2009.06.001" ext-link-type="DOI">10.1016/j.advwatres.2009.06.001</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><mixed-citation>Ouyang, Q., Lu, W., Miao, T., Deng, W., Jiang, C., and Luo, J.: Application of ensemble surrogates and adaptive sequential sampling to optimal groundwater remediation design at DNAPLs-contaminated sites, J. Contam. Hydrol., 207, 31–38, <ext-link xlink:href="https://doi.org/10.1016/j.jconhyd.2017.10.007" ext-link-type="DOI">10.1016/j.jconhyd.2017.10.007</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><mixed-citation>Pan, Z., Lu, W., Wang, H., and Bai, Y.: Groundwater contaminant source identification based on an ensemble learning search framework associated with an auto xgboost surrogate, Environ. Modell. Softw., 159, 105588, <ext-link xlink:href="https://doi.org/10.1016/j.envsoft.2022.105588" ext-link-type="DOI">10.1016/j.envsoft.2022.105588</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib32"><label>32</label><mixed-citation>Rasmussen, C. E. and Williams, C. K. I.: Gaussian Processes for Machine Learning, The MIT Press, Cambridge, MA, USA, 272 pp., ISBN: 978-0-262-18253-9, <ext-link xlink:href="https://doi.org/10.7551/mitpress/3206.001.0001" ext-link-type="DOI">10.7551/mitpress/3206.001.0001</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><mixed-citation>Razavi, S., Tolson, B. A., and Burn, D. H.: Review of surrogate modeling in water resources, Water Resour. Res., 48, W07401, <ext-link xlink:href="https://doi.org/10.1029/2011wr011527" ext-link-type="DOI">10.1029/2011wr011527</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib34"><label>34</label><mixed-citation>Singh, A.: Review: computer-based models for managing the water-resource problems of irrigated agriculture, Hydrogeol. J., 23, 1217–1227, <ext-link xlink:href="https://doi.org/10.1007/s10040-015-1270-1" ext-link-type="DOI">10.1007/s10040-015-1270-1</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><mixed-citation>Singh, R. M. and Datta, B.: Identification of groundwater pollution sources using GA-based linked simulation optimization model, J. Hydrol. Eng., 11, 101–109, <ext-link xlink:href="https://doi.org/10.1061/(ASCE)1084-0699(2006)11:2(101)" ext-link-type="DOI">10.1061/(ASCE)1084-0699(2006)11:2(101)</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib36"><label>36</label><mixed-citation>Smola, A. J. and Schölkopf, B.: A tutorial on support vector regression, Stat. Comput., 14, 199–222, <ext-link xlink:href="https://doi.org/10.1023/B:STCO.0000035301.49549.88" ext-link-type="DOI">10.1023/B:STCO.0000035301.49549.88</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><mixed-citation>Song, J., Yang, Y., Wu, J. F., Wu, J. C., Sun, X., and Lin, J.: Adaptive surrogate model based multiobjective optimization for coastal aquifer management, J. Hydrol., 561, 98–111, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2018.03.063" ext-link-type="DOI">10.1016/j.jhydrol.2018.03.063</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib38"><label>38</label><mixed-citation>Song, J., Yang, Y., Chen, G., Sun, X., Lin, J., Wu, J. F., and Wu, J. C.: Surrogate assisted multi-objective robust optimization for groundwater monitoring network design, J. Hydrol., 577, 123994, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2019.123994" ext-link-type="DOI">10.1016/j.jhydrol.2019.123994</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib39"><label>39</label><mixed-citation>Swetha, K., Eldho, T. I., Singh, L. G., and Kumar, A. V.: Groundwater contaminant source identification using swarm intelligence-based simulation optimization models, Environ. Sci. Pollut. R., 32, 1626–1639, <ext-link xlink:href="https://doi.org/10.1007/s11356-024-35850-x" ext-link-type="DOI">10.1007/s11356-024-35850-x</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib40"><label>40</label><mixed-citation>Wang, J. L., Lin, Y. H., and Lin, M. D.: Application of heuristic algorithms on groundwater pumping source identification problems, in: 2015 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), IEEE, New York, 858–862, <ext-link xlink:href="https://doi.org/10.1109/IEEM.2015.7385770" ext-link-type="DOI">10.1109/IEEM.2015.7385770</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib41"><label>41</label><mixed-citation>Wang, Z., Lu, W., Chang, Z., and Zhang, T.: Joint identification of groundwater pollution source information, model parameters, and boundary conditions based on a novel ES-MDA with a wheel battle strategy, J. Hydrol., 636, 131320, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2024.131320" ext-link-type="DOI">10.1016/j.jhydrol.2024.131320</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib42"><label>42</label><mixed-citation>Wu, M.: SA-CSA-TS: A multi-chain surrogate-assisted hybrid optimization algorithm combining CSA and TS, Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.17862863" ext-link-type="DOI">10.5281/zenodo.17862863</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib43"><label>43</label><mixed-citation>Wu, M., Wang, L., Xu, J., Hu, P., and Xu, P.: Adaptive surrogate-assisted multi-objective evolutionary algorithm using an efficient infill technique, Swarm Evol. Comput., 75, 101170, <ext-link xlink:href="https://doi.org/10.1016/j.swevo.2022.101170" ext-link-type="DOI">10.1016/j.swevo.2022.101170</ext-link>, 2022a.</mixed-citation></ref>
      <ref id="bib1.bib44"><label>44</label><mixed-citation>Wu, M., Wang, L., Xu, J., Wang, Z., Hu, P., and Tang, H.: Multiobjective ensemble surrogate-based optimization algorithm for groundwater optimization designs, J. Hydrol., 612, 128159, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2022.128159" ext-link-type="DOI">10.1016/j.jhydrol.2022.128159</ext-link>, 2022b.</mixed-citation></ref>
      <ref id="bib1.bib45"><label>45</label><mixed-citation>Xing, Z., Qu, R., Zhao, Y., Fu, Q., Ji, Y., and Lu, W.: Identifying the release history of a groundwater contaminant source based on an ensemble surrogate model, J. Hydrol., 572, 501–516, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2019.03.020" ext-link-type="DOI">10.1016/j.jhydrol.2019.03.020</ext-link>, 2019. </mixed-citation></ref>
      <ref id="bib1.bib46"><label>46</label><mixed-citation>Yin, J. and Tsai, F. T.-C.: Bayesian set pair analysis and machine learning based ensemble surrogates for optimal multi-aquifer system remediation design, J. Hydrol., 580, 124280, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2019.124280" ext-link-type="DOI">10.1016/j.jhydrol.2019.124280</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib47"><label>47</label><mixed-citation>Zhao, Y., Lu, W., and Xiao, C.: A kriging surrogate model coupled in simulation–optimization approach for identifying release history of groundwater sources, J. Contam. Hydrol., 185, 51–60, <ext-link xlink:href="https://doi.org/10.1016/j.jconhyd.2016.01.004" ext-link-type="DOI">10.1016/j.jconhyd.2016.01.004</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib48"><label>48</label><mixed-citation>Zheng, C. and Wang, P. P.: MT3DMS: A modular three-dimensional multispecies transport model for simulation of advection, dispersion, and chemical reactions of contaminants in groundwater systems; documentation and user’s guide, Contract Report SERDP-99-1, U.S. Army Engineer Research and Development Center, U.S. Army Corps of Engineers, Vicksburg, MS, USA, <uri>https://sav.el.erdc.dren.mil/elpubs/pdf/crserdp99-1.pdf</uri> (last access: 15 May 2026), 1999.</mixed-citation></ref>
      <ref id="bib1.bib49"><label>49</label><mixed-citation>Zhu, L., Lu, W., Luo, C., Xu, Y., and Wang, Z.: An ensemble optimizer with a stacking ensemble surrogate model for identification of groundwater contamination source, J. Contam. Hydrol., 267, 104437, <ext-link xlink:href="https://doi.org/10.1016/j.jconhyd.2024.104437" ext-link-type="DOI">10.1016/j.jconhyd.2024.104437</ext-link>, 2024.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>A multi-chain surrogate-assisted hybrid optimization framework for joint identification of groundwater contaminant sources and hydrogeological parameters</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
       Agbotui, P. Y., Firouzbehi, F., and Medici, G.: Review of effective porosity in sandstone aquifers: insights
for representation of contaminant transport, Sustainability, 17, 6469, <a href="https://doi.org/10.3390/su17146469" target="_blank">https://doi.org/10.3390/su17146469</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
       Ahrari, A. and Verstraete, D.: Online model tuning in surrogate-assisted optimization – an effective
approach considering the cost-benefit tradeoff, Swarm Evol. Comput., 82, 101357, <a href="https://doi.org/10.1016/j.swevo.2023.101357" target="_blank">https://doi.org/10.1016/j.swevo.2023.101357</a>,
2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
       Asher, M. J., Croke, B. F. W., Jakeman, A. J., and Peeters, L. J. M.: A review of surrogate models and their
application to groundwater modeling: Surrogates of Groundwater Models, Water Resour. Res., 51, 5957–5973,
<a href="https://doi.org/10.1002/2015wr016967" target="_blank">https://doi.org/10.1002/2015wr016967</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
       Ayvaz, M. T. and Elci, A.: Identification of the optimum groundwater quality monitoring network using a
genetic algorithm based optimization approach, J. Hydrol., 563, 1078–1091, <a href="https://doi.org/10.1016/j.jhydrol.2018.06.006" target="_blank">https://doi.org/10.1016/j.jhydrol.2018.06.006</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
       Bai, T. and Tahmasebi, P.: Characterization of groundwater contamination: a transformer-based deep learning
model, Adv. Water Resour., 164, 104217, <a href="https://doi.org/10.1016/j.advwatres.2022.104217" target="_blank">https://doi.org/10.1016/j.advwatres.2022.104217</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
       Bakker, M., Post, V., Langevin, C. D., Hughes, J. D., White, J. T., Starn, J. J., and Fienen, M. N.:
Scripting MODFLOW Model Development Using Python and FloPy, Groundwater, 54, 733–739, <a href="https://doi.org/10.1111/gwat.12413" target="_blank">https://doi.org/10.1111/gwat.12413</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
       Broomhead, D. S. and Lowe, D.: Multivariable functional interpolation and adaptive networks, Complex Systems, 2, 321–355, 1988.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
       Buhmann, M. D.: Radial Basis Functions: Theory and Implementations, Cambridge University Press,
<a href="https://doi.org/10.1017/CBO9780511543241" target="_blank">https://doi.org/10.1017/CBO9780511543241</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
       Chang, C.-C. and Lin, C.-J.: LIBSVM: a library for support vector machines, ACM T. Intel. Syst. Tec., 2, 27:1–27:27, <a href="https://doi.org/10.1145/1961189.1961199" target="_blank">https://doi.org/10.1145/1961189.1961199</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
       Chang, Z., Lu, W., and Wang, Z.: A differential evolutionary markov chain algorithm with ensemble smoother
initial point selection for the identification of groundwater contaminant sources, J. Hydrol., 603, 126918,
<a href="https://doi.org/10.1016/j.jhydrol.2021.126918" target="_blank">https://doi.org/10.1016/j.jhydrol.2021.126918</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
       Delshad, M., Pope, G. A., and Sepehrnoori, K.: A compositional simulator for modeling surfactant enhanced
aquifer remediation, 1 formulation, J. Contam. Hydrol., 23, 303–327, <a href="https://doi.org/10.1016/0169-7722(95)00106-9" target="_blank">https://doi.org/10.1016/0169-7722(95)00106-9</a>, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
       Feng, Z., Niu, W., and Liu, S.: Cooperation search algorithm: A novel metaheuristic evolutionary
intelligence algorithm for numerical optimization and engineering optimization problems, Appl. Soft Comput., 98,
106734, <a href="https://doi.org/10.1016/j.asoc.2020.106734" target="_blank">https://doi.org/10.1016/j.asoc.2020.106734</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
       Feng, Z., Shi, P., Yang, T., Niu, W., Zhou, J., and Cheng, C.: Parallel cooperation search algorithm and
artificial intelligence method for streamflow time series forecasting, J. Hydrol., 606, 127434,
<a href="https://doi.org/10.1016/j.jhydrol.2022.127434" target="_blank">https://doi.org/10.1016/j.jhydrol.2022.127434</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
       Feng, Z., Zhang, L., Mo, L., Wang, Y., and Niu, W.: A multi-objective cooperation search algorithm for
cascade reservoirs operation optimization considering power generation and ecological flows, Appl. Soft Comput., 150,
111085, <a href="https://doi.org/10.1016/j.asoc.2023.111085" target="_blank">https://doi.org/10.1016/j.asoc.2023.111085</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
       Gorelick, S. M. and Zheng, C.: Global change and the groundwater management challenge, Water Resour. Res.,
51, 3031–3051, <a href="https://doi.org/10.1002/2014WR016825" target="_blank">https://doi.org/10.1002/2014WR016825</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
       Guneshwor, L., Eldho, T. I., and Kumar, A. V.: Identification of groundwater contamination sources using
meshfree RPCM simulation and particle swarm optimization, Water Resour. Manag., 32, 1517–1538,
<a href="https://doi.org/10.1007/s11269-017-1885-1" target="_blank">https://doi.org/10.1007/s11269-017-1885-1</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
       Harbaugh, A. W.: MODFLOW-2005, the U.S. Geological Survey modular ground-water model – the Ground-Water Flow Process, U.S. Geological Survey Techniques and Methods, 6-A16, variously p., <a href="https://doi.org/10.3133/tm6A16" target="_blank">https://doi.org/10.3133/tm6A16</a>,
2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
       Hou, Z. and Lu, W.: Comparative study of surrogate models for groundwater contamination source
identification at DNAPL-contaminated sites, Hydrogeol. J., 26, 923–932, <a href="https://doi.org/10.1007/s10040-017-1690-1" target="_blank">https://doi.org/10.1007/s10040-017-1690-1</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
       Hughes, J. D., Langevin, C. D., and Banta, E. R.: Documentation for the MODFLOW 6 framework, Techniques and
Methods 6-A57, US Geological Survey, Reston, VA, <a href="https://doi.org/10.3133/tm6A57" target="_blank">https://doi.org/10.3133/tm6A57</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
       Jha, M. and Datta, B.: Three-dimensional groundwater contamination source identification using adaptive
simulated annealing, J. Hydrol. Eng., 18, 307–317, <a href="https://doi.org/10.1061/(ASCE)HE.1943-5584.0000624" target="_blank">https://doi.org/10.1061/(ASCE)HE.1943-5584.0000624</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
       Langevin, C. D., Provost, A. M., Panday, S., and Hughes, J. D.: Documentation for the MODFLOW 6 Groundwater
Transport Model, Book 6, Chap. A61, US Geological Survey Techniques and Methods, 56 pp., <a href="https://doi.org/10.3133/tm6A61" target="_blank">https://doi.org/10.3133/tm6A61</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
       Li, P., Karunanidhi, D., Subramani, T., and Srinivasamoorthy, K.: Sources and consequences of groundwater
contamination, Arch. Environ. Con. Tox., 80, 1–10, <a href="https://doi.org/10.1007/s00244-020-00805-z" target="_blank">https://doi.org/10.1007/s00244-020-00805-z</a>, 2021a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
       Li, J., Lu, W., and Fan, Y.: Groundwater pollution sources identification based on hybrid homotopy-genetic
algorithm and simulation optimization, Environ. Eng. Sci., 38, 777–788, <a href="https://doi.org/10.1089/ees.2020.0117" target="_blank">https://doi.org/10.1089/ees.2020.0117</a>, 2021b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
       Li, Y., Lu, W., Pan, Z., Wang, Z., and Dong, G.: Simultaneous identification of groundwater contaminant
source and hydraulic parameters based on multilayer perceptron and flying foxes optimization,
Environ. Sci. Pollut. R., 30, 78933–78947, <a href="https://doi.org/10.1007/s11356-023-27574-1" target="_blank">https://doi.org/10.1007/s11356-023-27574-1</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
       Lophaven, S. N., Nielsen, H. B., and Søndergaard, J.: DACE – A Matlab Kriging Toolbox, Version 2.0, Technical Report IMM-TR-2002-12, Informatics and Mathematical Modelling, Technical University of Denmark, DTU, 34 pp., <a href="https://www2.imm.dtu.dk/pubdb/pubs/3213-full.html" target="_blank"/> (last access: 15 May 2026), 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
       Luo, C., Wang, X., Xu, Y. J., Jia, S., Liu, Z., Mao, B., Lv, Q., Ji, X., Rong, Y., and Dai, Y.: Synergistic
identification of hydrogeological parameters and pollution source information for groundwater point and areal source
contamination based on machine learning surrogate–artificial hummingbird algorithm, Hydrol. Earth Syst. Sci., 29,
5719–5736, <a href="https://doi.org/10.5194/hess-29-5719-2025" target="_blank">https://doi.org/10.5194/hess-29-5719-2025</a>, 2025. 
    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
       Mahar, P. S. and Datta, B.: Optimal identification of ground-water pollution sources and parameter
estimation, J. Water Res. Plan. Man., 127, 20–29, <a href="https://doi.org/10.1061/(ASCE)0733-9496(2001)127:1(20)" target="_blank">https://doi.org/10.1061/(ASCE)0733-9496(2001)127:1(20)</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
       Meenal, M. and Eldho, T. I.: Simulation–optimization model for groundwater contamination remediation using
meshfree point collocation method and particle swarm optimization, Sadhana-Acad. P. Eng. S., 37, 351–369,
<a href="https://doi.org/10.1007/s12046-012-0086-0" target="_blank">https://doi.org/10.1007/s12046-012-0086-0</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
       Mirghani, B. Y., Mahinthakumar, K. G., Tryby, M. E., Ranjithan, R. S., and Zechman, E. M.: A parallel
evolutionary strategy based simulation–optimization approach for solving groundwater source identification problems,
Adv. Water Resour., 32, 1373–1385, <a href="https://doi.org/10.1016/j.advwatres.2009.06.001" target="_blank">https://doi.org/10.1016/j.advwatres.2009.06.001</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
       Ouyang, Q., Lu, W., Miao, T., Deng, W., Jiang, C., and Luo, J.: Application of ensemble surrogates and
adaptive sequential sampling to optimal groundwater remediation design at DNAPLs-contaminated
sites, J. Contam. Hydrol., 207, 31–38, <a href="https://doi.org/10.1016/j.jconhyd.2017.10.007" target="_blank">https://doi.org/10.1016/j.jconhyd.2017.10.007</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
       Pan, Z., Lu, W., Wang, H., and Bai, Y.: Groundwater contaminant source identification based on an ensemble
learning search framework associated with an auto xgboost surrogate, Environ. Modell. Softw., 159, 105588,
<a href="https://doi.org/10.1016/j.envsoft.2022.105588" target="_blank">https://doi.org/10.1016/j.envsoft.2022.105588</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
       Rasmussen, C. E. and Williams, C. K. I.: Gaussian Processes for Machine Learning, The MIT Press, Cambridge, MA, USA, 272 pp., ISBN: 978-0-262-18253-9, <a href="https://doi.org/10.7551/mitpress/3206.001.0001" target="_blank">https://doi.org/10.7551/mitpress/3206.001.0001</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
       Razavi, S., Tolson, B. A., and Burn, D. H.: Review of surrogate modeling in water resources, Water
Resour. Res., 48, W07401, <a href="https://doi.org/10.1029/2011wr011527" target="_blank">https://doi.org/10.1029/2011wr011527</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
       Singh, A.: Review: computer-based models for managing the water-resource problems of irrigated agriculture,
Hydrogeol. J., 23, 1217–1227, <a href="https://doi.org/10.1007/s10040-015-1270-1" target="_blank">https://doi.org/10.1007/s10040-015-1270-1</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
       Singh, R. M. and Datta, B.: Identification of groundwater pollution sources using GA-based linked
simulation optimization model, J. Hydrol. Eng., 11, 101–109, <a href="https://doi.org/10.1061/(ASCE)1084-0699(2006)11:2(101)" target="_blank">https://doi.org/10.1061/(ASCE)1084-0699(2006)11:2(101)</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
       Smola, A. J. and Schölkopf, B.: A tutorial on support vector regression, Stat. Comput., 14, 199–222,
<a href="https://doi.org/10.1023/B:STCO.0000035301.49549.88" target="_blank">https://doi.org/10.1023/B:STCO.0000035301.49549.88</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
       Song, J., Yang, Y., Wu, J. F., Wu, J. C., Sun, X., and Lin, J.: Adaptive surrogate model based
multiobjective optimization for coastal aquifer management, J. Hydrol., 561, 98–111,
<a href="https://doi.org/10.1016/j.jhydrol.2018.03.063" target="_blank">https://doi.org/10.1016/j.jhydrol.2018.03.063</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
       Song, J., Yang, Y., Chen, G., Sun, X., Lin, J., Wu, J. F., and Wu, J. C.: Surrogate assisted
multi-objective robust optimization for groundwater monitoring network design, J. Hydrol., 577, 123994,
<a href="https://doi.org/10.1016/j.jhydrol.2019.123994" target="_blank">https://doi.org/10.1016/j.jhydrol.2019.123994</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>39</label><mixed-citation>
       Swetha, K., Eldho, T. I., Singh, L. G., and Kumar, A. V.: Groundwater contaminant source identification
using swarm intelligence-based simulation optimization models, Environ. Sci. Pollut. R., 32, 1626–1639,
<a href="https://doi.org/10.1007/s11356-024-35850-x" target="_blank">https://doi.org/10.1007/s11356-024-35850-x</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>40</label><mixed-citation>
       Wang, J. L., Lin, Y. H., and Lin, M. D.: Application of heuristic algorithms on groundwater pumping source identification problems, in: 2015 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), IEEE, New York, 858–862, <a href="https://doi.org/10.1109/IEEM.2015.7385770" target="_blank">https://doi.org/10.1109/IEEM.2015.7385770</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>41</label><mixed-citation>
       Wang, Z., Lu, W., Chang, Z., and Zhang, T.: Joint identification of groundwater pollution source
information, model parameters, and boundary conditions based on a novel ES-MDA with a wheel battle
strategy, J. Hydrol., 636, 131320, <a href="https://doi.org/10.1016/j.jhydrol.2024.131320" target="_blank">https://doi.org/10.1016/j.jhydrol.2024.131320</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>42</label><mixed-citation>
       Wu, M.: SA-CSA-TS: A multi-chain surrogate-assisted hybrid optimization algorithm combining CSA and TS,
Zenodo [code], <a href="https://doi.org/10.5281/zenodo.17862863" target="_blank">https://doi.org/10.5281/zenodo.17862863</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>43</label><mixed-citation>
       Wu, M., Wang, L., Xu, J., Hu, P., and Xu, P.: Adaptive surrogate-assisted multi-objective evolutionary
algorithm using an efficient infill technique, Swarm Evol. Comput., 75, 101170, <a href="https://doi.org/10.1016/j.swevo.2022.101170" target="_blank">https://doi.org/10.1016/j.swevo.2022.101170</a>,
2022a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>44</label><mixed-citation>
       Wu, M., Wang, L., Xu, J., Wang, Z., Hu, P., and Tang, H.: Multiobjective ensemble surrogate-based
optimization algorithm for groundwater optimization designs, J. Hydrol., 612, 128159,
<a href="https://doi.org/10.1016/j.jhydrol.2022.128159" target="_blank">https://doi.org/10.1016/j.jhydrol.2022.128159</a>, 2022b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>45</label><mixed-citation>
       Xing, Z., Qu, R., Zhao, Y., Fu, Q., Ji, Y., and Lu, W.: Identifying the release history of a groundwater
contaminant source based on an ensemble surrogate model, J. Hydrol., 572, 501–516,
<a href="https://doi.org/10.1016/j.jhydrol.2019.03.020" target="_blank">https://doi.org/10.1016/j.jhydrol.2019.03.020</a>, 2019.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>46</label><mixed-citation>
       Yin, J. and Tsai, F. T.-C.: Bayesian set pair analysis and machine learning based ensemble surrogates for
optimal multi-aquifer system remediation design, J. Hydrol., 580, 124280, <a href="https://doi.org/10.1016/j.jhydrol.2019.124280" target="_blank">https://doi.org/10.1016/j.jhydrol.2019.124280</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>47</label><mixed-citation>
       Zhao, Y., Lu, W., and Xiao, C.: A kriging surrogate model coupled in simulation–optimization approach for
identifying release history of groundwater sources, J. Contam. Hydrol., 185, 51–60,
<a href="https://doi.org/10.1016/j.jconhyd.2016.01.004" target="_blank">https://doi.org/10.1016/j.jconhyd.2016.01.004</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>48</label><mixed-citation>
       Zheng, C. and Wang, P. P.: MT3DMS: A modular three-dimensional multispecies transport model for simulation of advection, dispersion, and chemical reactions of contaminants in groundwater systems; documentation and user’s guide, Contract Report SERDP-99-1, U.S. Army Engineer Research and Development Center, U.S. Army Corps of Engineers, Vicksburg, MS, USA, <a href="https://sav.el.erdc.dren.mil/elpubs/pdf/crserdp99-1.pdf" target="_blank"/> (last access: 15 May 2026), 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>49</label><mixed-citation>
       Zhu, L., Lu, W., Luo, C., Xu, Y., and Wang, Z.: An ensemble optimizer with a stacking ensemble surrogate
model for identification of groundwater contamination source, J. Contam. Hydrol., 267, 104437,
<a href="https://doi.org/10.1016/j.jconhyd.2024.104437" target="_blank">https://doi.org/10.1016/j.jconhyd.2024.104437</a>, 2024.

    </mixed-citation></ref-html>--></article>
