<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">HESS</journal-id><journal-title-group>
    <journal-title>Hydrology and Earth System Sciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">HESS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Hydrol. Earth Syst. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1607-7938</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/hess-30-2417-2026</article-id><title-group><article-title>Triple collocation validates CONUS-wide evapotranspiration inferred from atmospheric conditions</article-title><alt-title>Validated surface flux equilibrium ET across CONUS</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>McCormick</surname><given-names>Erica L.</given-names></name>
          <email>ericamcc@stanford.edu</email>
        <ext-link>https://orcid.org/0000-0002-7160-398X</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Sanders</surname><given-names>Lillian E.</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3 aff4">
          <name><surname>McColl</surname><given-names>Kaighin A.</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Konings</surname><given-names>Alexandra G.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-2810-1722</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Department of Earth System Science, Stanford University, Stanford, CA, USA</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Department of Computer Science, Stanford University, Stanford, CA, USA</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Department of Earth and Planetary Sciences, Harvard University, Cambridge, MA, USA</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Erica L. McCormick (ericamcc@stanford.edu)</corresp></author-notes><pub-date><day>28</day><month>April</month><year>2026</year></pub-date>
      
      <volume>30</volume>
      <issue>8</issue>
      <fpage>2417</fpage><lpage>2432</lpage>
      <history>
        <date date-type="received"><day>28</day><month>August</month><year>2025</year></date>
           <date date-type="rev-request"><day>18</day><month>September</month><year>2025</year></date>
           <date date-type="rev-recd"><day>14</day><month>March</month><year>2026</year></date>
           <date date-type="accepted"><day>10</day><month>April</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Erica L. McCormick et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026.html">This article is available from https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026.html</self-uri><self-uri xlink:href="https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026.pdf">The full text article is available as a PDF file from https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e128">Large-scale estimation of evapotranspiration (ET) remains challenging because no direct remote sensing estimates of ET exist and because most data-driven estimation approaches require assumptions about the impact of moisture conditions and biogeography on ET. The surface flux equilibrium (SFE) approach offers an alternative, deriving ET directly from atmospheric temperature and humidity under the assumption that conditions in the atmospheric boundary layer reflect ET's land boundary condition. We present a 4 km resolution, continental United States-wide, daily ET dataset spanning from 1979 to 2025 using the SFE method. The Bowen ratio is first calculated using the SFE method solely based on temperature and specific humidity estimates from gridMET and then converted to ET using net radiation and ground heat fluxes from ERA5-Land. We evaluate its performance using extended triple collocation to estimate the standard deviation of the random error and the correlation coefficient of SFE ET compared to true ET, as well as those of three widely used alternative ET datasets: GLEAM, FluxCom, and ERA5-Land. Despite its extreme simplicity, SFE ET achieves performance comparable to or exceeding the other datasets across large portions of CONUS, particularly in the Western U.S., while requiring no information about land surface, vegetation, or soil properties and no assumptions about ET's response to environmental and climate drivers. Our results support the use of SFE as a scalable, observation-driven method for estimating ET.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>National Science Foundation</funding-source>
<award-id>1942133</award-id>
<award-id>Graduate Research Fellowship Program</award-id>
</award-group>
<award-group id="gs2">
<funding-source>Alfred P. Sloan Foundation</funding-source>
<award-id>11974</award-id>
</award-group>
<award-group id="gs3">
<funding-source>Directorate for Geosciences</funding-source>
<award-id>AGS-2129576</award-id>
<award-id>AGS-2441565</award-id>
</award-group>
<award-group id="gs4">
<funding-source>Alfred P. Sloan Foundation</funding-source>
<award-id>FG-2023-19963</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e140">Evapotranspiration (ET) dominates the terrestrial water cycle (Friedlingstein et al., 2019; Good et al., 2015), controls the partitioning of radiation into latent and sensible heat (McColl and Rigden, 2020), and plays a key role in driving the hydrologic cycle by returning water to the atmosphere (Oki and Kanae, 2006). ET therefore has downstream feedbacks on temperature (Teuling et al., 2010), precipitation, and vegetation productivity (Green et al., 2017) in addition to directly impacting the carbon cycles through the trade-off between photosynthesis and transpiration (Yang et al., 2023). However, estimation of ET via remote sensing remains a significant challenge with implications for understanding of vegetation response to drought, fire risk, and the accounting of freshwater resources.</p>
      <p id="d2e143">One challenge for ET remote sensing is that, unlike some surface properties such as temperature, we are unable to directly sense the flux of water or latent heat associated with ET electromagnetically. Therefore, ET products must leverage modelling approaches – either physical, hybrid, or machine learning – constrained by the data that is observable via remote sensing. These modelling approaches for ET often assume – implicitly or explicitly – the response of evaporation and transpiration to environmental drivers, such as drought or variations in land cover.</p>
      <p id="d2e146">Alternatively, surface flux equilibrium (SFE) is a data-driven method for estimating ET directly from atmospheric conditions without relying on soil or vegetation parameterization. The concept of surface flux equilibrium was first proposed by McColl et al. (2019) and states that, under many circumstances, the atmosphere and land surface are coupled so that changes in surface fluxes (including ET) are reflected in atmospheric temperature and humidity. This approach has several advantages over other ET estimation methods. It requires no information about vegetation, soil, or subsurface properties. It also makes no assumptions about root-zone moisture status or vegetation response to water availability. This means it is well suited for hydrological research attempting to interrogate the relationship between ET and water availability or between ET and vegetation cover (or other biogeographic drivers). Additionally, SFE includes no tunable parameters and can be computed easily using only three inputs – air temperature, humidity, and net radiation – each of which is readily available at global scales (McColl and Rigden, 2020).</p>
      <p id="d2e149">However, more complex ET estimation methods would be expected to outperform SFE in many settings due to its extreme simplicity and lack of adjustable parameters. Nevertheless, previous SFE implementation and validation efforts indicate that SFE performance is comparable – or even better than – other ET estimation methods at the point- and watershed- scale (Chen et al., 2021; McColl and Rigden, 2020; Thakur et al., 2025). For example, SFE ET has been found to be within the range of in situ measurement uncertainty at a selection of inland eddy covariance towers, an upper limit on the performance of any ET estimate (McColl and Rigden, 2020). Thakur et al. (2025) also calculated SFE ET at inland eddy covariance sites across the continental United States (CONUS) using tower-based temperature, humidity, and net radiation. They found that SFE ET outperformed remotely sensed ET from MODIS (Mu et al., 2011) as well as from three ET algorithms using data from the ECOsystem Spaceborne Thermal Radiometer Experiment on Space Station (ECOSTRESS): the Simplified Surface Energy Balance (Savoca et al., 2013) SSEBop, (Savoca et al., 2013), the atmosphere-land exchange inverse disaggregation algorithm (DisALEXI) and the Priestley-Taylor Jet Propulsion Laboratory model (PT-JPL, Fisher et al., 2020).</p>
      <p id="d2e153">Thakur et al. (2025) further investigated the impact of input data on SFE performance by calculating SFE ET using three scenarios: only eddy covariance data, by using the North American Land Data Assimilation System (NLDAS, Xia et al., 2012) for temperature and humidity and the Clouds and the Earth's Radiant Energy System instrument (CERES, Doelling et al., 2013) for net radiation, and by finally using NLDAS for temperature and humidity and MODIS for net radiation. All three SFE ET implementations compared favorably to tower-based ET with <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> of 0.70, 0.68, and 0.67 for the tower-based SFE, CERES-based SFE, and MODIS-based SFE, respectively. This suggests that the emergent simplicity of ET that SFE takes advantage of is robust to choices of input data, at least at the scale of eddy covariance towers.</p>
      <p id="d2e167">The only gridded estimates of SFE ET are reported by Chen et al. (2021), who calculated monthly ET at 0.125° across CONUS using net radiation from CERES and 2 m temperature and humidity from North American Regional Reanalysis (NARR, Mesinger et al., 2006). They compared SFE ET to estimates from the Coupled Model Intercomparison Project phase 6 (CMIP6, Eyring et al., 2016) and to water balance-based ET estimates available at large catchments across CONUS. The error in the water balance-based estimates provides a minimum possible error, below which ET estimation approaches cannot be distinguished due to errors in the underlying reference data. They found that SFE ET errors are comparable to the error of the catchment water balances and that SFE outperforms the reanalysis (NARR) and most CMIP6 91 models.</p>
      <p id="d2e170">However, even this sole gridded implementation of SFE – while promising – is unable to provide a thorough evaluation of the SFE approach because the comparison datasets each have their own unquantified uncertainties. Therefore, disagreement between SFE and CMIP6 cannot be attributed to either dataset because their errors cannot be distinguished. One solution to this is the statistical evaluation approach of triple collocation. Using triple collocation and its 7 updated counterpart, extended triple collocation (McColl et al., 2014), it is possible to compare three datasets with co-located measurements and estimate two important performance metrics: (1) the variability in the random error of each dataset and (2) the correlation between the measured value and the underlying “true” variable. Both performance metrics can be calculated without reference to this unknowable “true” variable, in this case ET, and without assuming the error of any of the three comparison datasets.</p>
      <p id="d2e173">Triple collocation – sometimes also referred to as the “three-cornered hat” approach – has been widely used in evaluating datasets where a “truth” or reference dataset is unavailable, for example in the evaluation of datasets for soil moisture (Draper et al., 2013; Gruber et al., 2016; Scipal et al., 2008), ocean winds (Caires and Sterl, 2003), precipitation (Alemohammad et al., 2015, Burnett et al., 2020), sensible heat and carbon fluxes (Alemohammad et al., 2017), ET (Khan et al., 2018), near-surface air temperature and specific humidity (Sun et al., 2021), and terrestrial water storage (Ferreira et al., 2016). It can also be used to estimate the coupling of multiple variables, for example latent heat and soil moisture (Crow et al., 2015). Given three datasets with observations of the same state variable, each with their own non-correlated random errors, comparison of the three datasets via triple collocation enables calculation of each dataset's random error variance (Stoffelen, 1998).</p>
      <p id="d2e176">Here, we accomplish two steps in advancing the estimation of ET. First, we release the first publicly available, gridded dataset of daily SFE ET. We calculate this dataset at 4 km resolution across the continental United States (CONUS) using gridMET for 2 m temperature and humidity and net radiation from ERA5-Land. Second, we compare our gridded estimates of SFE ET to three other remotely sensed ET estimates: Global Land Evaporation Amsterdam Model Version 4 (GLEAM, Miralles et al., 2011), FluxCom (Jung et al., 2019), and ERA5-Land (Muñoz-Sabater et al., 2021). In addition to comparing the spatial pattern and variance of all datasets, we further use the statistical method of extended triple collocation following McColl et al. (2014) to calculate the error statistics of each dataset, despite lacking observations of “true” ET (Gruber et al., 2016; McColl et al., 2014; Stoffelen, 1998).</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methods</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Calculating ET from atmospheric conditions assuming surface flux equilibrium</title>
      <p id="d2e194">We calculate daily ET after McColl et al. (2019) by assuming that the near-surface atmosphere is in a state of “surface flux equilibrium” where atmospheric conditions at the boundary layer reflect the recent fluxes of latent (<inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>E</mml:mi></mml:mrow></mml:math></inline-formula>) and sensible (<inline-formula><mml:math id="M3" display="inline"><mml:mi>H</mml:mi></mml:math></inline-formula>) heat on the Earth's surface. If this is the case, then increasing ET (i.e. increasing latent heat) will correspond with diminished sensible heat and result in both atmospheric cooling and increased humidity. The ratio of sensible and latent heat fluxes – known as the Bowen ratio (<inline-formula><mml:math id="M4" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>) – can therefore be approximated by temperature and humidity at the boundary layer, so long as atmospheric conditions reflect the integrated signal of fluxes on the Earth's surface.</p>
      <p id="d2e221">We use 2 m air temperature (<inline-formula><mml:math id="M5" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi mathvariant="normal">a</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) and relative humidity (<inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">a</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) from gridMET (Abatzoglou, 2013) to estimate the Bowen ratio, where <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi mathvariant="normal">v</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">461.5</mml:mn></mml:mrow></mml:math></inline-formula> (J kg<sup>−1</sup> K<sup>−1</sup>) is the gas constant for water vapor, <inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1005</mml:mn></mml:mrow></mml:math></inline-formula> (J kg<sup>−1</sup> K<sup>−1</sup>) is the specific heat capacity of air at constant pressure, and <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2.56</mml:mn></mml:mrow></mml:math></inline-formula> 139 <inline-formula><mml:math id="M14" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 106 (J kg<sup>−1</sup>) is the latent heat of vaporization of water (Eq. 1).

            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M16" display="block"><mml:mrow><mml:mi>B</mml:mi><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi>H</mml:mi><mml:mrow><mml:mi>L</mml:mi><mml:mi>E</mml:mi></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>≈</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi mathvariant="normal">v</mml:mi></mml:msub><mml:msub><mml:mi>c</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:msubsup><mml:mi>T</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msup><mml:mi mathvariant="italic">λ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:msub><mml:mi>q</mml:mi><mml:mi mathvariant="normal">a</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e410">We choose gridMET because it downscales output from the North American Land Data Assimilation System (NLDAS) with PRISM. This incorporation of statistically interpolated station data at a fine resolution helps gridMET achieve a high correlation with in situ stations, particularly for the variable of temperature, while maintaining a relatively fine spatial resolution of 4 km across CONUS (Abatzoglou, 2013). Net radiation (<inline-formula><mml:math id="M17" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi mathvariant="normal">n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) allows conversion from the Bowen ratio to ET (Eq. 2). We use <inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi mathvariant="normal">n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> from ERA5-Land (Muñoz-Sabater et al., 2021) because of its high agreement with in situ measurements across CONUS (Yin et al., 2023). However, we note that error in these input datasets will propagate to error in the resulting ET estimates.</p>
      <p id="d2e435">Although the ground heat flux (<inline-formula><mml:math id="M19" display="inline"><mml:mi>G</mml:mi></mml:math></inline-formula>) can vary from 10 % to as much as 50 % of <inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi mathvariant="normal">n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> depending on ground cover (Clothier et al., 1986, Santanello and Friedl, 2003), here we assume a constant <inline-formula><mml:math id="M21" display="inline"><mml:mi>G</mml:mi></mml:math></inline-formula> of 10 %. Additionally, we do not evaluate SFE ET on any days with negative <inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi mathvariant="normal">n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> because doing so would result in a negative ET estimate, which is not physical.

            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M23" display="block"><mml:mrow><mml:mi mathvariant="italic">λ</mml:mi><mml:mi mathvariant="normal">ET</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi>B</mml:mi><mml:msup><mml:mo>)</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:msub><mml:mi>R</mml:mi><mml:mi mathvariant="normal">n</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mi>G</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula></p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Triple collocation error estimation</title>
      <p id="d2e525">Triple collocation assumes a linear error model for each dataset, where the observed value for a given dataset (<inline-formula><mml:math id="M24" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>) is assumed to be a linear function of the “true” ET (<inline-formula><mml:math id="M25" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>) obscured by a constant additive bias (<inline-formula><mml:math id="M26" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>), a constant multiplicative bias (<inline-formula><mml:math id="M27" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>) and a time-varying additive random error with zero mean (<inline-formula><mml:math id="M28" display="inline"><mml:mi mathvariant="italic">ϵ</mml:mi></mml:math></inline-formula>) (Eq. 3). While a linear error model likely does not fully capture the error structure of the actual ET dataset errors, it has been successfully used to evaluate ET datasets using triple collocation in other regions (Kahn et al., 2018; He et al., 2023).

            <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M29" display="block"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>+</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi>T</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mo>+</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e604">In addition to assuming a linear error model for each dataset, triple collocation further assumes that the errors of each dataset are stationary and uncorrelated both with each other and with the unknown truth (Gruber et al., 2016; McColl et al., 2014).</p>
      <p id="d2e607">With these assumptions, the variance of each dataset (<inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">11</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">22</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">33</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) represents the sensitivity of the dataset to variations in the true signal (via the product of <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) plus the variance of the random error (<inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mi>i</mml:mi></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>) (Eq. 4).

            <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M36" display="block"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mo>=</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msubsup><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>T</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mi>i</mml:mi></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e735">Covariance between pairs of datasets (e.g. <inline-formula><mml:math id="M37" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">12</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">13</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">23</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) likewise provides information about each dataset's sensitivity to the true unknown ET via <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. (Eq. 5).

            <disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M42" display="block"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>T</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e840">The <inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> terms cancel out for the ratio of each dataset covariance pair, resulting in six equations and six unknowns. These can be solved to calculate the standard deviation of the random error of each dataset, <inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (Eq. 6).

            <disp-formula id="Ch1.E6" content-type="numbered"><label>6</label><mml:math id="M46" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mfenced close="]" open="["><mml:mtable class="array" columnalign="left"><mml:mtr><mml:mtd><mml:msqrt><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">11</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">12</mml:mn></mml:msub><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">13</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">23</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mstyle></mml:mrow></mml:msqrt></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msqrt><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">22</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">12</mml:mn></mml:msub><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">23</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">13</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mstyle></mml:mrow></mml:msqrt></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msqrt><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">33</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">13</mml:mn></mml:msub><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">23</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">12</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mstyle></mml:mrow></mml:msqrt></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e985">The absolute values of <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> cannot be separated from the absolute value of <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. However, many studies assume <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> for one dataset – effectively choosing it as a reference dataset which has no multiplicative bias – and calculate <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for the other two datasets <italic>relative</italic> to the actual unknown multiplicative bias of the reference dataset. In this study, however, we do not separate <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e1062">Extended triple collocation further allows the calculation of the correlation between each dataset and the unknown truth, <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, while requiring no additional information (McColl et al., 2014); Eq. 7).

            <disp-formula id="Ch1.E7" content-type="numbered"><label>7</label><mml:math id="M54" display="block"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mi>T</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>=</mml:mo><mml:mfenced open="[" close="]"><mml:mtable class="array" columnalign="left"><mml:mtr><mml:mtd><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">12</mml:mn></mml:msub><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">13</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">11</mml:mn></mml:msub><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">23</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mstyle></mml:mtd></mml:mtr><mml:mtr><mml:mtd/></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">12</mml:mn></mml:msub><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">23</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">22</mml:mn></mml:msub><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">13</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mstyle></mml:mtd></mml:mtr><mml:mtr><mml:mtd/></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">13</mml:mn></mml:msub><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">23</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">33</mml:mn></mml:msub><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">12</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mstyle></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e1177">Triple collocation requires several assumptions, all of which are likely to be at least partially violated (e.g., Yilmaz and Crow, 2014). However, these assumptions are not unique to triple collocation. Gruber et al. (2016) showed that more common validation strategies implicitly require the same assumptions. For example, if we were to instead estimate the correlation coefficient and root-mean-squared error (RMSE) between SFE ET and another reference ET product, we would be implicitly making the same assumptions.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Comparison ET datasets</title>
      <p id="d2e1188">We compare SFE ET to ET from FluxCom, GLEAM version 4, and ERA5-Land. We compare all ET datasets over the years 1980 to 2016, which represents the maximum overlap in temporal coverage between all four datasets. Additionally, we resample each dataset to match the native resolution of FluxCom at 0.5°. We match the FluxCom resolution because it is the coarsest. We choose to compare SFE to these particular three ET datasets not just because they are commonly used, but also to minimize violation of the triple collocation assumptions, particularly the assumption of independent errors between datasets. This is commonly achieved by using datasets that differ in their input data sources and modeling frameworks (Gruber et al., 2016; McColl et al., 2014). We also remove the seasonal cycle from each dataset by subtracting the 30 d rolling average from each day (Chen et al., 2018; Draper et al., 2013; Miralles et al., 2010). This ensures that differences in the seasonality and timing of ET do not impact the triple collocation analysis and has been shown to improve error estimation with triple collocation for ET datasets specifically (He et al., 2023). Performing triple collocation on the anomaly should also reduce violation of the assumption that the ET error structure is linear. This is because the low-frequency (e.g. seasonal) ET signals which are removed are expected to have a different non-linearity than the high-frequency signals isolated by the anomaly (Miralles et al., 2010; Su et al., 2014).</p>
      <p id="d2e1191">After removing the seasonal cycle, we choose only the months of March through October for the triple collocation analysis. This is because negative daily net radiation occurs for some pixels during the winter months, prohibiting the calculation for SFE. Because the number of days with negative net radiation varies for each pixel, we eliminate all winter months for all datasets to ensure a consistent number of data for each dataset and pixel.</p>
      <p id="d2e1194">Finally, we use extended triple collocation to calculate the standard deviation of the random error and the correlation coefficient of each dataset (see Sect. 2.2 above). Because we have four comparison datasets and triple collocation requires just three, we are able to repeat our estimates of each dataset's error statistics once for each possible “triplet” (i.e. combination) of three datasets. Convergence of the error estimates regardless of the triplet chosen increases our confidence in the robustness of the triple collocation assumptions and therefore in our calculated values (Draper et al., 2013; He et al., 2023). In addition to performing triple collocation, we also compare the four datasets via a general analysis of the variance and spatial patterns of ET.</p>
      <p id="d2e1197">The FluxCom dataset we choose for our triple collocation analysis uses machine learning to upscale eddy covariance measurements from flux towers based on satellite and meteorological inputs. FluxCom provides an ensemble of latent heat estimates trained using different meteorological datasets. In order to have the longest data record with daily resolution, here we use the single FluxCom ensemble member trained with the CRUNCEPv6 reanalysis product (Wei et al., 2014), as opposed to the mean of all possible FluxCom ensemble members. However, the different model setups (each with a different weather model) were previously found to have similar performance (Jung et al., 2019). In addition to the climate data from CRUNCEP, FluxCom uses radiation data from CERES (Doelling et al., 2013), precipitation from the Global Precipitation Climatology Project (GPCP, Huffman et al., 2001), and temperature, land cover, and other reflectance indicators from MODIS. The FluxCom model is run per plant functional type and then combined into a single estimate by weighting each plant functional type's fractional areal coverage of the pixel (Jung et al., 2019).</p>
      <p id="d2e1201">GLEAM estimates ET by using remote sensing and reanalysis data to force a hybrid model which includes modules for canopy interception, potential evapotranspiration, soil water content, and vegetation response to evaporative stress. Although FluxCom and GLEAM have some remote sensing inputs in common, for example radiation from CERES and vegetation information from MODIS, Gleam Version 4 takes a hybrid modelling approach and does not rely fully on machine learning like FluxCom. Specifically, GLEAM version 4 primarily uses physical modelling modules with only a single module – for evaporative stress – using a deep neural network trained using in situ data from eddy covariance towers and sap flow measurements (Koppa et al., 2022; Martens et al., 2017; Miralles et al., 2025). This is in contrast to GLEAM version 3, which estimates evaporative stress empirically as a function of soil moisture and vegetation optical depth – both from microwave remote sensing inputs. Additionally, GLEAM Version 4 calculates ET using Penman's equation (as opposed to Priestley-Taylor, used in Version 3) and also updates the multi-layer water balance model so that vegetation access to groundwater can be represented. However, in GLEAM Version 4, plant rooting depths are static for each land cover within the groundwater scheme and there is still a prescribed multiplicative stress function to determine how vegetation responds to soil moisture stress. GLEAM is the only dataset in our comparison set which partitions ET between evaporation, transpiration, and interception. We use the variable referring to the total evaporation (<inline-formula><mml:math id="M55" display="inline"><mml:mi>E</mml:mi></mml:math></inline-formula>) to best match the other ET estimates.</p>
      <p id="d2e1211">Finally, ERA5-Land uses the near-surface atmospheric reanalysis from ERA5, which assimilates observations from a range of satellites and in situ observation networks for many variables including land surface temperature, precipitation, wind speed, and soil moisture (Hersbach et al., 2020). ERA5-Land then takes the atmospheric states from ERA5 and re-runs the land surface model component at a finer resolution (9 km) offline (Muñoz-Sabater et al., 2021). This allows for additional and refined land surface parameterizations and corrections. Unlike FluxCom and GLEAM, ERA5-Land has no machine learning components. For our analysis, we sum the hourly latent heat flux output of ERA5-Land to daily totals and then resample bilinearly to match the coarser 0.5° FluxCom grid. Finally, both ERA5-Land and FluxCom report latent heat flux in units of energy per unit area, which we convert to ET (mm d<sup>−1</sup>) by dividing by the latent heat of vaporization (<inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2.56</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">106</mml:mn></mml:mrow></mml:math></inline-formula> J kg<sup>−1</sup>).</p>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Comparing performance across biogeographical factors</title>
      <p id="d2e1262">We compare the resulting <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> estimates from triple collocation across a variety of biogeographical factors – specifically climate, elevation, land cover type, and the distance to the coast – to better understand under what conditions SFE ET performs well and how its performance across biogeography compares to that of the other ET estimates.</p>
      <p id="d2e1287">We calculate the mean annual precipitation at each pixel using monthly precipitation (<inline-formula><mml:math id="M61" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula>) from 1991 to 2020 from TerraClimate (Abatzoglou, 2013).We use elevation from MERIT Hydro (Version 1.0.1., Yamazaki et al., 2019). For land cover, we use the National Land Cover Database (NLCD) land cover map from 2021 (Dewitz, 2024). We consider the land cover types of forest (combining deciduous, evergreen, and mixed forests), shrub, grassland, wetland (combining woody and herbaceous wetlands), and agricultural (cultivated crops).</p>
      <p id="d2e1297">We further analyze the performance of each dataset by each pixel's distance from the coast because the assumptions of SFE are likely to be violated near the ocean (McColl et al., 2019). This is because in coastal regions, ocean moisture and temperature are expected to be a strong control on land surface fluxes. We calculate the distance of each pixel centroid from the nearest coast using the TIGER/Line Coastline National Shapefile (United States Census Bureau, 2019). We also exclude pixels from all analyses if their centroid overlaps with the ten largest water bodies in CONUS (ArcGIS Data and Maps, 2023).</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Surface flux equilibrium ET across CONUS from 1979 to 2025</title>
      <p id="d2e1316">Here, we publicly release a dataset of daily SFE ET from 1979 to 2025 at 4 km resolution across CONUS (see Data Availability section). The spatial mean (shown in Fig. 1a) follows expected patterns across CONUS – with an aridity driven gradient from West to East and a radiation driven gradient from North to South in the Eastern US. This spatial pattern exists regardless of the choice of parameter for the ground heat flux (<inline-formula><mml:math id="M62" display="inline"><mml:mi>G</mml:mi></mml:math></inline-formula>), although the magnitude of mean annual ET is altered (Fig. S1). The temporal variability in daily ET calculated using the SFE approach is consistent with the comparison datasets (Fig. S2). However, SFE has a larger standard deviation across much of CONUS – particularly the Western US – than FluxCom and GLEAM. Across several sample pixels, chosen as heavily vegetated examples spanning multiple regions, the seasonal cycle of mean annual ET is likewise comparable across all four ET estimates, although the timing of maximum summer ET each year varies between datasets (Fig. 1b–g).</p>
      <p id="d2e1326">The magnitude of mean annual continental SFE ET (Fig. 2) and the pattern of interannual variability which matches SFE the best is that of GLEAM (<inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.55</mml:mn></mml:mrow></mml:math></inline-formula>). The two datasets with the overall closest match in ET interannual variability, however, are FluxCom and ERA5-Land (<inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.71</mml:mn></mml:mrow></mml:math></inline-formula>). All correlation coefficients are shown in Table S1. Although SFE and FluxCom each have intermediate magnitudes of mean continental ET relative to GLEAM and ERA5, both datasets – and FluxCom in particular – also have the lowest interannual variability magnitude (8 mm yr<sup>−1</sup> standard deviation for FluxCom and 10.5 mm yr<sup>−1</sup> for SFE, compared to 22 and 28 mm yr<sup>−1</sup> for ERA5-Land and GLEAM, respectively). Across the entire average record, the mean annual ET from SFE (538 mm yr<sup>−1</sup>) is just below GLEAM (552 mm yr<sup>−1</sup>), with ERA5-Land having the highest mean annual ET (645 mm yr<sup>−1</sup>). The mean annual ET across CONUS is shown in Table S2.</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e1428">Mean annual SFE ET across CONUS from 1979 to 2025. Points show timeseries for example pixels for SFE (green), ERA5-Land (blue), GLEAM (purple) and FluxCom (pink).</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026-f01.png"/>

        </fig>

      <fig id="F2"><label>Figure 2</label><caption><p id="d2e1440">Interannual variability in mean annual ET across CONUS from 1979 through the record length of each dataset.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026-f02.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>SFE is the only dataset that performs well in terms of both the standard deviation of the random error and the correlation coefficient</title>
      <p id="d2e1457">SFE performance during non-winter months as estimated by triple collocation is comparable – and even exceeds – the performance of the comparison datasets across much of CONUS, despite its extreme simplicity, lack of tunable parameters, and relatively small number of assumptions (Fig. 3). SFE, FluxCom, and GLEAM show a strong divide in performance between the Western and Eastern US. SFE and FluxCom both have the lowest <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and highest <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in the Western US compared to the Eastern US. In contrast, GLEAM has lower <inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:mrow></mml:math></inline-formula> in the Western US, but higher <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in the Eastern US. ERA5-Land shows more heterogeneity in performance across space – especially compared to SFE and FluxCom – and has no clear performance gradient between the Western and Eastern US.</p>

      <fig id="F3"><label>Figure 3</label><caption><p id="d2e1505">The standard deviation of the random error, <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (left) and correlation coefficient to the truth, <inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (right) for each dataset averaged across all triplet combinations. Increasingly light colors are better performance. White pixels have no valid data for any triplet.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026-f03.png"/>

        </fig>

      <p id="d2e1536">Despite its simplicity, SFE is the best or second-best dataset according to both <inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> across more than half of CONUS (Fig. 4). SFE has the lowest or second lowest <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and highest or second highest <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> across 46.1 % and 77.9 % of pixels across CONUS, respectively (Fig. 4, Table 1), mostly in the Western US.</p>
      <p id="d2e1584">SFE's high performance with regards to both <inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is unique among the comparison datasets. Other than SFE, the datasets with the best <inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, respectively, have the lowest performance for the complementary metric. For example, FluxCom has the lowest <inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> across the majority of CONUS, but it also has the lowest <inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (Fig. 4). The opposite is true for ERA5, which is the highest performing dataset according to <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> across much of CONUS but frequently has the worst performance according to <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, particularly in the US Southwest. SFE is the only dataset which consistently has high performance according to both metrics.</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e1678">Summary of relative performance of all four datasets. The dataset with highest performance for the standard deviation of the random error, <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(a)</bold> and the correlation coefficient with “true” ET, <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(b)</bold> for each pixel. The worst performing datasets for <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(c)</bold> and <inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(d)</bold>. The relative ranking of SFE for <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(e)</bold> and <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <bold>(f)</bold>. The total number of pixels (and relative percent of pixels) of each color are shown in Table S1. Pixels with centroids within 4 km of the border have been removed.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026-f04.png"/>

        </fig>

<table-wrap id="T1"><label>Table 1</label><caption><p id="d2e1776">(Top) The number of pixels where each dataset has the best performance according to the standard deviation of the random error, <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, and the correlation coefficient to the truth, <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. (Bottom) The number of pixels by SFE ET ranking.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="left" colsep="1"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col5" align="center">Best dataset </oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry namest="col2" nameend="col3" align="center">By <inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry namest="col4" nameend="col5" align="center">By <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Pixels</oasis:entry>
         <oasis:entry colname="col3">Percent</oasis:entry>
         <oasis:entry colname="col4">Pixels</oasis:entry>
         <oasis:entry colname="col5">Percent</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SFE</oasis:entry>
         <oasis:entry colname="col2">164</oasis:entry>
         <oasis:entry colname="col3">(5.4 %)</oasis:entry>
         <oasis:entry colname="col4">115</oasis:entry>
         <oasis:entry colname="col5">(3.8 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">GLEAM</oasis:entry>
         <oasis:entry colname="col2">17</oasis:entry>
         <oasis:entry colname="col3">(0.6 %)</oasis:entry>
         <oasis:entry colname="col4">159</oasis:entry>
         <oasis:entry colname="col5">(5.2 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FLUXCOM</oasis:entry>
         <oasis:entry colname="col2">2537</oasis:entry>
         <oasis:entry colname="col3">(83.7 %)</oasis:entry>
         <oasis:entry colname="col4">33</oasis:entry>
         <oasis:entry colname="col5">(1.1 %)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">ERA5-Land</oasis:entry>
         <oasis:entry colname="col2">314</oasis:entry>
         <oasis:entry colname="col3">(10.4 %)</oasis:entry>
         <oasis:entry colname="col4">2725</oasis:entry>
         <oasis:entry colname="col5">(89.9 %)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col5" align="center">Ranking of SFE </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry namest="col2" nameend="col3" align="center">By <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry namest="col4" nameend="col5" align="center">By <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Pixels</oasis:entry>
         <oasis:entry colname="col3">Percent</oasis:entry>
         <oasis:entry colname="col4">Pixels</oasis:entry>
         <oasis:entry colname="col5">Percent</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1st</oasis:entry>
         <oasis:entry colname="col2">111</oasis:entry>
         <oasis:entry colname="col3">(3.7 %)</oasis:entry>
         <oasis:entry colname="col4">156</oasis:entry>
         <oasis:entry colname="col5">(5.1 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2nd</oasis:entry>
         <oasis:entry colname="col2">1286</oasis:entry>
         <oasis:entry colname="col3">(42.4 %)</oasis:entry>
         <oasis:entry colname="col4">2206</oasis:entry>
         <oasis:entry colname="col5">(72.8 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3rd</oasis:entry>
         <oasis:entry colname="col2">1397</oasis:entry>
         <oasis:entry colname="col3">(46.1 %)</oasis:entry>
         <oasis:entry colname="col4">646</oasis:entry>
         <oasis:entry colname="col5">(21.3 %)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4th</oasis:entry>
         <oasis:entry colname="col2">238</oasis:entry>
         <oasis:entry colname="col3">(7.8 %)</oasis:entry>
         <oasis:entry colname="col4">24</oasis:entry>
         <oasis:entry colname="col5">(0.8 %)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e2077">We note that the estimates of <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are consistent between triplets, indicating <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> estimates are robust to the choice of comparison datasets (Fig. 5). Individual <inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> maps for each dataset and triplet combination are shown in Figs. S3 and S4 and differences between each triplet combination are shown in Figs. S5 and S6. However, not all pixels have valid results for each triplet combination, which occurs when either <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is negative for one or more of the datasets or if any <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are greater than one. Figure 6 shows the total number of triplets which are valid for each pixel. The triplets with the most invalid pixels are those where FluxCom and ERA5-Land are both included. Invalid pixels are also more common in the Eastern US rather than the Western US. Even in the East, however, SFE – our main estimate of interest – still has at least one valid triplet in 96 % of pixels and at least two valid triplets in 88 % of pixels. SFE has three valid triplets – the maximum possible number for our four dataset analysis – in 56 % of pixels. The triple collocation results are also relatively insensitive to the choice of the ground heat flux (<inline-formula><mml:math id="M109" display="inline"><mml:mi>G</mml:mi></mml:math></inline-formula>) parameter used in the calculation of SFE, although increases in <inline-formula><mml:math id="M110" display="inline"><mml:mi>G</mml:mi></mml:math></inline-formula> necessarily reduce ET estimates, and therefore also reduce <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (Fig. S7). To the extent that uncertainty in <inline-formula><mml:math id="M112" display="inline"><mml:mi>G</mml:mi></mml:math></inline-formula> causes errors in the SFE ET estimate, it will also cause errors in estimates from other ET products, which must make similar assumptions or approximations for <inline-formula><mml:math id="M113" display="inline"><mml:mi>G</mml:mi></mml:math></inline-formula>.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e2212">(left) The coefficient of variation of <inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for each dataset across all possible triplet combinations with valid data. White pixels have no valid data for any triplet. (right) The standard deviation of <inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for each dataset across all possible triplet combinations with valid data. White pixels have no valid data for any triplet and black pixels have only one triplet combination with valid data.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026-f05.png"/>

        </fig>

      <fig id="F6"><label>Figure 6</label><caption><p id="d2e2245">The total number of triple collocation estimates – one from each possible combination of datasets – that are averaged for each pixel and dataset combination. Pixels with no valid triple collocation results for any triplet are shown in white. The maximum number of valid triplets is three.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026-f06.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Performance across biogeographical factors</title>
      <p id="d2e2262">Comparing the trends of <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (Fig. 7) and <inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (Fig. 8) across mean annual precipitation, elevation, landcover, and the distance to large water bodies shows that SFE performance is not more sensitive to any of these biogeographical factors than the comparison datasets. Even when comparing SFE performance with coastal proximity – a factor where we expect to see performance degradation due to the violation of SFE assumptions (McColl and Rigden, 2020) – the coastal proximity penalty of SFE is comparable to that of ERA5-Land. Indeed, ERA5-Land shows the sharpest decrease in performance within 20 km of the coast out of any of the datasets, however both SFE and ERA5-Land continue to show improved performance even up to 120 km inland. Neither GLEAM nor FluxCom have a strong relationship between coastal proximity and performance.</p>
      <p id="d2e2287">Likely due to its correlation with coastal proximity, SFE also has decreased performance at lower elevations with respect to both evaluation metrics. FluxCom and GLEAM likewise show their highest <inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at low elevations relative to higher elevations, with FluxCom <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> peaking around 500 m a.s.l. and GLEAM <inline-formula><mml:math id="M120" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> around 1000 m a.s.l. All three datasets continue to have decreased <inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:mrow></mml:math></inline-formula> as elevation increases. The relationship between elevation and <inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is relatively flat for SFE and FluxCom in the intermediate elevations, with the lowest <inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at the extreme low and high elevations. GLEAM and ERA5, however, have continuously decreasing <inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> with increasing elevation, and the lowest <inline-formula><mml:math id="M125" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at elevations exceeding 2000 m a.s.l.</p>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e2380">The standard deviation of the random error, <inline-formula><mml:math id="M126" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, for each ET dataset across mean annual precipitation, the distance to large water bodies, elevation, and land cover. The number of pixels in each category per ET dataset is shown below boxes.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026-f07.png"/>

        </fig>

      <p id="d2e2401">The <inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for SFE, GLEAM, and FluxCom is lowest at the driest and wettest pixels and highest at pixels with intermediate precipitation. However, the <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for GLEAM peaks at the 500–750 mm yr<sup>−1</sup> bin whereas FluxCom and SFE have the highest <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at slightly wetter locations, receiving between 1000–1250 mm yr<sup>−1</sup>. ERA5-Land, on the other hand, has a weaker relationship between mean annual precipitation and <inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. ERA5-Land has the opposite pattern than the other datasets and shows the highest <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at the driest and wettest pixels with lower <inline-formula><mml:math id="M134" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at intermediate aridity. The relationship between mean annual precipitation and <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> follows that of mean annual precipitation and <inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in general, however <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> does not increase at the wettest pixels to the same degree as for the <inline-formula><mml:math id="M138" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. For example, SFE has continually decreasing <inline-formula><mml:math id="M139" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> as mean annual precipitation increases with only a minimal increase in performance at the pixels with <inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:mi mathvariant="italic">&gt;</mml:mi><mml:mn mathvariant="normal">1500</mml:mn></mml:mrow></mml:math></inline-formula> mm yr<sup>−1</sup> of precipitation.</p>

      <fig id="F8" specific-use="star"><label>Figure 8</label><caption><p id="d2e2575">The correlation coefficient, <inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, for each ET dataset across mean annual precipitation, the distance to large water bodies, elevation, and land cover. The number of pixels in each category per ET dataset is shown below boxes.</p></caption>
          <graphic xlink:href="https://hess.copernicus.org/articles/30/2417/2026/hess-30-2417-2026-f08.png"/>

        </fig>

      <p id="d2e2595">The performance variability across land cover is not consistent between any of the datasets. ERA5-Land has the lowest <inline-formula><mml:math id="M143" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:mrow></mml:math></inline-formula> and highest <inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in agricultural pixels, GLEAM in forest pixels, and FluxCom in shrubland pixels. The SFE <inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is similar across all land cover types but SFE <inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is highest in wetlands, followed by forest and agricultural pixels. Forested pixels also have a greater spread in <inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for FluxCom and SFE compared to the other land cover types. SFE <inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is lowest in shrublands, followed by grasslands. FluxCom <inline-formula><mml:math id="M149" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is likewise lowest for grassland and shrublands, which is the opposite of ERA5-Land, with the highest <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in grasslands and shrublands.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Discussion</title>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Which ET estimate is most accurate?</title>
      <p id="d2e2702">While triple collocation reveals that SFE is rarely the highest performing dataset for the non-winter months evaluated here, it is the second-best performing dataset across much of CONUS for both <inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (Fig. 4e, f). In addition, we find that datasets which outperform SFE only exhibit better performance for one – not both – of either <inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. That SFE performs well – although not the best – for both metrics suggests its usefulness for a variety of applications, particularly those where it is not clear a priori whether having high <inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> or low <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is most useful. Furthermore, SFE may be a particularly good choice for studies interested in the response of ET to water limitations. Unlike the explicitly assumed dependence of ET on hydrologic conditions in ERA5-Land or the implicitly assumed dependence of GLEAM and FluxCom (which is limited by the constraints of the machine learning structure and input data), SFE contains no a priori assumptions about the effect of water stress on ET, aside from any impact of these assumptions embedded in the interpolated temperature or humidity data used as an input to SFE calculation (such as for the gridMET data used here). Our release, alongside this manuscript, of a daily, 4 km resolution CONUS-wide dataset of SFE-based ET spanning 1979 to 2025 should facilitate future applications of SFE for scientific analyses. Additionally, there is no reason to believe that SFE should not perform similarly at the global scale, particularly outside of regions with substantial influence from ocean dynamics.</p>
      <p id="d2e2772">SFE is generally the second-best dataset regardless of metric, while alternative datasets with low random noise also have low correlation with the truth and vice versa. For example, across the four datasets tested, FluxCom has the lowest (most desirable) <inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> across the majority of CONUS pixels (Fig. 4a). However, it also has the lowest (least desirable) <inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> more often than any other datasets (Fig. 4d). ERA5-Land shows the converse relationship, with the highest (most desirable) <inline-formula><mml:math id="M159" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in almost all pixels compared to all other datasets, but poorer relative performance with regard to <inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:mrow></mml:math></inline-formula> (Fig. 4b, c). How is this possible? To understand why, note that the triple collocation error model implies that,

            <disp-formula id="Ch1.E8" content-type="numbered"><label>8</label><mml:math id="M161" display="block"><mml:mrow><mml:msubsup><mml:mi>R</mml:mi><mml:mrow><mml:mi>T</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>T</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>T</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula>

          as shown in McColl et al. (2014). For a dataset to exhibit both the lowest <inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and lowest <inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> requires that <inline-formula><mml:math id="M164" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula> is also sufficiently small (<inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the same for each dataset and does not impact the ranking). An extreme example would be a dataset that simply set ET to a fixed climatological value and exhibited no temporal variability, for which <inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, even when <inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is small. At the other extreme, for a dataset to exhibit both highest <inline-formula><mml:math id="M169" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and highest <inline-formula><mml:math id="M170" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> requires <inline-formula><mml:math id="M171" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula> to be sufficiently large. In the limit of <inline-formula><mml:math id="M172" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mo>→</mml:mo><mml:mi mathvariant="normal">∞</mml:mi></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M173" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, even when <inline-formula><mml:math id="M174" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is large. The relative importance of choosing a dataset with a low <inline-formula><mml:math id="M175" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, a high <inline-formula><mml:math id="M176" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, or a low bias (which is not assessed here), depends on the application for which the ET dataset will be used (Entekhabi et al., 2010).</p>
      <p id="d2e3054">Beyond choosing a single dataset for a particular application, it is also possible to average multiple ET estimates into a single dataset weighted by each dataset's performance. While not often practical for large-scale use, He et al. (2023) used triple collocation to estimate an “optimal” ET product over China by weighting each dataset by the performance of the triple collocation results in order to minimize <inline-formula><mml:math id="M177" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.”. Burnett et al. (2020) also used this approach to generate a new rainfall product for the Congo River Basin. Such an approach was also proposed as a possible way forward by the WAter Cycle Multi- mission Observation Strategy (WACMOS) project, with the specific suggestion that ET datasets could be combined on a per-biome scale, if some datasets are known to perform better or worse under specific conditions (Miralles et al., 2016). However, this approach has the disadvantage of obscuring the individual problems with each dataset, especially if the datasets have different systematic errors or biases which are not accounted for by the random error variance and correlation coefficient metrics available through triple collocation analysis.. It may also perturb the larger-scale spatial patterns of ET. Given that the validity of the assumptions behind triple collocation are not fully known, any such effort would benefit from additional corroboration of the estimated uncertainties.</p>
</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Do spatial patterns in SFE performance match our expectation?</title>
      <p id="d2e3076">We find that the performance of SFE is not more sensitive to biogeographical gradients than that of other datasets, suggesting that the simplicity of SFE does not exacerbate performance issues for specific climate, vegetation, or topographical environments. This is particularly surprising given the previously hypothesized limitation of SFE in coastal regions, where atmospheric conditions strongly depend on the influence of the ocean as well as on recent land fluxes (McColl and Rigden, 2020). However, the SFE method has not previously been applied within 250 km of the coast, let alone had its errors characterized in these regions. Therefore, the actual performance of SFE in coastal regions has previously remained unknown.</p>
      <p id="d2e3079">While our statistical analysis (Figs. 7, 8) shows the expected increase in SFE <inline-formula><mml:math id="M178" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and reduction in <inline-formula><mml:math id="M179" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> near the coast, particularly within the first four pixels (<inline-formula><mml:math id="M180" display="inline"><mml:mo lspace="0mm">∼</mml:mo></mml:math></inline-formula> 20 km), this behavior is also true for ERA5-Land, which has even more severe performance decreases near the coast than SFE. This is despite the improved simulation of land surface temperature and surface energy fluxes in ERA5-Land compared ERA5 for coastal regions, which has been mainly attributed to ERA5-Land's finer spatial resolution (Martens et al., 2020; Muñoz-Sabater et al., 2021). However, ERA5-Land performance is not uniformly degraded for all coastal areas (Fig. 3). Instead, coastal areas in the North show higher <inline-formula><mml:math id="M181" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M182" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> compared to coastal areas in the Southwest and Southeast. This might suggest that the statistically lower performance of ERA5 Land with coastal proximity in general is due to cross correlation with other climatic factors. Despite the decreased performance of SFE and ERA5-Land near the coast, however, the absolute magnitude of <inline-formula><mml:math id="M183" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M184" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for both datasets is still comparable to those of the other datasets throughout the range of coastal proximities, particularly for <inline-formula><mml:math id="M185" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. Therefore, coastal proximity may not necessarily limit the usefulness of SFE near coasts. Future SFE implementation and evaluation studies should further investigate these limitations and not exclude areas within 250 km of the coast a priori.</p>
      <p id="d2e3167">SFE has the highest <inline-formula><mml:math id="M186" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at low elevations, as does GLEAM and FluxCom. Spatially, however, topographical gradients (such as around the Rocky Mountains) are not apparent on maps of <inline-formula><mml:math id="M187" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for any of the datasets (Fig. 3), although several smaller mountain ranges (e.g. the Sierra Nevada in California and the upper Appalachian Mountains) do show lower performance for the <inline-formula><mml:math id="M188" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of SFE and FluxCom. This lack of coherence between the elevation trends and spatial patterns could indicate cross correlation between elevation and other factors impacting performance, which require further investigation.</p>
      <p id="d2e3203">The most obvious spatial trend in dataset performance is the gradient of performance between the Eastern and Western US. SFE and FluxCom have lower <inline-formula><mml:math id="M189" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in the Western US than in the East, despite the Western US being well-known as a region where ET estimation is difficult. One possible explanation for our results is that ET amounts are lower in the West, where vegetation cover is in general lower and aridity higher, such that the overall magnitudes of <inline-formula><mml:math id="M190" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are also lower. This would also explain the lack of systematic difference in FluxCom and SFE <inline-formula><mml:math id="M191" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in the East vs the West. Another explanation might be that SFE and FluxCom both have the highest performance (for both low <inline-formula><mml:math id="M192" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and high <inline-formula><mml:math id="M193" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) in shrublands and grassland land cover types, both of which are often found in the Western US (Dewitz, 2024). This finding is in contrast to Zhu et al. (2024), who found that daily and monthly SFE had the lowest correlation and highest root mean squared error at the eight towers in shrublands, relative to towers in other land covers.</p>
</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>The benefits and limitations of triple collocation</title>
      <p id="d2e3269">Triple collocation makes several assumptions, including that the random errors between the datasets are independent, that the random errors are stationary across time, and that the random errors can be described linearly. The assumptions of triple collocation are also implicitly made by more standard validation analyses such as comparison via RMSE (Gruber et al., 2016). However, these assumptions are expected to be violated to some degree, regardless of how carefully comparison datasets are chosen. One reason for this is that most ET models contain at least some overlapping input data, for example the commonly used MODIS reflectance products for vegetation, such as leaf area index, are used as inputs to FLUXCOM, ERA5-Land, and GLEAM (ECMWF, 2018; Jung et al., 2019; Miralles et al., 2025). Any overlap in model input data reduces the likelihood that the resulting ET estimates will have independent errors. Triple collocation may also fail or wrongly estimate dataset errors if random error magnitudes vary in time or are not well described linearly. Therefore, it is not uncommon for triple collocation studies to have invalid pixel results (e.g. He et al., 2023). Some triple collocation studies also choose to pre-filter pixels to ensure high correlation coefficient between the raw datasets (Gruber et al., 2016; McColl et al., 2014), which also leads to pixels where triple collocation results are missing.</p>
      <p id="d2e3272">One way to increase the confidence in an application of triple collocation is to repeat the analysis for multiple triplets, as performed here. Violations in the triple collocation assumptions would lead to differences in the estimated error statistic for a given dataset depending on which datasets are used for comparison (He et al., 2023; McColl et al., 2014). We found that invalid triple collocation results were more prevalent when FluxCom and ERA5-Land were compared within the same triplet, regardless of the third dataset. This suggests that the assumption of independent errors may be worse between these two datasets, despite their seemingly larger input difference than GLEAM and FluxCom, for example, which both incorporate machine learning. Nevertheless, the overall high agreement between different triple collocation estimates for the other triplets – and the lack of coherent spatial pattern in error variability across triplets (Fig. 5) – strongly increases our confidence that our overall error estimates are robust.</p>
      <p id="d2e3275">One limitation of triple collocation is that it cannot provide information about multiplicative dataset biases (<inline-formula><mml:math id="M194" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>) beyond estimating relative biases with reference to one member of each triplet which is assumed to have no bias (Gruber et al., 2016; McColl et al., 2014). However, previous work suggests that SFE may have issues with bias particularly along aridity gradients. For example, Chen et al. (2021) and Zhu et al. (2024) both found that SFE ET had higher bias in arid conditions and tended to underestimate ET in wet conditions. This same pattern was also observed for comparisons of in situ SFE to eddy covariance data (McColl and Rigden, 2020; Thakur et al., 2025). While we do not consider bias because triple collocation only allows for its calculation relative to a comparison dataset, we do see that SFE <inline-formula><mml:math id="M195" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is highest at the driest and wettest pixels compared to pixels with intermediate mean annual precipitation. SFE <inline-formula><mml:math id="M196" display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mi>T</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, on the other hand, shows only a weak but slightly decreasing relationship with increasing mean annual precipitation. Further in situ validation of SFE in the wettest and driest ecosystems would be beneficial. However the problem of ET overestimation in arid conditions – when surface evaporation is high in general – is not unique to SFE (McColl and Rigden, 2020; Miralles et al., 2016; Salvucci and Gentine, 2013). Despite the assumptions and limitations of triple collocation, the method's ability to quantify error statistics relative to true ET without needing an error-free dataset of ET remains a substantial and unique benefit.</p>
</sec>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d2e3317">SFE allows for observational, data-driven estimates of ET with no tunable parameters or land surface information required. That SFE estimates ET from atmospheric conditions alone has several advantages: It can be calculated at a variety of scales and geographic domains and it provides an opportunity to test hypotheses about vegetation response to environmental drivers without assuming that response a priori in the creation of the ET estimate itself. The lack of parameterization for SFE eases issues of circularity constraining research into essential outstanding challenges in ecohydrology, such as the response of ET to drought (Zhao et al., 2022) and the inference of subsurface water storage from changes in vegetation behavior (Dralle et al., 2021; Feldman et al., 2023; Stocker et al., 2023). Based on triple collocation – and despite its simplicity – SFE exhibits comparable performance to the more complex ET estimates from GLEAM, FluxCom, and ERA5-Land.</p>
</sec>

      
      </body>
    <back><notes notes-type="codeavailability"><title>Code availability</title>

      <p id="d2e3324">Code is available on GitHub at <uri>https://github.com/erica-mccormick/sfe_et_and_triple_collocation</uri> (last access: 21 April 2026; DOI: <ext-link xlink:href="https://doi.org/10.5281/zenodo.17903676" ext-link-type="DOI">10.5281/zenodo.17903676</ext-link>, McCormick et al., 2026).</p>
  </notes><notes notes-type="dataavailability"><title>Data availability</title>

      <p id="d2e3336">All of the data used to estimate SFE ET as well as the comparison ET datasets are publicly available online. Daily 4 km estimates of SFE ET across CONUS from 1979 to 2025 are available on Zenodo at <ext-link xlink:href="https://doi.org/10.5281/zenodo.17903676" ext-link-type="DOI">10.5281/zenodo.17903676</ext-link> (McCormick et al., 2026).</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d2e3342">The supplement related to this article is available online at <inline-supplementary-material xlink:href="https://doi.org/10.5194/hess-30-2417-2026-supplement" xlink:title="pdf">https://doi.org/10.5194/hess-30-2417-2026-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e3351">ELM: conceptualization, data curation, formal analysis, investigation, methodology, software, visualization, writing – original draft, writing – review and editing. LES: data curation, formal analysis, investigation, visualization, writing – review and editing. KAM: conceptualization, methodology, writing – review and editing. AGK: conceptualization, funding acquisition, investigation, methodology, project administration, resources, supervision, visualization, writing – original draft, writing – review and editing.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e3357">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e3363">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e3369">ELM was supported by the Stanford University Diversifying Academia Recruiting Excellence Doctoral Fellowship and by the NSF GRFP.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e3374">This research has been supported by the National Science Foundation (grant nos. 1942133 and Graduate Research Fellowship Program), the Alfred P. Sloan Foundation (grant no. 11974), the Directorate for Geosciences (grant nos. AGS-2129576 and AGS-2441565), and the Alfred P. Sloan Foundation (grant no. FG-2023-19963). AGK was also supported by the NASA SMAP Science Team.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e3380">This paper was edited by Patricia Saco and reviewed by Alexander Gruber and one anonymous referee.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><mixed-citation>Abatzoglou, J. T.: Development of gridded surface meteorological data for ecological applications and modelling, Int. J. Climatol., 33, 121–131, <ext-link xlink:href="https://doi.org/10.1002/joc.3413" ext-link-type="DOI">10.1002/joc.3413</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><mixed-citation>Alemohammad, S. H., McColl, K. A., Konings, A. G., Entekhabi, D., and Stoffelen, A.: Characterization of precipitation product errors across the United States using multiplicative triple collocation, Hydrol. Earth Syst. Sci., 19, 3489–3503, <ext-link xlink:href="https://doi.org/10.5194/hess-19-3489-2015" ext-link-type="DOI">10.5194/hess-19-3489-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><mixed-citation>Alemohammad, S. H., Fang, B., Konings, A. G., Aires, F., Green, J. K., Kolassa, J., Miralles, D., Prigent, C., and Gentine, P.: Water, Energy, and Carbon with Artificial Neural Networks (WECANN): a statistically based estimate of global surface turbulent fluxes and gross primary productivity using solar-induced fluorescence, Biogeosciences, 14, 4101–4124, <ext-link xlink:href="https://doi.org/10.5194/bg-14-4101-2017" ext-link-type="DOI">10.5194/bg-14-4101-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><mixed-citation>ArcGIS Data and Maps: USA Detailed Water Bodies, <uri>https://hub.arcgis.com/datasets/esri::usa-detailed-water-bodies/explore</uri> (last access: 1 August 2025), 2023.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><mixed-citation>Burnett, M. W., Quetin, G. R., and Konings, A. G.: Data-driven estimates of evapotranspiration and its controls in the Congo Basin, Hydrol. Earth Syst. Sci., 24, 4189–4211, <ext-link xlink:href="https://doi.org/10.5194/hess-24-4189-2020" ext-link-type="DOI">10.5194/hess-24-4189-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><mixed-citation>Caires, S. and Sterl, A.: Validation of ocean wind and wave data using triple collocation, J. Geophys. Res.-Oceans, 108, <ext-link xlink:href="https://doi.org/10.1029/2002JC001491" ext-link-type="DOI">10.1029/2002JC001491</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><mixed-citation>Chen, F., Crow, W. T., Bindlish, R., Colliander, A., Burgin, M. S., Asanuma, J., and Aida, K.: Global-scale evaluation of SMAP, SMOS and ASCAT soil moisture products using triple collocation, Remote Sens. Environ., 214, 1–13, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2018.05.008" ext-link-type="DOI">10.1016/j.rse.2018.05.008</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><mixed-citation>Chen, S., McColl, K. A., Berg, A., and Huang, Y.: Surface Flux Equilibrium Estimates of Evapotranspiration at Large Spatial Scales, J. Hydrometeorol., 22, 765–779, <ext-link xlink:href="https://doi.org/10.1175/JHM-D-20-0204.1" ext-link-type="DOI">10.1175/JHM-D-20-0204.1</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><mixed-citation>Clothier, B. E., Clawson, K. L., Pinter Jr., P. J., Moran, M. S., Reginato, R. J., and Jackson, R. D.: Estimation of soil heat flux from net radiation during the growth of alfalfa, Agr. Forest Meteorol., 37, 319–329, 1986.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><mixed-citation>Crow, W. T., Lei, F., Hain, C., Anderson, M. C., Scott, R. L., Billesbach, D., and Arkebauer, T.: Robust estimates of soil moisture and latent heat flux coupling strength obtained from triple collocation: Estimation of Land Coupling Strength, Geophys. Res. Lett., 42, 8415–8423, <ext-link xlink:href="https://doi.org/10.1002/2015GL065929" ext-link-type="DOI">10.1002/2015GL065929</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><mixed-citation>Dewitz: National Land Cover Database (NLCD) 2019 Products (ver. 3.0, February 2024), USGS [data set], <ext-link xlink:href="https://doi.org/10.5066/P9KZCM54" ext-link-type="DOI">10.5066/P9KZCM54</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><mixed-citation>Doelling, D. R., Loeb, N. G., Keyes, D. F., Nordeen, M. L., Morstad, D., Nguyen, C., Wielicki, B. A., Young, D. F., and Sun, M.: Geostationary Enhanced Temporal Interpolation for CERES Flux Products, J. Atmospheric Ocean. Technol., 30, 1072–1090, <ext-link xlink:href="https://doi.org/10.1175/JTECH-D-12-00136.1" ext-link-type="DOI">10.1175/JTECH-D-12-00136.1</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><mixed-citation>Dralle, D. N., Hahm, W. J., Chadwick, K. D., McCormick, E., and Rempe, D. M.: Technical note: Accounting for snow in the estimation of root zone water storage capacity from precipitation and evapotranspiration fluxes, Hydrol. Earth Syst. Sci., 25, 2861–2867, <ext-link xlink:href="https://doi.org/10.5194/hess-25-2861-2021" ext-link-type="DOI">10.5194/hess-25-2861-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><mixed-citation>Draper, C., Reichle, R., De Jeu, R., Naeimi, V., Parinussa, R., and Wagner, W.: Estimating root mean square errors in remotely sensed soil moisture over continental scale domains, Remote Sens. Environ., 137, 288–298, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2013.06.013" ext-link-type="DOI">10.1016/j.rse.2013.06.013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><mixed-citation>ECMWF: IFS Documentation CY45R1 – Part IV: Physical processes, <ext-link xlink:href="https://doi.org/10.21957/4WHWO8JW0" ext-link-type="DOI">10.21957/4WHWO8JW0</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><mixed-citation>Entekhabi, D., Reichle, R. H., Koster, R. D., and Crow, W. T.: Performance Metrics for Soil Moisture Retrievals and Application Requirements, J. Hydrometeorol., 11, 832–840, <ext-link xlink:href="https://doi.org/10.1175/2010JHM1223.1" ext-link-type="DOI">10.1175/2010JHM1223.1</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><mixed-citation>Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958, <ext-link xlink:href="https://doi.org/10.5194/gmd-9-1937-2016" ext-link-type="DOI">10.5194/gmd-9-1937-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><mixed-citation>Feldman, A. F., Short Gianotti, D. J., Dong, J., Akbar, R., Crow, W. T., McColl, K. A., Konings, A. G., Nippert, J. B., Tumber-Dávila, S. J., Holbrook, N. M., Rockwell, F. E., Scott, R. L., Reichle, R. H., Chatterjee, A., Joiner, J., Poulter, B., and Entekhabi, D.: Remotely Sensed Soil Moisture Can Capture Dynamics Relevant to Plant Water Uptake, Water Resour. Res., 59, e2022WR033814, <ext-link xlink:href="https://doi.org/10.1029/2022WR033814" ext-link-type="DOI">10.1029/2022WR033814</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><mixed-citation>Ferreira, V. G., Montecino, H. D. C., Yakubu, C. I., and Heck, B.: Uncertainties of the Gravity Recovery and Climate Experiment time-variable gravity-field solutions based on three-cornered hat method, J. Appl. Remote Sens., 10, 015015, <ext-link xlink:href="https://doi.org/10.1117/1.JRS.10.015015" ext-link-type="DOI">10.1117/1.JRS.10.015015</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><mixed-citation>Fisher, J. B., Lee, B., Purdy, A. J., Halverson, G. H., Dohlen, M. B., Cawse-Nicholson, K., Wang, A., Anderson, R. G., Aragon, B., Arain, M. A., Baldocchi, D. D., Baker, J. M., Barral, H., Bernacchi, C. J., Bernhofer, C., Biraud, S. C., Bohrer, G., Brunsell, N., Cappelaere, B., Castro- Contreras, S., Chun, J., Conrad, B. J., Cremonese, E., Demarty, J., Desai, A. R., De Ligne, A., Foltýnová, L., Goulden, M. L., Griffis, T. J., Grünwald, T., Johnson, M. S., Kang, M., Kelbe, D., Kowalska, N., Lim, J., Maïnassara, I., McCabe, M. F., Missik, J. E. C., Mohanty, B. P., Moore, C. E., Morillas, L., Morrison, R., Munger, J. W., Posse, G., Richardson, A. D., Russell, E. S., Ryu, Y., Sanchez-Azofeifa, A., Schmidt, M., Schwartz, E., Sharp, I., Šigut, L., Tang, Y., Hulley, G., Anderson, M., Hain, C., French, A., Wood, E., and Hook, S.: ECOSTRESS: NASA's Next Generation Mission to Measure Evapotranspiration From the International Space Station, Water Resour. Res., 56, e2019WR026058, <ext-link xlink:href="https://doi.org/10.1029/2019WR026058" ext-link-type="DOI">10.1029/2019WR026058</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><mixed-citation>Friedlingstein, P., Jones, M. W., O'Sullivan, M., Andrew, R. M., Hauck, J., Peters, G. P., Peters, W., Pongratz, J., Sitch, S., Le Quéré, C., Bakker, D. C. E., Canadell, J. G., Ciais, P., Jackson, R. B., Anthoni, P., Barbero, L., Bastos, A., Bastrikov, V., Becker, M., Bopp, L., Buitenhuis, E., Chandra, N., Chevallier, F., Chini, L. P., Currie, K. I., Feely, R. A., Gehlen, M., Gilfillan, D., Gkritzalis, T., Goll, D. S., Gruber, N., Gutekunst, S., Harris, I., Haverd, V., Houghton, R. A., Hurtt, G., Ilyina, T., Jain, A. K., Joetzjer, E., Kaplan, J. O., Kato, E., Klein Goldewijk, K., Korsbakken, J. I., Landschützer, P., Lauvset, S. K., Lefèvre, N., Lenton, A., Lienert, S., Lombardozzi, D., Marland, G., McGuire, P. C., Melton, J. R., Metzl, N., Munro, D. R., Nabel, J. E. M. S., Nakaoka, S.-I., Neill, C., Omar, A. M., Ono, T., Peregon, A., Pierrot, D., Poulter, B., Rehder, G., Resplandy, L., Robertson, E., Rödenbeck, C., Séférian, R., Schwinger, J., Smith, N., Tans, P. P., Tian, H., Tilbrook, B., Tubiello, F. N., van der Werf, G. R., Wiltshire, A. J., and Zaehle, S.: Global Carbon Budget 2019, Earth Syst. Sci. Data, 11, 1783–1838, <ext-link xlink:href="https://doi.org/10.5194/essd-11-1783-2019" ext-link-type="DOI">10.5194/essd-11-1783-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><mixed-citation>Good, S. P., Noone, D., and Bowen, G.: Hydrologic connectivity constrains partitioning of global terrestrial water fluxes, Science, 349, 175–177, <ext-link xlink:href="https://doi.org/10.1126/science.aaa5931" ext-link-type="DOI">10.1126/science.aaa5931</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><mixed-citation>Green, J. K., Konings, A. G., Alemohammad, S. H., Berry, J., Entekhabi, D., Kolassa, J., Lee, J.-E., and Gentine, P.: Regionally strong feedbacks between the atmosphere and terrestrial biosphere, Nat. Geosci., 10, 410–414, <ext-link xlink:href="https://doi.org/10.1038/ngeo2957" ext-link-type="DOI">10.1038/ngeo2957</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><mixed-citation>Gruber, A., Su, C.-H., Zwieback, S., Crow, W., Dorigo, W., and Wagner, W.: Recent advances in (soil moisture) triple collocation analysis, Int. J. Appl. Earth Obs. Geoinformation, 45, 200–211, <ext-link xlink:href="https://doi.org/10.1016/j.jag.2015.09.002" ext-link-type="DOI">10.1016/j.jag.2015.09.002</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><mixed-citation>He, Y., Wang, C., Hu, J., Mao, H., Duan, Z., Qu, C., Li, R., Wang, M., and Song, X.: Discovering Optimal Triplets for Assessing the Uncertainties of Satellite-Derived Evapotranspiration Products, Remote Sens., 15, 3215, <ext-link xlink:href="https://doi.org/10.3390/rs15133215" ext-link-type="DOI">10.3390/rs15133215</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><mixed-citation>Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., De Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, <ext-link xlink:href="https://doi.org/10.1002/qj.3803" ext-link-type="DOI">10.1002/qj.3803</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><mixed-citation>Huffman, G. J., Adler, R. F., Morrissey, M. M., Bolvin, D. T., Curtis, S., Joyce, R., McGavock, B., and Susskind, J.: Global Precipitation at One-Degree Daily Resolution from Multisatellite Observations, J. Hydrometeorol., 2, 36–50, <ext-link xlink:href="https://doi.org/10.1175/1525-7541(2001)002&lt;0036:GPAODD&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1525-7541(2001)002&lt;0036:GPAODD&gt;2.0.CO;2</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><mixed-citation>Jung, M., Koirala, S., Weber, U., Ichii, K., Gans, F., Camps-Valls, G., Papale, D., Schwalm, C., Tramontana, G., and Reichstein, M.: The FLUXCOM ensemble of global land-atmosphere energy fluxes, Sci. Data, 6, 74, <ext-link xlink:href="https://doi.org/10.1038/s41597-019-0076-8" ext-link-type="DOI">10.1038/s41597-019-0076-8</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><mixed-citation>Khan, M. S., Waqas Liaqat, U., Baik, J., and Choi, M.: Stand-alone uncertainty characterization of GLEAM, GLDAS and MOD16 evapotranspiration products using an extended triple collocation approach, Agr. Forest Meteorol., 252, 256–268, <ext-link xlink:href="https://doi.org/10.1016/j.agrformet.2018.01.022" ext-link-type="DOI">10.1016/j.agrformet.2018.01.022</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><mixed-citation>Koppa, A., Rains, D., Hulsman, P., Poyatos, R., and Miralles, D. G.: A deep learning-based hybrid model of global terrestrial evaporation, Nat. Commun., 13, 1912, <ext-link xlink:href="https://doi.org/10.1038/s41467-022-29543-7" ext-link-type="DOI">10.1038/s41467-022-29543-7</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><mixed-citation>Martens, B., Miralles, D. G., Lievens, H., van der Schalie, R., de Jeu, R. A. M., Fernández-Prieto, D., Beck, H. E., Dorigo, W. A., and Verhoest, N. E. C.: GLEAM v3: satellite-based land evaporation and root-zone soil moisture, Geosci. Model Dev., 10, 1903–1925, <ext-link xlink:href="https://doi.org/10.5194/gmd-10-1903-2017" ext-link-type="DOI">10.5194/gmd-10-1903-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib32"><label>32</label><mixed-citation>Martens, B., Schumacher, D. L., Wouters, H., Muñoz-Sabater, J., Verhoest, N. E. C., and Miralles, D. G.: Evaluating the land-surface energy partitioning in ERA5, Geosci. Model Dev., 13, 4159–4181, <ext-link xlink:href="https://doi.org/10.5194/gmd-13-4159-2020" ext-link-type="DOI">10.5194/gmd-13-4159-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><mixed-citation>McColl, K. A. and Rigden, A. J.: Emergent Simplicity of Continental Evapotranspiration, Geophys. Res. Lett., 47, <ext-link xlink:href="https://doi.org/10.1029/2020GL087101" ext-link-type="DOI">10.1029/2020GL087101</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib34"><label>34</label><mixed-citation>McColl, K. A., Vogelzang, J., Konings, A. G., Entekhabi, D., Piles, M., and Stoffelen, A.: Extended triple collocation: Estimating errors and correlation coefficients with respect to an unknown target, Geophys. Res. Lett., 41, 6229–6236, <ext-link xlink:href="https://doi.org/10.1002/2014GL061322" ext-link-type="DOI">10.1002/2014GL061322</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><mixed-citation>McColl, K. A., Salvucci, G. D., and Gentine, P.: Surface Flux Equilibrium Theory Explains an Empirical Estimate of Water-Limited Daily Evapotranspiration, J. Adv. Model. Earth Syst., 11, 2036–2049, <ext-link xlink:href="https://doi.org/10.1029/2019MS001685" ext-link-type="DOI">10.1029/2019MS001685</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib36"><label>36</label><mixed-citation>McCormick, E. L., Sanders, L., McColl, K., and Konings, A.: Daily surface flux equilibrium (SFE) ET across CONUS (v1.0.0), Zenodo [code and data set], <ext-link xlink:href="https://doi.org/10.5281/zenodo.17903676" ext-link-type="DOI">10.5281/zenodo.17903676</ext-link>, 2026.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><mixed-citation>Mesinger, F., DiMego, G., Kalnay, E., Mitchell, K., Shafran, P. C., Ebisuzaki, W., Jović, D., Woollen, J., Rogers, E., Berbery, E. H., Ek, M. B., Fan, Y., Grumbine, R., Higgins, W., Li, H., Lin, Y., Manikin, G., Parrish, D., and Shi, W.: North American Regional Reanalysis, B. Am. Meteorol. Soc., 87, 343–360, <ext-link xlink:href="https://doi.org/10.1175/BAMS-87-3-343" ext-link-type="DOI">10.1175/BAMS-87-3-343</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib38"><label>38</label><mixed-citation>Miralles, D. G., Crow, W. T., and Cosh, M. H.: Estimating Spatial Sampling Errors in Coarse-Scale Soil Moisture Estimates Derived from Point-Scale Observations, J. Hydrometeorol., 11, 1423–1429, <ext-link xlink:href="https://doi.org/10.1175/2010JHM1285.1" ext-link-type="DOI">10.1175/2010JHM1285.1</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib39"><label>39</label><mixed-citation>Miralles, D. G., Holmes, T. R. H., De Jeu, R. A. M., Gash, J. H., Meesters, A. G. C. A., and Dolman, A. J.: Global land-surface evaporation estimated from satellite-based observations, Hydrol. Earth Syst. Sci., 15, 453–469, <ext-link xlink:href="https://doi.org/10.5194/hess-15-453-2011" ext-link-type="DOI">10.5194/hess-15-453-2011</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib40"><label>40</label><mixed-citation>Miralles, D. G., Jiménez, C., Jung, M., Michel, D., Ershadi, A., McCabe, M. F., Hirschi, M., Martens, B., Dolman, A. J., Fisher, J. B., Mu, Q., Seneviratne, S. I., Wood, E. F., and Fernández-Prieto, D.: The WACMOS-ET project – Part 2: Evaluation of global terrestrial evaporation data sets, Hydrol. Earth Syst. Sci., 20, 823–842, <ext-link xlink:href="https://doi.org/10.5194/hess-20-823-2016" ext-link-type="DOI">10.5194/hess-20-823-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib41"><label>41</label><mixed-citation>Miralles, D. G., Bonte, O., Koppa, A., Baez-Villanueva, O. M., Tronquo, E., Zhong, F., Beck, H. E., Hulsman, P., Dorigo, W., Verhoest, N. E. C., and Haghdoost, S.: GLEAM4: global land evaporation and soil moisture dataset at 0.1° resolution from 1980 to near present, Sci. Data, 12, 416, <ext-link xlink:href="https://doi.org/10.1038/s41597-025-04610-y" ext-link-type="DOI">10.1038/s41597-025-04610-y</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib42"><label>42</label><mixed-citation>Mu, Q., Zhao, M., and Running, S. W.: Improvements to a MODIS global terrestrial evapotranspiration algorithm, Remote Sens. Environ., 115, 1781–1800, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2011.02.019" ext-link-type="DOI">10.1016/j.rse.2011.02.019</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib43"><label>43</label><mixed-citation>Muñoz-Sabater, J., Dutra, E., Agustí-Panareda, A., Albergel, C., Arduini, G., Balsamo, G., Boussetta, S., Choulga, M., Harrigan, S., Hersbach, H., Martens, B., Miralles, D. G., Piles, M., Rodríguez-Fernández, N. J., Zsoter, E., Buontempo, C., and Thépaut, J.-N.: ERA5-Land: a state-of-the-art global reanalysis dataset for land applications, Earth Syst. Sci. Data, 13, 4349–4383, <ext-link xlink:href="https://doi.org/10.5194/essd-13-4349-2021" ext-link-type="DOI">10.5194/essd-13-4349-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib44"><label>44</label><mixed-citation>Oki, T. and Kanae, S.: Global Hydrological Cycles and World Water Resources, Freshw. Resour., 313, <ext-link xlink:href="https://doi.org/10.1126/science.1128845" ext-link-type="DOI">10.1126/science.1128845</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib45"><label>45</label><mixed-citation>Salvucci, G. D. and Gentine, P.: Emergent relation between surface vapor conductance and relative humidity profiles yields evaporation rates from weather data, Proc. Natl. Acad. Sci. USA, <ext-link xlink:href="https://doi.org/10.1073/pnas.1215844110" ext-link-type="DOI">10.1073/pnas.1215844110</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib46"><label>46</label><mixed-citation>Santanello Jr., J. A. and Friedl, M. A.: Diurnal covariation in soil heat flux and net radiation, J. Appl. Meteorol., 42, 851–862, 2003.</mixed-citation></ref>
      <ref id="bib1.bib47"><label>47</label><mixed-citation>Savoca, M. E., Senay, G. B., Maupin, M. A., Kenny, J. F., and Perry, C. A.: Actual evapotranspiration modeling using the operational Simplified Surface Energy Balance (SSEBop) approach, Reston, VA, <ext-link xlink:href="https://doi.org/10.3133/sir20135126" ext-link-type="DOI">10.3133/sir20135126</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib48"><label>48</label><mixed-citation>Scipal, K., Holmes, T., De Jeu, R., Naeimi, V., and Wagner, W.: A possible solution for the problem of estimating the error structure of global soil moisture data sets, Geophys. Res. Lett., 35, <ext-link xlink:href="https://doi.org/10.1029/2008gl035599" ext-link-type="DOI">10.1029/2008gl035599</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bib49"><label>49</label><mixed-citation>Stocker, B. D., Tumber-Dávila, S. J., Konings, A. G., Anderson, M. C., Hain, C., and Jackson, R. B.: Global patterns of water storage in the rooting zones of vegetation, Nat. Geosci., 16, 250–256, <ext-link xlink:href="https://doi.org/10.1038/s41561-023-01125-2" ext-link-type="DOI">10.1038/s41561-023-01125-2</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib50"><label>50</label><mixed-citation>Stoffelen, A.: Toward the true near-surface wind speed: Error modeling and calibration using triple collocation, J. Geophys. Res. Oceans, 103, 7755–7766, <ext-link xlink:href="https://doi.org/10.1029/97JC03180" ext-link-type="DOI">10.1029/97JC03180</ext-link>, 1998.</mixed-citation></ref>
      <ref id="bib1.bib51"><label>51</label><mixed-citation>Su, C. H., Ryu, D., Crow, W. T., and Western, A. W.: Beyond triple collocation: Applications to soil moisture monitoring, J. Geophys. Res.-Atmos., 119, 6419–6439, 2014.</mixed-citation></ref>
      <ref id="bib1.bib52"><label>52</label><mixed-citation>Sun, J., McColl, K. A., Wang, Y., Rigden, A. J., Lu, H., Yang, K., Li, Y., and Santanello, J. A.: Global evaluation of terrestrial near-surface air temperature and specific humidity retrievals from the Atmospheric Infrared Sounder (AIRS), Remote Sens. Environ., 252, 112146, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2020.112146" ext-link-type="DOI">10.1016/j.rse.2020.112146</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib53"><label>53</label><mixed-citation>Teuling, A. J., Seneviratne, S. I., Stöckli, R., Reichstein, M., Moors, E., Ciais, P., Luyssaert, S., Van Den Hurk, B., Ammann, C., Bernhofer, C., Dellwik, E., Gianelle, D., Gielen, B., Grünwald, T., Klumpp, K., Montagnani, L., Moureaux, C., Sottocornola, M., and Wohlfahrt, G.: Contrasting response of European forest and grassland energy exchange to heatwaves, Nat. Geosci., 3, 722–727, <ext-link xlink:href="https://doi.org/10.1038/ngeo950" ext-link-type="DOI">10.1038/ngeo950</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib54"><label>54</label><mixed-citation>Thakur, H., Raghav, P., Kumar, M., and Wolkeba, F.: Surface Flux Equilibrium Theory-Derived Evapotranspiration Estimate Outperforms ECOSTRESS, MODIS, and SSEBop Products, Geophys. Res. Lett., 52, e2025GL114822, <ext-link xlink:href="https://doi.org/10.1029/2025GL114822" ext-link-type="DOI">10.1029/2025GL114822</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib55"><label>55</label><mixed-citation>United States Census Bureau: “tl_2023_us_coastline”, TIGER/Line Shapefiles, Nation, U.S., Coastline, 2023, <uri>https://catalog.data.gov/dataset/tiger-line-shapefile-2023-nation-u-s-coastline</uri> (last access: 1 August 2025), 2019.</mixed-citation></ref>
      <ref id="bib1.bib56"><label>56</label><mixed-citation>Wei, Y., Liu, S., Huntzinger, D. N., Michalak, A. M., Viovy, N., Post, W. M., Schwalm, C. R., Schaefer, K., Jacobson, A. R., Lu, C., Tian, H., Ricciuto, D. M., Cook, R. B., Mao, J., and Shi, X.: The North American Carbon Program Multi-scale Synthesis and Terrestrial Model Intercomparison Project – Part 2: Environmental driver data, Geosci. Model Dev., 7, 2875–2893, <ext-link xlink:href="https://doi.org/10.5194/gmd-7-2875-2014" ext-link-type="DOI">10.5194/gmd-7-2875-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib57"><label>57</label><mixed-citation>Xia, Y., Mitchell, K., Ek, M., Sheffield, J., Cosgrove, B., Wood, E., Luo, L., Alonge, C., Wei, H., Meng, J., Livneh, B., Lettenmaier, D., Koren, V., Duan, Q., Mo, K., Fan, Y., and Mocko, D.: Continental-scale water and energy flux analysis and validation for the North American Land Data Assimilation System project phase 2 (NLDAS-2): 1. Intercomparison and application of model products, J. Geophys. Res.-Atmos., 117, 2011JD016048, <ext-link xlink:href="https://doi.org/10.1029/2011JD016048" ext-link-type="DOI">10.1029/2011JD016048</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib58"><label>58</label><mixed-citation>Yamazaki, D., Ikeshima, D., Sosa, J., Bates, P. D., Allen, G. H., and Pavelsky, T. M.: MERIT Hydro: A High-Resolution Global Hydrography Map Based on Latest Topography Dataset, Water Resour. Res., 55, 5053–5073, <ext-link xlink:href="https://doi.org/10.1029/2019WR024873" ext-link-type="DOI">10.1029/2019WR024873</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib59"><label>59</label><mixed-citation>Yang, Y., Roderick, M. L., Guo, H., Miralles, D. G., Zhang, L., Fatichi, S., Luo, X., Zhang, Y., McVicar, T. R., Tu, Z., Keenan, T. F., Fisher, J. B., Gan, R., Zhang, X., Piao, S., Zhang, B., and Yang, D.: Evapotranspiration on a greening Earth, Nat. Rev. Earth Environ., 4, 626–641, <ext-link xlink:href="https://doi.org/10.1038/s43017-023-00464-3" ext-link-type="DOI">10.1038/s43017-023-00464-3</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib60"><label>60</label><mixed-citation>Yilmaz, M. T. and Crow, W. T.: Evaluation of Assumptions in Soil Moisture Triple Collocation Analysis, J. Hydrometeorol., 15, 1293–1302, <ext-link xlink:href="https://doi.org/10.1175/JHM-D-13-0158.1" ext-link-type="DOI">10.1175/JHM-D-13-0158.1</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib61"><label>61</label><mixed-citation>Yin, X., Jiang, B., Liang, S., Li, S., Zhao, X., Wang, Q., Xu, J., Han, J., Liang, H., Zhang, X., Liu, Q., Yao, Y., Jia, K., and Xie, X.: Significant discrepancies of land surface daily net radiation among ten remotely sensed and reanalysis products, Int. J. Digit. Earth, 16, 3725–3752, <ext-link xlink:href="https://doi.org/10.1080/17538947.2023.2253211" ext-link-type="DOI">10.1080/17538947.2023.2253211</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib62"><label>62</label><mixed-citation>Zhao, M., A, G., Liu, Y., and Konings, A. G.: Evapotranspiration frequently increases during droughts, Nat. Clim. Change, 12, 1024–1030, <ext-link xlink:href="https://doi.org/10.1038/s41558-022-01505-3" ext-link-type="DOI">10.1038/s41558-022-01505-3</ext-link>, 807 2022. </mixed-citation></ref>
      <ref id="bib1.bib63"><label>63</label><mixed-citation>Zhu, W., Yu, X., Wei, J., and Lv, A.: Surface flux equilibrium estimates of evaporative fraction and evapotranspiration at global scale: Accuracy evaluation and performance comparison, Agric. Water Manag., 291, 108609, <ext-link xlink:href="https://doi.org/10.1016/j.agwat.2023.108609" ext-link-type="DOI">10.1016/j.agwat.2023.108609</ext-link>, 2024.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Triple collocation validates CONUS-wide evapotranspiration inferred from atmospheric conditions</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
      
Abatzoglou, J. T.: Development of gridded surface meteorological data for
ecological applications and modelling, Int. J. Climatol., 33, 121–131,
<a href="https://doi.org/10.1002/joc.3413" target="_blank">https://doi.org/10.1002/joc.3413</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
      Alemohammad, S. H., McColl, K. A., Konings, A. G., Entekhabi, D., and Stoffelen, A.: Characterization of precipitation product errors across the United States using multiplicative triple collocation, Hydrol. Earth Syst. Sci., 19, 3489–3503, <a href="https://doi.org/10.5194/hess-19-3489-2015" target="_blank">https://doi.org/10.5194/hess-19-3489-2015</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
      Alemohammad, S. H., Fang, B., Konings, A. G., Aires, F., Green, J. K., Kolassa, J., Miralles, D., Prigent, C., and Gentine, P.: Water, Energy, and Carbon with Artificial Neural Networks (WECANN): a statistically based estimate of global surface turbulent fluxes and gross primary productivity using solar-induced fluorescence, Biogeosciences, 14, 4101–4124, <a href="https://doi.org/10.5194/bg-14-4101-2017" target="_blank">https://doi.org/10.5194/bg-14-4101-2017</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
      ArcGIS Data and Maps: USA Detailed Water Bodies, <a href="https://hub.arcgis.com/datasets/esri::usa-detailed-water-bodies/explore" target="_blank"/> (last access: 1 August 2025), 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
      Burnett, M. W., Quetin, G. R., and Konings, A. G.: Data-driven estimates of evapotranspiration and its controls in the Congo Basin, Hydrol. Earth Syst. Sci., 24, 4189–4211, <a href="https://doi.org/10.5194/hess-24-4189-2020" target="_blank">https://doi.org/10.5194/hess-24-4189-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
      Caires, S. and Sterl, A.: Validation of ocean wind and wave data using triple collocation, J. Geophys. Res.-Oceans, 108, <a href="https://doi.org/10.1029/2002JC001491" target="_blank">https://doi.org/10.1029/2002JC001491</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
      Chen, F., Crow, W. T., Bindlish, R., Colliander, A., Burgin, M. S., Asanuma,
J., and Aida, K.: Global-scale evaluation of SMAP, SMOS and ASCAT soil
moisture products using triple collocation, Remote Sens. Environ., 214,
1–13, <a href="https://doi.org/10.1016/j.rse.2018.05.008" target="_blank">https://doi.org/10.1016/j.rse.2018.05.008</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
      Chen, S., McColl, K. A., Berg, A., and Huang, Y.: Surface Flux Equilibrium
Estimates of Evapotranspiration at Large Spatial Scales, J. Hydrometeorol.,
22, 765–779, <a href="https://doi.org/10.1175/JHM-D-20-0204.1" target="_blank">https://doi.org/10.1175/JHM-D-20-0204.1</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
      Clothier, B. E., Clawson, K. L., Pinter Jr., P. J., Moran, M. S., Reginato,
R. J., and Jackson, R. D.: Estimation of soil heat flux from net
radiation during the growth of alfalfa, Agr. Forest Meteorol.,
37, 319–329, 1986.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
      Crow, W. T., Lei, F., Hain, C., Anderson, M. C., Scott, R. L., Billesbach,
D., and Arkebauer, T.: Robust estimates of soil moisture and latent heat
flux coupling strength obtained from triple collocation: Estimation of Land
Coupling Strength, Geophys. Res. Lett., 42, 8415–8423,
<a href="https://doi.org/10.1002/2015GL065929" target="_blank">https://doi.org/10.1002/2015GL065929</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
      Dewitz: National Land Cover Database (NLCD) 2019 Products (ver. 3.0,
February 2024), USGS [data set], <a href="https://doi.org/10.5066/P9KZCM54" target="_blank">https://doi.org/10.5066/P9KZCM54</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
      Doelling, D. R., Loeb, N. G., Keyes, D. F., Nordeen, M. L., Morstad, D.,
Nguyen, C., Wielicki, B. A., Young, D. F., and Sun, M.: Geostationary
Enhanced Temporal Interpolation for CERES Flux Products, J. Atmospheric
Ocean. Technol., 30, 1072–1090, <a href="https://doi.org/10.1175/JTECH-D-12-00136.1" target="_blank">https://doi.org/10.1175/JTECH-D-12-00136.1</a>,
2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
      Dralle, D. N., Hahm, W. J., Chadwick, K. D., McCormick, E., and Rempe, D. M.: Technical note: Accounting for snow in the estimation of root zone water storage capacity from precipitation and evapotranspiration fluxes, Hydrol. Earth Syst. Sci., 25, 2861–2867, <a href="https://doi.org/10.5194/hess-25-2861-2021" target="_blank">https://doi.org/10.5194/hess-25-2861-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
      Draper, C., Reichle, R., De Jeu, R., Naeimi, V., Parinussa, R., and Wagner,
W.: Estimating root mean square errors in remotely sensed soil moisture over
continental scale domains, Remote Sens. Environ., 137, 288–298,
<a href="https://doi.org/10.1016/j.rse.2013.06.013" target="_blank">https://doi.org/10.1016/j.rse.2013.06.013</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
      ECMWF: IFS Documentation CY45R1 – Part IV: Physical processes,
<a href="https://doi.org/10.21957/4WHWO8JW0" target="_blank">https://doi.org/10.21957/4WHWO8JW0</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
      Entekhabi, D., Reichle, R. H., Koster, R. D., and Crow, W. T.: Performance
Metrics for Soil Moisture Retrievals and Application Requirements, J.
Hydrometeorol., 11, 832–840, <a href="https://doi.org/10.1175/2010JHM1223.1" target="_blank">https://doi.org/10.1175/2010JHM1223.1</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
      Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958, <a href="https://doi.org/10.5194/gmd-9-1937-2016" target="_blank">https://doi.org/10.5194/gmd-9-1937-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
      Feldman, A. F., Short Gianotti, D. J., Dong, J., Akbar, R., Crow, W. T.,
McColl, K. A., Konings, A. G., Nippert, J. B., Tumber-Dávila, S. J.,
Holbrook, N. M., Rockwell, F. E., Scott, R. L., Reichle, R. H., Chatterjee,
A., Joiner, J., Poulter, B., and Entekhabi, D.: Remotely Sensed Soil
Moisture Can Capture Dynamics Relevant to Plant Water Uptake, Water Resour.
Res., 59, e2022WR033814, <a href="https://doi.org/10.1029/2022WR033814" target="_blank">https://doi.org/10.1029/2022WR033814</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
      Ferreira, V. G., Montecino, H. D. C., Yakubu, C. I., and Heck, B.:
Uncertainties of the Gravity Recovery and Climate Experiment time-variable
gravity-field solutions based on three-cornered hat method, J. Appl. Remote
Sens., 10, 015015, <a href="https://doi.org/10.1117/1.JRS.10.015015" target="_blank">https://doi.org/10.1117/1.JRS.10.015015</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
      Fisher, J. B., Lee, B., Purdy, A. J., Halverson, G. H., Dohlen, M. B.,
Cawse-Nicholson, K., Wang, A., Anderson, R. G., Aragon, B., Arain, M. A.,
Baldocchi, D. D., Baker, J. M., Barral, H., Bernacchi, C. J., Bernhofer, C.,
Biraud, S. C., Bohrer, G., Brunsell, N., Cappelaere, B., Castro- Contreras,
S., Chun, J., Conrad, B. J., Cremonese, E., Demarty, J., Desai, A. R., De
Ligne, A., Foltýnová, L., Goulden, M. L., Griffis, T. J.,
Grünwald, T., Johnson, M. S., Kang, M., Kelbe, D., Kowalska, N., Lim,
J., Maïnassara, I., McCabe, M. F., Missik, J. E. C., Mohanty, B. P.,
Moore, C. E., Morillas, L., Morrison, R., Munger, J. W., Posse, G.,
Richardson, A. D., Russell, E. S., Ryu, Y., Sanchez-Azofeifa, A., Schmidt,
M., Schwartz, E., Sharp, I., Šigut, L., Tang, Y., Hulley, G., Anderson,
M., Hain, C., French, A., Wood, E., and Hook, S.: ECOSTRESS: NASA's Next
Generation Mission to Measure Evapotranspiration From the International
Space Station, Water Resour. Res., 56, e2019WR026058,
<a href="https://doi.org/10.1029/2019WR026058" target="_blank">https://doi.org/10.1029/2019WR026058</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
      Friedlingstein, P., Jones, M. W., O'Sullivan, M., Andrew, R. M., Hauck, J., Peters, G. P., Peters, W., Pongratz, J., Sitch, S., Le Quéré, C., Bakker, D. C. E., Canadell, J. G., Ciais, P., Jackson, R. B., Anthoni, P., Barbero, L., Bastos, A., Bastrikov, V., Becker, M., Bopp, L., Buitenhuis, E., Chandra, N., Chevallier, F., Chini, L. P., Currie, K. I., Feely, R. A., Gehlen, M., Gilfillan, D., Gkritzalis, T., Goll, D. S., Gruber, N., Gutekunst, S., Harris, I., Haverd, V., Houghton, R. A., Hurtt, G., Ilyina, T., Jain, A. K., Joetzjer, E., Kaplan, J. O., Kato, E., Klein Goldewijk, K., Korsbakken, J. I., Landschützer, P., Lauvset, S. K., Lefèvre, N., Lenton, A., Lienert, S., Lombardozzi, D., Marland, G., McGuire, P. C., Melton, J. R., Metzl, N., Munro, D. R., Nabel, J. E. M. S., Nakaoka, S.-I., Neill, C., Omar, A. M., Ono, T., Peregon, A., Pierrot, D., Poulter, B., Rehder, G., Resplandy, L., Robertson, E., Rödenbeck, C., Séférian, R., Schwinger, J., Smith, N., Tans, P. P., Tian, H., Tilbrook, B., Tubiello, F. N., van der Werf, G. R., Wiltshire, A. J., and Zaehle, S.: Global Carbon Budget 2019, Earth Syst. Sci. Data, 11, 1783–1838, <a href="https://doi.org/10.5194/essd-11-1783-2019" target="_blank">https://doi.org/10.5194/essd-11-1783-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
      Good, S. P., Noone, D., and Bowen, G.: Hydrologic connectivity constrains
partitioning of global terrestrial water fluxes, Science, 349, 175–177,
<a href="https://doi.org/10.1126/science.aaa5931" target="_blank">https://doi.org/10.1126/science.aaa5931</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
      Green, J. K., Konings, A. G., Alemohammad, S. H., Berry, J., Entekhabi, D.,
Kolassa, J., Lee, J.-E., and Gentine, P.: Regionally strong feedbacks
between the atmosphere and terrestrial biosphere, Nat. Geosci., 10,
410–414, <a href="https://doi.org/10.1038/ngeo2957" target="_blank">https://doi.org/10.1038/ngeo2957</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
      Gruber, A., Su, C.-H., Zwieback, S., Crow, W., Dorigo, W., and Wagner, W.:
Recent advances in (soil moisture) triple collocation analysis, Int. J.
Appl. Earth Obs. Geoinformation, 45, 200–211,
<a href="https://doi.org/10.1016/j.jag.2015.09.002" target="_blank">https://doi.org/10.1016/j.jag.2015.09.002</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
      He, Y., Wang, C., Hu, J., Mao, H., Duan, Z., Qu, C., Li, R., Wang, M., and
Song, X.: Discovering Optimal Triplets for Assessing the Uncertainties of
Satellite-Derived Evapotranspiration Products, Remote Sens., 15, 3215,
<a href="https://doi.org/10.3390/rs15133215" target="_blank">https://doi.org/10.3390/rs15133215</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
      Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A.,
Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D.,
Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P.,
Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D.,
Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer,
A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková,
M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., De Rosnay,
P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.: The ERA5
global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049,
<a href="https://doi.org/10.1002/qj.3803" target="_blank">https://doi.org/10.1002/qj.3803</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
      Huffman, G. J., Adler, R. F., Morrissey, M. M., Bolvin, D. T., Curtis, S., Joyce, R., McGavock, B., and Susskind, J.: Global Precipitation at One-Degree Daily Resolution from Multisatellite Observations, J. Hydrometeorol., 2, 36–50, <a href="https://doi.org/10.1175/1525-7541(2001)002&lt;0036:GPAODD&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1525-7541(2001)002&lt;0036:GPAODD&gt;2.0.CO;2</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
      Jung, M., Koirala, S., Weber, U., Ichii, K., Gans, F., Camps-Valls, G.,
Papale, D., Schwalm, C., Tramontana, G., and Reichstein, M.: The FLUXCOM
ensemble of global land-atmosphere energy fluxes, Sci. Data, 6, 74,
<a href="https://doi.org/10.1038/s41597-019-0076-8" target="_blank">https://doi.org/10.1038/s41597-019-0076-8</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
      Khan, M. S., Waqas Liaqat, U., Baik, J., and Choi, M.: Stand-alone uncertainty characterization of GLEAM, GLDAS and MOD16 evapotranspiration products using an extended triple collocation approach, Agr. Forest Meteorol., 252, 256–268, <a href="https://doi.org/10.1016/j.agrformet.2018.01.022" target="_blank">https://doi.org/10.1016/j.agrformet.2018.01.022</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
      Koppa, A., Rains, D., Hulsman, P., Poyatos, R., and Miralles, D. G.: A deep
learning-based hybrid model of global terrestrial evaporation, Nat. Commun.,
13, 1912, <a href="https://doi.org/10.1038/s41467-022-29543-7" target="_blank">https://doi.org/10.1038/s41467-022-29543-7</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
      Martens, B., Miralles, D. G., Lievens, H., van der Schalie, R., de Jeu, R. A. M., Fernández-Prieto, D., Beck, H. E., Dorigo, W. A., and Verhoest, N. E. C.: GLEAM v3: satellite-based land evaporation and root-zone soil moisture, Geosci. Model Dev., 10, 1903–1925, <a href="https://doi.org/10.5194/gmd-10-1903-2017" target="_blank">https://doi.org/10.5194/gmd-10-1903-2017</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
      Martens, B., Schumacher, D. L., Wouters, H., Muñoz-Sabater, J., Verhoest, N. E. C., and Miralles, D. G.: Evaluating the land-surface energy partitioning in ERA5, Geosci. Model Dev., 13, 4159–4181, <a href="https://doi.org/10.5194/gmd-13-4159-2020" target="_blank">https://doi.org/10.5194/gmd-13-4159-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
      McColl, K. A. and Rigden, A. J.: Emergent Simplicity of Continental
Evapotranspiration, Geophys. Res. Lett., 47,
<a href="https://doi.org/10.1029/2020GL087101" target="_blank">https://doi.org/10.1029/2020GL087101</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
      McColl, K. A., Vogelzang, J., Konings, A. G., Entekhabi, D., Piles, M., and
Stoffelen, A.: Extended triple collocation: Estimating errors and
correlation coefficients with respect to an unknown target, Geophys. Res.
Lett., 41, 6229–6236, <a href="https://doi.org/10.1002/2014GL061322" target="_blank">https://doi.org/10.1002/2014GL061322</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
      McColl, K. A., Salvucci, G. D., and Gentine, P.: Surface Flux Equilibrium
Theory Explains an Empirical Estimate of Water-Limited Daily
Evapotranspiration, J. Adv. Model. Earth Syst., 11, 2036–2049,
<a href="https://doi.org/10.1029/2019MS001685" target="_blank">https://doi.org/10.1029/2019MS001685</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
      McCormick, E. L., Sanders, L., McColl, K., and Konings, A.: Daily surface flux equilibrium (SFE) ET across CONUS (v1.0.0), Zenodo [code and data set], <a href="https://doi.org/10.5281/zenodo.17903676" target="_blank">https://doi.org/10.5281/zenodo.17903676</a>, 2026.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
      Mesinger, F., DiMego, G., Kalnay, E., Mitchell, K., Shafran, P. C.,
Ebisuzaki, W., Jović, D., Woollen, J., Rogers, E., Berbery, E. H., Ek,
M. B., Fan, Y., Grumbine, R., Higgins, W., Li, H., Lin, Y., Manikin, G.,
Parrish, D., and Shi, W.: North American Regional Reanalysis, B. Am.
Meteorol. Soc., 87, 343–360, <a href="https://doi.org/10.1175/BAMS-87-3-343" target="_blank">https://doi.org/10.1175/BAMS-87-3-343</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
      Miralles, D. G., Crow, W. T., and Cosh, M. H.: Estimating Spatial Sampling
Errors in Coarse-Scale Soil Moisture Estimates Derived from Point-Scale
Observations, J. Hydrometeorol., 11, 1423–1429,
<a href="https://doi.org/10.1175/2010JHM1285.1" target="_blank">https://doi.org/10.1175/2010JHM1285.1</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>39</label><mixed-citation>
      Miralles, D. G., Holmes, T. R. H., De Jeu, R. A. M., Gash, J. H., Meesters, A. G. C. A., and Dolman, A. J.: Global land-surface evaporation estimated from satellite-based observations, Hydrol. Earth Syst. Sci., 15, 453–469, <a href="https://doi.org/10.5194/hess-15-453-2011" target="_blank">https://doi.org/10.5194/hess-15-453-2011</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>40</label><mixed-citation>
      Miralles, D. G., Jiménez, C., Jung, M., Michel, D., Ershadi, A., McCabe, M. F., Hirschi, M., Martens, B., Dolman, A. J., Fisher, J. B., Mu, Q., Seneviratne, S. I., Wood, E. F., and Fernández-Prieto, D.: The WACMOS-ET project – Part 2: Evaluation of global terrestrial evaporation data sets, Hydrol. Earth Syst. Sci., 20, 823–842, <a href="https://doi.org/10.5194/hess-20-823-2016" target="_blank">https://doi.org/10.5194/hess-20-823-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>41</label><mixed-citation>
      Miralles, D. G., Bonte, O., Koppa, A., Baez-Villanueva, O. M., Tronquo, E.,
Zhong, F., Beck, H. E., Hulsman, P., Dorigo, W., Verhoest, N. E. C., and
Haghdoost, S.: GLEAM4: global land evaporation and soil moisture dataset at
0.1° resolution from 1980 to near present, Sci. Data, 12, 416,
<a href="https://doi.org/10.1038/s41597-025-04610-y" target="_blank">https://doi.org/10.1038/s41597-025-04610-y</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>42</label><mixed-citation>
      Mu, Q., Zhao, M., and Running, S. W.: Improvements to a MODIS global
terrestrial evapotranspiration algorithm, Remote Sens. Environ., 115,
1781–1800, <a href="https://doi.org/10.1016/j.rse.2011.02.019" target="_blank">https://doi.org/10.1016/j.rse.2011.02.019</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>43</label><mixed-citation>
      Muñoz-Sabater, J., Dutra, E., Agustí-Panareda, A., Albergel, C., Arduini, G., Balsamo, G., Boussetta, S., Choulga, M., Harrigan, S., Hersbach, H., Martens, B., Miralles, D. G., Piles, M., Rodríguez-Fernández, N. J., Zsoter, E., Buontempo, C., and Thépaut, J.-N.: ERA5-Land: a state-of-the-art global reanalysis dataset for land applications, Earth Syst. Sci. Data, 13, 4349–4383, <a href="https://doi.org/10.5194/essd-13-4349-2021" target="_blank">https://doi.org/10.5194/essd-13-4349-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>44</label><mixed-citation>
      Oki, T. and Kanae, S.: Global Hydrological Cycles and World Water Resources,
Freshw. Resour., 313, <a href="https://doi.org/10.1126/science.1128845" target="_blank">https://doi.org/10.1126/science.1128845</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>45</label><mixed-citation>
      Salvucci, G. D. and Gentine, P.: Emergent relation between surface vapor
conductance and relative humidity profiles yields evaporation rates from
weather data, Proc. Natl. Acad. Sci. USA,
<a href="https://doi.org/10.1073/pnas.1215844110" target="_blank">https://doi.org/10.1073/pnas.1215844110</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>46</label><mixed-citation>
      Santanello Jr., J. A. and Friedl, M. A.: Diurnal covariation in soil
heat flux and net radiation, J. Appl. Meteorol., 42, 851–862, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>47</label><mixed-citation>
      Savoca, M. E., Senay, G. B., Maupin, M. A., Kenny, J. F., and Perry, C. A.:
Actual evapotranspiration modeling using the operational Simplified Surface
Energy Balance (SSEBop) approach, Reston, VA,
<a href="https://doi.org/10.3133/sir20135126" target="_blank">https://doi.org/10.3133/sir20135126</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>48</label><mixed-citation>
      Scipal, K., Holmes, T., De Jeu, R., Naeimi, V., and Wagner, W.: A possible
solution for the problem of estimating the error structure of global soil
moisture data sets, Geophys. Res. Lett., 35,
<a href="https://doi.org/10.1029/2008gl035599" target="_blank">https://doi.org/10.1029/2008gl035599</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>49</label><mixed-citation>
      Stocker, B. D., Tumber-Dávila, S. J., Konings, A. G., Anderson, M. C.,
Hain, C., and Jackson, R. B.: Global patterns of water storage in the
rooting zones of vegetation, Nat. Geosci., 16, 250–256,
<a href="https://doi.org/10.1038/s41561-023-01125-2" target="_blank">https://doi.org/10.1038/s41561-023-01125-2</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>50</label><mixed-citation>
      Stoffelen, A.: Toward the true near-surface wind speed: Error modeling and
calibration using triple collocation, J. Geophys. Res. Oceans, 103,
7755–7766, <a href="https://doi.org/10.1029/97JC03180" target="_blank">https://doi.org/10.1029/97JC03180</a>, 1998.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>51</label><mixed-citation>
      Su, C. H., Ryu, D., Crow, W. T., and Western, A. W.: Beyond triple
collocation: Applications to soil moisture monitoring, J.
Geophys. Res.-Atmos., 119, 6419–6439, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>52</label><mixed-citation>
      Sun, J., McColl, K. A., Wang, Y., Rigden, A. J., Lu, H., Yang, K., Li, Y.,
and Santanello, J. A.: Global evaluation of terrestrial near-surface air
temperature and specific humidity retrievals from the Atmospheric Infrared
Sounder (AIRS), Remote Sens. Environ., 252, 112146,
<a href="https://doi.org/10.1016/j.rse.2020.112146" target="_blank">https://doi.org/10.1016/j.rse.2020.112146</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>53</label><mixed-citation>
      Teuling, A. J., Seneviratne, S. I., Stöckli, R., Reichstein, M., Moors,
E., Ciais, P., Luyssaert, S., Van Den Hurk, B., Ammann, C., Bernhofer, C.,
Dellwik, E., Gianelle, D., Gielen, B., Grünwald, T., Klumpp, K.,
Montagnani, L., Moureaux, C., Sottocornola, M., and Wohlfahrt, G.:
Contrasting response of European forest and grassland energy exchange to
heatwaves, Nat. Geosci., 3, 722–727, <a href="https://doi.org/10.1038/ngeo950" target="_blank">https://doi.org/10.1038/ngeo950</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>54</label><mixed-citation>
      Thakur, H., Raghav, P., Kumar, M., and Wolkeba, F.: Surface Flux Equilibrium
Theory-Derived Evapotranspiration Estimate Outperforms ECOSTRESS, MODIS, and
SSEBop Products, Geophys. Res. Lett., 52, e2025GL114822,
<a href="https://doi.org/10.1029/2025GL114822" target="_blank">https://doi.org/10.1029/2025GL114822</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>55</label><mixed-citation>
      United States Census Bureau: “tl_2023_us_coastline”, TIGER/Line Shapefiles, Nation, U.S., Coastline, 2023, <a href="https://catalog.data.gov/dataset/tiger-line-shapefile-2023-nation-u-s-coastline" target="_blank"/> (last access: 1 August 2025), 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>56</label><mixed-citation>
      Wei, Y., Liu, S., Huntzinger, D. N., Michalak, A. M., Viovy, N., Post, W. M., Schwalm, C. R., Schaefer, K., Jacobson, A. R., Lu, C., Tian, H., Ricciuto, D. M., Cook, R. B., Mao, J., and Shi, X.: The North American Carbon Program Multi-scale Synthesis and Terrestrial Model Intercomparison Project – Part 2: Environmental driver data, Geosci. Model Dev., 7, 2875–2893, <a href="https://doi.org/10.5194/gmd-7-2875-2014" target="_blank">https://doi.org/10.5194/gmd-7-2875-2014</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>57</label><mixed-citation>
      Xia, Y., Mitchell, K., Ek, M., Sheffield, J., Cosgrove, B., Wood, E., Luo,
L., Alonge, C., Wei, H., Meng, J., Livneh, B., Lettenmaier, D., Koren, V.,
Duan, Q., Mo, K., Fan, Y., and Mocko, D.: Continental-scale water and energy
flux analysis and validation for the North American Land Data Assimilation
System project phase 2 (NLDAS-2): 1. Intercomparison and application of
model products, J. Geophys. Res.-Atmos., 117, 2011JD016048,
<a href="https://doi.org/10.1029/2011JD016048" target="_blank">https://doi.org/10.1029/2011JD016048</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>58</label><mixed-citation>
      Yamazaki, D., Ikeshima, D., Sosa, J., Bates, P. D., Allen, G. H., and
Pavelsky, T. M.: MERIT Hydro: A High-Resolution Global Hydrography Map Based
on Latest Topography Dataset, Water Resour. Res., 55, 5053–5073,
<a href="https://doi.org/10.1029/2019WR024873" target="_blank">https://doi.org/10.1029/2019WR024873</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>59</label><mixed-citation>
      Yang, Y., Roderick, M. L., Guo, H., Miralles, D. G., Zhang, L., Fatichi, S.,
Luo, X., Zhang, Y., McVicar, T. R., Tu, Z., Keenan, T. F., Fisher, J. B.,
Gan, R., Zhang, X., Piao, S., Zhang, B., and Yang, D.: Evapotranspiration on
a greening Earth, Nat. Rev. Earth Environ., 4, 626–641,
<a href="https://doi.org/10.1038/s43017-023-00464-3" target="_blank">https://doi.org/10.1038/s43017-023-00464-3</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>60</label><mixed-citation>
      Yilmaz, M. T. and Crow, W. T.: Evaluation of Assumptions in Soil Moisture
Triple Collocation Analysis, J. Hydrometeorol., 15, 1293–1302,
<a href="https://doi.org/10.1175/JHM-D-13-0158.1" target="_blank">https://doi.org/10.1175/JHM-D-13-0158.1</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>61</label><mixed-citation>
      Yin, X., Jiang, B., Liang, S., Li, S., Zhao, X., Wang, Q., Xu, J., Han, J.,
Liang, H., Zhang, X., Liu, Q., Yao, Y., Jia, K., and Xie, X.: Significant
discrepancies of land surface daily net radiation among ten remotely sensed
and reanalysis products, Int. J. Digit. Earth, 16, 3725–3752,
<a href="https://doi.org/10.1080/17538947.2023.2253211" target="_blank">https://doi.org/10.1080/17538947.2023.2253211</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>62</label><mixed-citation>
      Zhao, M., A, G., Liu, Y., and Konings, A. G.: Evapotranspiration frequently
increases during droughts, Nat. Clim. Change, 12, 1024–1030,
<a href="https://doi.org/10.1038/s41558-022-01505-3" target="_blank">https://doi.org/10.1038/s41558-022-01505-3</a>, 807 2022.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>63</label><mixed-citation>
      Zhu, W., Yu, X., Wei, J., and Lv, A.: Surface flux equilibrium estimates of
evaporative fraction and evapotranspiration at global scale: Accuracy
evaluation and performance comparison, Agric. Water Manag., 291, 108609,
<a href="https://doi.org/10.1016/j.agwat.2023.108609" target="_blank">https://doi.org/10.1016/j.agwat.2023.108609</a>, 2024.

    </mixed-citation></ref-html>--></article>
