Unveiling the impact of potential evapotranspiration method selection on trends in hydrological cycle components across Europe

Thakur, Vishal; Markonis, Yannis; Kumar, Rohini; Thomson, Johanna Ruth; Vargas Godoy, Mijael Rodrigo; Hanel, Martin; Rakovec, Oldrich

doi:https://doi.org/10.5194/hess-29-4395-2025

Articles | Volume 29, issue 18

https://doi.org/10.5194/hess-29-4395-2025

Articles | Volume 29, issue 18

Research article

16 Sep 2025

Research article |

| 16 Sep 2025

Unveiling the impact of potential evapotranspiration method selection on trends in hydrological cycle components across Europe

Vishal Thakur, Yannis Markonis, Rohini Kumar, Johanna Ruth Thomson, Mijael Rodrigo Vargas Godoy, Martin Hanel, and Oldrich Rakovec

Abstract

Hydrological models are essential tools for assessing changes in the hydrological cycle, enabling detailed quantification of runoff (Q), total water storage (TWS), and actual evapotranspiration (AET). Precipitation (P) and potential evapotranspiration (PET) are the two major drivers in modeling these components, with the influence of P more extensively studied than PET. This study evaluates the impact of PET method selection on AET, Q, and TWS using 12 PET formulations categorized as temperature-based, radiation-based, and combinational. We applied the mesoscale Hydrological Model (mHM) to simulate 40 years of hydrological components across 553 European catchments. PET effects were analyzed through trend analysis and the data concurrence index (DCI) across three catchment categories: energy-limited, mixed, and water-limited. Our results indicate that annual and seasonal trends are variably sensitive to method choice, depending on each component and catchment category. While PET shows strong agreement in trend direction, trend magnitudes vary among different PET methods. Jensen–Haise consistently produces the highest annual and seasonal PET trend magnitudes, whereas no single method consistently yields the lowest. AET trends generally align with PET but are weaker in magnitude on an annual scale. Seasonally, only energy-limited catchments show AET trends similar to PET. For Q and TWS, most European catchments exhibit strong trend agreement across PET methods. As expected, summer is the primary contributor to annual PET trends, while for AET, its influence is most notable in energy-limited catchments. Looking at statistically significant trends, there is general agreement for PET and AET, which decline for the other hydroclimatic variables. On an annual scale, varying patterns of hydrological cycle intensification (increases in P, AET, Q, and TWS) are observed across European catchments, highlighting the influence of PET method selection. Overall, this study highlights how the PET method selection affects the quantification of hydrological trends, emphasizing the importance of method selection for robust assessment of AET, Q, and TWS.

Download & links

Article (PDF, 8745 KB)

Supplement (16807 KB)

Download & links

Article (8745 KB)
Full-text XML
Supplement (16807 KB)
BibTeX
EndNote

How to cite.

Received: 31 Oct 2024 – Discussion started: 07 Nov 2024 – Revised: 17 May 2025 – Accepted: 01 Jul 2025 – Published: 16 Sep 2025

1 Introduction

Potential evapotranspiration (PET) is the potential to evaporate water from the land surface to the atmosphere without any limitation to water availability. Although the concept has been in use for centuries, Thornthwaite (1948) was the first to formally introduce the term “potential evapotranspiration” in the scientific literature. A related but distinct concept is “reference crop evapotranspiration”, which is sometimes used interchangeably with PET. However, these terms differ in their conceptual basis and applications. Reference crop evapotranspiration specifically estimates the water requirements of a standardized reference crop under ideal conditions, whereas potential evapotranspiration provides a broader representation of water and energy exchange processes over diverse landscapes and large regions (Xiang et al., 2020). PET is used in diverse research fields. In environmental studies, PET is used for aridification research and investigating extreme events, including meteorological, agricultural, and hydrological droughts (Park et al., 2018; Zhou et al., 2023; Shi et al., 2023 a). In hydrology, it is used to determine the long-term states of catchments, such as energy-limited and water-limited catchments, and it plays a key role in the Budyko framework for estimating long-term changes in hydrological components (Reaver et al., 2022). Furthermore, PET is extensively used in hydrological modeling to define the maximum rate of possible water loss through evaporation and transpiration. It is used as one of the important input variables to simulate key hydrological components, such as actual evapotranspiration (AET), runoff (Q), and total water storage (TWS).

Since Thornthwaite's study, more than 100 empirical PET equations have been developed, ranging from simple to complex types (Proutsos et al., 2023). They can be classified mainly into three categories based on input data. (1) Temperature-based methods utilize temperature as input (Shaw and Riha, 2011). Due to their simplicity and minimal data requirement, these are widely used in hydrological modeling (Arnold et al., 1998; Liu et al., 2008). (2) Radiation-based methods require solar radiation (shortwave or net radiation) (Xu and Singh, 2000). (3) The combinational type requires temperature, radiation, wind speed, relative humidity, vapor pressure, etc. (Vicente-Serrano et al., 2014; Allen, 1998). Out of these 100+ methods, the majority are temperature-based methods (40+), followed by radiation-based methods (30+) and combination-based methods (10+) (Proutsos et al., 2023). Many of these empirical methods were initially developed and tested for particular regional scales or climatic conditions. For instance, the Thornthwaite method is most suitable for humid climates, while the Hargreaves–Samani method is particularly effective in arid and semi-arid regions. Similarly, the Hamon method is suitable for all climates. All methods in these three categories incorporate several assumptions (climatic conditions and data availability), resulting in significant differences in their estimates (Lu et al., 2005).

PET influences AET and consequently impacts the estimation of infiltration, Q and TWS in hydrological models. PET can have direct as well as indirect influence on AET. In hydrological models, AET is estimated by either separately determining water surface evaporation, soil evaporation, and vegetation transpiration and then combining these based on land use patterns or by first assessing potential evapotranspiration and subsequently adjusting it to actual evapotranspiration using the soil moisture extraction function (Zhao et al., 2013). The mesoscale Hydrological Model (mHM) explicitly represents interception, where a portion of AET is derived from interception evaporation. This process is estimated as a fraction of PET using a power function derived from Deardorff (1978) and Liang et al. (1994). When the evaporative demand exceeds the intercepted water, the interception storage is fully depleted. Interception storage in the mHM is estimated as a function of leaf area index (LAI) that varies depending on vegetation type and season. AET in the mHM is mainly contributed by canopy evaporation, soil evaporation, and open water evaporation. AET, being a key component of the water balance, affects the estimation of other water balance components (Q and TWS). While Q remains relatively insensitive, AET and TWS are more responsive to the choice of PET method (Bai et al., 2016). Hence, uncertainty in PET estimation influences the quantification of change in water cycle components.

Many studies have investigated the sensitivity of the hydrological model output to PET. Oudin et al. (2005) evaluated 27 PET methods with 4 hydrological models, concluding that PET is insensitive to runoff generation, with similar conclusions made by Aouissi et al. (2016) and Birhanu et al. (2018). Assessment of four PET methods with two monthly hydrological models reported that runoff is unaffected by the PET method, whereas AET and total water storage depend on the PET method (Bai et al., 2016). The study also concluded that calibration against the runoff is the main cause of PET insensitivity, and AET and total water storage compensate for it. In contrast to previous studies, Ndiaye et al. (2024) compared 21 PET methods for runoff estimation with three conceptual lumped hydrological models (GR4J, GR5J, and GR6J) in the Senegal River Basin, stating that better performance is shown by combination-type methods. Similarly, Pimentel et al. (2023) compared three PET methods for their accuracy in simulating runoff and AET in the large-scale hydrological model (HYPE model). They found that Hargreaves–Samani performed best in the Amazonas, central Europe, and Oceania, and Priestley–Taylor performed best in higher latitudes. These studies focus on the sensitivity and choice of PET methods in estimating hydrological components. While these findings reveal how PET methods can impact the magnitude of hydrological components, the impact of PET method selection on changes in these hydrological components is not often investigated. Temporal changes in these hydrological components are crucial for climate change mitigation, water availability, energy availability, and agricultural produce.

Trends in PET and its implication for hydrological components (AET) are examined by Anabalón and Sharma (2017). They compare trends in six PET and AET datasets, mainly estimated by the Penman–Monteith or Priestley–Taylor PET methods. They found that PET trends were highly correlated with AET trends in energy-limited regions, while the AET trends were closely correlated with precipitation trends in water-limited regions. Additionally, they reported that PET and AET trends were inversely related in certain cases, mainly due to the prevailing influence of precipitation trends on AET trends. Similarly, Liu et al. (2022) identified a strong positive relationship between PET and AET changes in most global regions and an inverse relationship with total water storage change. The study is limited by using only the Penman–Monteith approach for PET and global datasets for AET and total water storage change. The inconsistency and lack of coherence between existing PET and AET datasets often necessitate using a single PET method compared to various AET datasets. Furthermore, previous studies have primarily focused on one-to-one trend comparisons than comprehensive analysis of all hydrological cycle components, including Q and TWS. Thus, research is needed to explore the impact of changes in PET methods on changes in different hydrological components of hydrological models.

In this study, our objective is to assess the trends of PET using 12 different PET methods and their influence on the trend of hydrological components (AET, Q, and TWS) across 553 European catchments. We further evaluate the agreement among PET methods by applying the data concurrence index (DCI) to the trends of each corresponding hydrological cycle component (AET, Q, and TWS). The mesoscale Hydrological Model (mHM) is used to evaluate the influence of changes in different PET methods, from simple to most advanced approaches, on hydrological components across a range of European catchments. We chose a concurrency index to assess agreement between the PET method and hydrological components at each catchment. The data concurrency index is used to compare directions between different datasets (Anabalón and Sharma, 2017). In our research, we use it to examine directional changes in PET estimates, AET, Q, and TWS across each catchment.

2 Methods and data

2.1 Study area and catchment classification

This study includes 553 European catchments ranging in size from 500 km² to 252 000 km². Catchments were selected based on the following criteria: first, a minimum area of 500 km²; second, at least 10 years of observed discharge data from GRDC database; and third, a closed water balance condition ( $(P - Q) / P < 1$ ). The selected catchments are divided into three categories based on the aridity index: energy-limited, mixed, and water-limited (Fig. 1a). This classification is based on the aridity index (AI), estimated as the ratio of mean PET to mean precipitation, a widely used metric that quantifies the dry or wet state of the catchment (Zhang et al., 2016; Massari et al., 2022). In our approach, which involves the application of multiple PET methods, a catchment is considered energy-limited if the AI is less than one for all the PET methods. Similarly, a catchment is water-limited if all PET methods report AI greater than one. If AI values appear to be both above and below one, depending on the PET method used, then the catchment is assigned to the mixed category (Fig. 1b). Three representative catchments from each category are indicated by dark-black lines in Fig. 1a and are plotted in Fig. 1b. This classification allows us to distinguish the differences in magnitudes of PET and the other key hydrological components among the catchments (Fig. 1c). By employing this methodology out of 553 catchments, we find that 189 catchments are energy-limited, 34 are water-limited, and the remaining 330 belong to the mixed category.

https://hess.copernicus.org/articles/29/4395/2025/hess-29-4395-2025-f01

Figure 1Catchment classification into energy-limited, mixed, and water-limited categories. (a) Spatial location of catchments; black borders indicate a representative catchment of each category. (b) Classification example within the Budyko space for the representative catchments. Colored points represent three representative catchments, with each set of 12 points per color corresponding to different PET methods of representative catchments. (c) Annual time series of simulated hydrological components corresponding to each representative catchment and PET estimation method (TH: Thornthwaite; BR: Baier–Robertson; BC: Blaney–Criddle; OD: Oudin; MB: McGuinness–Bordne; HM: Hamon; HS: Hargreaves–Samani; JH: Jensen–Haise; MD: Milly–Dunne; PT: Priestley–Taylor; PM: Penman–Monteith; CO₂: Penman–Monteith[CO₂].). All units are in mm yr⁻¹.

2.2 Meteorological and geomorphological data

The Ensemble Meteorological Dataset for Planet Earth (EM-Earth; Tang et al., 2022) and ERA5-Land (Muñoz-Sabater et al., 2021) were used to calculate different PET estimates and run the mesoscale Hydrological Model (mHM; Samaniego et al., 2010; Kumar et al., 2013 b). The EM-Earth dataset (Tang et al., 2022) is derived from observed SC-Earth station data (Tang et al., 2021) and ERA5 data (Hersbach et al., 2020). It incorporates a novel optimal interpolation technique and considers the temporal inconsistencies between the station and reanalysis data (Tang et al., 2022). ERA5-Land dataset is a reanalysis data product created by the European Centre for Medium-Range Weather Forecasts (ECMWF) and has been widely used in numerous hydrological modeling studies (Muñoz-Sabater et al., 2021). Both datasets are available at 0.1° × 0.1° spatial resolution, but EM-Earth has an hourly as well as daily time step, while ERA5-Land is at the hourly scale. Meteorological forcings, particularly precipitation and temperature from EM-Earth, have been found to be more accurate based on comparisons with several datasets in data-rich regions (Tang et al., 2022).

In our analysis, we use daily temperature and precipitation from EM-Earth and radiation (long- and shortwave), surface pressure, and wind components (U and V) from ERA5-Land for the period 1980–2019 (Table 1). The EM-Earth dataset provides high-quality precipitation and temperature data and has been shown to perform well over Europe (Tang et al., 2022). It has undergone climatology-based bias correction and accounts for precipitation undercatch. However, since EM-Earth does not include all necessary variables for PET estimation, we utilize ERA5-Land as a complementary dataset. ERA5-Land has been demonstrated to perform better than other reanalysis datasets, including ERA5 and ERA-Interim (Muñoz-Sabater et al., 2021). Nonetheless, its limitations in hydrological modeling have been acknowledged by Clerc-Schwarzenbach et al. (2024) and Tarek et al. (2020). Several recent global studies follow a similar strategy, combining precipitation and temperature from EM-Earth with radiation, wind speed, and other meteorological variables from ERA5-Land (Tang et al., 2023; Yin et al., 2024; Rakovec et al., 2023). These meteorological data combinations, along with the simulated hydrological components derived from them, demonstrate lower uncertainty across Europe (Tang et al., 2023). These are homogenized to daily temporal scale and 0.125° × 0.125° spatial scale to be compatible with the previous simulations run by the mHM (Pohl et al., 2023; Fang et al., 2024). Homogenization using the nearest neighbor technique and necessary mathematical operations (appropriate unit conversion of datasets) is performed using the climate data operator (CDO; Schulzweida, 2022).

Morphological data such as leaf area index (LAI), soil properties, and terrain characteristics (such as flow direction, flow accumulation, slope, and aspect) are sourced from the mHM European database (Rakovec et al., 2016). This database originally utilized data from different sources, such as soil properties from the International Soil Reference and Information Centre (ISRIC), terrain characteristics from the US Geological Survey (USGS) and the National Geospatial-Intelligence Agency (NGA), LAI from Global Inventory Modeling and Mapping Studies (GIMMS), and land cover from Global Land Cover (GlobCover) by the European Space Agency (ESA). CO₂ concentration is sourced from Cheng et al. (2022) and is reconstructed from the Carbon Dioxide Information Analysis Center (CDIAC) data.

Tang et al. (2022)Tang et al. (2022)Tang et al. (2022)Tang et al. (2022)Muñoz-Sabater et al. (2021)Muñoz-Sabater et al. (2021)Muñoz-Sabater et al. (2021)Muñoz-Sabater et al. (2021)Cheng et al. (2022)Tucker et al. (2005)Arino et al. (2012)Hartmann and Moosdorf (2012)

Table 1Summary of meteorological and morphological data. P is precipitation; T_avg is average air temperature; T_range is the temperature range, which is the difference between maximum and minimum air temperature; T_dew is dew point temperature of air; SW is shortwave radiation; LW is longwave radiation; U is the eastward component of wind speed at 10 m; V is the northward component of wind speed at 10 m; and Con $_{{CO}_{2}}$ is CO₂ concentration.

Download Print Version | Download XLSX

2.3 Methodology

2.3.1 PET methods/formulations

We incorporate 12 PET methods at a daily scale from all three categories of estimation: temperature, radiation, and combinational methods (Table 2). Temperature-based methods require temperature data, which can include average temperature, minimum temperature, or maximum temperature. Additionally, PET methods that incorporate extraterrestrial radiation are also considered under this category. Combinational-based methods require a larger number of variables compared to temperature- and radiation-based methods to estimate various physical terms, such as wind speed and surface pressure (Table 2). Most temperature-based methods use only daily average temperature (Thornthwaite, Oudin, Hamon, Jensen–Haise, McGuinness–Bordne, and Blaney–Criddle), while Baier–Robertson employs both minimum and maximum daily temperatures. Some of these methods also include an extraterrestrial radiation term in their formulation. However, since this radiation term is calculated based on latitude and follows a consistent annual cycle varying only with the calendar date, only temperature data are needed for PET calculation. We utilize only one radiation-based method, Milly–Dunne PET, which requires only net radiation data to estimate PET. The combinational methods, such as Penman–Monteith and Priestley–Taylor, have a stronger physical basis. In our analysis, all these physical terms are estimated following Allen (1998). Additionally, within the combinational category, we employ the modified Penman–Monteith (CO₂) method, which accounts for temporal variations in changing carbon dioxide concentrations. Formulation details, including mathematical equations and associated constants for each PET method, are provided in Table A1.

Hargreaves and Samani (1985)Thornthwaite (1948)Oudin et al. (2005)Hamon (1961)Bai et al. (2016)Jensen and Haise (1963)McGuinness and Bordne (1972)Blaney (1950)Milly and Dunne (2016)Priestley and Taylor (1972)Penman (1948)Yang et al. (2019)

Table 2List of PET methods and required input data. T_max is maximum air temperature (°C), T_min is minimum air temperature (°C), Pr is surface pressure (pa), R_n is net radiation (J m⁻²), u₂ is the wind speed at 2 m from the surface (m s⁻¹), and Con $_{{CO}_{2}}$ is CO₂ concentration (ppm).

Download Print Version | Download XLSX

2.3.2 The mesoscale Hydrological Model (mHM)

The mHM is a hydrological model which explicitly accounts for sub-grid variability in hydrological processes (Samaniego et al., 2010; Kumar et al., 2013 b; Thober et al., 2019). The mHM has been successfully applied and tested in more than 1000 European basins ranging in size from 4 km² to more than 100 000 km² at various spatial resolutions and grid cell sizes (1–100 km) (Samaniego et al., 2010; Kumar et al., 2013 b; Rakovec et al., 2016, 2019; Shrestha et al., 2024). Additionally, the model is currently applied at the global scale with comparable and sometimes even improved model performance with respect to other large-scale hydrological models (Samaniego et al., 2019). The mHM demonstrates robust performance and applicability across Europe (Kumar et al., 2020). It is also one of the several large-scale hydrological models used by the WMO for their annual State of Global Water Resources reports (World Meteorological Organization (WMO), 2023).

We run the mHM (v5.12.0) over 553 European catchments, using the meteorological data from EM-Earth and the 12 different PET estimation methods. Overall, 6636 (12 × 553) mHM simulations are performed for all the study basins. The model was set up for each catchment at a daily temporal resolution and a 0.125° × 0.125° spatial scale. All meteorological forcings were kept constant with only varying PET estimates. To calculate TWS, we aggregate soil moisture at different layers, canopy interception storage, snowpack, groundwater levels, sealed area reservoirs, and unsaturated zone reservoirs at each grid cell and time step. The hydrological components (AET, Q, and TWS) and PET are averaged over the catchment area and monthly time steps. In this research default parametrization is used for model setup. The default parameterization of the mHM has been shown to perform well in previous studies (Kumar et al., 2013 a; Rakovec et al., 2016). Furthermore, it has been demonstrated as one of the best-performing configurations compared to other large-scale hydrological models (Samaniego et al., 2019). Additionally, our assessment of model performance is consistent with the findings of Samaniego et al. (2019). Our model evaluation against discharge shows that the median Kling–Gupta efficiency (KGE) ranges from 0.60 to 0.75 across most PET methods (Fig. S1 in the Supplement).

2.3.3 Trend analysis

We use Theil–Sen's slope method to calculate the magnitude and direction of linear change in PET, AET, Q, and TWS (Sen, 1968). The Mann–Kendall trend test is used to test the statistical significance of the observed trend (Kendall, 1948). A trend is considered statistically significant at the 5 % level (p≤0.05). Sen's slope is non-parametric and insensitive to outliers and types of distribution. Due to its robust application, this method is widely used in hydrology-, climate-, and environment-related studies (Anabalón and Sharma, 2017; Thackeray et al., 2022). It accounts for all possible pairs of data points from a time series and finds the median value as the slope magnitude. Equations (1) and (2) show the calculation steps of Sen's slope:

\begin{matrix} (1) & S_{k} = \frac{X_{j} - X_{i}}{t_{j} - t_{i}} where 1 \leq i < j \leq n \\ (2) & S_{med} = \{\begin{cases} S_{[\frac{n + 1}{2}]} & if n is odd \\ \frac{S_{[\frac{n}{2}]} + S_{[\frac{n + 2}{2}]}}{2} & if n is even, \end{cases} \end{matrix}

where S_k is the linear slope for pair X_i and X_j, S_med is the median slope, X_i and X_j are data points from periods t_i and t_j, and n is the number of data points in the time series. Positive S_med represents a positive trend, with the magnitude indicating the rate of increase. Similarly, negative S_med represents a negative trend, with the magnitude indicating the rate of decrease.

Here, we use the trend R package to estimate Sen's slope over a 40-year period from 1980 to 2019 at annual and seasonal (winter, spring, summer, and autumn) scales for each catchment. The units of trend at the annual scale are expressed as mm yr⁻¹, while at the seasonal scale, they are represented as mm per season per year. For instance, a summer season trend of 1 mm per season per year indicates that each year, an additional 1 mm is added to the summer season. The trends of each PET method were analyzed exclusively within seasons (for example winter season compared with winter season), without cross-seasonal comparisons.

2.3.4 Data concurrence index (DCI)

The data concurrence index quantifies the level of concurrence between the significant trends in different datasets of the same variable (Anabalón and Sharma, 2017). DCI is applied in two forms in this study. The first is the original formulation, which includes only statistically significant trends. It is used to evaluate agreement among PET methods that yield significant changes in hydrological cycle components. The second is a modified version that considers all detected trends, regardless of significance. This captures the overall agreement across all PET methods for each catchment and each hydrological component. The detailed formulations for both cases are described in Eq. (3):

\begin{matrix} (3) & \begin{aligned} DCI = \{\begin{cases} \frac{1}{N} \sum_{i = 1}^{N} δ_{i} \times \frac{S_{i}}{| S_{i} |} \\ for statistically significant trends \\ \frac{1}{N} \sum_{i = 1}^{N} \frac{S_{i}}{| S_{i} |} \\ for all trends \end{cases} \\ with δ_{i} = \{\begin{cases} 1, & if S_{i} is statistically significant \\ 0, & otherwise \end{cases} \end{aligned}, \end{matrix}

where DCI is the data concurrence index, N denotes the number of datasets, and S_i is the magnitude of the slope.

The positive DCI represents a higher number of positive slopes than negative slopes and vice versa. For instance, a DCI of 1 for AET and Q implies positive change for all the PET methods. Similarly, a DCI of −1 for AET and Q implies a negative change for all the PET methods. A DCI of 0.5 indicates that 9 out of 12 methods, or 75 % of the methods, show a positive change, and similarly, a DCI of −0.5 indicates that 9 out of 12 methods show a negative change, or 75 % of the methods. A DCI of zero denotes an equal number of positive and negative slopes (six positive and six negative). Our analysis estimates DCI from PET, AET, Q, and TWS slopes at annual as well as seasonal scales.

3 Results

3.1 Trend comparison of PET methods at annual scales

By applying the Theil–Sen slope method, we observe that changes in PET depend on the choice of PET estimation formulation (Fig. 2). Considerable variability is observed among the PET methods, with median slopes ranging from slightly positive to 6 mm yr⁻¹ during the 1980–2019 period. The Jensen–Haise method shows the highest change among all methods across different catchment categories and also has the highest absolute average PET across European catchments (Fig. S2). Generally, changes in PET are higher in water-limited than in energy-limited catchments. This difference arises since temperature-based methods depend on temperature changes. Conversely, combinational methods are influenced by more than one meteorological variable (temperature, wind speed, radiation, etc.). When we consider only statistically significant trends (p<0.05), we observe that temperature-based PET methods consistently demonstrate statistically significant positive trends for most of the catchments (Table S2). Radiation-based and combinational methods account for fewer catchments than temperature-based PET methods (Table S2), implying that they generate weaker trends compared to the temperature-based PET methods. In addition, overall trend variability among the PET methods decreases from energy-limited to water-limited catchments, irrespective of trend significance.

AET trends generally align with PET trends in energy-limited catchments but with smaller magnitudes (Fig. 2). In these catchments, all PET methods lead to a positive AET trend in terms of median value. However, a few catchments in this category reveal a slight negative change for the Blaney–Criddle, Jensen–Haise, Milly–Dunne, and Priestley–Taylor methods. For mixed catchments, the median AET trend is positive for all PET methods except Blaney–Criddle. The negative AET trends are similar to those in energy-limited catchments. Overall, the AET trend patterns (high and low trends) for energy-limited and mixed catchments are similar to the trends in PET for these catchments, regardless of trend magnitude, with a few exceptions, such as Blaney–Criddle and Jensen–Haise (Fig. 2). In water-limited catchments, both positive and negative trends in AET are observed. The pattern remains similar to PET trends with a few exceptions, such as Blaney–Criddle. Across all PET methods, statistically significant positive AET trends were found in 162 energy-limited, 217 mixed, and 2 water-limited catchments (Fig. S3). Among these significant trends, Jensen–Haise yields the highest AET trend estimates in energy-limited and mixed catchments. These statistically significant trends closely follow the overall AET trend patterns in these catchment categories.

Q trends exhibit lower sensitivity to PET methods in energy-limited and mixed catchments compared to PET trends when considering all trends (Fig. 2). Despite the positive median, a substantial fraction of catchments exhibit a negative trend in energy-limited catchments. In contrast, for mixed catchments, most PET methods produce negative Q trends, though some catchments maintain positive trends. In water-limited catchments, there is variability in PET methods; for instance, Milly–Dunne has a larger trend, whereas Blaney–Criddle shows the lowest trend. Even though PET methods are insensitive in Q, variability exists among the PET methods within each catchment category. Overall, one-third of the catchments (183) show statistically significant Q trends: 28 energy-limited, 154 mixed, and one water-limited. All energy-limited catchments show positive Q trends, except those estimated with Jensen–Haise. In mixed catchments, all statistically significant trends are negative, with the exception of Blaney–Criddle. Despite fewer catchments with statistically significant trends, the variability in Q trends across PET methods persists, particularly in energy-limited and mixed catchments.

TWS trends are sensitive to PET methods in water-limited and mixed catchments, but energy-limited catchments remain largely unaffected (Fig. 2). Despite a consistent negative median trend across PET methods in energy-limited catchments, few catchments still exhibit positive trends. Mixed catchments display a similar pattern. Among PET methods, temperature-based approaches show greater variability than radiation or combinational types in mixed catchments. In water-limited catchments, trends span positive to near-zero negative, with Blaney–Criddle, Hargreaves–Samani, and Milly–Dunne showing notable positive trends. The Blaney–Criddle method also yields higher variability in trend estimates, especially in mixed and water-limited catchments. When focusing on statistically significant trends, TWS has a similar distribution to Q ( $30 / 172 / 1$ ). Most PET methods show decreasing trends in energy-limited and mixed catchments, with the exception again of Blaney–Criddle (Fig. S3). Compared to the full trend set, statistically significant trends reveal stronger inter-method differences, with Jensen–Haise showing the steepest decline and Thornthwaite the weakest.

Hydrological trend estimates for AET, Q, and TWS vary across PET methods and catchment types, regardless of whether all trends or only statistically significant trends are considered. Statistically significant trends reveal greater divergence between methods in energy-limited and mixed catchments. AET trends, though weaker in magnitude than PET trends, show similar spatial patterns, particularly in energy-limited and mixed regions. No single PET method stands out as consistently dominant across all components. For the trends beyond the statistically significant threshold a stronger pattern emerges. Approximately 70 % of the catchments exhibit statistically significant AET trends, compared to only 33 % for Q and 36 % and TWS. Despite widespread statistical insignificance for Q and TWS, distinct regional patterns appear. For instance, northern catchments display a mix of increasing and decreasing trends, while southern regions, especially the Iberian Peninsula, consistently demonstrate positive Q trends.

https://hess.copernicus.org/articles/29/4395/2025/hess-29-4395-2025-f02

Figure 2Boxplots represent the annual trends (mm yr⁻¹) of different PET methods for PET, AET, Q, and TWS across various categories of catchments. The whiskers represent the 10th and 90th percentiles, and the box encompasses the 25th and 75th percentiles, with the median represented by the middle line of the box. Abbreviations used for different PET methods are TH: Thornthwaite; BR: Baier–Robertson; BC: Blaney–Criddle; OD: Oudin; MB: McGuinness–Bordne; HM: Hamon; HS: Hargreaves–Samani; JH: Jensen–Haise; MD: Milly–Dunne; PT: Priestley–Taylor; PM: Penman–Monteith; CO₂: Penman–Monteith[CO₂].

Download

3.2 Trend comparison of PET methods at seasonal scales

Hydrological cycle components exhibit considerable seasonal variability in trend magnitude. During summer (JJA), nearly all PET methods exhibit positive trends, with the exception of Milly–Dunne in water-limited catchments (Fig. 3). Jensen–Haise consistently shows the highest trend and greatest variability across all catchments. Notable variability is observed among PET methods in water-limited catchments. These patterns persist in statistically significant trends, despite a smaller number of catchments per category (Fig. S4). Winter records the weakest PET trends, while spring trends are comparable to those in summer (Supplement Sect. S2). The summer season is the primary contributor to the annual trends in PET across all catchment categories (Fig. S5). The distribution of statistically significant trends aligns with these findings, with the highest catchment count in spring (460) and the lowest in winter (66).

In summer, AET exhibits an overall positive trend in energy-limited and mixed catchments for all trends and for statistically significant trends only (Figs. 3, S4). The Jensen–Haise method exhibits greater variability for energy-limited and mixed catchments. In water-limited catchments, despite positive PET trends, AET trends are negative across all PET methods, with Baier–Robertson showing the strongest decline, followed by Jensen–Haise (Fig. 3). In spring, AET increases across methods, with the highest trends observed for Jensen–Haise in energy-limited and Baier–Robertson in mixed catchments. Combinational methods show consistent trends across both types (Fig. S6). This pattern is maintained for statistically significant trends (Fig. S7), with all PET methods showing positive median trends in autumn (Figs. S8, S9). Summer primarily drives annual trends in energy- and water-limited catchments, while spring and summer contribute jointly in mixed catchments depending on the PET method (Fig. S5).

Q remains generally insensitive to PET method variation across all seasons and catchment types, with minor variability among methods within each category (Figs. 3, S6, S8, S10). In energy-limited catchments, trends remain close to zero across PET methods. Mixed catchments show broadly negative Q trends across methods, with Blaney–Criddle exhibiting the weakest decline. In water-limited regions, Q trends are similarly insensitive, though radiation and combinational methods tend to exhibit positive median trends, unlike temperature-based methods, which exhibit mixed results. Statistically significant Q trends reveal limited seasonality in water-limited catchments, with notable exceptions like Blaney–Criddle, which contributes fewer catchments (Figs. S4, S7, S9, S11). In energy-limited and mixed catchments, statistically significant Q trends demonstrate higher seasonal magnitude as weaker trends fail to surpass the significance threshold. No single PET method consistently dominates in trend magnitude, though Jensen–Haise and Blaney–Criddle frequently yield the highest number of statistically significant catchments. Spring emerges as the dominant contributor to annual Q trends in mixed and water-limited catchments, while in energy-limited areas, winter and summer are most influential, depending on the PET method (Fig. S5).

TWS trends across seasons show minimal sensitivity to PET method in energy-limited and mixed catchments under all-trend conditions, though slight variability exists among PET methods (Fig. 3). Water-limited catchments, however, display a mixed response. Median trend patterns are generally stable, but Blaney–Criddle consistently shows greater variability than other methods across all seasons and catchment categories. Statistically significant trends are uncommon in water-limited catchments, but trend magnitudes increase notably in energy-limited and mixed catchments during spring, summer, and autumn. In winter, temperature-based methods demonstrate positive trends, with Blaney–Criddle and Jensen–Haise showing the lowest values (Fig. S11). Slight variability across methods is also seen in non-winter seasons (Figs. S4, S7, S9). Seasonal contributions to annual trends vary by catchment category: spring contributes most in energy-limited catchments, summer in mixed catchments and in water-limited catchments. The dominant season depends on the PET method; summer leads for several methods (TH, BR, HM, PM, and CO₂), while spring dominates for the remaining methods.

https://hess.copernicus.org/articles/29/4395/2025/hess-29-4395-2025-f03

Figure 3The boxplot represents summer season (JJA) trends of different PET methods for AET and Q across three categories of catchments: energy-limited, mixed, and water-limited. The whiskers represent the 10th and 90th percentiles, and the box encompasses the 25th and 75th percentiles, with the median represented by the black line within the box. Abbreviations used for different PET methods are TH: Thornthwaite; BR: Baier–Robertson; BC: Blaney–Criddle; OD: Oudin; MB: McGuinness–Bordne; HM: Hamon; HS: Hargreaves–Samani; JH: Jensen–Haise; MD: Milly–Dunne; PT: Priestley–Taylor; PM: Penman–Monteith; CO₂: Penman–Monteith[CO₂]. Trend units are in mm per season per year.

Download

3.3 Catchment-wise DCI distribution across annual and seasonal scales

Even though there is a strong agreement across different PET methods in annual PET and AET trends, substantial variation exists in Q and TWS responses (Fig. 4). All catchments exhibit strong positive DCI for PET, indicating that at least 75 % of methods report a positive trend. AET follows a similar pattern, with high positive DCI values across northern (Scandinavia), central, western, eastern, southeastern (Balkans), and parts of southern (Iberian Peninsula) Europe, except for a few southern catchments with low as well as negative DCI. In contrast, Q reflects strong positive agreement in southern regions but mostly negative DCI in northern and eastern Europe. Central and western European catchments are marked by both strong positive and negative DCI, with few catchments showing disagreement among PET methods. TWS shows widespread disagreement, especially in southern, western, central, southeastern, and northern European catchments. Most central, eastern, and northern catchments show strong negative agreement for TWS. Overall, PET methods show high directional consistency for PET and AET but diverge notably for Q and TWS.

The above findings appear consistent when the selection of PET methods becomes balanced to eight methods: four temperature-based, one radiation-based, and three combinational methods. The latter two are fixed across all 70 possible method combinations. All the 70 combination results are consistent with the previous findings. For TWS, only a few catchments in southern and western Europe show strong negative DCI, consistent with earlier areas of disagreement (Figs. S12, S13). To assess the impact of weaker trends, we applied the DCI using only statistically significant trends. PET shows similar agreement patterns to the all-trend analysis, with minor disagreement in western Europe. For AET, disagreement emerges in most southern catchments and some in western, central, and eastern Europe, which previously showed agreement. Q and TWS exhibit widespread disagreement, largely due to weak trends across PET methods. Catchments with strong agreement under statistically significant trends align well with those under all-trend analysis. PET and AET trends remain predominantly positive in both statistically significant and all-trend evaluations.

To better understand the annual changes in the level of concurrence between all PET method trends, we decompose them into sub-seasonal values. Figure 5 shows strong positive agreement for PET and AET across all seasons, while Q and TWS exhibit predominantly negative agreement in central Europe, with evident regional variation. PET demonstrates high positive DCI in most catchments during spring, summer, and autumn, indicating consistent upward trends. In winter, agreement is highest in central, eastern, and southern Europe, but weaker elsewhere. AET shows strong seasonal agreement in central Europe, particularly in winter, spring, and summer, while disagreement emerges in northern and western regions during winter and in several southern and central catchments during autumn. AET also exhibits strong negative agreement in southern Europe during summer. Q consistently reflects a negative DCI across central Europe in all seasons, although southern catchments show positive agreement in spring and autumn. TWS trends show persistent negative DCI in central Europe across seasons, while southern Europe sees a shift from strong positive agreement in spring to strong negative agreement in summer, with weak agreement in winter and autumn. Northern Europe shows a mix of agreement and disagreement across all seasons.

Comparing sub-seasonal concurrence based on statistically significant trends (Fig. S14) highlights both consistencies and divergences compared to the all-trend results. Strong positive DCI for PET and AET is observed in most catchments during spring, which aligns with the findings from all trends. In summer, PET shows strong agreement in central and southern regions. In contrast, Q and, TWS exhibit substantial disagreement across all seasons, with most catchments showing low concurrence among PET methods. Similar inconsistencies are observed for PET and AET in winter. Some central and eastern catchments consistently exhibit strong negative DCI for Q and TWS in both spring and summer, reflecting similar patterns in the all-trend case. Applying statistical significance thresholds often shifts catchments from strong agreement (positive or negative) to disagreement, primarily because trends are no longer significant. Notably, no catchments switch from strong positive to strong negative DCI (or vice versa), confirming the consistency between statistically significant and all-trend results.

https://hess.copernicus.org/articles/29/4395/2025/hess-29-4395-2025-f04

Figure 4Spatial distribution of annual-scale data concurrence index (DCI) for PET, AET, Q, and TWS. PET represents potential evapotranspiration, AET represents actual evapotranspiration, Q represents runoff at the outlet of the catchment, and TWS represents total water storage.

https://hess.copernicus.org/articles/29/4395/2025/hess-29-4395-2025-f05

Figure 5Spatial distribution of seasonal-scale (winter (DJF), spring (MAM), summer (JJA), and autumn (SON)) DCI for PET, AET, Q, and TWS, where DCI represents data concurrence index, PET represents potential evapotranspiration, AET represents actual evapotranspiration, Q represents runoff at the outlet of the catchment, and TWS represents total water storage.

3.4 PET methods and patterns of hydrological cycle at annual and seasonal scales

In the previous section, we compare PET methods and their influence on individual hydrological components (P, AET, Q, and TWS), including agreement among PET methods. Here, we assess the influence of PET methods on patterns of key hydrological components across European catchments, identifying the most prevalent trend patterns where components (P, AET, Q, and TWS) concurrently increase or decrease. For instance, one pattern involves all components showing positive trends. Figure 6 summarizes these patterns, presenting the average and total number of catchments associated with each PET method. The analysis includes only the five most frequent patterns, covering the majority of catchments and excluding those patterns with minimal representation.

Most European catchments exhibit increasing trends across all hydrological components. The second-most common pattern involves a decrease in P, Q, and TWS, with an increase in AET. Temperature-based PET methods generally account for a larger number of catchments for these patterns than radiation and combinational methods, though exceptions exist, such as Baier–Robertson and Blaney–Criddle. Catchment distributions are spatially consistent across methods for these two patterns (Fig. S15), with Blaney–Criddle showing the highest counts for patterns involving uniform increases or decreases. In the case of statistically significant trends, only the all-positive pattern is prominent (Fig. S16), with an average of 80 catchments. This pattern reveals notable differences between temperature-based and combinational methods. Overall, PET methods differ in the number of catchments assigned to each pattern; however, combinational methods consistently demonstrate similar catchment counts across most patterns.

Seasonal analyses reveal distinct PET method preferences across hydrological component patterns. During winter, spring, and autumn, the prevailing pattern involves decreasing P, Q, and TWS with increasing AET (Figs. S17, S18, S20). In contrast, summer is characterized by increased P and AET and decreased Q and TWS. Substantial variation is observed among PET methods for each hydrological cycle pattern across all seasons. For instance, for a pattern where all components exhibit positive trends, Baier–Robertson captures the fewest catchments, while Blaney–Criddle captures the most. The pattern reverses for the combination of decreased P, Q, and TWS with increased AET. Blaney–Criddle consistently represents the highest number of catchments in all-positive and all-negative patterns during spring, summer, and autumn. Combinational methods generally show stable catchment counts, whereas temperature-based methods exhibit greater variability. Statistically significant trends, however, are associated with very few catchments, and no single pattern dominates.

https://hess.copernicus.org/articles/29/4395/2025/hess-29-4395-2025-f06

Figure 6Pattern of different hydrological cycle components and the corresponding influence of PET methods on an annual scale. Panel (a) represents different patterns of hydrological cycle components. Each vertical column in this table corresponds to one pattern of hydrological cycle components. For example, the first column is filled with “+” signs, indicating that all hydrological components (P, AET, Q, and TWS) exhibit positive changes. The “+” and “−” signs denote positive and negative changes in the respective components. In panel (b), each bar represents the average number of catchments for each hydrological cycle pattern. Panel (c) shows the number of catchments associated with each PET method for the corresponding hydrological cycle patterns. The color of each cell represents a specific PET method, and each column aligns with the hydrological cycle pattern represented in the corresponding column of panel (a), where P is precipitation, AET is actual evapotranspiration, Q is runoff, and TWS is total water storage. Abbreviations used for different PET methods are TH: Thornthwaite; BR: Baier–Robertson; BC: Blaney–Criddle; OD: Oudin; MB: McGuinness–Bordne; HM: Hamon; HS: Hargreaves–Samani; JH: Jensen–Haise; MD: Milly–Dunne; PT: Priestley–Taylor; PM: Penman–Monteith; CO₂: Penman–Monteith[CO₂].

Download

3.5 Relationship of PET and precipitation

Precipitation is an important component of the hydrological cycle. Figure 7 shows the changes in precipitation (P) without considering statistical significance. Annually, positive P trends dominate northern, western, southern, and southeastern catchments, while central Europe shows mixed positive and negative trends. In western Europe, a few catchments exhibit decreasing P trends (Fig. 7a). Seasonally, southern catchments experience increased P in winter, spring, and autumn but declines in summer. Southeastern Europe shows consistent P increases across all seasons, while eastern Europe exhibits negative P trends in summer and autumn and positive trends in winter and spring (Fig. 7a). P demonstrates a higher correlation with Q and TWS across all catchment categories (Fig. 7c). This suggests that P has a greater influence on Q and TWS than PET. In energy-limited catchments, AET is mainly driven by PET. In mixed catchments, both P and PET contribute to AET. In water-limited catchments, AET is mainly influenced by P. When we consider statistically significant P trends, only 129 catchments show significant trends at the annual scale. Across seasons, the number of statistically significant catchments varies from 20 to 61 (Fig. S21). Despite the limited number of statistically significant catchments, our findings regarding the influence of P and PET on AET, Q, and TWS remain consistent with all trends.

https://hess.copernicus.org/articles/29/4395/2025/hess-29-4395-2025-f07

Figure 7Spatial variation in precipitation (P) trends and their relationship with other hydrological components across catchment categories. Panel (a) shows the spatial distribution of increasing and decreasing annual P trends. Panel (b) illustrates the seasonal variation in increasing and decreasing P trends. Panel (c) represents the median correlation between P and AET, Q, and TWS, as well as between PET and AET, Q, and TWS, for each catchment category across all PET methods.

4 Discussion

4.1 Trends in P, PET, AET, Q, and TWS across Europe

PET methods consistently exhibit upward annual trends across European catchments (Fig. 2), aligning with findings by Anabalón and Sharma (2017), who reported similar increases using diverse PET datasets. Among the methods, Jensen–Haise consistently produces the highest absolute values, a pattern observed in various regions by other studies (Kingston et al., 2009; Hanselmann et al., 2024; Seiller and Anctil, 2016). This method relies on temperature and extraterrestrial radiation data, with the latter remaining constant annually. Thus, the observed trends are primarily temperature-driven. Shi et al. (2023 b) found that Jensen–Haise trends align closely with Penman–Monteith, which is used as a benchmark for evaluating other PET methods. In our study, we observe a notable distinction between the Jensen–Haise and Penman–Monteith PET methods, primarily driven by differences in meteorological forcings. While temperature is the dominant factor in Jensen–Haise, Penman–Monteith is more influenced by radiation, followed by temperature, vapor pressure deficit, and wind speed (Maček et al., 2018). Conversely, the Milly–Dunne method consistently shows the lowest trends in water-limited catchments, with lower net radiation-driven trends in southern Europe (Pfeifroth et al., 2018). Sensitivity analyses of each PET method with meteorological forcings are beyond the scope of this research.

The AET trend is proportional to the PET trend in energy-limited and mixed catchments. In water-limited regions, there is not enough water to evaporate, and it is mainly governed by available water (P) (Bruno and Duethmann, 2024). This results in a notable decline in AET when compared to PET. Anabalón and Sharma (2017) similarly reported stronger correlations between AET and PET in energy-limited regions, while AET in water-limited areas aligned more closely with precipitation. Notably, their study did not differentiate between PET methods but relied on existing datasets. In our analysis, AET exhibits the same directional changes as precipitation at both annual and seasonal scales in water-limited catchments (Fig. 7). Despite differences among catchment categories, PET methods demonstrate strong positive or negative agreement in AET trends.

Runoff trends vary with PET method selection in water-limited catchments but remain largely insensitive to PET methods in energy-limited and mixed catchments. This aligns with previous findings (Bai et al., 2015; Oudin et al., 2005; Seiller and Anctil, 2016), which reported insensitivity of runoff to PET formulations. This insensitivity is often attributed to hydrological model calibration, where PET impacts are counterbalanced by parameterization. Notably, despite the absence of individual PET method calibration in our study, runoff in energy-limited and mixed catchments remained insensitive to PET variation. This is likely due to the strong correlation between precipitation and runoff trends (Fig. 7c), which often outweighs the impact of PET (Berghuijs et al., 2017; Anabalón and Sharma, 2017). The strong negative agreement among PET methods for runoff in central European catchments is similarly related to their strong correlation with precipitation (Fig. 7a).

Total water storage (TWS) appears to be insensitive in energy-limited and mixed catchments, while it exhibits variability in the trend of different PET methods in water-limited catchments. Bai et al. (2016) observed that TWS in energy-limited catchments is more strongly impacted by PET than in water-limited catchments, though their study focused solely on Chinese catchments. More recently, Boeing et al. (2024) reported a decline in TWS over Germany, consistent with our findings that TWS decreases in energy-limited and mixed catchments for all PET methods. In hydrological models, TWS compensates for long-term changes; i.e., higher PET results in lower TWS in energy-limited catchments, whereas water-limited catchments are primarily governed by precipitation (Bai et al., 2016). This is in line with our findings on the opposite TWS trends relative to AET and a stronger agreement among PET methods in energy-limited and mixed catchments.

Precipitation generally increases across most catchments, with annual and seasonal patterns largely consistent. Exceptions occur in summer for the southern catchment and spring for eastern catchments, consistent with findings by Caloiero et al. (2018) and Markonis et al. (2019). Precipitation has a stronger influence on the hydrological cycle than AET in low- and mid-latitude regions compared to higher latitudes (Zhang et al., 2019). This spatial distribution explains the stronger correlations observed in mixed and energy-limited catchments, predominantly located in mid-latitudes and high latitudes. Globally, Q is more sensitive to P than PET, supporting the observed decline in Q despite rising PET across many catchments (Berghuijs et al., 2017).

4.2 Methodological sensitivity of PET estimation methods

PET methods vary in both absolute values and trends, even within the same category, due to structural and empirical differences in their formulations. For instance, while Jensen–Haise and Hargreaves–Samani both incorporate extraterrestrial radiation and air temperature, Hargreaves–Samani includes a diurnal temperature range term absent in Jensen–Haise. Thornthwaite, in contrast, relies solely on a heat index derived from monthly temperature. Among combinational methods, Penman–Monteith and its modified version differ by the inclusion of a CO₂ concentration term. Structurally similar methods like McGuinness–Bordne, Oudin, and Jensen–Haise diverge primarily due to empirically derived constants tailored to specific regions or climates (Proutsos et al., 2023). Moreover, catchment size does not significantly influence the overall findings related to PET and hydrological components (Fig. S22), consistent with Tang et al. (2023), who observed that spatial averaging over larger catchments reduces uncertainty and enhances reliability.

Even though our study's experimental design varies from the global analysis of Pimentel et al. (2023), both utilize large-scale hydrological models at the basin scale. Pimentel et al. (2023) evaluated three PET methods to identify the optimal approach for estimating PET, AET, and Q across the globe. Across Europe, they reported that Jensen–Haise in northern Europe (energy-limited), Hargreaves in central (mixed), and Priestley–Taylor alongside Hargreaves–Samani in southern Europe (water-limited) perform better in PET estimation. For AET estimation, Jensen–Haise remains the preferred method in northern Europe, with Hargreaves–Samani leading in central Europe and Priestley–Taylor in southern Europe. Priestley–Taylor was also deemed the most effective for runoff estimation. In contrast, our findings indicate a consistent distinction between Jensen–Haise and Penman–Monteith for PET across all catchment types. Similarly, for AET, Jensen–Haise consistently shows higher trends than Penman–Monteith in both energy-limited and water-limited catchments. For runoff, Priestley–Taylor and Penman–Monteith exhibit similar patterns in energy-limited and mixed catchments; however, in water-limited catchments, Priestley–Taylor yields more pronounced changes than Penman–Monteith.

4.3 Patterns of hydrological cycle components

When we look at the combination of changes among the hydrological cycle components (P, AET, Q, TWS) across European catchments, two dominant patterns of changes are observed: a water cycle intensification pattern of simultaneous increase in all components and an aridification pattern of simultaneous decline in all except AET. These two patterns can be seen in over 60 % of European catchments. However, when focusing only on significant trends, the pattern of intensification becomes more dominant, declining approximately to 15 % (80 catchments) compared to 0.4 % (2 catchments) aridification. The intensification pattern aligns with findings by Teuling et al. (2019), who reported rising P, AET, and Q in central–western Europe and declines in these components in the Mediterranean. Their analysis, based on the Penman–Monteith method, also indicated fewer catchments classified under this pattern compared to temperature-based methods, which show stronger responses. This is a vivid example of how PET method selection could amplify or dampen our estimates about hydrological cycle intensification.

In contrast, the aridification pattern (decrease in P, Q, and TWS and increase in AET) suggests that water reserves are being depleted to sustain evapotranspiration, a mechanism particularly evident in water-limited regions. Bruno and Duethmann (2024) noted that rising atmospheric demand, without sufficient water supply, results in reduced Q and TWS. Massari et al. (2022) similarly reported that increasing AET contributes to Q reductions in water-limited regions. Even with decreasing P and Q, continued declines in TWS appear to support increases in AET (Massari et al., 2022). For Europe, this is also very relevant to the compound warm season droughts that have been reported to increase since the beginning of the century (Markonis et al., 2021), as well as to the conditions that favor the onset and propagation of flash droughts (Shah et al., 2023). Since the evaporative demand is expected to further increase in the future (Rakovec et al., 2022), it is essential to acknowledge the uncertainties due to PET method selection.

4.4 Implications of PET method selection for hydroclimatic regime classification

The selection of PET methods significantly influences the hydroclimatic classification of catchments. Traditionally, catchments are categorized as water-limited or energy-limited based on the aridity index. However, our study introduces a third category, termed “mixed”, which lacks a physical basis but highlights the critical role of PET method selection in defining catchment types. This is when a PET method that consistently estimates higher values may shift a catchment from energy-limited to water-limited, whereas a method with lower estimates could reverse this shift. This underscores the importance of method selection, where variations in PET estimates can alter the hydroclimatic classification. Similarly, Zhang et al. (2016) introduced a less common classification termed “equitant”, which applies a single PET method to calculate the aridity index. Such methodological differences can lead to inconsistencies in catchment classification. For instance, Kuentz et al. (2017) use the Jensen–Haise method, Ajami et al. (2017) utilize the Priestley–Taylor method, and Zhang et al. (2016) adopt the Penman method for aridity index estimation. In our analysis, excluding PET methods that consistently estimate higher PET values, such as Blaney–Criddle, Jensen–Haise, and McGuinness–Bordne, resulted in 42 % of catchments transitioning from the mixed to the energy-limited category (Fig. S23). Detailed catchment shifts based on various combinations are outlined in Table S1.

4.5 Limitations and future research

Our study comes with certain limitations that pave the way for future research. One key limitation is the uncertainty associated with the input data used to calculate PET methods. Previous studies have shown that temperature-based methods are sensitive to temperature; radiation-based methods to radiation; and combinational methods to multiple variables, including temperature, radiation, relative humidity, and wind speed (Hua et al., 2020; Guo et al., 2017). In addition, we use a monthly time step, which tends to mask the influence of PET method selection on AET extremes. These extremes can behave differently from the mean state of AET, potentially leading to different implications for changes in hydrological cycle fluxes compared to those based on mean AET values (Markonis, 2025). Finally, we limited our analysis to one precipitation product to isolate the specific impact of the PET method. However, precipitation is widely recognized as the most sensitive meteorological input, with extensive studies highlighting its uncertainties (Mazzoleni et al., 2019; Markonis et al., 2024). Our findings also confirm its dominant influence over PET in certain catchment categories (Fig. 7). This identifies a potential gap for exploring the combination of precipitation with PET for more accurate simulations of hydrological cycle components. Although we selected meteorological datasets with comparatively lower uncertainty, data quality, whether from observational or reanalysis sources, remains an issue for hydrological assessments. While our focus was on methodological comparisons among PET methods, future research could benefit from multi-source assessments to enhance the robustness and reliability of hydrological modeling.

Large-scale hydrological models, including the mHM, typically rely on default parameterization. In the mHM, these parameters were initially developed using German basins, as outlined by Samaniego et al. (2010) and Kumar et al. (2013 a). Since then, the mHM has been extensively tested across various basins and hydrological variables (Rakovec et al., 2016; Samaniego et al., 2019; Boeing et al., 2024). For instance, Rakovec et al. (2016) evaluated discharge simulations across 400 European catchments using 36 parameter sets, demonstrating consistent model performance regardless of parameterization, thereby reinforcing confidence in the model's reliability. Similar approaches have been applied in global water models; Beck et al. (2017) employed ensemble parameters derived from 10 catchments, while Kumar et al. (2013 a) tested default parameters from European basins across 80 American catchments with varying climatic conditions. While these studies demonstrate the robustness of default parameterization, investigating how PET-specific calibration affects hydrological trends could provide valuable insights for future research. Notably, the Hargreaves–Samani PET method, used in developing these parameters in the mHM, demonstrated the best model performance in this study but did not consistently stand out compared to other PET methods in trend analysis across hydrological components. Moreover, the study is confined to temperate European catchments, leaving a gap in the assessment of arid and tropical climates, where distinct patterns may emerge, while the Penman–Monteith[CO₂] did not exhibit substantial differences compared to the Penman–Monteith method, indicating the need for further exploration of this method. It would be interesting to assess their impact under changing climate conditions and their implications for trend assessments.

5 Summary and conclusions

Twelve PET methods were used to evaluate their impacts on changes in the components of the hydrological cycle using the mesoscale Hydrological Model (mHM). These methods were applied across 553 European catchments, which vary in size and include different European climate types. These catchments were classified as water-limited, energy-limited, and mixed catchments based on their aridity index. Changes in PET and hydrological components were assessed using Sen’s slope for trend magnitude and the Mann–Kendall test for statistical significance. We analyzed our results using two cases: first, using all trends, and second, using only statistically significant trends. To assess the agreement between different PET methods, we used the data concurrence index for the period 1980 to 2019. The results demonstrate that the choice of PET method can substantially affect changes in AET, Q, and TWS, especially in water-limited and mixed catchments, with smaller changes and greater variability observed in water-limited catchments on an annual scale. Seasonal variations in changes and agreement between PET methods were also observed, as discussed in detail in Sect. 3.2 and 3.3. In general, there is agreement among the different methods that, since 1980, PET and AET are increasing over Europe, while runoff and total water storage exhibit mixed fluctuations depending on the method used and the catchment latitude. The key findings of our study are summarized as follows:

PET is increasing across European catchments. The majority of the PET methods indicate a positive trend in all categories of catchments, but the increase rates differ among the methods employed.
At the annual scale for all trends, the Jensen–Haise PET method stands out by consistently showing the highest trends for PET and AET across all catchment categories. The Milly–Dunne (energy-based) method is notable as the only one to exhibit a negative trend for water-limited catchments. Regarding Q and TWS, the PET methods display different changes and variability, with no method consistently showing either the lowest or highest trends for these hydrological components. Conclusions remain consistent when considering only statistically significant trends.
The trend patterns between PET and AET are similar across all methods for the hydrological cycle components. However, Q and TWS do not exhibit the same pattern and appear to be less sensitive to choice of PET methods. Most PET methods agree on the trend direction for PET and AET, but in a few catchments, the trends for Q and TWS show an opposite direction. The negative trends in Q and TWS are primarily due to negative precipitation trends, which have a stronger impact on these components in all catchment categories.
At the seasonal scale, PET methods reveal different trends for PET, with no method consistently showing the highest or lowest trends across all seasons. However, the Jensen–Haise method shows the highest PET trends during spring, autumn, and summer. AET trends follow a similar pattern to PET in all seasons. The PET methods show strong agreement on trend direction for central and southern European catchments, especially for PET, but there is less agreement for northern catchments in winter. Strong negative agreement is found for Q and TWS in summer and spring, while disagreement is observed for AET in central and southern catchments during autumn.
The summer season contributes more to the annual PET trends than any other season across all catchment categories. Similarly, for AET, the summer season has a higher contribution to the annual AET trend in energy-limited and water-limited catchments. For runoff (Q), the spring season contributes more in mixed and water-limited catchments. For TWS, the spring season has a higher contribution in energy-limited catchments.
Overall, the magnitude of trends varied between PET methods for PET and the hydrological components (AET, Q, and TWS). The use of a specific PET method in a hydrological model can notably affect studies focused on the hydrological cycle.
Precipitation primarily governs trends in all hydrological components and catchment types, except for AET in energy-limited catchments, which is largely influenced by the choice of PET variations.
The choice of PET method substantially influences hydrological patterns across European catchments on both annual and seasonal scales. Combinational methods generally account for fewer catchments than temperature-based methods in dominant hydrological patterns. This observation remains consistent for statistically significant trends as well.
In the case of statistically significant trends, the conclusions remain consistent with those from the all-trend analysis. The only difference is the reduced number of catchments, primarily omitted due to weaker trends.

Our research demonstrates the critical role of PET method selection and its implications for quantifying fluctuations in the hydrological cycle. Our findings reveal that two methods notably deviate from the others. Specifically, the Jensen–Haise method shows higher trend values, while the Milly–Dunne method exhibits lower trends in water-limited catchments. Consequently, we recommend exercising caution when applying these methods as they appear to be outliers. Despite these variations, the PET methods generally agree that atmospheric moisture demand is increasing across Europe, reflecting recent shifts in temperature and radiation. The observed variability in trend magnitudes emphasizes the importance of careful PET method selection to ensure robust and representative assessments of hydrological trends.

Appendix A: Potential evapotranspiration formulation

Hargreaves and Samani (1985)McGuinness and Bordne (1972)Hamon (1961)Oudin et al. (2005)Bai et al. (2016)Blaney (1950)Thornthwaite (1948)Jensen and Haise (1963)Priestley and Taylor (1972)Milly and Dunne (2016)Penman (1948)Yang et al. (2019)

Table A1Formulations of PET methods, where R_e is extraterrestrial radiation (MJ m⁻² d⁻¹), λ is the latent heat of vaporization (MJ kg⁻¹), ρ is water density (= 1000 kg m⁻³), d_a is air density (kg m⁻³), T_a is air temperature (°C), T_d is dew point temperature (°C), T_max is maximum air temperature (°C), T_min is minimum air temperature (°C), Δ is the slope of the vapor pressure curve (kPa °C⁻¹), γ is the psychrometric constant (kPa °C⁻¹), e_s is saturation vapor pressure (kPa), e_a is actual vapor pressure (kPa), u₂ is wind speed 2 m above the soil surface (m s⁻¹), R_s is net shortwave radiation (MJ m⁻² d⁻¹), R_n is net incoming solar radiation (MJ m⁻² d⁻¹), G is soil heat flux (MJ m⁻² d⁻¹), RH is relative humidity ( %), DL is day length (h d⁻¹), I is annual heat index, and CO₂ is carbon dioxide concentration (ppm). The following abbreviations are used throughout the paper to refer to the respective PET methods: TH (Thornthwaite), BR (Baier–Robertson), BC (Blaney–Criddle), OD (Oudin), MB (McGuinness–Bordne), HM (Hamon), HS (Hargreaves–Samani), JH (Jensen–Haise), MD (Milly–Dunne), PT (Priestley–Taylor), PM (Penman–Monteith), and CO₂ (Penman–Monteith[CO₂]).

Download Print Version | Download XLSX

Code and data availability

The data and code necessary to reproduce the final analysis and figures are available in the Zenodo repository (https://doi.org/10.5281/zenodo.17077902; Thakur, 2024).

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/hess-29-4395-2025-supplement.

Author contributions

VT: conceptualization, PET calculation, hydrological model simulations, postprocessing, formal analysis, and writing (original draft); YM: conceptualization, supervision, writing (review and editing); RK: conceptualization, writing (review and editing); JRT: writing (review and editing); MRVG: writing (review and editing); MH: writing (review and editing); OR: setup of hydrological model, supervision, conceptualization, writing (review and editing).

Competing interests

At least one of the (co-)authors is a member of the editorial board of Hydrology and Earth System Sciences. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

Computational resources were provided by the e-INFRA CZ project (ID:90254), supported by the Ministry of Education, Youth and Sports of the Czech Republic. Oldrich Rakovec also acknowledges the Research Excellence in Environmental Sciences (REES) project of the Faculty of Environmental Sciences, Czech University of Life Sciences Prague.

Financial support

This research has been supported by the Grantová Agentura České Republiky (grant no. 22-33266M) and the Česká Zemědělská Univerzita v Praze (grant no. 2023B0008).

Review statement

This paper was edited by Elena Toth and reviewed by Franziska Clerc-Schwarzenbach and two anonymous referees.

References

Ajami, H., Sharma, A., Band, L. E., Evans, J. P., Tuteja, N. K., Amirthanathan, G. E., and Bari, M. A.: On the non-stationarity of hydrological response in anthropogenically unaffected catchments: an Australian perspective, Hydrol. Earth Syst. Sci., 21, 281–294, https://doi.org/10.5194/hess-21-281-2017, 2017. a

Allen, R. G. (Ed.): Crop evapotranspiration: guidelines for computing crop water requirements, no. 56 in FAO irrigation and drainage paper, Food and Agriculture Organization of the United Nations, Rome, ISBN 978-92-5-104219-9, 1998. a, b

Anabalón, A. and Sharma, A.: On the divergence of potential and actual evapotranspiration trends: An assessment across alternate global datasets, Earth's Future, 5, 905–917, https://doi.org/10.1002/2016EF000499, 2017. a, b, c, d, e, f

Aouissi, J., Benabdallah, S., Lili Chabaâne, Z., and Cudennec, C.: Evaluation of potential evapotranspiration assessment methods for hydrological modelling with SWAT – Application in data-scarce rural Tunisia, Agr. Water Manage., 174, 39–51, https://doi.org/10.1016/j.agwat.2016.03.004, 2016. a

Arino, O., Ramos Perez, J. J., Kalogirou, V., Bontemps, S., Defourny, P., and Van Bogaert, E.: Global Land Cover Map for 2009 (GlobCover 2009), PANGAEA [data set], https://doi.org/10.1594/PANGAEA.787668, 2012. a

Arnold, J. G., Srinivasan, R., Muttiah, R. S., and Williams, J. R.: large area hydrologic modeling and assessment Part I: model development, J. Am. Water Resour. Assoc., 34, 73–89, https://doi.org/10.1111/j.1752-1688.1998.tb05961.x, 1998. a

Bai, P., Liu, X., Liang, K., and Liu, C.: Comparison of performance of twelve monthly water balance models in different climatic catchments of China, J. Hydrol., 529, 1030–1040, https://doi.org/10.1016/j.jhydrol.2015.09.015, 2015. a

Bai, P., Liu, X., Yang, T., Li, F., Liang, K., Hu, S., and Liu, C.: Assessment of the Influences of Different Potential Evapotranspiration Inputs on the Performance of Monthly Hydrological Models under Different Climatic Conditions, J. Hydrometeorol., 17, 2259–2274, https://doi.org/10.1175/JHM-D-15-0202.1, 2016. a, b, c, d, e, f

Beck, H. E., van Dijk, A. I. J. M., de Roo, A., Dutra, E., Fink, G., Orth, R., and Schellekens, J.: Global evaluation of runoff from 10 state-of-the-art hydrological models, Hydrol. Earth Syst. Sci., 21, 2881–2903, https://doi.org/10.5194/hess-21-2881-2017, 2017. a

Berghuijs, W. R., Larsen, J. R., van Emmerik, T. H. M., and Woods, R. A.: A Global Assessment of Runoff Sensitivity to Changes in Precipitation, Potential Evaporation, and Other Factors, Water Resour. Res., 53, 8475–8486, https://doi.org/10.1002/2017WR021593, 2017. a, b

Birhanu, D., Kim, H., Jang, C., and Park, S.: Does the Complexity of Evapotranspiration and Hydrological Models Enhance Robustness?, Sustainability, 10, 2837, https://doi.org/10.3390/su10082837, 2018. a

Blaney, H. F. and Criddle, W. P.: Determining water requirements in irrigated areas from climatological and irrigation data, USDA (SCS), Washington, D.C., 1–48, 1950. a, b

Boeing, F., Wagener, T., Marx, A., Rakovec, O., Kumar, R., Samaniego, L., and Attinger, S.: Increasing influence of evapotranspiration on prolonged water storage recovery in Germany, Environ. Res. Lett., 19, 024047, https://doi.org/10.1088/1748-9326/ad24ce, 2024. a, b

Bruno, G. and Duethmann, D.: Increases in Water Balance‐Derived Catchment Evapotranspiration in Germany During 1970s–2000s Turning Into Decreases Over the Last Two Decades, Despite Uncertainties, Geophys. Res. Lett., 51, e2023GL107753, https://doi.org/10.1029/2023GL107753, 2024. a, b

Caloiero, T., Caloiero, P., and Frustaci, F.: Long‐term precipitation trend analysis in Europe and in the Mediterranean basin, Water Environ. J., 32, 433–445, https://doi.org/10.1111/wej.12346, 2018. a

Cheng, W., Dan, L., Deng, X., Feng, J., Wang, Y., Peng, J., Tian, J., Qi, W., Liu, Z., Zheng, X., Zhou, D., Jiang, S., Zhao, H., and Wang, X.: Global monthly gridded atmospheric carbon dioxide concentrations under the historical and future scenarios, Sci. Data, 9, 83, https://doi.org/10.1038/s41597-022-01196-7, 2022. a, b

Clerc-Schwarzenbach, F., Selleri, G., Neri, M., Toth, E., van Meerveld, I., and Seibert, J.: Large-sample hydrology – a few camels or a whole caravan?, Hydrol. Earth Syst. Sci., 28, 4219–4237, https://doi.org/10.5194/hess-28-4219-2024, 2024. a

Deardorff, J. W.: Efficient prediction of ground surface temperature and moisture, with inclusion of a layer of vegetation, J. Geophys. Res.-Oceans, 83, 1889–1903, https://doi.org/10.1029/JC083iC04p01889, 1978. a

Fang, B., Bevacqua, E., Rakovec, O., and Zscheischler, J.: An increase in the spatial extent of European floods over the last 70 years, Hydrol. Earth Syst. Sci., 28, 3755–3775, https://doi.org/10.5194/hess-28-3755-2024, 2024. a

Guo, D., Westra, S., and Maier, H. R.: Sensitivity of potential evapotranspiration to changes in climate variables for different Australian climatic zones, Hydrol. Earth Syst. Sci., 21, 2107–2126, https://doi.org/10.5194/hess-21-2107-2017, 2017. a

Hamon, W. R.: Estimating Potential Evapotranspiration, J. Hydraul. Div., 87, 107–120, https://doi.org/10.1061/JYCEAJ.0000599, 1961. a, b

Hanselmann, N., Osuch, M., Wawrzyniak, T., and Alphonse, A. B.: Evaluating potential evapotranspiration methods in a rapidly warming Arctic region, SW Spitsbergen (1983–2023), J. Hydrol., 56, 101979, https://doi.org/10.1016/j.ejrh.2024.101979, 2024. a

Hargreaves and Samani: Reference Crop Evapotranspiration from Temperature, Appl. Eng. Agr., 1, 96–99, https://doi.org/10.13031/2013.26773, 1985. a, b

Hartmann, J. and Moosdorf, N.: The new global lithological map database GLiM: A representation of rock properties at the Earth surface, Geochem. Geophy. Geosy., 13, 2012GC004370, https://doi.org/10.1029/2012GC004370, 2012. a

Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz‐Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, https://doi.org/10.1002/qj.3803, 2020. a

Hua, D., Hao, X., Zhang, Y., and Qin, J.: Uncertainty assessment of potential evapotranspiration in arid areas, as estimated by the Penman-Monteith method, J. Arid Land, 12, 166–180, https://doi.org/10.1007/s40333-020-0093-7, 2020. a

Jensen, M. E. and Haise, H. R.: Estimating Evapotranspiration from Solar Radiation, J. Irr. Drain.-Div., 89, 15–41, https://doi.org/10.1061/JRCEA4.0000287, 1963. a, b

Kendall, M.: Rank correlation methods, Rank correlation methods, Griffin, Oxford, England, ISBN 0-85264-199-0 , 1948. a

Kingston, D. G., Todd, M. C., Taylor, R. G., Thompson, J. R., and Arnell, N. W.: Uncertainty in the estimation of potential evapotranspiration under climate change, Geophys. Res. Lett., 36, 2009GL040267, https://doi.org/10.1029/2009GL040267, 2009. a

Kuentz, A., Arheimer, B., Hundecha, Y., and Wagener, T.: Understanding hydrologic variability across Europe through catchment classification, Hydrol. Earth Syst. Sci., 21, 2863–2879, https://doi.org/10.5194/hess-21-2863-2017, 2017. a

Kumar, R., Livneh, B., and Samaniego, L.: Toward computationally efficient large-scale hydrologic predictions with a multiscale regionalization scheme, Water Resour. Res., 49, 5700–5714, https://doi.org/10.1002/wrcr.20431, 2013a. a, b, c

Kumar, R., Samaniego, L., and Attinger, S.: Implications of distributed hydrologic model parameterization on water fluxes at multiple scales and locations, Water Resour. Res., 49, 360–379, https://doi.org/10.1029/2012WR012195, 2013b. a, b, c

Kumar, R., Heße, F., Rao, P. S. C., Musolff, A., Jawitz, J. W., Sarrazin, F., Samaniego, L., Fleckenstein, J. H., Rakovec, O., Thober, S., and Attinger, S.: Strong hydroclimatic controls on vulnerability to subsurface nitrate contamination across Europe, Nat. Commun., 11, 6302, https://doi.org/10.1038/s41467-020-19955-8, 2020. a

Liang, X., Lettenmaier, D. P., Wood, E. F., and Burges, S. J.: A simple hydrologically based model of land surface water and energy fluxes for general circulation models, J. Geophys. Res.-Atmos., 99, 14415–14428, https://doi.org/10.1029/94JD00483, 1994. a

Liu, C., Wang, Z., Zheng, H., Zhang, L., and Wu, X.: Development of Hydro-Informatic Modelling System and its application, Sci. China Ser. E, 51, 456–466, https://doi.org/10.1007/s11431-008-0040-x, 2008. a

Liu, Y., Jiang, Q., Wang, Q., Jin, Y., Yue, Q., Yu, J., Zheng, Y., Jiang, W., and Yao, X.: The divergence between potential and actual evapotranspiration: An insight from climate, water, and vegetation change, Sci. Total Environ., 807, 150648, https://doi.org/10.1016/j.scitotenv.2021.150648, 2022. a

Lu, J., Sun, G., McNulty, S. G., and Amatya, D. M.: A COMPARISON OF SIX POTENTIAL EVAPOTRANSPIRATION METHODS FOR REGIONAL USE IN THE SOUTHEASTERN UNITED STATES, J. Am. Water Resour. Assoc., 41, 621–633, https://doi.org/10.1111/j.1752-1688.2005.tb03759.x, 2005. a

Maček, U., Bezak, N., and Šraj, M.: Reference evapotranspiration changes in Slovenia, Europe, Agr. Forest Meteorol., 260-261, 183–192, https://doi.org/10.1016/j.agrformet.2018.06.014, 2018. a

Markonis, Y.: On the Definition of Extreme Evaporation Events, Geophys. Res. Lett., 52, e2024GL113038, https://doi.org/10.1029/2024GL113038, 2025. a

Markonis, Y., Papalexiou, S. M., Martinkova, M., and Hanel, M.: Assessment of Water Cycle Intensification Over Land using a Multisource Global Gridded Precipitation DataSet, J. Geophys. Res.-Atmos., 124, 11175–11187, https://doi.org/10.1029/2019JD030855, 2019. a

Markonis, Y., Kumar, R., Hanel, M., Rakovec, O., Máca, P., and AghaKouchak, A.: The rise of compound warm-season droughts in Europe, Sci. Adv., 7, eabb9668, https://doi.org/10.1126/sciadv.abb9668, 2021. a

Markonis, Y., Vargas Godoy, M. R., Pradhan, R. K., Pratap, S., Thomson, J. R., Hanel, M., Paschalis, A., Nikolopoulos, E., and Papalexiou, S. M.: Spatial partitioning of terrestrial precipitation reveals varying dataset agreement across different environments, Commun. Earth Environ., 5, 217, https://doi.org/10.1038/s43247-024-01377-9, 2024. a

Massari, C., Avanzi, F., Bruno, G., Gabellani, S., Penna, D., and Camici, S.: Evaporation enhancement drives the European water-budget deficit during multi-year droughts, Hydrol. Earth Syst. Sci., 26, 1527–1543, https://doi.org/10.5194/hess-26-1527-2022, 2022. a, b, c

Mazzoleni, M., Brandimarte, L., and Amaranto, A.: Evaluating precipitation datasets for large-scale distributed hydrological modelling, J. Hydrol., 578, 124076, https://doi.org/10.1016/j.jhydrol.2019.124076, 2019. a

McGuinness, J. L. and Bordne, E. F.: A comparison of lysimeter-derived potential evapotranspiration with computed values, US Department of Agriculture, ISBN 0082-9811, 1972. a, b

Milly, P. C. D. and Dunne, K. A.: Potential evapotranspiration and continental drying, Nat. Clim. Change, 6, 946–949, https://doi.org/10.1038/nclimate3046, 2016. a, b

Muñoz-Sabater, J., Dutra, E., Agustí-Panareda, A., Albergel, C., Arduini, G., Balsamo, G., Boussetta, S., Choulga, M., Harrigan, S., Hersbach, H., Martens, B., Miralles, D. G., Piles, M., Rodríguez-Fernández, N. J., Zsoter, E., Buontempo, C., and Thépaut, J.-N.: ERA5-Land: a state-of-the-art global reanalysis dataset for land applications, Earth Syst. Sci. Data, 13, 4349–4383, https://doi.org/10.5194/essd-13-4349-2021, 2021. a, b, c, d, e, f, g

Ndiaye, P. M., Bodian, A., Dezetter, A., Ogilvie, A., and Goudiaby, O.: Sensitivity of global hydrological models to potential evapotranspiration estimation methods in the Senegal River Basin (West Africa), J. Hydrol., 53, 101823, https://doi.org/10.1016/j.ejrh.2024.101823, 2024. a

Oudin, L., Michel, C., and Anctil, F.: Which potential evapotranspiration input for a lumped rainfall-runoff model?, J. Hydrol., 303, 275–289, https://doi.org/10.1016/j.jhydrol.2004.08.025, 2005. a, b, c, d

Park, C.-E., Jeong, S.-J., Joshi, M., Osborn, T. J., Ho, C.-H., Piao, S., Chen, D., Liu, J., Yang, H., Park, H., Kim, B.-M., and Feng, S.: Keeping global warming within 1.5 °C constrains emergence of aridification, Nat. Clim. Change, 8, 70–74, https://doi.org/10.1038/s41558-017-0034-4, 2018. a

Penman, H. L.: Natural evaporation from open water, bare soil and grass, Proc. Roy. Soc. Lond. Ser. A, 193, 120–145, 1948. a, b

Pfeifroth, U., Sanchez‐Lorenzo, A., Manara, V., Trentmann, J., and Hollmann, R.: Trends and Variability of Surface Solar Radiation in Europe Based On Surface‐ and Satellite‐Based Data Records, J. Geophys. Res.-Atmos., 123, 1735–1754, https://doi.org/10.1002/2017JD027418, 2018. a

Pimentel, R., Arheimer, B., Crochemore, L., Andersson, J. C. M., Pechlivanidis, I. G., and Gustafsson, D.: Which Potential Evapotranspiration Formula to Use in Hydrological Modeling World‐Wide?, Water Resour. Res., 59, e2022WR033447, https://doi.org/10.1029/2022WR033447, 2023. a, b, c

Pohl, F., Rakovec, O., Rebmann, C., Hildebrandt, A., Boeing, F., Hermanns, F., Attinger, S., Samaniego, L., and Kumar, R.: Long-term daily hydrometeorological drought indices, soil moisture, and evapotranspiration for ICOS sites, Sci. Data, 10, 281, https://doi.org/10.1038/s41597-023-02192-1, 2023. a

Priestley, C. H. B. and Taylor, R. J.: On the assessment of surface heat flux and evaporation using large-scale parameters, Mon. Weather Rev., 100, 81–92, 1972. a, b

Proutsos, N., Tigkas, D., Tsevreni, I., Alexandris, S. G., Solomou, A. D., Bourletsikas, A., Stefanidis, S., and Nwokolo, S. C.: A Thorough Evaluation of 127 Potential Evapotranspiration Models in Two Mediterranean Urban Green Sites, Remote Sens., 15, 3680, https://doi.org/10.3390/rs15143680, 2023. a, b, c

Rakovec, O., Kumar, R., Mai, J., Cuntz, M., Thober, S., Zink, M., Attinger, S., Schäfer, D., Schrön, M., and Samaniego, L.: Multiscale and Multivariate Evaluation of Water Fluxes and States over European River Basins, J. Hydrometeorol., 17, 287–307, https://doi.org/10.1175/JHM-D-15-0054.1, 2016. a, b, c, d, e

Rakovec, O., Mizukami, N., Kumar, R., Newman, A. J., Thober, S., Wood, A. W., Clark, M. P., and Samaniego, L.: Diagnostic Evaluation of Large‐Domain Hydrologic Models Calibrated Across the Contiguous United States, J. Geophys. Res.-Atmos., 124, 13991–14007, https://doi.org/10.1029/2019JD030767, 2019. a

Rakovec, O., Samaniego, L., Hari, V., Markonis, Y., Moravec, V., Thober, S., Hanel, M., and Kumar, R.: The 2018–2020 Multi‐Year Drought Sets a New Benchmark in Europe, Earth's Future, 10, e2021EF002394, https://doi.org/10.1029/2021EF002394, 2022. a

Rakovec, O., Kumar, R., Shrestha, P. K., and Samaniego, L.: Global assessment of hydrological components using a seamless multiscale modelling system, EGU General Assembly 2023, Vienna, Austria, 24–28 Apr 2023, EGU23-11945, https://doi.org/10.5194/egusphere-egu23-11945, 2023. a

Reaver, N. G. F., Kaplan, D. A., Klammler, H., and Jawitz, J. W.: Theoretical and empirical evidence against the Budyko catchment trajectory conjecture, Hydrol. Earth Syst. Sci., 26, 1507–1525, https://doi.org/10.5194/hess-26-1507-2022, 2022. a

Samaniego, L., Kumar, R., and Attinger, S.: Multiscale parameter regionalization of a grid-based hydrologic model at the mesoscale, Water Resour. Res., 46, W05523, https://doi.org/10.1029/2008WR007327, 2010. a, b, c, d

Samaniego, L., Thober, S., Wanders, N., Pan, M., Rakovec, O., Sheffield, J., Wood, E. F., Prudhomme, C., Rees, G., Houghton-Carr, H., Fry, M., Smith, K., Watts, G., Hisdal, H., Estrela, T., Buontempo, C., Marx, A., and Kumar, R.: Hydrological Forecasts and Projections for Improved Decision-Making in the Water Sector in Europe, B. Am. Meteorol. Soc., 100, 2451–2472, https://doi.org/10.1175/BAMS-D-17-0274.1, 2019. a, b, c, d

Schulzweida, U.: CDO User Guide, Zenodo, https://doi.org/10.5281/ZENODO.7112925, 2022. a

Seiller, G. and Anctil, F.: How do potential evapotranspiration formulas influence hydrological projections?, Hydrolog. Sci. J., 61, 2249–2266, https://doi.org/10.1080/02626667.2015.1100302, 2016. a, b

Sen, P. K.: Estimates of the Regression Coefficient Based on Kendall's Tau, J. Am. Stat. Assoc., 63, 1379–1389, https://doi.org/10.1080/01621459.1968.10480934, 1968. a

Shah, J., Kumar, R., Samaniego, L., Markonis, Y., Hanel, M., Attinger, S., Hari, V., and Rakovec, O.: On the role of antecedent meteorological conditions on flash drought initialization in Europe, Environ. Res. Lett., 18, 064039, https://doi.org/10.1088/1748-9326/acd8d3, 2023. a

Shaw, S. B. and Riha, S. J.: Assessing temperature-based PET equations under a changing climate in temperate, deciduous forests, Hydrol. Process., 25, 1466–1478, https://doi.org/10.1002/hyp.7913, 2011. a

Shi, H., Luo, G., Hellwich, O., He, X., Kurban, A., De Maeyer, P., and Van de Voorde, T.: Global dryland aridity changes indicated by atmospheric, hydrological, and vegetation observations at meteorological stations, Hydrol. Earth Syst. Sci., 27, 4551–4562, https://doi.org/10.5194/hess-27-4551-2023, 2023a. a

Shi, L., Wang, B., Liu, D. L., Feng, P., Cleverly, J., Li, L., Zhang, G., and Yu, Q.: Performance of potential evapotranspiration models across different climatic stations in New South Wales, Australia, J. Hydrol., 50, 101573, https://doi.org/10.1016/j.ejrh.2023.101573, 2023b. a

Shrestha, P. K., Samaniego, L., Rakovec, O., Kumar, R., Mi, C., Rinke, K., and Thober, S.: Toward Improved Simulations of Disruptive Reservoirs in Global Hydrological Modeling, Water Resour. Res., 60, e2023WR035433, https://doi.org/10.1029/2023WR035433, 2024. a

Tang, G., Clark, M. P., and Papalexiou, S. M.: SC-Earth: A Station-Based Serially Complete Earth Dataset from 1950 to 2019, J. Climate, 34, 6493–6511, https://doi.org/10.1175/JCLI-D-21-0067.1, 2021. a

Tang, G., Clark, M. P., and Papalexiou, S. M.: EM-Earth: The Ensemble Meteorological Dataset for Planet Earth, B. Am. Meteorol. Soc., 103, E996–E1018, https://doi.org/10.1175/BAMS-D-21-0106.1, 2022. a, b, c, d, e, f, g, h, i

Tang, G., Clark, M. P., Knoben, W. J. M., Liu, H., Gharari, S., Arnal, L., Beck, H. E., Wood, A. W., Newman, A. J., and Papalexiou, S. M.: The Impact of Meteorological Forcing Uncertainty on Hydrological Modeling: A Global Analysis of Cryosphere Basins, Water Resour. Res., 59, e2022WR033767, https://doi.org/10.1029/2022WR033767, 2023. a, b, c

Tarek, M., Brissette, F. P., and Arsenault, R.: Evaluation of the ERA5 reanalysis as a potential reference dataset for hydrological modelling over North America, Hydrol. Earth Syst. Sci., 24, 2527–2544, https://doi.org/10.5194/hess-24-2527-2020, 2020. a

Teuling, A. J., de Badts, E. A. G., Jansen, F. A., Fuchs, R., Buitink, J., Hoek van Dijke, A. J., and Sterling, S. M.: Climate change, reforestation/afforestation, and urbanization impacts on evapotranspiration and streamflow in Europe, Hydrol. Earth Syst. Sci., 23, 3631–3652, https://doi.org/10.5194/hess-23-3631-2019, 2019. a

Thackeray, C. W., Hall, A., Norris, J., and Chen, D.: Constraining the increased frequency of global precipitation extremes under warming, Nat. Clim. Change, 12, 441–448, https://doi.org/10.1038/s41558-022-01329-1, 2022. a

Thakur, V.: Data for plots and analsysis, Zenodo [data set], https://doi.org/10.5281/zenodo.14008649, 2024. a

Thober, S., Cuntz, M., Kelbling, M., Kumar, R., Mai, J., and Samaniego, L.: The multiscale routing model mRM v1.0: simple river routing at resolutions from 1 to 50 km, Geosci. Model Dev., 12, 2501–2521, https://doi.org/10.5194/gmd-12-2501-2019, 2019. a

Thornthwaite, C. W.: An Approach toward a Rational Classification of Climate, Geograph. Rev., 38, 55, https://doi.org/10.2307/210739, 1948. a, b, c

Tucker, C. J., Pinzon, J. E., Brown, M. E., Slayback, D. A., Pak, E. W., Mahoney, R., Vermote, E. F., and El Saleous, N.: An extended AVHRR 8‐km NDVI dataset compatible with MODIS and SPOT vegetation NDVI data, Int. J. Remote Sens., 26, 4485–4498, https://doi.org/10.1080/01431160500168686, 2005. a

Vicente-Serrano, S. M., Azorin-Molina, C., Sanchez-Lorenzo, A., Revuelto, J., López-Moreno, J. I., González-Hidalgo, J. C., Moran-Tejeda, E., and Espejo, F.: Reference evapotranspiration variability and trends in Spain, 1961–2011, Global Planet. Change, 121, 26–40, https://doi.org/10.1016/j.gloplacha.2014.06.005, 2014. a

World Meteorological Organization (WMO): State of Global Water Resources 2022, Tech. rep., Geneva, Switzerland, https://wmo.int/publication-series/state-of-global-water-resources-2022 (last access: 2 July 2025), 2023. a

Xiang, K., Li, Y., Horton, R., and Feng, H.: Similarity and difference of potential evapotranspiration and reference crop evapotranspiration – a review, Agr. Water Manage., 232, 106043, https://doi.org/10.1016/j.agwat.2020.106043, 2020. a

Xu, C.-Y. and Singh, V. P.: Evaluation and generalization of radiation-based methods for calculating evaporation, Hydrol. Process., 14, 339–349, https://doi.org/10.1002/(SICI)1099-1085(20000215)14:2<339::AID-HYP928>3.0.CO;2-O, 2000. a

Yang, Y., Roderick, M. L., Zhang, S., McVicar, T. R., and Donohue, R. J.: Hydrologic implications of vegetation response to elevated CO₂ in climate projections, Nat. Clim. Change, 9, 44–48, https://doi.org/10.1038/s41558-018-0361-0, 2019. a, b

Yin, Z., Lin, P., Riggs, R., Allen, G. H., Lei, X., Zheng, Z., and Cai, S.: A synthesis of Global Streamflow Characteristics, Hydrometeorology, and Catchment Attributes (GSHA) for large sample river-centric studies, Earth Syst. Sci. Data, 16, 1559–1587, https://doi.org/10.5194/essd-16-1559-2024, 2024. a

Zhang, D., Liu, X., Zhang, Q., Liang, K., and Liu, C.: Investigation of factors affecting intra-annual variability of evapotranspiration and streamflow under different climate conditions, J. Hydrol., 543, 759–769, https://doi.org/10.1016/j.jhydrol.2016.10.047, 2016. a, b, c

Zhang, Y., He, B., Guo, L., Liu, J., and Xie, X.: The relative contributions of precipitation, evapotranspiration, and runoff to terrestrial water storage changes across 168 river basins, J. Hydrol., 579, 124194, https://doi.org/10.1016/j.jhydrol.2019.124194, 2019. a

Zhao, L., Xia, J., Xu, C.-y., Wang, Z., Sobkowiak, L., and Long, C.: Evapotranspiration estimation methods in hydrological models, J. Geograph. Sci., 23, 359–369, https://doi.org/10.1007/s11442-013-1015-9, 2013. a

Zhou, J., Li, Q., Ye, A., Xu, S., Yuan, Y., Xu, S., Zhang, D., Zhao, X., Zhu, Y., Zhao, Y., Xue, D., Dou, J., Liu, C., Shi, W., Wei, W., and Yang, X.: An improved methodology for quantifying the impact of human activities on hydrological drought change, J. Hydrol., 50, 101603, https://doi.org/10.1016/j.ejrh.2023.101603, 2023. a

Articles

Short summary

Understanding the changes in water movement in earth is crucial for everyone. To quantify this water movement there are several techniques. We examined how different methods of estimating evaporation impact predictions of various types of water movement across Europe. We found that, while these methods generally agree on whether changes are increasing or decreasing, they differ in magnitude. This means selecting the right evaporation method is crucial for accurate predictions of water movement.