The effects of spatial and temporal resolution of gridded meteorological forcing on watershed hydrological responses

Shuai, Pin; Chen, Xingyuan; Mital, Utkarsh; Coon, Ethan T.; Dwivedi, Dipankar

doi:https://doi.org/10.5194/hess-26-2245-2022

Articles | Volume 26, issue 8

https://doi.org/10.5194/hess-26-2245-2022

Articles | Volume 26, issue 8

Research article

02 May 2022

Research article |

| 02 May 2022

The effects of spatial and temporal resolution of gridded meteorological forcing on watershed hydrological responses

Pin Shuai, Xingyuan Chen, Utkarsh Mital, Ethan T. Coon, and Dipankar Dwivedi

Abstract

Meteorological forcing plays a critical role in accurately simulating the watershed hydrological cycle. With the advancement of high-performance computing and the development of integrated watershed models, simulating the watershed hydrological cycle at high temporal (hourly to daily) and spatial resolution (tens of meters) has become efficient and computationally affordable. These hyperresolution watershed models require high resolution of meteorological forcing as model input to ensure the fidelity and accuracy of simulated responses. In this study, we utilized the Advanced Terrestrial Simulator (ATS), an integrated watershed model, to simulate surface and subsurface flow and land surface processes using unstructured meshes at the Coal Creek Watershed near Crested Butte (Colorado). We compared simulated watershed hydrologic responses including streamflow and distributed variables such as evapotranspiration, snow water equivalent (SWE), and groundwater table driven by three publicly available, gridded meteorological forcings (GMFs) – Daily Surface Weather and Climatological Summaries (Daymet), the Parameter-elevation Regressions on Independent Slopes Model (PRISM), and the North American Land Data Assimilation System (NLDAS). By comparing various spatial resolutions (ranging from 400 m to 4 km) of PRISM, the simulated streamflow only becomes marginally worse when spatial resolution of meteorological forcing is coarsened to 4 km (or 30 % of the watershed area). However, the 4 km-resolution has much worse performance than finer resolution in spatially distributed variables such as SWE. Using the temporally disaggregated PRISM, we compared models forced by different temporal resolutions (hourly to daily), and sub-daily resolution preserves the dynamic watershed responses (e.g., diurnal fluctuation of streamflow) that are absent in results forced by daily resolution. Conversely, the simulated streamflow shows better performance using daily resolution compared to that using sub-daily resolution. Our findings suggest that the choice of GMF and its spatiotemporal resolution depends on the quantity of interest and its spatial and temporal scale, which may have important implications for model calibration and watershed management decisions.

Download & links

How to cite.

Received: 01 Oct 2021 – Discussion started: 12 Oct 2021 – Revised: 14 Mar 2022 – Accepted: 30 Mar 2022 – Published: 02 May 2022

1 Introduction

The accuracy of meteorological forcings such as precipitation plays a crucial role in simulating the watershed hydrological cycle. With the advancement of high-performance computing and the development of integrated hydrologic models (e.g., Amanzi-Advanced Terrestrial Simulator – ATS Coon et al., 2019, ParFlow, Kollet and Maxwell, 2006, and HydroGeoSphere, Aquanty, 2015), simulating the watershed hydrological cycle at high temporal and spatial resolution has become possible (Wood et al., 2011). These models often require gridded meteorological forcing (GMF), which is typically fused from various sources, including ground-based gages, radar, satellite remote sensing, as well as regional and global climate models. Due to different interpolation methods and data sources, GMF is available at different spatial and temporal resolutions and contains considerable uncertainties (Schreiner‐McGraw and Ajami, 2020).

Recently, GMF products, notably Daily Surface Weather and Climatological Summaries (Daymet) (Thornton et al., 1997, 2021), the Parameter-elevation Regressions on Independent Slopes Model (PRISM) (Daly et al., 2008), and the North American Land Data Assimilation System (NLDAS) (Mitchell, 2004; Xia et al., 2012)), have become popular for hydrologic modeling within the conterminous United States (CONUS) owing to their temporally and spatially complete coverage and relatively high spatiotemporal resolution. Past studies have compared and evaluated the performance of GMF against weather stations (Behnke et al., 2016; Daly et al., 2008; Muche et al., 2020). Daly et al. (2008) presented a detailed comparison between PRISM and Daymet and found that, for the products available in 2008, PRISM outperforms Daymet, especially in mountainous and coastal areas of the western US. Behnke et al. (2016) compared eight widely used meteorological forcing datasets, including Daymet, PRISM, and NLDAS against Global Historical Climatology Network-Daily (GHCN-D) stations across the CONUS. They found that different interpolation methods affected the accuracy of downscaled meteorological data, and care should be taken when selecting meteorological forcing for a given region. In a similar study, Muche et al. (2020) compared four GMFs (i.e., Daymet, PRISM, NLDAS, and the Global Land Data Assimilation System – GLDAS) as precipitation data sources and evaluated the precipitation estimates at GHCN-D stations within the Delaware Watershed at Perry Lake in eastern Kansas. They showed that precipitation from Daymet and PRISM were more closely matched with precipitation collected at GHCN-D than that from NLDAS and GLDAS.

Understanding the bias and fidelity of each meteorological forcing and the effects of meteorological forcing spatiotemporal resolution on simulated watershed responses is important for accurate simulations of watershed processes. Previous studies have evaluated the impact of different GMFs on model-simulated surface runoff (Muche et al., 2020; Behnke et al., 2016; Gao et al., 2017; Elsner et al., 2014). Using the Soil and Water Assessment Tool (SWAT), Muche et al. (2020) evaluated model performance on simulated streamflow against observation under different GMFs. They found that the simulated streamflow yielded a higher correlation when driven by PRISM and Daymet than those by NLDAS and GLDAS. Eum et al. (2014) evaluated hydrologic responses using the Variable Infiltration Capacity (VIC) model forced by three GMFs available in Canada. They found notable differences in simulated surface runoff during the snowmelt period but not so much during the snowfall period. However, these studies mostly focused on meteorological forcing effects on surface runoff and ignored other relevant hydrological processes (e.g., snowmelt, evapotranspiration – ET –, and subsurface flow). In addition, these studies used either semi-distributed models (e.g., SWAT) or coarse regional-scale land surface models (e.g., VIC), which do not fully utilize the GMFs at their finest resolutions.

Compared to semi-distributed models, fully distributed, integrated hydrologic models are favorable in simulating watershed hydrologic responses to changes in climate forcing as they can preserve the spatial heterogeneity of inputs from GMF and provide a spatially distributed representation of both surface and subsurface flow processes. Recently, Maina et al. (2020) used the ParFlow-Community Land Model (CLM), a fully distributed, process-based watershed model, to study the effect of spatial resolution of meteorological forcing (0.5 to 40.5 km) generated from the Weather Research and Forecasting (WRF) model on spatially resolved hydrologic responses, including snow water equivalent (SWE), ET, infiltration, surface ponded depth, and groundwater table. Using the Cosumnes Watershed as a test bed, they found that most hydrologic variables were seasonally and spatially dependent on the different spatial resolutions of the meteorological forcing. Although climate models such as WRF provide alternative GMF at any given spatiotemporal resolution, they require extensive expert knowledge in setting up and running the models and thus are less popular compared to publicly available GMFs (e.g., Daymet, PRISM, and NLDAS). To our knowledge, few, if any, studies have utilized the common GMFs to investigate the impact of spatial resolution of meteorological forcing on both watershed cumulative variables (e.g., streamflow) and distributed variables (e.g., SWE, ET, and groundwater level).

The temporal resolution of meteorological forcing, especially precipitation, plays an important role in the timing of runoff generation. It is particularly important for flood volume modeling (Ficchì et al., 2016), flood forecasting (Wetterhall et al., 2011), and hydrodynamic modeling in urban catchments (Ochoa-Rodriguez et al., 2015; Bruni et al., 2015). The temporal resolution of rainfall inputs has been shown to affect the simulation of surface runoff more strongly than variations in spatial resolution during storm events (Ochoa-Rodriguez et al., 2015). High temporal resolution is also important for studying watershed biogeochemical cycling since sub-daily meteorological forcing could induce diurnal snowmelt that produces regular infiltration of cold, chemically distinct snow water into the soil which alters the soil temperature and chemical composition of soil water and groundwater (Petrone et al., 2007; Woelber et al., 2018). Despite the importance of the temporal resolution of input forcing, the impact of GMF temporal resolution on watershed hydrodynamics has largely been overlooked. For example, a daily timestep is used routinely in watershed hydrologic modeling, and the simulated daily streamflow is generally used to compare to observed daily streamflow even though sub-daily streamflow measurement is collected at most United States Geological Survey (USGS) stream gages.

The objectives of this study are to intercompare three widely available GMFs (i.e., PRISM, Daymet, and NLDAS) and to evaluate the impact of meteorological forcing spatial and temporal resolution on simulated watershed hydrologic responses including streamflow, ET, SWE, soil moisture, ponded surface water depth, and groundwater table. We choose ATS as the integrated watershed model to couple surface and subsurface flows with land surface processes (Coon et al., 2019). The model can fully resolve the meteorological forcing at a much finer resolution (≤100 m) using unstructured triangular grids. We seek to understand the impact of meteorological forcing by comparing model simulations to field observations including GHCN-D stations, USGS stream gages, and remote-sensing products. We aim to answer the following questions.

How would different GMFs at their native resolutions impact the simulated streamflow, distributed variables such as SWE?
What are the effects of spatial and temporal resolution of GMF on simulated streamflow and spatially distributed variables?
Is spatial resolution more important than temporal resolution of the GMF for watershed hydrologic simulations?

To address these questions, we perform different numerical experiments using ATS by forcing the model with various spatial and temporal resolutions of GMFs. We choose a mountainous watershed due to its complex terrain and heterogeneous weather conditions, which provides an ideal test bed for studying the impact of meteorological forcing spatiotemporal resolution on watershed dynamic responses. The findings from this study are relevant for the use of the GMF dataset in watershed hydrologic simulations using fully distributed watershed models in mountainous watersheds. It also provides important implications for watershed calibration using inverse modeling.

2 Methods

2.1 Study site

Our study site is located in the Coal Creek Watershed (Hydrologic Unit Code (HUC) 140200010204) with an area of 53.2 km² located within the larger East Taylor Watershed (HUC 14020001) near Crested Butte, in southwestern Colorado (Fig. 1). The Coal Creek Watershed is a high alpine, snow-dominated catchment, characterized as warm summer, humid continental climate in the Köppen classification system (Koppen and Geiger, 1930). It receives ∼850 mm of precipitation annually, with ∼530 mm as snowfall which was estimated from the long-term Daymet forcing dataset (Thornton et al., 2021). The primary land cover types are evergreen forest (62.6 %) and shrub (20.5 %). This watershed has strong variations in topography and land cover, which is representative of many headwater, mountainous watersheds in the western US.

https://hess.copernicus.org/articles/26/2245/2022/hess-26-2245-2022-f01

Figure 1Map of the Coal Creek Watershed in relation to the larger East Taylor Watershed as well as its relative location in the western US. Also shown are the locations of GHCN-D stations, the USGS stream gage, the National Hydrography Dataset Plus (NHDPlus) stream network, and the digital elevation model (DEM). The marked points (A–D) and (1)–(4) are point locations used to observe groundwater table and surface ponded depth from the model, respectively.

2.2 ATS model setup

ATS is an integrated, distributed hydrologic code that solves the diffusion wave approximation of the St-Venant equations for surface flow coupled to the Richards equation for flow in variably saturated porous media in the subsurface (Coon et al., 2019, 2020). The Richards equation is described as

\begin{matrix} (1) & \frac{\partial}{\partial t} (ϕ s) + ▿ \cdot q = 0, \end{matrix}

with

\begin{matrix} (2) & q = - \frac{1}{μ} k_{r} κ (▿ p + ρ g), \end{matrix}

where ϕ is the effective porosity (–), s is the saturation (–), q is the Darcy flux (m s⁻¹), μ is the dynamic viscosity (Pa s⁻¹), k_r is the relative permeability (–), κ is the saturated hydraulic permeability (m²), p is the water pressure (Pa), and g is the gravitational constant (m s⁻²).

The diffusive wave approximation to overland flow is described as

\begin{matrix} (3) & \frac{\partial h}{\partial t} + ▿ \cdot (h v) = Q_{w} + Q_{ss}, \end{matrix}

with

\begin{matrix} (4) & v = - \frac{h^{2 / 3}}{n \cdot \max (ϵ, \sqrt{▿ z})} ▿ (z + h), \end{matrix}

where h is the depth of ponded water (m), v is the surface flow velocity (m s⁻¹), Q_w are all external source/sink terms (m s⁻¹), Q_ss is the exchange flux between surface and subsurface systems (m s⁻¹), n is Manning's coefficient (s m $^{- 1 / 3}$ ), z is surface elevation (m), and ϵ is a small positive regularization to keep the equations non-singular in places with zero bed slope (m).

The ATS meshes including surface land covers and subsurface structures and properties were developed using the Watershed Workflow package (Coon and Shuai, 2021), which brings together a variety of data streams, delineates the catchment, and generates a variable-resolution mesh with refined resolution at the stream network. Resolutions ranged from typical triangle areas of 5000 m² near the stream network to 50 000 m² away from the stream network. This triangular surficial mesh was then elevated using a digital elevation model (DEM) from the USGS National Elevation Dataset (NED) 30 m resolution dataset.

On the surface, 14 land cover types were delineated from the National Land Cover Database (NLCD 2016) product for the CONUS. The leaf area index (LAI) seasonal variations for each land cover type were retrieved from MODIS (https://modis.gsfc.nasa.gov/data, last access: 31 March 2021). Some of the plant functional types and their properties such as rooting profile and photosynthetic parameters were adopted from parameters used in the CLM 4.5 technical notes (Oleson et al., 2013).

In the subsurface, the model was discretized into 19 terrain-following layers with a total thickness of ∼28 m. A total of six soil layers encompassed the top 2 m of the domain. The depth to bedrock (DTB) was determined from SoilGrids (Shangguan et al., 2017) that varies from 3 m at its shallowest to 26 m at its deepest. The geologic layers were sandwiched between the soil and bedrock layers. The vertical resolution of the mesh gradually increased from 5 cm at the surface to 2 m at the 2 m depth, and it remained constant at 2 m until the bottom of the model domain at a depth of 28 m. The total number of cells is 171 760.

Based on the National Resources Conservation Service (NRCS) Soil Survey Geographic (SSURGO) soils database, 22 soil types were identified and mapped within the soil layer. Due to the edge-matching issues in the SSURGO soil database (Gatzke et al., 2011), the 22 soil types were regrouped into 9 types to remove the discontinuity of a soil type across soil survey area boundaries. Using a global surface geology dataset from GLobal HYdrogeology MaPS (GLHYMPS) 2.0 (Huscroft et al., 2018), 11 geologic material types were identified and mapped within the geologic layer. The spatial distribution of the soil and geological layers was shown in Fig. 2. The permeability and porosity for each soil type were retrieved from the SSURGO database, and the van Genuchten parameters were determined using Rosseta v3, a pedotransfer function that relates sand, silt, and clay percentage to van Genuchten parameters (Zhang and Schaap, 2017). The permeability and porosity for each geology type were retrieved from the GLHYMPS database. Bedrock functions as a confining layer and is assumed to have a very small permeability of $1 \times 10^{- 17}$ m².

https://hess.copernicus.org/articles/26/2245/2022/hess-26-2245-2022-f02

Figure 2(a) Land cover, (b) soil map, and (c) geology map of Coal Creek Watershed that are generated from Watershed Workflow. (d) ATS-simulated surface ponded depth and soil saturation on 1 October 2018. The zoomed-in plot shows the 3D unstructured triangular mesh.

The model was first run for 1000 years with constant precipitation (∼850 mm yr⁻¹) as the cold spinup that resulted in steady-state model outputs at the final timestep, which was then used as the initial condition for a 10-year (1 October 2004–1 October 2014) transient simulation (i.e., warm spinup) driven by the Daymet forcing. Model state at the end of the 10-year run was used as the initial condition for a 4-year transient run (1 October 2015–1 October 2019) driven by various GMFs. The water year 2015 was treated as a second warm spinup and was discarded from the analysis to avoid any influence from previous spinup runs. The study period features a high snow year (∼709 mm in water year 2017) and a low snow year (∼296 mm in water year 2018), allowing us to demonstrate how different meteorological forcings impact watershed responses under various weather conditions. ATS runs were taken at a sub-hourly timestep determined by the model while outputting streamflow and watershed-averaged variables at an hourly timestep. Due to large file size, spatially distributed variables such as SWE and ET were output at daily timesteps. Each run took ∼17 h wall-clock time using 64 processors on the Cori clusters at the National Energy Research Scientific Computing Center (NERSC). The models were not calibrated because the focus of this study was to evaluate the effect of meteorological forcings on model simulation instead of estimating the optimal parameters used in ATS.

2.3 Gridded meteorological forcing

For this comparison, three widely used GMFs were considered: PRISM (Daly et al., 2008), NLDAS-2, (Xia et al., 2012)), and Daymet v4 (Thornton et al., 1997, 2021). NLDAS-2 and Daymet v4 are hereafter referred to as NLDAS and Daymet, respectively. The detailed comparison between each meteorological dataset can be found in Table 1.

(Thornton et al., 1997)(Daly et al., 2008)(Cosgrove et al., 2003)

Table 1Meteorological dataset comparison.

Abbreviations: T_min: minimum temperature, T_max: maximum temperature, T_mean: mean temperature, Prcp: precipitation, S_rad: shortwave radiation, L_rad: longwave radiation, VP: vapor pressure, SWE: snow water equivalent, Day_l: day length, T_dmean: mean dew point temperature, V_pdmin: minimum vapor pressure deficit, V_pdmax: maximum vapor pressure deficit, SH: specific humidity, WS: wind speed, pET: potential evaporation. ^a Puerto Rico has had Daymet since 1950. ^b This native resolution is not free.

Download Print Version | Download XLSX

The Daymet climate forcing is a gridded, daily product with a spatial resolution of 1 km, covering continental North America, Puerto Rico, and Hawaii. It assimilates data from weather stations (primarily GHCN-D stations) and accounts for elevation, prevailing winds, storm tracks, and proximity to large water bodies (Thornton et al., 1997). Here, the latest Daymet version 4 product is used because this product has gone through significant bias corrections in station observations and the gridded product shows a better match with weather stations compared to the earlier versions (Thornton et al., 2021).

The PRISM forcing is developed by the PRISM climate group at Oregon State University and is recognized as the official climate dataset for the US Department of Agriculture. It utilizes a wide range of monitoring networks including GHCN-D stations and local/state weather stations to generate daily, spatially continuous climate data for the CONUS. PRISM provides a native grid resolution of 30 arcsec (∼800 m) for a fee but also provides a coarsened 4 km resolution free of charge. We used the native 30 arcsec resolution and downscaled (upscaled) the dataset to obtain finer (coarser) spatial resolutions.

The NLDAS dataset is a gridded, hourly product with a spatial resolution of $1 / 8$ th^∘ (∼12 km at the study site) for the entire North American region. The non-precipitation forcing variables are primarily derived from the North American Regional Reanalysis (NARR) by spatially interpolating data from the 32 km-resolution NARR grid to the $1 / 8$ th^∘ NLDAS grid while temporally disaggregated from 3-hourly to hourly frequency (Cosgrove et al., 2003). The precipitation is a product of a temporal disaggregation of a gage-only Climate Prediction Center (CPC) analysis of daily precipitation into hourly frequency, performed directly on the NLDAS grid and including an orographic adjustment based on the widely applied PRISM climatology.

All three datasets provide temperature and precipitation as the primary forcing with a few secondary forcing variables. In addition to temperature and precipitation, ATS requires solar radiation (both incoming shortwave radiation (S_rad) and longwave radiation – L_rad), relative humidity, and wind speed as forcing inputs. Relative humidity can be estimated based on vapor pressure and mean temperature (Bolton, 1980). L_rad can be estimated from S_rad and relative humidity. Because PRISM does not provide S_rad and L_rad, we used solar radiation from Daymet instead. Wind speed was assumed to be constant (i.e., 4 m s⁻¹) for both Daymet and PRISM. Compared to PRISM and Daymet, NLDAS provides the most complete set of variables to drive ATS simulations.

Different meteorological forcings have different definitions for a calendar day, and they are often different from the local time used in the observation data (see Table A1 in the Appendix). Time zone adjustment and lag corrections have been applied to account for the time lag difference between meteorological forcing and local gages. For example, PRISM lags Daymet by 1 d, so PRISM has been shifted forward 1 d to be consistent with Daymet. Both model simulation and gage observation have been converted to the Coordinated Universal Time (UTC) time zone for hourly streamflow comparison. For consistency, all simulated streamflows are in hourly resolution and are compared to hourly USGS streamflows in Sect. 3.

To study the effect of spatial resolution of meteorological forcing, precipitation and temperature from 800 m PRISM and 1 km Daymet have been downscaled (upscaled) into finer (coarser) spatial resolutions. The downscaling of 800 m PRISM or 1 km Daymet into 400 m used a data-driven downscaling approach (Mital et al., 2022). Specifically, random forests (Breiman, 2001) were used to extract the relationships between precipitation (or average temperature) and topography. These relationships were developed at 800 m (for PRISM) and 1 km (for Daymet) resolutions and were used as-is to generate the 400 m downscaled estimates. The downscaled precipitation grids were additionally filtered to ensure a smooth field in low-gradient areas without affecting high-gradient areas (Daly et al., 2008). The topographic variables considered were elevation, slope, aspect, latitude, and longitude. These variables were extracted from the NED 10 m-resolution product and upscaled to 400 and 800 m (for PRISM) via bilinear interpolation. Upscaling of topographic variables was done in maximum increments of 2× (e.g., 10 m → 20 m → 40 m and so on).

For consistency, spatial upscaling of 800 m PRISM into 1600 and 4000 m was performed using a coarsened function from python package xarray (http://xarray.pydata.org, last access: 1 April 2021) by applying a moving average based on a 2×2 window size. The same approach was used for spatial upscaling of 1 km Daymet to 2 and 4 km. To study the effect of temporal resolution of meteorological forcing, the daily PRISM dataset was disaggregated into hourly resolution using the temporal pattern of NLDAS. The hourly PRISM dataset was then aggregated into 12-hourly temporal resolution by taking the mean (for temperature) or sum (for precipitation) for the aggregated period.

In ATS, meteorological forcing is distributed linearly across its temporal resolution, and each model surface cell gets its meteorological forcing through spatially bilinear interpolation. For example, both Daymet and PRISM apply their meteorological forcing at the daily timescale, whereas NLDAS applies its meteorological forcing at an hourly timescale.

2.4 Observation data

Instantaneous streamflow data (every 15 min) have been available from 1 April through 15 November every year since 2014 at a USGS gage (station number 09111250) located at the watershed outlet. The 15 min streamflow was aggregated to hourly streamflow which was used to compare against model simulations in the Results section. Past Airborne Snow Observatory (ASO) survey has four flights covering this watershed in 2018 and 2019 to survey the snow depth and SWE. Remote sensing products such as the Moderate Resolution Imaging Spectroradiometer (MODIS) 8 d composite ET have been available at a 500 m resolution since 2000. Groundwater measurements and field-observed soil moisture data are not available within the study site.

To compare the accuracy of each meteorological forcing against field observations, all three meteorological forcings at their native resolutions were compared against GHCN-D weather stations within the East Taylor Watershed. In total, there were seven stations with long-term precipitation records and four stations with long-term temperature records (see GHCN-D station locations in Fig. 1). Both precipitation and temperature time series were extracted at each GHCN-D gage location from the GMF.

2.5 Model evaluation metrics

Model-simulated outputs were compared against observation data including hourly streamflow from a USGS gage and spatially distributed SWE from the ASO survey. The modified Kling–Gupta efficiency (KGE) and its three components (r, γ, β) were used to evaluate the model performance (Kling et al., 2012) in addition to the standard Nash–Sutcliffe efficiency (NSE). The theoretical version of the modified KGE metric is

\begin{matrix} (5) & KGE = 1 - \sqrt{(r - 1)^{2} + (γ - 1)^{2} + (β - 1)^{2}}, \end{matrix}

with

\begin{matrix} (6) & r = \frac{cov (S, O)}{σ_{s} σ_{o}}, \end{matrix}

\begin{array}{l} (7) & γ = \frac{σ_{s} / μ_{s}}{σ_{o} / μ_{o}}, \\ (8) & β = \frac{μ_{s}}{μ_{o}}, \end{array}

where S and O represent simulated and observed values, respectively, r is the correlation coefficient, γ is the variability ratio, β is the bias ratio, cov(S,O) is the covariance between simulated and observed values, σ is the standard deviation, and μ is the mean.

Using the modified KGE avoids the effect of input bias on the variability indicator, which has an advantage over the original KGE (Gupta et al., 2009; Kling et al., 2012), and it also allows diagnostic interpretation of the performance score. KGE decomposes model performance into correlation (r), variability (γ), and bias (β) terms. For example, the correlation measures the temporal dynamics of streamflow (i.e., timing), while the variability and bias measure the flow duration curve (i.e., magnitude). The KGE ranges from −∞ (poorest model skill) to 1 (perfect) when all three terms reach unity. Similarly, the NSE ranges from −∞ (poorest model skill) to 1 (perfect).

A Taylor diagram is used to show how closely a set of patterns (e.g., meteorological forcing) matches observations (Taylor, 2001). In each Taylor diagram, performance metrics such as standard deviation and Pearson's correlation coefficient (r) are shown together. The azimuthal angle represents correlation, and the radial distance represents the standard deviation. Also shown is the centered root mean square error (RMSE) between simulation and observation. The relationship between these statistics is shown below:

\begin{matrix} (9) & E^{2} = σ_{s}^{2} + σ_{o}^{2} - 2 σ_{s} σ_{o} r, \end{matrix}

where E is the centered RMSE, which is also measured by the geometric distance between simulation and observation data points on the Taylor diagram (unit is the same as the standard deviation). In cases where more than one observation point are plotted on the same diagram, the centered RMSE is omitted. Note that the centered RMSE is a mean-removed RMSE, and thus any bias in the data is not shown.

The closer the distance between simulation and observation data point on a Taylor diagram, the smaller the centered RMSE (observation data point has centered RMSE=0), the more similarity they show in terms of standard deviation, and the higher the correlation coefficient (observation data point has r=1).

3 Results

3.1 Comparison between meteorological forcing and weather stations

A Taylor diagram was used to compare the similarity in precipitation and temperature patterns between meteorological forcing and GHCN-D stations (Fig. 3). Compared to temperature, precipitation showed stronger spatial heterogeneity among stations indicated by the larger difference in standard deviation and correlation. The close clustering of temperature data points indicated that the difference between different stations in temperature patterns was small. For precipitation, PRISM showed a strong correlation (r>0.9) with GHCN-D at three stations, whereas Daymet only showed a strong correlation at one location and all NLDAS sites showed a relatively weak correlation ( $0.5 < r < 0.85$ ). For temperature, all three meteorological forcings showed a very strong correlation (r>0.95) with GHCN-D, though Daymet was slightly better than PRISM and NLDAS. Previous studies also reported similar findings at different watersheds that Daymet and PRISM showed better agreement with ground-based observational data than NLDAS (Muche et al., 2020), and the temperature was more accurately represented than precipitation (Behnke et al., 2016).

https://hess.copernicus.org/articles/26/2245/2022/hess-26-2245-2022-f03

Figure 3Taylor diagram showing the correlation coefficients and standard deviations between meteorological forcing and GHCN-D gages (black) for (a) precipitation and (b) temperature. The azimuthal angle represents correlation, and the radial distance represents the standard deviation. Each marker symbol represents a different GHCN-D station location.

The effects of spatial and temporal resolution of gridded meteorological forcing on watershed hydrological responses

2.1 Study site

2.2 ATS model setup

2.3 Gridded meteorological forcing

2.4 Observation data

2.5 Model evaluation metrics

3.1 Comparison between meteorological forcing and weather stations

3.2 ATS simulations driven by different meteorological forcing products

3.3 Effects of meteorological forcing spatial resolution

3.4 Effects of meteorological forcing temporal resolution

4.1 The choice of gridded meteorological forcing for integrated watershed simulation

4.2 Spatial vs. temporal resolution: which one is more important?

4.3 Limitations, implications, and transferability of the current study

A1 Calendar day definition used in the meteorological datasets

A2 Triple collocation analysis (TCA) of meteorological forcing

A3 Comparison of model outputs from different spatial resolutions of Daymet

A4 Impact of spatial and temporal resolution of solar radiation

A5 Additional results from PRISM comparison under different spatial resolution