Articles | Volume 29, issue 23
https://doi.org/10.5194/hess-29-6935-2025
https://doi.org/10.5194/hess-29-6935-2025
Research article
 | 
02 Dec 2025
Research article |  | 02 Dec 2025

High-resolution snow water equivalent estimation: a data-driven method for localized downscaling of climate data

Fatemeh Zakeri, Gregoire Mariethoz, and Manuela Girotto
Abstract

Estimating high-resolution daily snow water equivalent (SWE) in mountainous regions is challenging due to geographical complexity and the irregular availability of high-resolution meteorological data. This study introduces a method for downscaling SWE based on low-resolution climate models. It is based on the dependence between climate estimators and SWE. Although SWE changes rapidly, patterns often repeat under similar meteorological conditions. We implement this principle to downscale SWE to a 500 m resolution using a k-nearest neighbor algorithm with a customized distance metric.

To evaluate the performance of our approach, we conducted tests for California's Sierra Nevada and Colorado's Upper Colorado River Basin in the western United States using different low-resolution climate models (ec-earth3-veg, mpi-esm1-2, and cnrm-esm2-1) at both 100 and 9 km scales. We performed a cross-validation analysis and compared our results with commonly used gridded SWE datasets, as well as with point-scale time series. The results demonstrate that our approach enables the generation of downscaled SWE, which closely matches reanalysis data in terms of statistical properties. The outputs demonstrate that, for each region, performance depends on the choice and accuracy of the climate model inputs, such as precipitation and temperature data. Overall, the cnrm-esm2-1 model demonstrates superior accuracy in Colorado, outperforming other models at both the 100 and 9 km resolutions. Conversely, the ec-earth3-veg model achieves the best performance in California with 9 km resolution climate data. Across models, a 9 km resolution typically provides slightly better accuracy compared to a 100 km resolution. This opens up possibilities for applications in regions with limited in situ or meteorological measurements. The approach also has the potential to recreate unmeasured historical SWE values.

Share
1 Introduction

The snowpack in high-elevation regions serves as a primary source of streamflow, particularly during the spring and summer seasons (Bales et al., 2006). Rapidly changing weather conditions (Ranzi et al., 2024) and extreme events, such as atmospheric rivers, can lead to significant snowmelt and generate extreme runoff, posing threats to both water supply and infrastructure (Henn et al., 2020). Understanding snow and its role in the hydrological cycle is not merely a scientific question. It also has economic significance as it directly relates to water resources, agriculture, and energy production (Sturm et al., 2017). Therefore, accurate and detailed information on snow water equivalent (SWE) at a high temporal and spatial resolution is crucial for effective water resource management and decision-making (Bales et al., 2006; Fiddes et al., 2019; Siirila-Woodburn et al., 2021b).

Although ground stations are valuable for collecting SWE data, their limited presence in certain regions affects their representativeness. Moreover, variations in topography, land cover, and environmental conditions in mountainous areas render point-scale data insufficient for capturing the overall spatial characteristics of a watershed (Bales et al., 2006; Alonso-González et al., 2023). To address this lack of data, physically based snow models utilize an energy balance approach to estimate snowmelt. These models range in complexity, with more advanced models integrating detailed processes such as wind-induced snow transport, interactions with topography, and vegetation impacts. While complex models, such as those incorporating advection–diffusion equations or three-dimensional wind fields, more accurately represent snow properties, they often require extensive input data, which may not always be available (Liston and Sturm, 1998; Lehning et al., 2006; Vionnet et al., 2014). Simpler models, in contrast, may fail to capture critical aspects of snow dynamics (Bair et al., 2016; Clow et al., 2012). Moreover, to achieve high-resolution SWE (HR-SWE) estimates using these models, it is necessary to use meteorological and land-cover-related data that match the desired SWE output resolution. However, obtaining high-resolution data in mountainous regions remains challenging (Wundram and Löffler, 2008). Although generating HR-SWE using physical models can be time-consuming, recent studies have focused on reducing this computational burden. For instance, advances in SWE modeling have been achieved by implementing parallelized versions of snow models. This approach maintains the integrity of physical processes while utilizing parallelization to manage the computational demands of fine-resolution datasets over large domains (Mower et al., 2024). However, it is important to clarify that computational expenses become significant when operating at high spatial and temporal resolutions, particularly in an ensemble context, which is often required for robust meteorological predictions. Additionally, regardless of computational aspects, generating high-resolution snow data using physical models typically necessitates high-resolution meteorological inputs, such as temperature and precipitation fields, to ensure the quality of the downscaled outputs.

Given these challenges in obtaining high-resolution meteorological data, remote sensing has become increasingly important for monitoring and predicting snowpack conditions and their impacts on water resources (USBR, 2021). Satellites equipped with optical sensors, such as MODIS and Landsat, provide data on snow cover (Dietz et al., 2012; Painter et al., 2012; Largeron et al., 2020; Wu et al., 2021) and temperature (Lundquist et al., 2018), while lidar and microwave sensors (Tsang et al., 2022; Saberi et al., 2020) offer insights into snow depth or SWE (Lievens et al., 2019; Shi and Dozier, 2000; Pflug et al., 2024; Ma et al., 2023) and wetness (Shi and Dozier, 1995; Snapir et al., 2019). However, limitations such as revisit time and cloud cover restrict the availability of daily data.

To overcome the limitations of models and observations, data assimilation has emerged as a promising approach, because it capitalizes on the strengths of both observations and models, minimizing their respective uncertainties. It has proven useful in improving the accuracy of snow state estimates, snow physics, model parameters, and identifying sources of uncertainty. Data assimilation is particularly effective in harmonizing the different temporal and spatial resolutions of in situ and remotely sensed observations and bridging the scale gap between observations and models. In general, the assimilation of satellite and airborne observations leads to enhanced estimates of seasonal snow and related variables (Fang et al., 2022a; Margulis et al., 2016). However, a major shortcoming of snow data assimilation is its inability to provide observationally constrained daily HR-SWE for periods without satellite data. Key areas of ongoing research include understanding the impact of underlying spatial error correlations in data assimilation to improve the spatial estimates of SWE and the potential integration of multiple observations to boost snow modeling accuracy. Despite these challenges, the snow science community continues to improve the accuracy of seasonal snow estimations (Girotto et al., 2020).

Acknowledging the complexities involved in modeling snow processes and assimilating diverse datasets, we propose a localized climate data downscaling method to estimate HR-SWE. Statistical downscaling methods have demonstrated their capability to act as a bridge between large-scale climate forecasts and local-scale climate impacts (Abatzoglou and Brown, 2012; Tabari et al., 2021; Rettie et al., 2023). One of the statistical downscaling methods is bias-correction spatial disaggregation (BCSD) (Wood et al., 2004), which effectively reduces uncertainties in climate model outputs by adjusting biases using high-resolution observational data. These methods are particularly useful in non-mountainous regions, where data availability is typically higher. Their primary strength lies in their ability to correct model outputs while capturing local variability. However, their reliance on high-quality in situ data significantly restricts their applicability in remote or data-scarce areas. In contrast, our method overcomes these limitations by utilizing low-resolution climate data without requiring ground-based observations, making it well-suited for a wider range of conditions, including regions with limited data availability.

Some other widely used statistical downscaling methods in climatology, known as analog methods, are based on pattern recognition (Zorita and Von Storch, 1999). These methods are used to identify patterns in historical data that closely match the patterns simulated by atmosphere–ocean general circulation models. The observed surface climate conditions corresponding to these historical matches are then used as downscaled predictions. Analog methods have seen extensive application. For example, Abatzoglou and Brown (2012) demonstrate the effectiveness of the multivariate adapted constructed analog for wildfire assessments and show that it outperforms traditional spatial downscaling methods. Similarly, Pons et al. (2010) utilized analog-based downscaling to analyze snow trends in Northern Spain, successfully replicating observed variability and trends for seasonal and climate change projections. Caillouet et al. (2016) similarly demonstrated the utility of probabilistic downscaling in reconstructing high-resolution precipitation and temperature fields over France, effectively addressing seasonal biases. The AtmoSwing software by Horton (2019) demonstrates the flexibility of analog methods for operational forecasting and climate impact studies.

An analog-type method, named nearest neighbor resampling, also used by Lall and Sharma (1996), relies on identifying patterns in historical-point-based time series data and resampling them using a nearest neighbor approach to preserve the serial dependence structure of the data. The k-nearest neighbors (k-NN) algorithm is a simple, non-parametric machine-learning technique commonly used for classification and regression (Cover and Hart, 1967). It works by identifying the “k” most similar data points (neighbors) to a target data point based on a chosen distance metric, such as the Manhattan distance. The k-NN downscaling by Gangopadhyay et al. (2005) extends the analog method by weighting several similar historical analogs to create predictive ensembles, adding further flexibility to this approach. Building on this concept, Rajagopalan and Lall (1999) extended the methodology to multivariate weather simulations, incorporating variables such as precipitation, temperature, and wind speed to simulate daily weather sequences. Later, Yates et al. (2003) used an adapted version of these methods to generate daily weather sequences and alternative climate scenarios. In the weather generator models based on k-NN, in general, the day directly succeeding the identified analog day is selected as the next day in the generated sequence, and this process continues iteratively (Gangopadhyay et al., 2005). Similarly, recent advancements, such as the study by Yiou and Déandréis (2019), have extended analog methods to ensemble-based probabilistic forecasts, which were shown to demonstrate skill in predicting variables like the NAO index and temperature measurements at European stations. These innovations highlight the adaptability and growing utility of analog-based and k-NN approaches in climate and hydrological modeling.

Recognizing the potential for snow spatiotemporal patterns to repeat on days with similar climatological characteristics (Zakeri and Mariethoz, 2024; Pflug and Lundquist, 2020), this study aims to provide daily high-spatial-resolution SWE data based on climate predictors. Our proposed approach is based on establishing a statistical relationship between daily global low-resolution climate data, such as temperature, precipitation, low-resolution SWE (LR-SWE), and local reanalysis HR-SWE images as a training dataset. Then, based on this learned relationship, embedded in a k-NN algorithm, a unique daily HR-SWE dataset is obtained for the historical period (1950 to present) based on low-resolution climate data.

This method offers several key advancements over existing k-NN downscaling techniques. First, the adaptation made to the k-NN downscaling method, specifically by introducing far and near temporal intervals of climate data, is highly flexible to dynamic variables undergoing significant changes due to climate variability. Second, unlike most analog methods that restrict analog candidates to a specific temporal window near the query date, this approach does not impose such limitations. This flexibility is crucial for three reasons: (1) the inclusion of far and near temporal intervals makes such restrictions unnecessary, as the most suitable candidates are selected based on their match within the temporal window; (2) it plays a potentially important role in preserving extreme events, as restricting candidates to a narrow date range risks losing matches that represent rare but important extreme events; and (3) it can enable downscaling for periods where exact analogs may not exist in the historical record within a specific date range. However, suitable analogs may still be found in historical observations but on different dates. For example, with climate change, a specific snow day in winter may no longer match the query day, but an analog might be found in another season, such as fall or spring, due to warmer climatic conditions.

Moreover, the method can reconstruct HR-SWE data for historical periods where only low-resolution climate data are available, providing valuable insights into past snow conditions. Additionally, the method excels at capturing fine-scale SWE patterns in complex terrains, such as mountainous regions, significantly improving upon traditional statistical downscaling models that often struggle in such environments. Finally, our method does not require high-resolution climate input data. This substantially reduces computational demands compared to physical snow models.

The paper is structured as follows: Sect. 2 details the methodology; Sect. 3 describes the study area, database, and parameters; Sects. 4 and 5 present the evaluation approaches and the results; and Sects. 6 and 7 conclude with discussions and conclusions.

2 Methodology

2.1 Overview of the algorithm

The algorithm involves two primary datasets: training and target. The training dataset consists of HR-SWE images, low-resolution (LR) climate data, and LR-SWE. In contrast, the target dataset includes LR-SWE images alongside LR climate data, for which we aim to estimate HR-SWE images.

The fundamental strategy of the proposed method involves ranking the training data to estimate an HR-SWE image for a given target date. This ranking is based on the Manhattan distance between each date in the training dataset and the selected target date, as detailed in Sect. 2.3. The Manhattan distance is chosen, as it is more robust against outliers and often computationally less expensive than the Euclidean distance. Then, the algorithm estimates the downscaled SWE for the target date based on the k-nearest SWE candidates. As detailed in the recent work by Zakeri and Mariethoz (2024), this approach was initially designed to create synthetic satellite snow cover images for dates with no satellite data, based on the relationship between meteorological estimators from the ERA5-Land reanalysis dataset (Muñoz Sabater, 2019) and available clear sky Landsat/Sentinel-2 snow cover images. Here, we adapted this approach to downscale SWE. The innovation lies in the downscaling of the LR daily SWE data at the spatial scale of climate model simulations to produce much higher-resolution daily SWE estimates. This output is particularly important in areas with complex terrain, such as the western United States, where global models face challenges in accurately representing the regional climate. Indeed, on the one hand, the resolution of global climate simulations is often insufficient for capturing local influences (e.g., topography or vegetation) on SWE patterns. On the other hand, physical models are computationally expensive. Therefore, developing a tool to estimate HR-SWE that closely aligns with established regional HR-SWE reanalysis is essential for scientific and management purposes, offering a faster and less computationally demanding solution. Further details on the proposed methodology are provided in the subsequent sections.

2.2 Input data and preprocessing

2.2.1 Definition of temporal intervals for climate variables

SWE, representing the amount of water stored in the snowpack, is influenced by various factors, which can be classified into two main categories: climatic variables (including temperature, precipitation, and surface downwelling shortwave radiation) and environmental variables (including land cover, topography, and the presence of topography shadows). While environmental variables can impact SWE distribution and accumulation, their effects are assumed to be consistent and not subject to significant temporal variations within the specified regions. It is worth noting that the effects of environmental variables such as terrain shading may vary, driven by the Sun's position throughout the year; these can be captured in the baseline SWE models used to generate training data. In contrast, climatic variables exhibit significant temporal and spatial variability, making them primary drivers of SWE dynamics. Consequently, the downscaled SWE estimation focuses primarily on climatic variables. Their dominant influence ensures that the proposed model can capture the spatiotemporal variability essential for SWE estimation. The effects of environmental variables, while excluded from the direct downscaling process, can be accounted for in the baseline SWE models that provide the training data.

SWE is also affected by preceding meteorological conditions, such as temperature and precipitation patterns on previous days. For instance, the amount of SWE today may vary depending on the conditions experienced in the preceding days. To represent this, we introduce two distinct climate temporal intervals: a near interval and a far interval (as shown in Fig. 1). These intervals consider climate variables including minimum temperature, maximum temperature, precipitation, and surface downwelling shortwave radiation. Considering both far- and near-term variations allows accounting for complex relationships between climate dynamics and snow accumulation and melting. By incorporating these intervals, we can capture the influence of climate variables over different timescales and enhance SWE estimation accuracy. The specific lengths of these near and far climate intervals are determined through an optimization procedure outlined in Sect. 2.4.1.

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f01

Figure 1Illustration of the definition of near (shown in grey) and far (shown in white) daily intervals for climate variables, highlighting the far temporal interval (FI) and near temporal interval (NI) of climate data at a specific time (Ct).

Download

2.2.2 Input climate variables

The downscaling process relies on low-resolution climate information and LR-SWE data obtained from global or regional climate models that provide the predictors for our k-NN framework. As a result, the downscaling procedure can be described by Eq. (1):

(1) SWE ^ HR ( t ) f ( T min , LR FI , T min , LR NI , T max , LR FI , T max , LR NI , P LR FI , P LR NI , RSDS LR FI , RSDS LR NI , SWE LR ( t ) )

Here, SWE^HR(t), and SWELR(t) are the downscaled SWE and the LR-SWE, both at the query time (t). Tmin and Tmax represent the minimum and maximum temperatures, while P and RSDS denote precipitation and surface downwelling shortwave radiation. The superscript “FI” or “NI” indicates the far or near temporal intervals introduced in Fig. 1. The subscript notation (“LR”, “HR”) explicitly indicates the resolution (low resolution or high resolution). All the climate variables (Tmin, Tmax, P, RSDS, and SWE) are daily measurements. To ensure compatibility among datasets with varying units, all input data are rescaled to a range of 0–1 based on the absolute minimum and maximum values observed in the training dates. Hereafter, SWE^ refers to downscaled SWE. Details on the experiments conducted to identify the most effective climate predictors within these datasets are available in the Supplement (Table S1 and Fig. S1).

2.3 Downscaled SWE estimation

Estimating SWE^ for a date when HR-SWE data are not available relies on the existing HR-SWE data from the training dates. For each training date, the input variables described in Eq. (1) and their corresponding HR-SWE images are used. By utilizing a vector of inputs, we estimate SWE^ by selecting the k-nearest candidates in the input space.

In this study, the k-NN algorithm is applied to downscale low-resolution climate data to HR-SWE estimates by selecting training days with similar climate conditions. The flowchart (Fig. 2) illustrates the proposed downscaling method for estimating HR-SWE, and the steps are as follows:

  1. Gather the input variables, including the far and near intervals of temperature, precipitation, shortwave radiation, and LR-SWE for both the target date and the training dates.

  2. Calculate the distance between the input vector of the target date and the input vectors of the training dates.

  3. Select the k-nearest training dates based on their proximity to the target date in the input space.

  4. Retrieve the corresponding HR-SWE images associated with the selected k-nearest training dates.

  5. Aggregate the retrieved HR-SWE images to estimate SWE^ for the target date.

The metric used to identify the nearest neighbors between a given target date (t1) and a training date (t2) is the multivariate Manhattan distance, described in Eq. (2):

(2) d t 1 , t 2 = i = 1 E α i x i ( t 2 ) - x i ( t 1 )

The weights αi assigned to the input variables sum to 1 and are used to balance their influence in the selection of the k-nearest observations. The determination of optimal weights is further discussed in Sect. 2.4.2. Here, xi represents each of the E estimators introduced in Eq. (1). Unlike Euclidean distance, which calculates the shortest straight-line distance, Manhattan distance computes the sum of the absolute differences between variables. This makes it more robust against outliers and better suited to handling high-dimensional datasets, such as those containing multiple climate variables. In our method, the distance is used to rank training dates according to their similarity to the target date.

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f02

Figure 2Flowchart depicting the proposed downscaling algorithm for estimating high-resolution snow water equivalent (HR-SWE) on a target date using low-resolution (LR) climate data: (1) gather input variables: collect the input variables, including near (NI) and far (FI) temporal intervals of climate data (e.g., temperature, precipitation) for both the target date and the training dataset. (2) Calculate distances: compute the distance (d) between the input vector of the target date and the input vectors of each candidate in the training dataset using a defined distance metric (Manhattan distance). (3) Identify k-nearest neighbors (k-NNs): rank the training data in order of increasing distances to identify the k-nearest neighbors. (4) Aggregate HR-SWE images: retrieve the HR-SWE images corresponding to the k-NNs and aggregate them to estimate the HR-SWE for the target date.

Download

2.4 Estimating parameters and weights

To optimize parameter estimation and ensure better convergence, the parameters are estimated either using sensitivity analysis or an optimization algorithm. In this paper, the far and near intervals (FI and NI) for climate variables (Fig. 1), as well as the number of k-nearest observations, are determined using a sensitivity analysis. The weights assigned to the input features are established using an optimization algorithm.

2.4.1 Parameter sensitivity analysis

Dissimilarity, denoted as ε, is critical for determining the optimal near (NI) and far (FI) temporal intervals, as well as the optimal number of nearest observations (K), for estimating SWE^. This dissimilarity is quantified using the root mean squared error (RMSE), a standard measure of accuracy that reflects the average magnitude of the square of errors between the estimated and observed values. The RMSE is defined as follows:

(3) RMSE = i = 1 N SWE ^ i - SWE i 2 N

where SWE^i is the downscaled SWE at the ith pixel, SWEi is the reference HR-SWE at the ith pixel, and N is the total number of pixels. This formula underpins the evaluation of ε across different configurations of temporal intervals and k-nearest observation counts.

To identify the optimal temporal intervals (NI and FI), we conduct a sensitivity analysis within predefined ranges, set by NI1,7 d and FI1,90 d, aiming to minimize ε. Similarly, the optimal number of nearest observations (K) is determined by evaluating ε as a function of K, ranging from 1 to the maximum number of available training dates (K1,Kmax) to identify the configuration that yields the smallest dissimilarity between SWE^ and the reference HR-SWE. The maximum number of available training dates (Kmax) can vary for each case study from a few days to several months.

Mathematically, the optimization processes are represented as:

(4) NI = argmin NI ε NI FI = argmin FI ε FI K = argmin K ε K

where εFI, εNI, and εK are the sum of RMSEs for the respective configurations, reflecting the dissimilarity between SWE^ (using different sizes of NI, FI, and K) and the reference HR-SWE. The only distinction between RMSEFI, RMSENI, and RMSEK is based on the varying elements (FI, NI, or K), illustrating how each influences the SWE^ estimate. Determining these optimal parameters enables the refinement of our SWE estimation model, enhancing its accuracy by minimizing ε.

The parameters are selected through a sensitivity analysis to minimize ε, but they do not necessarily correspond to a global optimum. This means that while they perform well in the studied regions, they may not generalize well to other locations or climatic conditions. Therefore, we recommend performing a sensitivity analysis for each region. Additionally, the sensitivity analysis assumes that the influence of these intervals is consistent across different temporal scales, which may not always be valid, particularly in regions with highly variable climate patterns. Despite these assumptions, the chosen parameters strike a balance between computational efficiency and accuracy for the downscaling task. Moreover, the subsequent weight optimization can further reduce the impact of non-global optimal parameter selection.

2.4.2 Weights optimization

The weights αi defined in Eq. (2) are determined by minimizing the RMSE between the SWE^ and the reference HR-SWE using Bayesian optimization. In this model-based optimization, the global maximum or minimum of an unknown objective function is found sequentially (Shahriari et al., 2015; Snoek et al., 2012). The key aspect of this approach is the use of a probabilistic model of the response function, which is evaluated at a minimal cost through the acquisition function. The Bayesian optimization framework employs a Gaussian process prior over the objective function and iteratively refines the model through Bayesian posterior updating to determine the optimal solution.

3 Study areas, datasets, and parameters

3.1 Study areas

To evaluate the effectiveness of the proposed methodology, we selected two regions in the western United States, as indicated in Fig. 3: the California Sierra Nevada region (referred to as California) and the Upper Colorado River Basin (referred to as Colorado). These regions are heavily dependent on snowmelt as a vital source of water resources (Dawadi and Ahmad, 2012; Siirila-Woodburn et al., 2021a).

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f03

Figure 3(a) Map of elevation (in meters) across the western United States, highlighting two regions of interest (red squares). (b) Map of the elevation and locations of in situ sites in Sierra Nevada, California. (c) Map of the elevation and locations of in situ sites in the Upper Colorado River Basin, Colorado.

In the California region, elevation ranges from a minimum of 1200 m to a maximum of 3700 m, with an average elevation of 2200 m. In the Colorado region, elevation ranges from a minimum of 1900 m to a maximum of 4300 m, with an average elevation of 3000 m. By focusing on these specific regions, we can assess the performance and applicability of the proposed methodology in other areas where snow water resources are an important component of the hydrology system.

The California Sierra Nevada snowpack plays a critical role in water resource management, contributing approximately 30 % of the state's water supply through snowmelt. However, its sensitivity to warming temperatures is evident, with the 2015 SWE on 1 April dropping to just 5 % of historical averages. This dramatic decline underscores the combined effects of reduced precipitation and higher temperatures, which exacerbate drought severity and shift the timing of water availability (Belmecheri et al., 2016).

In the Upper Colorado River Basin, snowmelt accounts for 70 %–90 % of annual streamflow, making snowpack dynamics essential to hydrological processes and water management. Heldmyer et al. (2023) identify three snow drought types, “warm”, “dry”, and “warm-and-dry”, which affect SWE and streamflow timing. Warm droughts tend to reduce SWE at lower elevations, while dry conditions cause uniform SWE reductions across elevations. These droughts advance peak streamflow timing by 7–13 d, emphasizing the region's sensitivity to climatic changes in temperature and precipitation (Heldmyer et al., 2023).

3.2 Datasets

The reference HR-SWE data, which this study aims to produce, comes from the reanalyzed SWE dataset for the western United States (Fang et al., 2022a). This dataset, captured at a 16 arcsec ( 500 m) resolution, covers the period from the water year (i.e., 1 October to 30 September) 1984 to 2021 and is updated daily. It combines high-resolution remotely-sensed data with a Bayesian data assimilation (Margulis et al., 2019, 2015, 2016) framework. The dataset is derived from Landsat-based fractional snow-covered area observations, is updated daily, and incorporates a land surface model to estimate SWE and snow depth. This approach enables spatially and temporally continuous SWE estimates, which are verified against in situ and lidar-derived SWE measurements for accuracy. It offers ensemble median values of SWE, calculated from a discrete probability distribution of posterior weights, thus providing an extensive and detailed view of the snow water content over the years. The dataset can be accessed at https://doi.org/10.5067/PP7T2GBI52I2 (Fang et al., 2022b). Hereafter, we refer to this dataset as the UCLA SWE. It is used as the reference dataset for training and validating our model outputs.

To obtain the necessary climatic estimators, including daily minimum and maximum air temperature, daily total precipitation, and shortwave radiation, we utilize two distinct sources: the Coupled Model Intercomparison Project (CMIP) version 6 simulations (Meehl et al., 2014; Eyring et al., 2016) and the downscaled CMIP6 over the western United States using the Weather Research and Forecasting (WRF) model (Skamarock et al., 2019) datasets (referred to as WRF-CMIP6) (Rahimi et al., 2024, 2022). The CMIP6 simulations provide daily climate data at a spatial resolution of 100 km, while the WRF-CMIP6 datasets offer high-resolution daily climate simulations at different resolutions for the western United States. In this study, we use WRF-CMIP6 dataset at a 9 km resolution. Importantly, WRF-CMIP6 and CMIP6 are used solely as sources for input climatological variables, which are used as predictors in our statistical downscaling model.

Rahimi et al. (2024) produced high-resolution regional climate projections by dynamically downscaling outputs from different CMIP6 global climate models using the WRF model. The WRF configuration was previously optimized and evaluated using ERA5 reanalysis data in Rahimi et al. (2022), which was used as a benchmark for assessing model performance under historical conditions.

While CMIP6 global models provide valuable information on large-scale climate variability and long-term trends, their coarse spatial resolution limits their ability to resolve regional weather patterns and localized climate processes. The WRF-CMIP6 dataset addresses this by dynamically downscaling CMIP6 model outputs to a 9 km grid, enhancing the simulation of regional climate variability across the western United States. This high-resolution framework offers improved regional climate projections. Utilizing both CMIP6 and WRF-CMIP6 at different resolutions enables us to investigate how using a regional configuration of a climate model that dynamically downscales CMIP6 data, and a global climate model with different spatial resolutions, affects in the accuracy of our approach.

To illustrate the effect of resolution on SWE image accuracy, Fig. 4 depicts the UCLA SWE, WRF-CMIP6, and CMIP6 images for a day in the California region.

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f04

Figure 4Comparative illustration of SWE images highlighting the effect of spatial resolution on accuracy. Displayed from left to right are the UCLA SWE, WRF-CMIP6, and CMIP6 images captured over the California region (Fig. 3b, lon: 120 to 119, lat: 38 to 39) on 8 January 1995. Distinct variations between these images underscore the influence of scaling on image accuracy.

Download

Among the available models with resolutions of both 9 and 100 km, we have selected three models for testing our methodology (Table 1). These models have been utilized in other studies (Thrasher et al., 2022; Kouki et al., 2022). Although our methodology is not limited to these models, they were selected merely as examples. Hereafter, we will refer to them as cnrm-esm2-1, ec-earth3-veg, and mpi-esm1-2.

Table 1The climate models that are used in this study.

Download Print Version | Download XLSX

To ensure that the test period is not influenced by the training period, the periods 1985–2004 and 2011–2014 are designated for training dates, while the years 2005–2010 are allocated for testing. Additionally, to confirm that our testing dates are not grouped into nearly identical wet or dry categories, we employed the CONUS Drought Indices dataset (Abatzoglou, 2013). This includes drought indices derived from the 4 km daily Gridded Surface Meteorological (GRIDMET) dataset. The Standardized Precipitation Evapotranspiration Index (SPEI) is the specific drought index used in our analysis. We calculated the average aggregated SPEI for each year across each region to categorize the testing and training years in Supplement Table S2. The testing years are different in terms of the drought index.

3.3 Parameters for both case studies

Through a sensitivity analysis, we determine the NI and FI using Eq. (4). This analysis is carried out for both datasets: CMIP6 (100 km) and WRF-CMIP6 (9 km). The results of this analysis are presented in the Supplement Figs. S2 to S5, indicating that a FI of 60 d and a NI of 4 d are sufficient for accurate SWE estimation in all examined scenarios. Even though these values might not represent a global optimum, potential suboptimal effects can be considered negligible. Furthermore, these effects can also be reduced in subsequent optimization stages when determining weights.

Next, we conduct a sensitivity analysis for the parameter K, the number of nearest observations, using the methodology described in Sect. 2.4.1 and Eq. (4). Figures S6 and S7 illustrate that, on average, a value of K=130 is appropriate for both the California and Colorado regions. Considering the relatively small variations in accuracy, we decided not to optimize K for each dataset. Note that the value of 130 nearest observations is high. This suggests the presence of frequently repeating situations or patterns within the data. This is particularly advantageous for statistical analysis, as it provides a large number of data points, thereby improving the robustness and reliability of the results.

The optimized weights α obtained from the Bayesian optimization algorithm after 500 iterations are summarized in Table 2. Both regions exhibit similar weight patterns, with relatively high weights assigned to variables Tmin,LRNI, PLRNI, SWELR, and PLRFI.

Table 2Weight optimization (α, in Eq. 2) results using Bayesian optimization. The numbers greater than 0.1 are highlighted in bold. This value, 0.1, is chosen because it almost represents the scenario where weights are distributed equally. “cnrm” refers to cnrm-esm2-1, “earth3” to ec-earth3-veg, and “mpi” to mpi-esm1-2.

Download Print Version | Download XLSX

From a physical perspective, this weighting aligns with the processes governing SWE distribution and dynamics. Minimum temperature (Tmin,LRNI) significantly influences freezing and melting thresholds, which are critical for snowpack accumulation or reduction. Precipitation variables, both near and far intervals (PLRNI, PLRFI), directly contribute to SWE through their impact on snowfall volume. Meanwhile, the inclusion of SWELR as a relatively highly weighted variable can highlight its importance as an indicator.

4 Evaluation approach

Our method is evaluated through various accuracy metrics and independent datasets. First, we generate two quality metrics in conjunction with the SWE^ images. These metrics provide a quick assessment of the quality of the SWE^ products. For each target date, two quality metrics are provided: the average and the standard deviation of the similarity metric across the k best-selected SWE candidates. The lower the average and standard deviation of the similarity metric, the more accurate the SWE^ estimation.

4.1 Time series visualization

To further demonstrate the efficacy of our method, we showcase some SWE time series spanning 6 years at in situ locations from the Snow Telemetry (SNOTEL) network (https://www.wcc.nrcs.usda.gov/snow/, last access: January 2023). First, the pixels nearest the in situ locations are identified. Then, the SWE values at these locations in both the UCLA SWE data and other well-established SWE datasets (explained in Sect. 4.3) in the western United States are compared with the SWE^ values. We also present scatter plots colored by density (density scatter plots), illustrating the correlation between the reference (UCLA SWE) and SWE^ values at in situ locations. As in Fang et al. (2022a)'s study, SWE values below 10 mm are excluded from the comparison. This screening is done because very small SWE values (less than 10 mm) can be the result of measurement errors, minimal snow presence, or other factors that may not yield meaningful insights.

While the SNOTEL network provided the spatial locations for these comparisons, we did not use direct in situ SNOTEL SWE measurements in this analysis. The UCLA SWE dataset has been previously validated against in situ observations, including SNOTEL, and has been widely recognized as a benchmark for SWE validation in subsequent studies. For example, Ma et al. (2023) used the UCLA SWE dataset as a reference to assess the performance of machine learning approaches for estimating spatiotemporally continuous SWE. Similarly, Fang et al. (2023) used the UCLA SWE dataset as a baseline for evaluating snow water storage uncertainty. These applications highlight the dataset's reliability and its critical role in advancing snow hydrology research.

4.2 Cross-validation

We also validate our results using cross-validation. This involves removing 6 years of the UCLA SWE from the training dataset. We estimate SWE^ based on the remaining training data and compare the results with the reference UCLA SWE image using correlation, mean difference, and RMSE as evaluation criteria. These assessment metrics are summarized in Table S3.

In our quantitative comparison, we calculate the accuracy metrics at different elevation bands (elevation < 2000, 2000 < elevation < 3000 m, and elevation > 3000 m) and across various land cover types, including forest and non-forest areas. We utilize the MODIS Land Cover Type (MCD12Q1) Version 6.1 data for land cover information (https://lpdaac.usgs.gov, last access: January 2023, Friedl and Sulla-Menashe, 2022). This data, as of 2018, is maintained by the NASA EOSDIS Land Processes Distributed Active Archive Center (LP DAAC) at the USGS Earth Resources Observation and Science (EROS) Center in Sioux Falls, South Dakota. Elevations are derived from NASA SRTM Digital Elevation 30 m data (Farr et al., 2007).

4.3 Comparison with independent SWE datasets

To enhance the reliability of SWE^, additional verification is performed through comparisons with the following three independent SWE datasets. These comparisons are essential for validating the accuracy and consistency of our SWE^ images, using the evaluation criteria detailed in Table S3.

4.3.1 SNODAS 1 km SWE product

The first independent dataset includes the 1 km SWE data from the Snow Data Assimilation System (SNODAS), developed by the NOAA National Weather Service's National Operational Hydrologic Remote Sensing Center (NOHRSC) (Center, 2004). SNODAS is a comprehensive modeling and data assimilation system designed to provide estimates of snow cover that are crucial for hydrologic modeling and analysis. The dataset covers the period from 28 September 2003 to the present.

We first resample the SWE^ images by aggregating the values of all the pixels that intersect each 1 km SNODAS pixel. This ensures that both datasets have the same spatial resolution. Subsequently, we compare the SNODAS SWE products with the SWE^ images using the evaluation criteria introduced in Table S3. SWE pixel values less than 10 mm are not included in the comparison.

4.3.2 Daymet 1 km SWE product

The second independent dataset includes the 1 km SWE data from the Daymet V4 dataset: Daily Surface Weather and Climatological Summaries (Thornton et al., 2022). Daymet provides gridded estimates of daily weather parameters for North America, including minimum and maximum temperatures, precipitation, and SWE, with data spanning from 1 January 1980. We follow a similar procedure as in the comparison with the SNODAS SWE products.

4.3.3 University of Arizona (UA) 4 km SWE product

The third independent dataset is the 4 km SWE data provided by the University of Arizona (UA) (Broxton et al., 2019; Zeng et al., 2018). It assimilates in situ snow measurements from the SNOTEL and Cooperative Observer Network (COOP) networks (Fleming et al., 2023; National Research Council, 1998), incorporating detailed snowpack measurements. It also integrates modeled, gridded temperature and precipitation data, providing a comprehensive representation of SWE over the conterminous United States since 1981. For this comparison, we employ a similar procedure to that used in the comparisons with the previous two independent SWE products.

5 Results

5.1 Time series visualizations

To showcase the effectiveness of the proposed technique, SWE time series for a 6-year duration at the three highest-elevation in situ locations (see Table 3) in California are presented in Fig. 5 (and Supplement Fig. S8 for Colorado). These time series visually represent SWE^ values over time, enabling a comparison between the SWE^ values and the UCLA SWE, Daymet, SNODAS, and UA SWE data. The red line, representing our estimated values, generally tracks the blue line (UCLA SWE) well across the 6-year period for the three high-elevation locations. However, in some years and locations, such as at the “Leavitt Lake” and “Lobdell Lake” stations in 2005, the red line diverges slightly from the blue line, indicating variability in the accuracy of the estimates across different time periods. Nevertheless, even in 2005, our estimation is still within the range of other well-established SWE estimations. Moreover, in 2005, the other datasets do not align on the amount of SWE, indicating that this year is intrinsically challenging. This demonstrates that the proposed method can closely estimate the UCLA SWE on a point scale in most years.

Table 3Names and locations of in situ sites.

Download Print Version | Download XLSX

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f05

Figure 5The SWE time series spans a 6-year period at the three highest-elevation in situ locations (Table 3) in California. “UA” represents the University of Arizona SWE datasets. The downscaled SWE (SWE^) results were obtained from ec-earth3-veg climate data at a 9 km resolution.

Download

Additionally, density scatter plots are obtained using the “dscatter” function (Henson, 2024) and are illustrated in Fig. 6 for California (and Supplement Fig. S9 for Colorado), depicting the relationship between UCLA SWE and SWE^ values at in situ locations (see Table 3) obtained from three climate models as estimators: ec-earth3-veg, mpi-esm1-2, and cnrm-esm2-1 at both 9 and 100 km resolution.

Referencing Fig. 6, using ec-earth3-veg at a 9 km resolution as an estimator, the derived SWE^ in California shows a stronger correlation with the UCLA SWE at in situ locations, achieving an R value of approximately 0.87. This model also demonstrates improved accuracy, with a lower RMSE and MD (approximately 0.14 (m) for RMSE and about 0.02 (m) for MD). In contrast, as shown in Supplement Fig. S9, using cnrm-esm2-1 at a 100 km resolution as an estimator, the derived SWE^ in Colorado exhibits a stronger correlation with the UCLA SWE at in situ locations, marked by an R value close to 0.85, and the RMSE estimated at around 0.09 m.

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f06

Figure 6A density scatter plot in California from 2005 to 2010 compares in situ data locations (Table 3) from UCLA SWE images with the downscaled SWE (SWE^) values. The solid black line represents the 1:1 line. The displayed metrics include the correlation coefficient (R), mean difference (MD), and root mean squared error (RMSE). Only data with SWE values greater than 10 mm are included in the comparison.

Download

Figure 7 presents a comparison of the mean SWE^ and mean UCLA SWE over a 6-year period in California and Colorado. Among the models, cnrm-esm2-1 at 100 km resolution, in most months, outperforms ec-earth3-veg and mpi-esm1-2 in Colorado. In contrast, in California, ec-earth3-veg mostly exhibits superior performance at both the 9 and 100 km resolutions. However, all the models underestimate SWE in California, suggesting that testing other climate models for this region may be beneficial. Collectively, these findings emphasize the significance of selecting an appropriate model and its optimal resolution for SWE estimation.

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f07

Figure 7The average of the UCLA SWE (black dotted line, reference) and the downscaled SWEs (SWE^; blue and green lines) for each area over the 6-year period from 2005 to 2010 are shown. The three climate models used in the downscaling are represented by different colors: light blue for ec-earth3-veg, green for cnrm-esm2-1, and dark blue for mpi-esm1-2. Line styles indicate the spatial resolution of the climate data: solid lines correspond to results based on a 9 km resolution, and dashed lines represent results based on a 100 km resolution. The dotted black line represents the average UCLA SWE, which is the reference.

Download

Additionally, two videos (one for each study region) have been created to showcase a side-by-side comparison of UCLA SWE and SWE^ images: California (SWE^ obtained from ec-earth3-veg at 100 km), Colorado (SWE^ obtained from cnrm-esm2-1 at 9 km).

Several factors explain why certain models outperform others in specific locations. While a detailed comparison of model performances across various regions is beyond the scope of this study, other studies have explored this area. For example, Kouki et al. (2022) evaluated the ability of CMIP6 models to estimate SWE across the Northern Hemisphere and found that different models perform variably in specific regions based on their ability to simulate particular climatic and geographical conditions. In terms of the contribution of temperature and precipitation to SWE biases, precipitation plays a more dominant role, especially in winter. However, temperature becomes more significant during spring, when snowmelt occurs. In regions where temperatures are closer to 0 °C, biases in temperature can substantially affect snowmelt modeling. Overall, the results underscore that precipitation is the primary driver of SWE biases in winter. However, temperature plays a crucial role during the snowmelt season in spring, particularly in regions with more temperate climates, such as the southern parts of North America and Europe.

5.2 Cross-validation

Cross-validation accuracy assessments of SWE^ are outlined in Supplement Tables S4 and S5. Figures 8 and 9, and Figs. S10 and S11 in the Supplement further present these assessments, considering variations in elevation and land cover for California and Colorado. Overall, for most months, using climate data with a higher resolution (9 km) as an estimator yields only slightly higher or identical accuracies compared to climate data with a lower resolution (100 km). In a broad comparison across resolutions in Colorado and California, the cnrm-esm2-1 and ec-earth3-veg models generally exhibit the best performance. Nonetheless, it is crucial to recognize variability in model performance across different regions. For instance, in both California and Colorado, all models generally perform better or comparably at higher elevations (elevation > 3000 m) compared to medium elevations (2000 m < elevation < 3000 m). While RMSE values tend to be smaller at lower elevations due to lower SWE magnitudes, accuracy at higher elevations is more hydrologically significant, as these zones contribute the majority of seasonal snowmelt runoff.

In Colorado, all models exhibit comparable performance in both forested and non-forested areas. In contrast, in California, models tend to perform slightly better in non-forested areas. Moreover, Figs. 9 and S11 illustrate the average spatial error in March and May, respectively, for California and Colorado. In California, SWE^ underestimates reference SWE, with the extent of underestimation varying across different climate models. For instance, in California, cnrm-esm2-1 at 9 km achieves the lowest underestimation. In general, the cnrm-esm2-1 model at 9 km performs better as an estimator for Colorado, with a mean RMSE of 0.07 m and a standard deviation of 0.05 m. Conversely, for California, the ec-earth3-veg model at 9 km provides the highest accuracy, yielding a mean RMSE of 0.13 m and a standard deviation of 0.1 m.

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f08

Figure 8Accuracy assessment of downscaling cross-validation over the 6-year period from 2005 to 2010, conducted in California. Column titles indicate the results for the downscaled SWE (SWE^) using the respective climate models as estimators. “9 km” and “100 km” refer to the resolutions of the climate data. The evaluation metrics include the correlation coefficient (R), mean difference (MD), and root mean squared error (RMSE). These metrics are considered in different elevation values and different land covers. In the legend, 2000 m < elevation < 3000 m is shown as elevation < 3000 m.

Download

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f09

Figure 9Spatial assessment of the downscaling. The first three rows represent the mean difference between the UCLA SWE and downscaled SWE (SWE SWE^) over California throughout the 6-year testing period (2005–2010) for two months (March and May). The last row represents the averaged reference SWE (UCLA SWE) or averaged downscaled SWE (SWE^) throughout the 6-year testing period (2005–2010) for two different months (March and May). The evaluation metric includes mean difference (MD).

Download

5.3 Comparison with 1 km SWE SNODAS dataset

The comparison of the UCLA SWE and SWE^ with the 1 km resolution SNODAS dataset is presented in Supplement Table S6, Figs. 10 and 11, and Figs. S12 and S13.

In California, during the snowiest months, the ec-earth3-veg estimator excels in terms of RMSE, correlation, and mean difference at both resolutions. In Colorado, the mpi-esm1-2 estimator mostly demonstrates superior performance. Interestingly, in Colorado, the 100 km resolution slightly outperforms the 9 km resolution across some models, but in others, the 9 km resolution marginally surpasses the 100 km resolution. In other words, in our experiments, SWE downscaling performance using 100 km CMIP6 inputs was in some cases comparable to using 9 km WRF-CMIP6 climate predictors, though higher resolution inputs typically offered modest improvements. According to Figs. 11 and S12, generally, in California, the ec-earth3-veg model outperforms others at both 9 and 100 km resolutions in both March and May, although it tends to underestimate SWE. In contrast, in Colorado, the mpi-esm1-2 model at 9 km performs better in March and May, and it mostly overestimates SWE. In general, in Colorado, the mpi-esm1-2 model at 100 km showed the most favorable results, with a mean RMSE of 0.08 m and a standard deviation of 0.07 m. In California, the ec-earth3-veg model at 9 km achieved the best performance, with a mean RMSE of 0.23 m and a standard deviation of 0.20 m.

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f10

Figure 10The average mean difference of SWE over California for the 6-year testing period (2005–2010) compares UCLA SWE (black dotted line) and downscaled SWE (SWE^) data (blue and green lines) against SNODAS. The SWE^ data, derived from different climate models, are represented by distinct colors: light blue for ec-earth3-veg, green for cnrm-esm2-1, and dark blue for mpi-esm1-2. Solid lines indicate estimates at a 9 km resolution, while dashed lines correspond to a 100 km resolution, distinguishing the spatial resolutions of the climate data.

Download

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f11

Figure 11Comparison with SNODAS data. The first four rows present the average difference between SNODAS and downscaled SWE (SWE^) or UCLA SWE (SNODAS-SWE^ or SNODAS-UCLA) over California for March and May during the 6-year testing period (2005–2010). The last row compares the average UCLA SWE or SNODAS SWE over California for March and May during the same 6-year testing period (2005–2010). The evaluation metric includes mean difference (MD).

Download

5.4 Comparison with 1 km SWE Daymet dataset

This section presents a comparison of SWE^ from the three climate models with the 1 km SWE data from the Daymet dataset. Supplement Table S7, along with Figs. 12 and 13,and Figs. S14 and S15 present the comparison of UCLA SWE and SWE^ against the 1 km SWE Daymet dataset.

In California, the ec-earth3-veg model leads in performance at both the 9 and 100 km resolutions, boasting the lowest RMSE and highest correlation. For Colorado, the cnrm-esm2-1 estimator predominantly demonstrates superior performance. When comparing both the UCLA SWE and SWE^ to the 1 km SWE Daymet dataset, the results indicate a close alignment between the two datasets . Figures 13 and S13 demonstrate that in California and Colorado, during the months of March and May, the cnrm-esm2-1 estimator yields the best results. In both regions, this model tends to overestimate SWE in March and underestimate it in May. In general, the cnrm-esm2-1 estimator at 100 km is the most accurate for Colorado, presenting a mean RMSE of 0.11 m and a standard deviation of 0.04 m. For California, the mpi-esm1-2 model at 100 km performs best, with a mean RMSE of 0.19 m and a standard deviation of 0.06 m. However, ec-earth3-veg achieves the same mean RMSE, with a standard deviation of 0.07 m.

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f12

Figure 12Same as Fig. 10, but for comparison against the Daymet data.

Download

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f13

Figure 13Same as Fig. 11, but for comparison against the Daymet data.

Download

5.5 Comparison with 4 km SWE from the University of Arizona dataset

This section compares the SWE^ from three models against the 4 km SWE data from the University of Arizona (UA). Supplement Table S8, Figs. 14 and 15, and Figs. S16 and S17 present the comparison of UCLA SWE and SWE^ with the 4 km SWE data from the UA dataset.

A deeper analysis of model performance highlights differences. In Colorado, the cnrm-esm2-1 estimator stands out at both 9 and 100 km resolutions. Meanwhile, in California, the ec-earth3-veg model mostly surpasses other models, achieving the lowest mean RMSE at both resolutions.

Overall, the 9 km resolution slightly outperforms the 100 km resolution in both states. This suggests that a finer resolution might yield slightly more precise SWE^ estimates. Additionally, when comparing UCLA SWE and SWE^ to the 4 km UA SWE, the results demonstrate a tight congruence between the two datasets. Furthermore, Figs. 15 and S15 show that in California, for March and May, the ec-earth3-veg model at a 100 km resolution performs better. Meanwhile, in Colorado, the cnrm-esm2-1 estimator performs best in these two months. The ec-earth3-veg model tends to underestimate SWE in California, while in Colorado, cnrm-esm2-1 generally overestimates SWE in May. In general, the cnrm-esm2-1 estimator at the 9 km scale provides the best results for Colorado, having a mean RMSE of 0.07 m and a standard deviation of 0.05 m. In California, the ec-earth3-veg estimator at the 9 km scale is the most precise, with a mean RMSE of 0.14 m and a standard deviation of 0.13 m.

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f14

Figure 14Same as Fig. 10, but for comparison against University of Arizona (UA) data.

Download

https://hess.copernicus.org/articles/29/6935/2025/hess-29-6935-2025-f15

Figure 15Same as Fig. 11, but for comparison against University of Arizona (UA) data.

Download

6 Discussion

In general, using the cnrm-esm2-1 model as an estimator results in better accuracy in Colorado at both the 100 and 9 km resolutions compared to other models. For instance, in Colorado, the use of the cnrm-esm2-1 model at a 9 km resolution demonstrated close agreement with the observed SWE, with an average RMSE of 0.07 m. This performance highlights the model's strong compatibility with the climatic and geographical complexities of Colorado. Conversely, in California, the ec-earth3-veg model excels at a 9 km resolution, providing the most accurate results with an average RMSE of 0.13 m compared to the reference datasets. This suggests that its higher resolution better captures the region's complex environmental and topographical variations.

It also appears that the finer resolution of 9 km provides slightly better accuracy than the 100 km resolution across all models, although the difference is not substantial. However, the difference between 9 and 100 km inputs can vary by region and model.

This underscores the importance of selecting an appropriate climate model for SWE estimation, beyond merely choosing a higher-resolution model. Moreover, CMIP6 models are designed for long-term climate projections and capture broad climate trends rather than predicting specific weather events. Despite this, the downscaled SWE using the proposed approach based on CMIP6 inputs, in some cases, can be comparable to those using WRF-CMIP6 climate predictors, which dynamically downscale CMIP6. This can be largely because the proposed methodology relies on long-term climate data through the use of far and near temporal intervals, and CMIP6 effectively captures broad climatic trends and seasonality, including changes in temperature and precipitation patterns. This comparison emphasizes that model selection and input quality are key drivers for SWE estimation performance, sometimes more than input resolution alone. Accordingly, achieving an accurate HR-SWE estimation relies significantly on the choice and accuracy of the climate model inputs, such as precipitation and temperature data. As a result, the accuracy of the downscaled SWE remains highly sensitive to biases in these input variables. For example, precipitation biases can be a dominant factor influencing SWE estimation errors, and temperature biases can become more significant during transitional periods, such as the spring melt season. While we test both CMIP6 and WRF-CMIP6 climate variables as inputs, a full comparison to downscale SWE is beyond the scope of this study.

The following comparison provides a broader perspective on how our proposed method compares with other statistical downscaling techniques. BCSD methods are effective in reducing uncertainties in climate model outputs by adjusting model biases using high-resolution observations. These methods are particularly valuable for ensuring that model outputs align with observed climatology and capture local variability. However, they depend heavily on the availability of high-quality in situ data, which limits their application in remote or data-sparse regions. In contrast, our method excels in areas with sparse data, as it uses low-resolution climate data without requiring ground observations, making it adaptable to a broader range of conditions.

Analog-type statistical downscaling approaches offer a simple, computationally efficient way to project high-resolution data based on historical relationships between large-scale climate patterns and local climate variables. These methods are useful in regions where historical climate patterns are stable and well-documented. On the other hand, our method introduces several key improvements over traditional k-NN downscaling techniques. First, the adaptation of the k-NN approach through the incorporation of far and near temporal intervals of climate data enhances its ability to handle dynamic variables such as snow, which are subject to significant changes due to climate variability. Unlike conventional analog methods, which constrain analog candidates to a specific temporal window near the query date, this method eliminates such restrictions.

This flexibility is crucial for three reasons: (1) the inclusion of far and near temporal intervals allows the selection of the most suitable candidates across a broader temporal range, eliminating the need for narrow constraints; (2) this flexibility may help preserve rare but important extreme events in the downscaled data, since restricting candidates to a narrow temporal window increases the risk of excluding suitable analogs.; and (3) it facilitates downscaling for the periods where no exact analogs exist in the historical record within a specific date range. Instead, suitable analogs may still be found in the historical data, but during different periods. For instance, as climate change progresses, a day in winter may no longer have a match on the same calendar day in the past. Instead, an analog might be found on another calendar day, such as in a warmer season like fall or spring.

Using very low-resolution climate data as input reduces both memory requirements and computation time compared to physical snow models, which also require high-resolution climate data as input to estimate HR-SWE. This efficiency enables the generation of HR-SWE estimates over large spatial domains with reduced computational overhead. This is particularly beneficial when applying the method to large areas, long temporal scales, or ensembles of climate data.

7 Conclusion

This research proposes a methodology to downscale low-resolution daily snow water equivalent (LR-SWE) by utilizing low-resolution daily climate data to produce daily, high-resolution SWE (HR-SWE) data, covering the period from 1950 to the present. We test our approach in two distinct areas (California and Colorado) in the western United States. Utilizing existing low-resolution climate factors obtained from global CMIP6 climate data (100 km), downscaled CMIP6 for the western United States (WRF-CMIP6, 9 km), and available 500-meter HR-SWE images from 1984 to 2021 (UCLA SWE dataset) for the western United States, we implement a downscaling algorithm to estimate HR-SWE images. WRF-CMIP6 is a regional configuration of a climate model that dynamically downscales CMIP6 data to enhance spatial resolution. Utilizing both CMIP6 and WRF-CMIP6 at different resolutions as input estimators for our k-NN statistical downscaling framework enables us to assess the impact of using a regional climate model that dynamically downscales CMIP6 data, alongside a global climate model with different spatial resolutions, on the accuracy of our approach. We also select three climate models (ec-earth3-veg, mpi-esm1-2, and cnrm-esm2-1) for testing the proposed method using different models.

To perform a comprehensive accuracy assessment, we compared the downscaled SWE (SWE^) with established datasets like the 1 km SWE Daymet, 1 km SWE SNODAS, and the 4 km SWE from the University of Arizona. Moreover, we performed cross-validation accuracy assessments. Our SWE^ data closely mirrored reference HR-SWE conditions. Even when using lower-resolution climate datasets like CMIP6 (100 km), the method in some cases remained robust, leveraging recurring SWE patterns and climate data to generate more detailed SWE images.

A comprehensive analysis over all months reveals notable patterns in the performance of various climate models at different resolutions in California and Colorado. Overall, the cnrm-esm2-1 model tends to provide higher accuracy in Colorado when used as an estimator, outperforming other models at both the 100 and 9 km resolutions. In contrast, the ec-earth3-veg model performs best in California at a 9 km resolution. A finer resolution (9 km) generally offers slightly better accuracy than a 100 km resolution across models, though the difference is modest, emphasizing the importance of model selection over merely increasing resolution. However, the difference between the 9 and 100 km climate inputs can vary by region and model.

Ongoing research will focus on applying our downscaling framework to predict HR-SWE based on future climate predictions. The data generated could be invaluable not only in climate change studies but also for water resources management. This underscores the need for precise model selection and the potential for applying our findings to future climate scenarios, improving water resource management amid climate change. Moreover, our approach may be applicable in regions where high-resolution SWE data are available for certain years, but high-resolution meteorological data are limited or unavailable. However, it would require region-specific sensitivity analysis and weight optimization.

Code and data availability

The code for generating downscaled SWE maps, along with some sample data, are available at https://github.com/fatemehzakeri/SWE-Downscaling-High-Resolution-Snow-Water-Equivalent-Mapping (last access: June 2024, Zakeri, 2025).

Video supplement

The videos of high-resolution SWE estimations for California and Colorado are available at https://doi.org/10.5446/68236 (Zakeri et al., 2024a) and https://doi.org/10.5446/68237 (Zakeri et al., 2024b), respectively.

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/hess-29-6935-2025-supplement.

Author contributions

FZ: conceptualization, methodology, software, validation, investigation, data curation, formal analysis, writing (original draft). GM and MG: conceptualization, methodology, validation, formal analysis, writing (review and editing), supervision, project administration, and resources. FZ and GM: funding acquisition.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

Our study utilized datasets including the western United States UCLA snow reanalysis dataset; Daymet V4 dataset; SNODAS dataset; University of Arizona’s SWE dataset; SNOTEL dataset; MODIS Land Cover Type MCD12Q1 dataset; NASA SRTM Digital Elevation 30 m; the CONUS Drought Indices dataset; CMIP6 climate predictions; and downscaled CMIP6 data using the Weather Research and Forecasting (WRF) model. We thank the respective institutions and researchers for providing these datasets.

This project was carried out as a joint effort between the University of Lausanne (UNIL) and the University of California, Berkeley (UCB), and was made possible thanks to the UNIL Mobility Fellowship with fellowship number MD0012. We express our gratitude to UNIL for providing the funding and to UCB for hosting the first author. Moreover, this work was primarily carried out as part of the scientific project “Deep-time Synthetic Data Cubes to Enable Long-term Hydrological Modeling”, funded by the Swiss National Science Foundation (SNSF), with project number 200021_204130. The authors would like to gratefully acknowledge the SNSF.

Financial support

This research has been supported by the Université de Lausanne (Mobility Fellowship no. MD0012).

Review statement

This paper was edited by Yue-Ping Xu and reviewed by two anonymous referees.

References

Abatzoglou, J. T.: Development of gridded surface meteorological data for ecological applications and modelling, Int. J. Climatol., 33, 121–131, 2013. 

Abatzoglou, J. T. and Brown, T. J.: A comparison of statistical downscaling methods suited for wildfire applications, Int. J. Climatol., 32, 772–780, 2012. 

Alonso-González, E., Aalstad, K., Pirk, N., Mazzolini, M., Treichler, D., Leclercq, P., Westermann, S., López-Moreno, J. I., and Gascoin, S.: Spatio-temporal information propagation using sparse observations in hyper-resolution ensemble-based snow data assimilation, Hydrol. Earth Syst. Sci., 27, 4637–4659, https://doi.org/10.5194/hess-27-4637-2023, 2023. 

Bair, E. H., Rittger, K., Davis, R. E., Painter, T. H., and Dozier, J.: Validating reconstruction of snow water equivalent in California's Sierra Nevada using measurements from the NASA Airborne Snow Observatory, Water Resour. Res., 52, 8437–8460, 2016. 

Bales, R. C., Molotch, N. P., Painter, T. H., Dettinger, M. D., Rice, R., and Dozier, J.: Mountain hydrology of the western United States, Water Resour. Res., 42, https://doi.org/10.1029/2005WR004387, 2006. 

Belmecheri, S., Babst, F., Wahl, E. R., Stahle, D. W., and Trouet, V.: Multi-century evaluation of Sierra Nevada snowpack, Nat. Clim. Change, 6, 2–3, 2016. 

Broxton, P., Zeng, X., and Dawson, N.: Daily 4 km Gridded SWE and Snow Depth from Assimilated In-Situ and Modeled Data over the Conterminous US, Version 1, NASA National Snow and Ice Data Center Distributed Active [data set], https://doi.org/10.5067/0GGPB220EX6A, 2019. 

Caillouet, L., Vidal, J.-P., Sauquet, E., and Graff, B.: Probabilistic precipitation and temperature downscaling of the Twentieth Century Reanalysis over France, Clim. Past, 12, 635–662, https://doi.org/10.5194/cp-12-635-2016, 2016. 

National Operational Hydrologic Remote Sensing Center: Snow Data Assimilation System (SNODAS) Data Products at NSIDC, Version 1, National Snow and Ice Data Center [data set], https://doi.org/10.7265/N5TB14TC, 2004. 

Clow, D. W., Nanus, L., Verdin, K. L., and Schmidt, J.: Evaluation of SNODAS snow depth and snow water equivalent estimates for the Colorado Rocky Mountains, USA, Hydrol. Process., 26, 2583–2591, 2012. 

Cover, T. and Hart, P.: Nearest neighbor pattern classification, IEEE T. Inform. Theory, 13, 21–27, 1967. 

Dawadi, S. and Ahmad, S.: Changing climatic conditions in the Colorado River Basin: Implications for water resources management, J. Hydrol., 430, 127–141, 2012. 

Dietz, A. J., Wohner, C., and Kuenzer, C.: European Snow Cover Characteristics between 2000 and 2011 Derived from Improved MODIS Daily Snow Cover Products, Remote Sensing, 4, 2432–2454, https://doi.org/10.3390/rs4082432, 2012. 

EC-Earth Consortium (EC-Earth): EC-Earth-Consortium EC-Earth3-Veg model output prepared for CMIP6 ScenarioMIP, Earth System Grid Federation [data set], https://doi.org/10.22033/ESGF/CMIP6.727, 2019. 

Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geoscientific Model Development, 9, 1937–1958, https://doi.org/10.5194/gmd-9-1937-2016, 2016. 

Fang, Y., Liu, Y., and Margulis, S. A.: A western United States snow reanalysis dataset over the Landsat era from water years 1985 to 2021, Scientific Data, 9, 677, https://doi.org/10.1038/s41597-022-01768-7, 2022a. 

Fang, Y., Liu, Y., and Margulis, S. A.: Western United States UCLA Daily Snow Reanalysis, WUS_UCLA_SR, Version 1, NASA National Snow and Ice Data Center Distributed Active Archive Center [data set], https://doi.org/10.5067/PP7T2GBI52I2, 2022b. 

Fang, Y., Liu, Y., Li, D., Sun, H., and Margulis, S. A.: Spatiotemporal snow water storage uncertainty in the midlatitude American Cordillera, The Cryosphere, 17, 5175–5195, https://doi.org/10.5194/tc-17-5175-2023, 2023. 

Farr, T. G., Rosen, P. A., Caro, E., Crippen, R., Duren, R., Hensley, S., Kobrick, M., Paller, M., Rodriguez, E., and Roth, L.: The shuttle radar topography mission, Rev. Geophys., 45, https://doi.org/10.1029/2005RG000183, 2007. 

Fiddes, J., Aalstad, K., and Westermann, S.: Hyper-resolution ensemble-based snow reanalysis in mountain regions using clustering, Hydrol. Earth Syst. Sci., 23, 4717–4736, https://doi.org/10.5194/hess-23-4717-2019, 2019. 

Fleming, S. W., Zukiewicz, L., Strobel, M. L., Hofman, H., and Goodbody, A. G.: SNOTEL, the Soil Climate Analysis Network, and water supply forecasting at the Natural Resources Conservation Service: Past, present, and future, J. Am. Water Resour. As., 00, 1–15, https://doi.org/10.1111/1752-1688.13104, 2023. 

Friedl, M. and Sulla-Menashe, D.: MCD12Q1 MODIS/Terra+Aqua Land Cover Type Yearly L3 Global 500m SIN Grid V061, NASA EOSDIS Land Processes DAAC [data set], https://doi.org/10.5067/MODIS/MCD12Q1.061, 2022. 

Gangopadhyay, S., Clark, M., and Rajagopalan, B.: Statistical downscaling using k-nearest neighbors, Water Resour. Res., 41, W02024, https://doi.org/10.1029/2004WR003444, 2005. 

Girotto, M., Musselman, K. N., and Essery, R. L.: Data assimilation improves estimates of climate-sensitive seasonal snow, Current Climate Change Reports, 6, 81–94, 2020. 

Heldmyer, A. J., Bjarke, N. R., and Livneh, B.: A 21st-Century perspective on snow drought in the Upper Colorado River Basin, J. Am. Water Resour. As., 59, 396–415, 2023. 

Henn, B., Musselman, K. N., Lestak, L., Ralph, F. M., and Molotch, N. P.: Extreme runoff generation from atmospheric river driven snowmelt during the 2017 Oroville Dam spillways incident, Geophys. Res. Lett., 47, e2020GL088189, https://doi.org/10.1029/2020GL088189, 2020. 

Henson, R.: Flow Cytometry Data Reader and Visualization, MATLAB Central File Exchange [code], https://www.mathworks.com/matlabcentral/fileexchange/8430-flow-cytometry-data-reader-and-visualization (last access: January 2023), 2024. 

Horton, P.: AtmoSwing: Analog Technique Model for Statistical Weather forecastING and downscalING (v2.1.0), Geosci. Model Dev., 12, 2915–2940, https://doi.org/10.5194/gmd-12-2915-2019, 2019. 

Kouki, K., Räisänen, P., Luojus, K., Luomaranta, A., and Riihelä, A.: Evaluation of Northern Hemisphere snow water equivalent in CMIP6 models during 1982–2014, The Cryosphere, 16, 1007–1030, https://doi.org/10.5194/tc-16-1007-2022, 2022. 

Lall, U. and Sharma, A.: A nearest neighbor bootstrap for resampling hydrologic time series, Water Resour. Res., 32, 679–693, 1996. 

Largeron, C., Dumont, M., Morin, S., Boone, A., Lafaysse, M., Metref, S., Cosme, E., Jonas, T., Winstral, A., and Margulis, S. A.: Toward Snow Cover Estimation in Mountainous Areas Using Modern Data Assimilation Methods: A Review, Frontiers in Earth Science, 8, https://doi.org/10.3389/feart.2020.00325, 2020. 

Lehning, M., Völksch, I., Gustafsson, D., Nguyen, T. A., Stähli, M., and Zappa, M.: ALPINE3D: a detailed model of mountain surface processes and its application to snow hydrology, Hydrological Processes: An International Journal, 20, 2111–2128, 2006. 

Lievens, H., Demuzere, M., Marshall, H. P., Reichle, R. H., Brucker, L., Brangers, I., de Rosnay, P., Dumont, M., Girotto, M., Immerzeel, W. W., Jonas, T., Kim, E. J., Koch, I., Marty, C., Saloranta, T., Schober, J., and De Lannoy, G. J. M.: Snow depth variability in the Northern Hemisphere mountains observed from space, Nat. Commun., 10, 4629, https://doi.org/10.1038/s41467-019-12566-y, 2019. 

Liston, G. E. and Sturm, M.: A snow-transport model for complex terrain, J. Glaciol., 44, 498–516, 1998. 

Lundquist, J. D., Chickadel, C., Cristea, N., Currier, W. R., Henn, B., Keenan, E., and Dozier, J.: Separating snow and forest temperatures with thermal infrared remote sensing, Remote Sens. Environ., 209, 764–779, https://doi.org/10.1016/j.rse.2018.03.001, 2018. 

Ma, X., Li, D., Fang, Y., Margulis, S. A., and Lettenmaier, D. P.: Estimating spatiotemporally continuous snow water equivalent from intermittent satellite observations: an evaluation using synthetic data, Hydrol. Earth Syst. Sci., 27, 21–38, https://doi.org/10.5194/hess-27-21-2023, 2023. 

Margulis, S. A., Girotto, M., Cortés, G., and Durand, M.: A particle batch smoother approach to snow water equivalent estimation, J. Hydrometeorol., 16, 1752–1772, 2015. 

Margulis, S. A., Cortés, G., Girotto, M., and Durand, M.: A Landsat-era Sierra Nevada snow reanalysis (1985–2015), J. Hydrometeorol., 17, 1203–1221, 2016. 

Margulis, S. A., Liu, Y., and Baldo, E.: A joint landsat-and modis-based reanalysis approach for midlatitude montane seasonal snow characterization, Frontiers in Earth Science, 7, 272, https://doi.org/10.3389/feart.2019.00272, 2019. 

Meehl, G. A., Moss, R., Taylor, K. E., Eyring, V., Stouffer, R. J., Bony, S., and Stevens, B.: Climate model intercomparisons: Preparing for the next phase, Eos, Transactions American Geophysical Union, 95, 77–78, 2014. 

Mower, R., Gutmann, E. D., Liston, G. E., Lundquist, J., and Rasmussen, S.: Parallel SnowModel (v1.0): a parallel implementation of a distributed snow-evolution modeling system (SnowModel), Geosci. Model Dev., 17, 4135–4154, https://doi.org/10.5194/gmd-17-4135-2024, 2024. 

Muñoz Sabater, J.: ERA5-Land hourly data from 1950 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], https://doi.org/10.24381/cds.e2161bac, 2019. 

National Research Council: Future of the National Weather Service Cooperative Observer Network, Washington, DC, The National Academies Press, https://doi.org/10.17226/6197, 1998. 

Painter, T. H., Bryant, A. C., and Skiles, S. M.: Radiative forcing by light absorbing impurities in snow from MODIS surface reflectance data, Geophys. Res. Lett., 39, L17502, https://doi.org/10.1029/2012gl052457, 2012. 

Pflug, J. and Lundquist, J.: Inferring distributed snow depth by leveraging snow pattern repeatability: Investigation using 47 lidar observations in the Tuolumne watershed, Sierra Nevada, California, Water Resour. Res., 56, e2020WR027243, https://doi.org/10.1029/2021JD035699, 2020. 

Pflug, J. M., Wrzesien, M. L., Kumar, S. V., Cho, E., Arsenault, K. R., Houser, P. R., and Vuyovich, C. M.: Extending the utility of space-borne snow water equivalent observations over vegetated areas with data assimilation, Hydrol. Earth Syst. Sci., 28, 631–648, https://doi.org/10.5194/hess-28-631-2024, 2024. 

Pons, M., San-Martín, D., Herrera, S., and Gutiérrez, J. M.: Snow trends in Northern Spain: analysis and simulation with statistical downscaling methods, Int. J. Climatol., 30, 1795–1806, https://doi.org/10.1002/joc.2016, 2010. 

Rahimi, S., Krantz, W., Lin, Y. H., Bass, B., Goldenson, N., Hall, A., Lebo, Z. J., and Norris, J.: Evaluation of a Reanalysis-Driven Configuration of WRF4 Over the Western United States From 1980 to 2020, J. Geophys. Res.-Atmos., 127, e2021JD035699, https://doi.org/10.1029/2021JD035699, 2022. 

Rahimi, S., Huang, L., Norris, J., Hall, A., Goldenson, N., Krantz, W., Bass, B., Thackeray, C., Lin, H., Chen, D., Dennis, E., Collins, E., Lebo, Z. J., Slinskey, E., Graves, S., Biyani, S., Wang, B., Cropper, S., and the UCLA Center for Climate Science Team: An overview of the Western United States Dynamically Downscaled Dataset (WUS-D3), Geosci. Model Dev., 17, 2265–2286, https://doi.org/10.5194/gmd-17-2265-2024, 2024. 

Rajagopalan, B. and Lall, U.: A k-nearest-neighbor simulator for daily precipitation and other weather variables, Water Resour. Res., 35, 3089–3101, 1999. 

Ranzi, R., Colosio, P., and Galeati, G.: Climatology of snow depth and water equivalent measurements in the Italian Alps (1967–2020), Hydrol. Earth Syst. Sci., 28, 2555–2578, https://doi.org/10.5194/hess-28-2555-2024, 2024. 

Rettie, F. M., Gayler, S., Weber, T. K., Tesfaye, K., and Streck, T.: High-resolution CMIP6 climate projections for Ethiopia using the gridded statistical downscaling method, Scientific data, 10, 442, https://doi.org/10.1038/s41597-023-02337-2, 2023. 

Saberi, N., Kelly, R., Flemming, M., and Li, Q.: Review of snow water equivalent retrieval methods using spaceborne passive microwave radiometry, Int. J. Remote Sens., 41, 996–1018, 2020. 

Seferian, R.: CNRM-CERFACS CNRM-ESM2-1 model output prepared for CMIP6 CMIP. Earth System Grid Federation [data set], https://doi.org/10.22033/ESGF/CMIP6.1391, 2018. 

Shahriari, B., Swersky, K., Wang, Z., Adams, R. P., and De Freitas, N.: Taking the human out of the loop: A review of Bayesian optimization, P. IEEE, 104, 148–175, 2015. 

Shi, J. and Dozier, J.: Inferring snow wetness using C-band data from SIR-C's polarimetric synthetic aperture radar, IEEE T. Geosci. Remote, 33, 905–914, 1995. 

Shi, J. and Dozier, J.: Estimation of snow water equivalence using SIR-C/X-SAR. II. Inferring snow depth and particle size, IEEE T. Geosci. Remote, 38, 2475–2488, 2000. 

Siirila-Woodburn, E. R., Rhoades, A. M., Hatchett, B. J., Huning, L. S., Szinai, J., Tague, C., Nico, P. S., Feldman, D. R., Jones, A. D., and Collins, W. D.: A low-to-no snow future and its impacts on water resources in the western United States, Nature Reviews Earth and Environmental, 2, 800–819, 2021a. 

Siirila-Woodburn, E. R., Rhoades, A. M., Hatchett, B. J., Huning, L. S., Szinai, J., Tague, C., Nico, P. S., Feldman, D. R., Jones, A. D., Collins, W. D. J. N. R. E., and Environment: A low-to-no snow future and its impacts on water resources in the western United States, Nature Reviews Earth Environment, 2, 800–819, 2021b. 

Skamarock, W. C., Klemp, J. B., Dudhia, J., Gill, D. O., Liu, Z., Berner, J., Wang, W., Powers, J. G., Duda, M. G., and Barker, D. M.: A description of the advanced research WRF model version 4, National Center for Atmospheric Research, Boulder, CO, USA, 145, 550, https://doi.org/10.5065/1dfh-6p97, 2019. 

Snapir, B., Momblanch, A., Jain, S., Waine, T. W., and Holman, I. P.: A method for monthly mapping of wet and dry snow using Sentinel-1 and MODIS: Application to a Himalayan river basin, Int. J. Appl. Earth Obs., 74, 222–230, 2019. 

Snoek, J., Larochelle, H., and Adams, R. P.: Practical bayesian optimization of machine learning algorithms, Adv. Neur. In., 25, https://proceedings.neurips.cc/paper/2012/file/05311655a15b75fab86956663e1819cd-Paper.pdf (last access: January 2023), 2012. 

Sturm, M., Goldstein, M. A., and Parr, C.: Water and life from snow: A trillion dollar science question, Water Resour. Res., 53, 3534–3544, 2017. 

Tabari, H., Paz, S. M., Buekenhout, D., and Willems, P.: Comparison of statistical downscaling methods for climate change impact analysis on precipitation-driven drought, Hydrol. Earth Syst. Sci., 25, 3493–3517, https://doi.org/10.5194/hess-25-3493-2021, 2021. 

Thornton, M., Shrestha, R., Wei, Y., Thornton, P., Kao, S., and Wilson, B.: Daymet: daily surface weather data on a 1-km grid for North America, Version 4 R1, ORNL DAAC [data set], https://doi.org/10.3334/ORNLDAAC/2129, 2022. 

Thrasher, B., Wang, W., Michaelis, A., Melton, F., Lee, T., and Nemani, R.: NASA global daily downscaled projections, CMIP6, Scientific data, 9, 262, https://doi.org/10.1038/s41597-022-01393-4, 2022. 

Tsang, L., Durand, M., Derksen, C., Barros, A. P., Kang, D.-H., Lievens, H., Marshall, H.-P., Zhu, J., Johnson, J., King, J., Lemmetyinen, J., Sandells, M., Rutter, N., Siqueira, P., Nolin, A., Osmanoglu, B., Vuyovich, C., Kim, E., Taylor, D., Merkouriadi, I., Brucker, L., Navari, M., Dumont, M., Kelly, R., Kim, R. S., Liao, T.-H., Borah, F., and Xu, X.: Review article: Global monitoring of snow water equivalent using high-frequency radar remote sensing, The Cryosphere, 16, 3531–3573, https://doi.org/10.5194/tc-16-3531-2022, 2022. 

U.S. Bureau of Reclamation (USBR): Emerging Technologies in Snow Monitoring,, Report to Congress, https://www.usbr.gov/research/docs/news/Emerging_Snow_Monitoring_Report_508.pdf (last access: January 2023), 2021. 

Vionnet, V., Martin, E., Masson, V., Guyomarc'h, G., Naaim-Bouvet, F., Prokop, A., Durand, Y., and Lac, C.: Simulation of wind-induced snow transport and sublimation in alpine terrain using a fully coupled snowpack/atmosphere model, The Cryosphere, 8, 395–415, https://doi.org/10.5194/tc-8-395-2014, 2014. 

Wieners, K.-H., Giorgetta, M., Jungclaus, J., Reick, C., Esch, M., Bittner, M., Legutke, S., Schupfner, M., Wachsmann, F., Gayler, V., Haak, H., de Vrese, P., Raddatz, T., Mauritsen, T., von Storch, J.-S., Behrens, J., Brovkin, V., Claussen, M., Crueger, T., Fast, I., Fiedler, S., Hagemann, S., Hohenegger, C., Jahns, T., Kloster, S., Kinne, S., Lasslop, G., Kornblueh, L., Marotzke, J., Matei, D., Meraner, K., Mikolajewicz, U., Modali, K., Müller, W., Nabel, J., Notz, D., Peters-von Gehlen, K., Pincus, R., Pohlmann, H., Pongratz, J., Rast, S., Schmidt, H., Schnur, R., Schulzweida, U., Six, K., Stevens, B., Voigt, A., and Roeckner, E.: MPI-M MPIESM1.2-LR model output prepared for CMIP6 CMIP, Earth System Grid Federation [data set], https://doi.org/10.22033/ESGF/CMIP6.742, 2019. 

Wood, A. W., Leung, L. R., Sridhar, V., and Lettenmaier, D.: Hydrologic implications of dynamical and statistical approaches to downscaling climate model outputs, Climatic Change, 62, 189–216, 2004. 

Wu, X., Naegeli, K., Premier, V., Marin, C., Ma, D., Wang, J., and Wunderle, S.: Evaluation of snow extent time series derived from Advanced Very High Resolution Radiometer global area coverage data (1982–2018) in the Hindu Kush Himalayas, The Cryosphere, 15, 4261–4279, https://doi.org/10.5194/tc-15-4261-2021, 2021.  

Wundram, D. and Löffler, J.: High-resolution spatial analysis of mountain landscapes using a low-altitude remote sensing approach, Int. J. Remote Sens., 29, 961–974, 2008. 

Yates, D., Gangopadhyay, S., Rajagopalan, B., and Strzepek, K.: A technique for generating regional climate scenarios using a nearest-neighbor algorithm, Water Resour. Res., 39, 1199, https://doi.org/10.1029/2002WR001769, 2003. 

Yiou, P. and Déandréis, C.: Stochastic ensemble climate forecast with an analogue model, Geosci. Model Dev., 12, 723–734, https://doi.org/10.5194/gmd-12-723-2019, 2019. 

Zakeri, F.: fatemehzakeri/SWE-Downscaling-High-Resolution-Snow-Water-Equivalent-Mapping: k-NN SWE Downscaling (Version V1), Zenodo [code], https://doi.org/10.5281/zenodo.17108974, 2025. 

Zakeri, F. and Mariethoz, G.: Synthesizing long-term satellite imagery consistent with climate data: Application to daily snow cover, Remote Sens. Environ., 300, 113877, https://doi.org/10.1016/j.rse.2023.113877, 2024. 

Zakeri, F., Mariethoz, G., and Girotto, M.: High-Resolution SWE Estimation for California: Using 'ec-earth3-veg' Estimator at 100 km Resolution, TIB AV Portal [video], https://doi.org/10.5446/68236, 2024a. 

Zakeri, F., Mariethoz, G., and Girotto, M.: High-Resolution SWE Estimation for Colorado: Analysis Using 'cnrm-esm2-1' Model at 9 km Resolution, TIB AV Portal [video], https://doi.org/10.5446/68237, 2024b. 

Zeng, X., Broxton, P., and Dawson, N.: Snowpack change from 1982 to 2016 over conterminous United States, Geophys. Res. Lett., 45, 12940–12947, 2018. 

Zorita, E. and Von Storch, H.: The analog method as a simple statistical downscaling technique: Comparison with more complicated methods, J. Climate, 12, 2474–2489, 1999. 

Download
Short summary
Our study presents a method to estimate high-resolution snow water equivalent (HR-SWE) using low-resolution climate data (LR-CD). By using a data-driven approach, we analyze historical weather patterns from LR-CD to generate HR-SWE maps. Machine learning and statistical relationships between LR-CD and HR-SWE enable estimation for dates without HR-SWE data. This method enhances water resource management and climate impact assessments, especially in data-scarce regions.
Share