Decision tree-based detection of blowing  snow events in the European Alps

Xie, Zhipeng; Ma, Weiqiang; Ma, Yaoming; Hu, Zeyong; Sun, Genhou; Han, Yizhe; Hu, Wei; Su, Rongmingzhu; Fan, Yixi

doi:https://doi.org/10.5194/hess-25-3783-2021

Articles | Volume 25, issue 7

https://doi.org/10.5194/hess-25-3783-2021

Articles | Volume 25, issue 7

Research article

02 Jul 2021

Research article |

| 02 Jul 2021

Decision tree-based detection of blowing snow events in the European Alps

Zhipeng Xie, Weiqiang Ma, Yaoming Ma, Zeyong Hu, Genhou Sun, Yizhe Han, Wei Hu, Rongmingzhu Su, and Yixi Fan

Abstract

Blowing snow processes are crucial in shaping the strongly heterogeneous spatiotemporal distribution of snow and in regulating subsequent snowpack evolution in mountainous terrain. Although empirical formulae and constant threshold wind speeds have been widely used to estimate the occurrence of blowing snow in regions with sparse observations, the scarcity of in situ observations in mountainous regions contrasts with the demands of models for reliable observations at high spatiotemporal resolution. Therefore, these methods struggle to accurately capture the high local variability of blowing snow. This study investigated the potential capability of the decision tree model (DTM) to detect blowing snow in the European Alps. The DTMs were constructed based on routine meteorological observations (mean wind speed, maximum wind speed, air temperature and relative humidity) and snow measurements (including in situ snow depth observations and satellite-derived products). Twenty repetitions of a random sub-sampling validation test with an optimal size ratio (0.8) between the training and validation subsets were applied to train and assess the DTMs. Results show that the maximum wind speed contributes most to the classification accuracy, and the inclusion of more predictor variables improves the overall accuracy. However, the spatiotemporal transferability of the DTM might be limited if the divergent distribution of wind speed exists between stations. Although both the site-specific DTMs and site-independent DTM show great ability in detecting blowing snow occurrence and are superior to commonly used empirical parameterizations, specific assessment indicators varied between stations and surface conditions. Events for which blowing snow and snowfall occurred simultaneously were detected the most reliably. Although models failed to fully reproduce the high frequency of local blowing snow events, they have been demonstrated to be a promising approach requiring limited meteorological variables and have the potential to scale to multiple stations across different regions.

Download & links

Article (PDF, 6537 KB)

Download & links

How to cite.

Received: 01 Mar 2021 – Discussion started: 11 Mar 2021 – Revised: 11 May 2021 – Accepted: 05 Jun 2021 – Published: 02 Jul 2021

1 Introduction

Wind plays a key role in the snow distribution in mountainous terrain, as it shapes both the spatial heterogeneity of snowfall and the erosion, transport and deposition of surface snow via blowing snow processes. In addition to their impacts on the strong spatiotemporal heterogeneity of the surface snow distribution, blowing snow processes also have important consequences for the subsequent evolution of the snowpack (Déry and Yau, 2002; Leonard and Maksym, 2011) and the surface water and energy budgets (Lenaerts et al., 2012a; Liston, 2004; Pomeroy and Gray, 1995; Sexstone et al., 2018). Meanwhile, wind-induced snow transport can also be a major hazard, causing severe reductions to visibility near the ground and triggering snow avalanches (Lehning and Fierz, 2008), with the potential for loss of life, property damage, and disruption of transportation. Blowing snow events result in large-scale snow mass divergence or convergence from open, wind-exposed surfaces to wind-sheltered areas such as densely vegetated surfaces and topographic depressions (Essery and Pomeroy, 2004). Micro- and meso-scale variability in snow cover and snowmelt strongly influences the surface radiation balance, surface discharge, ecology, and soil freeze/thaw and can be largely attributed to the spatial heterogeneity of surface snow redistribution caused by blowing snow (Liston, 2004; Mott et al., 2018). Therefore, wind-driven snow redistribution is widely recognized as driving patterns in snow accumulation and snowpack evolution in alpine basins and represents an important interaction between the land and the overlying atmosphere.

Several specific instruments facilitate direct observation of blowing snow at the local scale, for example, the mechanical traps used by Budd et al. (1966), the optical sensors deployed in the Antarctic and Alps (Snow Particle Counters, SPC; Sato et al., 1993; Nishimura and Nemoto, 2005; Vionnet et al., 2013), and the acoustic sensors (i.e., FlowCapt and SPC) used to provide reliable measurements of blowing snow mass flux (Chritin et al., 1999; Trouvilliez et al., 2015). However, direct near-surface blowing snow observations are extremely sparse in time and space. Alternative methods using empirical formulae to parameterize blowing snow occurrence have been proposed (e.g., He and Ohara, 2017; Li and Pomeroy, 1997a; Schmidt, 1980). One of the most important parameters is the threshold wind speed for snow transport, as it determines the occurrence of blowing snow. A blowing snow event takes place when the wind exceeds the threshold wind speed. Previous studies have demonstrated that cohesive resistance increases dramatically when snow becomes wet, as the meltwater increases the associated cohesion between the particles (e.g., Li and Pomeroy, 1997a; Schmidt, 1980), and sintering of snow particles has a significant bearing on the cohesive force development as well (He and Ohara, 2017; Schmidt, 1980). Therefore, the presence of liquid water and the associated snow metamorphism and aging processes typically increase the bond strength in the surface snow layer (Bromwich, 1988; Li and Pomeroy, 1997a). As summarized by Schmidt (1980), the threshold wind speed highly depends on the cohesion between snow particles and was greatly influenced by temperature, humidity and deposition time.

The threshold wind speed is important for predicting the initialization of a blowing snow event. Threshold wind speed at the height of 10 m was found to be 9.9 m s⁻¹ for wet snow and 7.7 m s⁻¹ for dry snow, and a formula that expresses the threshold wind speed as a function of air temperature has been proposed based on field observations from the Canadian Prairies (Li and Pomeroy, 1997a). Moreover, other parameterizations have also been established using the relationship between threshold wind speed and the microstructural properties of surface snow, such as snow density, the bond diameter between snow particles, and the particle mean radius (Gallée et al., 2001, 2013; Guyomarc'h and Mérindol, 1998; He and Ohara, 2017; Lehning et al., 2000; Schmidt, 1980, 1981). These parameterizations are widely used in numerical models to describe wind-driven snow transport processes. Rather than being constant, it is widely accepted that the threshold wind speed varies with temperature, humidity, particle size, and deposition time (He and Ohara, 2017). Though there are proposed relationships between the threshold wind speed and meteorological conditions, parameterizations have only been validated for very limited areas (Gallée et al., 2001; Li and Pomeroy, 1997a; Schmidt, 1981), and there is no standard method for determining the meteorological conditions under which blowing snow events occur (Li and Pomeroy, 1997b).

Recent attempts have been made to retrieve blowing snow occurrences from satellite remote-sensing data (Palm et al., 2011, 2018). Results demonstrate the validity of the remote retrieval algorithms in detecting the blowing snow events over the Antarctic, providing insights into the spatial and temporal variability of blowing snow events independently of modeling approaches. The satellite-based technique provides the opportunity to derive blowing snow occurrences with wide spatial coverage, but it is hampered by the presence of clouds and the coarse vertical resolution (Gossart et al., 2017). Moreover, satellite blowing snow detection is associated with pronounced uncertainty and cannot detect the presence of blowing snow events at fine temporal resolution (Palm et al., 2011), preventing its widespread application in remote areas outside Antarctica.

Progress has been made in obtaining large spatial-scale blowing snow estimates using various multiple data sources, such as visual observations (Mahesh et al., 2003), ground-based ceilometer observations (Gossart et al., 2017), and snow depth and simultaneous meteorological observations (Guyomarc'h et al., 2019; Guyomarc'h and Mérindol, 1998; Vionnet et al., 2013). However, direct observations are scarce, both in time and space. Snow depth measurements are more common than visual blowing snow observations or ground-based ceilometer observations but are not routinely included in conventional meteorological observation systems. Meanwhile, numerical modeling provides a useful tool to estimate blowing snow occurrences but relies not only on accurate forcing datasets (e.g., temperature and wind speed), but also on knowledge of the surface snow properties, which are difficult to accurately define. This is particularly notable in mountainous regions such as the Alps, where the surface is strongly heterogeneous and environmental conditions are very variable.

Table 1List of stations used in this study.

Download Print Version | Download XLSX

Whether a blowing snow event occurs or not is an important state variable for detailed simulations of blowing snow processes. Standard meteorological instruments (distinct from specific instruments such as SPC and FlowCapt, which are less commonly deployed) are often used in blowing snow studies. In this study, we use a machine-learning-based decision tree model (DTM) to detect the presence of blowing snow by exploiting routine meteorological observations (such as wind speed, air temperature, precipitation and relative humidity) and snow measurements (in situ snow depth observations and satellite-derived products) from 10 ISAW stations (http://isaw.ch/, last access: 12 December 2020). This study aims to develop a simple but efficient tool to detect blowing snow occurrences and to advance our understanding of the relationships between blowing snow processes and ambient meteorological conditions.

2 Data and methods

2.1 Data

Data were obtained from ISAW and include measurements of blowing snow fluxes and surface meteorological variables. These include mean and maximum wind speed (WS and WSMAX) at 3.5 m height, wind direction (WD), air temperature (T), relative humidity (RH), snow depth (SD), and precipitation about 30 observation stations. The surface meteorological data are measured at every minute, and hourly averages are stored. Although the available meteorological variables vary between stations (for example, RH is only available at Fmor, Fcmb, Fber, Fhue and Fgie), each ISAW station is equipped with the FlowCapt acoustic sensor (Chritin et al., 1999) to measure blowing snow fluxes. In this study, 10 stations that include all the above-mentioned observations were selected (Table 1), of which the Fsal station was used in the sensitivity test and not used in constructing the DTMs. This maximizes the number of different dimensions (corresponding to the different observed fields) that can be used to construct an efficient DTM for identifying blowing snow events.

To minimize uncertainty in the classification stemming from the use of poor-quality data, strict data selection criteria were applied to observations. First, using a threshold value of 50 % change within 1 h, the main change range check was applied to the relative humidity to detect its abnormal change. In addition, a threshold check was performed for the hourly measured air temperature, wind speed, and maximum wind speed. For example, data with T outside the range −50 to 50 ^∘C or WSMAX greater than 40 m s⁻¹ were considered unreliable and were discarded. Periods when both WS and WSMAX were zero for more than 3 consecutive hours and when non-zero WS remain unchanged for more than 5 h were also removed. Since blowing snow fluxes measured by the FlowCapt sensors are sensitive to soil particles, false signals are frequently detected; therefore, only data from winter and spring (from November to April) were used, minimizing the uncertainty resulting from this issue. Additional suspicious data were discarded when a blowing snow event was recorded by the FlowCapt sensor without concurrent snowfall and in the absence of snow cover or when the positive air temperature lasted for more than 24 h. In this study, periods of blowing snow occurrence were identified when positive blowing snow flux was observed. This is different from the work of Trouvilliez et al. (2015), who used a threshold of 1 g m⁻² s⁻¹ to remove non-significant blowing snow occurrences, and the work of Vionnet et al. (2013), who only analyzed events of durations longer than 4 h. The presence of snow on the ground was determined based on the snow depth measurements from two snow depth sensors, the MODIS daily snow cover product (MOD10A1 and MYD10A1, Hall and Riggs, 2021) and the CryoLand fractional snow cover product over the Alps (http://cryoland.enveo.at, last access: 12 December 2020). For detailed procedures, please refer to the schematic flowchart in Fig. 1a.

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f01

Figure 1Schematic flowchart of (a) the procedures to identify the presence of snow, (b) flowchart of a simple decision tree model to detect blowing snow occurrence (only WSMAX and T were used to construct the DTM, A denotes the threshold maximum wind speed, and B1 and B2 denote the threshold air temperature), and (c) logical framework of this study. BS and No_BS denote with and without blowing snow occurrence, respectively.

Download

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f02

Figure 2The frequencies of occurrence of the three types of blowing snow at each station and at all stations combined. SF denotes snowfall condition and NoSF_SC_DRY denotes surface covered by dry snow without concurrent snowfall condition, and NoSF_SC_WET denotes the surface covered by wet snow without a concurrent snowfall condition. The blowing snow frequency denotes the ratio between occurrences of blowing snow for a given atmospheric condition divided by the total number of occurrences of this atmospheric condition.

Download

As discussed above, internal physical properties of the snowpack, such as snow particle bonding, cohesion, and its kinetic properties, greatly influence the strength of snow resistance, which determines the initiation and persistence of blowing snow events (Li and Pomeroy, 1997b; Pomeroy and Gray, 1990; Schmidt, 1980). Previous studies have shown a sharp contrast in the threshold wind speed for snow transport between fresh snow and aged snow (Huang et al., 2008; Liston et al., 2007; Xie et al., 2019), owing to the strength of the bond between snow particles depending on destructive metamorphism, melting, snow loading, and increased compaction caused by overburden (Li and Pomeroy, 1997a, b; Oleson et al., 2013). Newly fallen snow particles are characteristically soft and powdery, with relatively low density, making new snow particles much more likely to be lifted by the wind. The occurrence of snowfall within the hourly measurement interval is therefore the primary factor used to distinguish a blowing snow event in the classification samples. Snowfall occurs when a precipitation greater than 0 coincides with an air temperature ≤0 ^∘C. As cohesive resistance increases dramatically once snow becomes wet, there are very considerable differences in the frequencies of blowing snow occurrence in dry snow and wet snow (Li and Pomeroy, 1997b). Thus, the wet/dry snow condition is also used as an attribute in establishing the classification tree model. Wet snow refers to the snow which has either melted or received liquid precipitation since the last snowfall, while dry snow is defined as snow that has not received temperatures of 0 ^∘C or above or liquid precipitation (Li and Pomeroy, 1997a). To accurately capture the different effects of ambient atmospheric conditions on the occurrence of blowing snow, the quality-controlled data were categorized into three types: snowfall (SF), surface covered by wet snow without concurrent snowfall (NoSF_SC_WET), and surface covered by dry snow without concurrent snowfall (NoSF_SC_DRY). The occurrence frequencies of the three types of blowing snow at each station and at all stations combined are shown in Fig. 2. A higher occurrence of blowing snow events was detected under the concurrent falling snow condition than other conditions.

Table 2Summary of three tests conducted to verify the performance of the spatiotemporal prediction of the DTM.

Download Print Version | Download XLSX

2.2 Method

Decision tree analysis uses a machine-learning algorithm to build a tree-like classification structure and regression model to identify a set of characteristics that can best differentiate between individual classes based on a categorical feature variable. This method has become increasingly popular in industrial applications and scientific research. It has been proven to be a very useful and efficient technique in processing remote-sensing images (Yang et al., 2017), predicting natural hazard events (Park and Lee, 2014; Ragettli et al., 2017), and estimating hydro-meteorological variables such as winter lake ice (Sharma et al., 2019) and snow depth (Gharaei-Manesh et al., 2016).

A classification tree is composed of decision nodes that represent attributes of the samples to be classified, branches that represent the different possible outcomes of a decision node, and leaves that represent the possible classification (Fig. 1b shows a simple decision tree model). Building a classification tree requires two steps: a learning step and a classification step. In the learning step, a classification model is developed based on multi-dimensional training data with labeled attributes. In this step, the maximum depth of the tree can be used as a control variable for pre-pruning to optimize the decision tree. In the classification step, independent data are used to verify the accuracy of the constructed model.

In this study, we use the scikit-learn package (Pedregosa et al., 2011), an open-source Python module for machine learning, to build DTMs and to identify the occurrence of blowing snow events based on routine meteorological observations. Information on the construction of the decision tree model is detailed in the next section, for instance, the selection of key characteristic variables used to build the tree and identification of the optimal ratio between the training and validation sets. We conducted three sensitivity tests to verify the spatiotemporal prediction of the established DTMs (Table 2). In Test 1, 80 % of the out-of-bag (OOB) observations from each site and all stations were used to construct the site-specific decision tree model (SSDTM) and site-independent decision tree model (SIDTM). This test offers a comprehensive assessment of the decision tree model in identifying the occurrence of blowing snow at both temporal and spatial scales. In Test 2 and Test 3, only five stations were selected to train the model. The main difference between these two tests lies in the distribution characteristics of the feature variables among stations; three stations (Fmor, Fcmb and Fmon) were both involved in these two tests. Tests 2 and 3 serve as a complementary test for further accuracy assessment of the spatiotemporal prediction. The logical framework of this study is presented in Fig. 1c.

To reduce the classification uncertainty attributable to training data selection, 20 repetitions of a random sub-sampling validation method were applied (with the optimal ratio between the training and validation sets) in the construction of each decision tree model. In each cross-validation, the vast majority of available observations were used for training, and the remaining set was used to validate the model. At the end of the cross-validation, 20 testing probabilities were created and averaged before the final analysis. The accuracy of the model was first calculated by comparing the actual and predicted classifications. The correspondence of predicted and observed blowing snow events was then quantitatively assessed using the overall accuracy (OA) index, false alarm rate (FAR), probability of detection (POD), Heidke skill score (HSS) and missing rate (MR). These evaluation metrics are defined from the contingency table of dichotomous events in Table 3 and can be written in the form

\begin{array}{l} (1) & OA = (a + d) / (a + b + c + d), \\ (2) & FAR = b / (a + b), \\ (3) & POD = a / (a + c), \\ (4) & HSS = \frac{2 (a d - b c)}{(a + c) (c + d) + (a + b) (b + d)}, \\ (5) & MR = c / (a + c) . \end{array}

The overall agreement between estimated and actual blowing snow events is captured in OA, which ranges from 0 to 1, with 1 representing a perfect classification. The FAR measures the fraction of forecasted events that did not actually occur, and the MR denotes the proportion of blowing snow events that actually occurred but were not captured by the DTM model (both range from 0 to 1, with an optimal performance of 0), and the POD is the fraction of observed blowing snow events that were correctly identified by the models (range from 0 to 1, with 1 representing the perfect score). When one category is dominant, previous studies have reported that the OA is not sufficient, as it can be hedged by forecasting common events more frequently (Roebber et al., 2003; Notarnicola et al., 2013). The HSS accounts for this bias by characterizing the skill of the compared dataset with regards to the no-skill random forecasts. The HSS ranges from −1 to 1, with 1 representing a perfect classification skill, 0 representing a random classification and negative values corresponding to a decision tree-based classification that is less accurate than a random classification.

Table 3Contingency table for blowing snow events and the corresponding indices used for the computation of evaluation metrics.

Download Print Version | Download XLSX

3 Results

3.1 Sensitivity to the proportion of training samples

Training a model is the first step in making good predictions. Splitting the available dataset into a training portion and a validation portion is therefore necessary to build a solid basis with which to train and test a model. Theoretically, the DTM should be trained on a larger portion of the data to more accurately capture the underlying spread and pattern of data. However, real datasets are often imbalanced, with random noise; therefore, a certain portion of the validation dataset must be retained to verify the model's ability to derive the underlying pattern of observations and to ensure the reliability of the assessment.

Sensitivity tests were conducted and evaluated to confirm the appropriate ratio between the training subset and validation subset. Starting with observations from all stations (All), a varying proportion of these stations was retained as the training dataset (range from 0.5 to 0.9, at 0.01 increments), from which 2400 DTMs were established (here, $2400 = 40 \times 3 \times 20$ : this represents the 40 training sets from 0.5 to 0.9; three groups SF, NoSF_SC_DRY and NoSF_SC_WET; and 20 repetitions of the random sampling). As listed in Table 4, the overall classification accuracy of these models ranged from 0.848 to 0.935, indicating that models predicted the occurrence of blowing snow events very accurately. The variation range and standard deviation of overall accuracy changed slightly with decreasing sample size: the accuracy range ranged from $1.06 \times 10^{- 3}$ to $7.19 \times 10^{- 3}$ , and the standard deviation increased from $2.23 \times 10^{- 4}$ to $15.26 \times 10^{- 4}$ as sample size decreased. According to the sensitivity analysis, training sample size had a small influence on the classification accuracy of the DTM. However, because the reliability of the accuracy assessment decreased with decreasing validation sample size, the training set proportion of 0.8 was recommended in this study.

3.2 Sensitivity to the feature variables

The current decision tree uses a greedy algorithm, meaning that an optimal node construction and attribute combinations were selected to build the classification tree model. To construct an efficient model capable of being applied across broad spatial and temporal scales, a major challenge is to select the fewest number of feature variables to construct a model with the highest classification accuracy. Although many factors (including land surface characteristics and ambient meteorological conditions) can influence the occurrence of a blowing snow event, it is unrealistic to consider all factors in the estimation because of the requirement for spatiotemporal transferability. Therefore, the possible predictor variables used in this study comprise WS, WSMAX, T and RH.

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f03

Figure 3The mean overall accuracy (a–c) and scores (d–f) of the site-independent DTM (All) and the site-specific DTMs trained with different feature variable combinations in snowfall conditions (SF), dry snow cover conditions (NoSF_SC_DRY) and wet snow cover conditions (NoSF_SC_WET), and the mean scores of models constructed based on feature variable combinations with and without the RH included (g–i). The pound sign (#) indicates that the SSDTM was not constructed using RH. WS is mean wind speed; WSMAX is the maximum wind speed; T is air temperature; RH is relative humidity.

Download

Table 4Performance statistics of decision tree models constructed under different situations with varied ratios for training and testing datasets (ranging from 0.5 to 0.9).

^* The situations are listed based on sample size.

Download Print Version | Download XLSX

To assess the relative importance of each single feature variable and to determine a suitable rule for training samples, nine combinations of feature variables for each of the three conditions (SF, NoSF_SC_WET and NoSF_SC_DRY) were selected from each station and all stations (All) to train the decision tree model. That is to say, in theory, there will be a total of 5400 DTMs trained ( $9 \times 10 \times 20 \times 3$ ). However, the RH observations are only available at Fmor, Fcmb and Fber stations (RH observations from Fhue and Fgie stations were discarded due to their frequent and dramatic fluctuations over short periods); therefore, 4680 DTMs were eventually obtained. Figure 3 displays the accuracy of these DTMs in estimating the blowing snow occurrence in the validation samples. The total score, which is a synthetic demonstration of the 20 random sampling tests of the DTMs, is also shown in Fig. 3. The total scores are calculated by ranking the accuracies of DTMs trained at each station with different combinations of predictor variables (the model with the highest accuracy scores the highest: 9 for All, Fmor, Fcmb and Fber and 7 for the other stations without RH observations). Furthermore, as the mean accuracy was also included, the maximum values of scores are 189 and 140 for stations with and without RH observations, respectively. As shown in Fig. 3, even though the same attribute combination was used, model accuracy varied widely not only between stations, but also between different snow conditions. Generally, models presented higher overall accuracies under dry snow conditions than the other two conditions. Although models derived for snowfall conditions produced accuracies comparable with models for the ground surface covered by wet snow, the latter were more effective in accurately detecting blowing snow occurrence than the former.

3.2.1 Air temperature and relative humidity

Of all the attribute combinations evaluated, models trained merely with either T or RH presented the lowest accuracy (Fig. 3a–c), indicating that the use of T or RH alone cannot fully capture the variance in the validation samples. However, significant improvements were achieved when either WS or WSMAX were accompanied by T or RH, even though the single factors performed poorly when used alone. Taking Fcmb station in snowfall conditions as an example, model accuracy increased from 0.54 when the model trained merely with T to 0.8 or 0.82 when WS or WSMAX was added, respectively. These results suggest that neither T nor RH is the guarantee of model accuracy, although the model with more predictor variables used generally achieved relatively high accuracy.

3.2.2 Mean and maximum wind speed

Models trained with a combination that included WAMAX outperformed the other models, revealing that WSMAX rather than WS contributed the most to the model accuracy, highlighting the importance of WSMAX in constructing a reliable DTM. The result is reasonable, as the fastest wind speed acts as the primary driving force that allows wind shear stress to overcome snow cohesion, bonding and frictional resistance (He and Ohara, 2017). Wind transport of snow can be initiated once the fastest wind speed exceeds the threshold wind speed, and the blowing snow process can then be sustained by relatively low wind speeds. In other words, the fastest wind speed and the mean wind speed control the occurrence and persistence of blowing snow events, respectively. Generally, model accuracy improved as more predictor variables were used. However, strongly correlated feature variables might slightly affect the model accuracy; this was evident when WS was added to WSMAX in snowfall conditions. Overall, this comparison indicates the superiority of DTM as a means of blowing snow identification, which is achieved by making full use of all available feature variables.

The model accuracy analyzed above presents the overall performance of models in identifying the occurrence of blowing snow. Synthetic scores displayed in Fig. 3d–f show that the attribute combinations including WSMAX generally achieved a higher ranking. Model scores in either wet snow cover or dry snow cover further indicated the need for assimilating more attribute information to improve the classification accuracy of models, particularly in wet snow conditions. These results are closely consistent with the mean overall accuracy results discussed above, implying that they are representative of the mean overall accuracy. Moreover, synthetic scores could reveal further information that the averaged accuracy cannot clearly illustrate. For instance, except for the models trained with only T or RH, there were no notable differences in mean accuracy among the various combinations. However, a substantial divergence of synthetic scores was seen among different combinations. In snowfall conditions, WSMAX scored highest at most of the stations, while the combination of WS, WSMAX and T ranked highest in the other two conditions. When the land surface was covered by dry snow, the model scores were greatly increased with the inclusion of T, demonstrating the key role of T in influencing the blowing snow occurrences. The contribution of T was more important to the model accuracy than WS or WAMAX, as the blowing snow occurrence regime of dry snow is more sensitive to temperature variation than to shear stress (Li and Pomeroy, 1997b). The scores shown in Fig. 3g–i are the integrated scores across stations. One noticeable distinction between models trained with and without RH inclusion was in the optimal attribute combination. When RH was included in the feature variables, the models trained with the combination of WS, WSMAX, T and RH yielded the highest score across all conditions, while the optimal combination varied with conditions when RH was unavailable. This comparison suggested that redundant information in the combinations of WS, WSMAX and T might slightly weaken the efficiency of the DTM and further demonstrated the importance of WSMAX in constructing an accurate and efficient DTM.

Finally, models were trained with only WSMAX in snowfall conditions, while the combination of WS, WSMAX, T and RH (if available) was used for further analysis in snow-covered conditions.

3.3 Validation of the SSDTMs and SIDTM

In this section, validations were conducted to assess the predictive performance of DTMs. Similarly to the sensitivity tests conducted above, 20 DTMs were trained at each station, based on randomly selected datasets comprising 80 % of the sample observations. The optimal combinations of feature variables determined in Sect. 3.2 were used. Each DTM was evaluated by the remaining 20 % observations at the corresponding training station and by all available observations from the other stations. The divergence between the SSDTMs and SIDTM was compared for assessing the potential of the DTM to be scaled to multiple stations across regions. There were 660 ( $11 \times 20 \times 3$ ) and 240 ( $4 \times 20 \times 3$ ) DTMs in total, trained with and without RH, respectively. The mean OA, FAR, POD and HSS of the 20 random sampling tests are compared and analyzed in this section.

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f04

Figure 4The overall accuracy (OA) of the SIDTM and SSDTMs trained without (a–c) and with (d–f) RH, and their difference (g–i) in SF, No_SF_DRY and No_SF_WET conditions, respectively. The x axis represents the SIDTM and SSDTMs constructed based on 80 % of corresponding observations, and the y axis represents the validation stations.

Download

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f05

Figure 5The same as Fig. 4 but for the probability of detection (POD).

Download

According to the results in Fig. 4a–c, either the SSDTMs or the SIDIM exhibited high overall accuracy throughout the range of conditions (from 0.66 to 0.96 and 0.72 to 0.96 for models trained with and without RH considered, respectively). However, there were differences among stations and conditions. The DTMs trained and evaluated with observations in wet snow conditions showed the best predictive performance among three conditions except for the evaluations conducted at Fmor station. Overall classification accuracy at stations where SSDIMs were trained (those lying in the diagonal rising up to the right) was not always higher than the accuracy evaluated at other stations, demonstrating the high ability of DTMs in accurately capturing the blowing snow occurrences outside the training range (both temporal and spatial). When comparing the overall accuracy between models trained with and without RH at those stations with RH observations available (Fig. 4d–i), the inclusion of RH is seen to increase the performance of DTMs to a certain extent.

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f06

Figure 6The relative probability and cumulative probability of blowing snow occurrences, plotted with the maximum wind speed. The histogram is the relative probability and the orange solid curve is the cumulative probability, respectively. The orange dash line is the 50 % cumulative probability.

Download

Although OA represents the overall classification skill of the models, POD is an important metric characterizing the models' ability to detect blowing snow events. As shown in Fig. 5, the DTMs constructed under snowfall conditions were generally more accurate in detecting blowing snow occurrences than the models established under the other two conditions, and DTMs exhibited the lowest detective capacity under wet snow conditions. However, there were clear differences for between both the SSDTMs and SIDTM in detecting the blowing snow events that occurred at different stations. At Fmor, Fcmb and Fhue stations, all the DTMs showed consistently high skill in accurately identifying the true blowing snow events (the POD values were above 0.82), even higher than identifying the blowing snow events that occurred at the stations used for training the DTMs. Taking the SSDTM trained at Fgie station as an example, 98 %, 97 % and 96 % of blowing snow events were accurately detected at Fmor, Fcmb and Fhue stations, dropping to 45 % at Fgie station (Fig. 5a). In snowfall conditions, the POD values dropped sharply when DTMs were evaluated at Fgie and Fcel stations (only up to 45 % and 68 % of blowing snow events occurring at Fgie and Fcel stations were accurately identified, respectively). Although these differences narrowed dramatically in snow-covered conditions (Fig. 5b and c), the capabilities of both the SSDTMs and SIDTM remain relatively low in detecting the blowing snow events that occurred at Fgie station; the performance deteriorated further when the surface was covered by wet snow (only up to 29 % of blowing snow events were detected). The low POD values corresponding to the high miss rate indicate that blowing snow events occurring at Fgie station were seriously underestimated by both the SSDIMs and the SIDIM. The differences in POD shown in Fig. 5g–i illustrate that the detective ability can be improved when RH serves as a feature variable to train the DTM, as was particularly noticeable in snow-covered conditions (with maximum increases of 24 % and 18 % for dry snow cover and wet snow cover, respectively). For snowfall conditions, however, the improvement was more limited and was only achieved in 5 of 13 tests (Fig. 5g).

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f07

Figure 7The POD (a), FAR (b), MR (c), OA (d) and HSS (e) of SIDTM and SSDTMs assessed at Fsal station, where the maximum wind speed was relatively low (similar to Fgie, Fcel and Fber stations).

Download

The relatively low PODs at Fgie, Fcel and Fber stations under snowfall conditions reflect the significantly lower maximum wind speeds at these stations when compared with other stations. The relative probability and cumulative probability of maximum wind speed at each station, with concurrent blowing snow (Fig. 6), indicate that about 50% of the blowing snow events occurred at Fgie, Fcel and Fber stations occurred when maximum wind speed was below 10 m s⁻¹ (Fig. 6g–i), demonstrating that most of the blowing snow events at these three stations were initiated by relatively low maximum wind speed; this was much lower than that at other stations (Fig. 6a–f). DTMs trained with higher maximum wind speeds generally choose a larger threshold WSMAX for the occurrence of blowing snow, thus resulting in an underestimation of blowing snow events when the models were applied to stations with low maximum wind speed. To verify this speculation, DTMs were assessed at Fsal station, since relatively low maximum wind speeds were also reported at this station. As expected, all the DTMs significantly underestimated the frequency of blowing snow events at this station (Fig. 7). However, in snow cover scenarios, and except for Fgie and Fsal stations, the distribution of maximum wind speeds was broadly consistent among the stations (figure not shown), and thus the differences between the models evaluated at these stations were small (Fig. 7).

The FAR values evaluated at each station were compared in Fig. 8. In general, FAR was slightly lower when falling snow was detected than that in other conditions. For example, except for the model trained at Fgie station, the FAR values for both the SSDTMs and SIDTM evaluated at Fmor, Fhue and Fbon stations were below or equal to 10 %, accompanied by high OA and POD values, indicating a good retrieval performance in estimating blowing snow occurrence. However, the blowing snow events under wet snow conditions were more likely to be falsely identified. One obvious distinction in POD values between the SIDTM and SSDTMs trained with RH was that the SIDTM showed a robust improvement in reducing the probability of false detection whatever the circumstances, while its effectiveness for SSDTMs varied between stations and conditions.

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f08

Figure 8The same as Fig. 4 but for the FAR.

Download

Although blowing snow events frequently occur in the study region, blowing snow is still a rather rare weather phenomenon. Analysis of OA, POD and FAR demonstrated the great ability of the DTMs in accurately identifying the occurrence of blowing snow; however, when considering the impact of the imbalanced dataset, the HSS index, which is particularly suitable for the evaluation of forecast skill for rare events (Doswell et al., 1990), should be analyzed. Despite the considerable fluctuations in HSS across stations and conditions as shown in Fig. 9, the generally high HSS indicated that the DTMs showed promising agreement with in situ blowing snow measurements (except for Fgie station, where HSS values less than 0.4 were frequently observed). Generally, the consistency between the actual blowing snow events and estimates using the DTM was improved when RH was included in the DTM training. In conclusion, even though the datasets inevitably suffer from imbalanced observations, both the constructed SIDTM and SSDTMs are promising in detecting blowing snow occurrence with considerable accuracy.

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f09

Figure 9The same as Fig. 4 but for the HSS.

Download

3.4 Spatiotemporal transferability assessments

Spatial and temporal transferability refers to how applicable a classifier model is across broad spatial and temporal scales. Due to the temporal independence of the randomly chosen 80 % of samples, the accuracy of the temporal prediction of models can be assessed based on the remaining set. Furthermore, the spatially independent datasets from other validation stations can be used to assess the accuracy of spatial prediction. As mentioned in the previous analysis, the constructed DTMs performed commendably in temporal extrapolation and are very much applicable to the estimation of blowing snow occurrence at other stations outside the training range. In this section, sensitivity tests were conducted to further evaluate and explore the spatiotemporal transferability of the DTMs.

As shown in the assessment of SSDTM across different stations, all the SSDTMs showed consistently accurate performance in estimating the occurrence of blowing snow events at the station the SSDTM trained, demonstrating the high capacity of models in the temporal prediction of blowing snow events. Meanwhile, the model accuracy obtained at each validation station was comparable to, or even better than, that assessed at the respective station where each model was trained at. For example, the model trained at Fcmb station yielded an overall classification accuracy of 0.82 when evaluated at Fcmb station (Fig. 4a), but the model achieved a markedly higher accuracy when applied at Fmor (0.89), Fhue (0.92) and Fmon (0.92) stations. Notably, this pattern was more pronounced in HSS (Fig. 9).

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f10

Figure 10The assessment indicators of DTMs trained based on observations from five stations with a homogeneous distribution (a–e), inhomogeneous distribution (f–j) of the feature variable, and the DTM trained based on observations from all the stations.

Download

To explore the accuracy of temporal prediction further, two datasets were created and used in two sensitivity experiments (Test 3 listed in Table 2). Both datasets included observations from five stations, and the major difference between them was whether the differences in the distribution of feature variables across stations were significant. One dataset was composed of observations from Fmor, Fcmb, Fmon, Fsal and Fcel stations (Mix), with substantial differences in the distributions of feature variables between the first three stations and the last two stations, particularly the WSMAX. The other dataset contained a more homogeneous distribution of observations compared with the Mix (Homo; observations from Fmor, Fcmb, Fmon, Fbon and Fche were included). When models were trained with one of the datasets, observations from the stations which were not included in the training dataset were used to assess its predictive performance. The results of these two sensitivity tests will also be compared with the performance of models constructed using 80 % of observations from all stations to explore the effect of different numbers of training stations on the model accuracy.

As shown in Fig. 10, the model trained with either the Mix (Fig. 10i) or Homo (Fig. 10d) dataset presented similar OA to the SIDTM (Fig. 8n) when evaluated at the same stations. The accuracies of these models were comparable to that of the SIDTM and even to models constructed based only on observations from an individual station. Moreover, the source of the training samples had a minimal impact on the model performance, as shown in the assessment conducted at Fhue station (comparing the first and second rows of Fig. 10). These results indicated that the DTM shows robust spatial transferability and, importantly, was independent of the source of the training data and the number of stations used. When compared at Fber, Fgie, Fcel and Fsal stations, the model trained with observations from all stations (third row) outperformed the model trained with the Homo dataset (second row) in terms of the POD and HSS indices; improvement was also noted in the MR. However, poorer performance was noted at Fhue, Fbon and Fche stations when the model was trained with observations from all stations instead of the Homo dataset, as evident form the lower POD and HSS and larger MR (particularly in dry snow cover conditions). This divergence may partly be explained by the larger threshold maximum wind speed for the occurrence of blowing snow in models trained with observations from all stations, which prevents the identification of blowing snow events at stations with relatively high maximum wind speed (see Fhue station in Fig. 11: the frequency of blowing events follows WSMAX). Meanwhile, blowing snow is more likely to be predicted at stations with relatively low maximum wind speeds (i.e., the Fgie and Fber stations) due to the lower threshold maximum wind speed. The occurrence of blowing snow events was underestimated at the relatively low maximum wind speed stations (e.g., Fgie station). The accuracy of estimating blowing snow events increased as the threshold WSMAX decreased.

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f11

Figure 11The same as Fig. 6 but only for the DTM constructed based on observations from all stations (All) and observations from five stations with a heterogeneous distribution (Mix) and homogeneous distribution (Homo) of feature variables, respectively. DTMs were assessed at stations with relatively high maximum wind speed (Fhue) and with relatively low maximum wind speed (Fgie and Fber).

Download

The DTMs were established on the principle of maximum inclusiveness, enabling their applicability across broad spatial and temporal scales by synthesizing all the features in the available observations. However, conventional algorithms are often biased towards the majority class without considering the data distribution. When there is a need to handle heterogeneous data from various sources, the model must seek a good compromise between accuracy, efficiency and a good fit to all the training samples; therefore, the probability of compromise increases as data heterogeneity increases. This is an important issue to be aware of when using DTMs. In this case, data preprocessing (e.g., the scenario classification method used in this study) is an important and effective step in reducing the heterogeneity of data and improving the effectiveness of the model.

4 Discussion

4.1 Influence of training data on the DTM

At stations where a low probability of blowing snow detection was observed (i.e., Fber and Fgie: Fig. 5), a considerable improvement in the OA, POD, HSS and MR was achieved when the DTM was trained with the Mix dataset instead of the Homo dataset. For example, a 17.3 % increase in the POD, an increase in HSS from 0.59 to 0.64, and a decrease in MR (−0.09) were achieved at Fber station under dry snow cover conditions. Distinct difference was noted at Fber, Fgie, Fcel and Fsal between models trained with or without heterogeneous training samples. However, the source of the training dataset and the distribution characteristics of predictor variables exerted only a slight impact on the model performance at other stations (e.g., Fhue). In general, using training data with heterogeneous information can effectively improve the classification accuracy while promoting the spatiotemporal transferability of the DTM.

Models did not show consistent improvement in estimating blowing snow occurrence at all stations when trained with 80 % of all observations. For example, as mentioned earlier, the model showed a marked decrease in POD and increase in MR in dry snow cover conditions at Fhue, Fche and Fbon stations, revealing that heterogeneity of the training data is a key factor in influencing the performance of the DTM. Meanwhile, the results also highlighted the importance of a reliable and impartial training dataset; strongly imbalanced datasets should be avoided.

4.2 Influence of meteorological and environmental factors on the DTM

The occurrence of blowing snow events depends largely on a combination of meteorological and environmental factors, such as wind speed, wind direction, air temperature, topography and vegetation. Wind speed serves as the driving force for the initiation of blowing snow. As it is difficult to determine the occurrence of blowing snow on a deterministic, physical basis (Li and Pomeroy, 1997b), wind speed has been widely used in empirical formulae as a critical indicative parameter to simulate the wind transport of snow (Schmidt, 1980; Pomeroy and Gray, 1990). A DTM is a black-box model, in which it is difficult to envisage how the different components operate and interact. However, sensitivity tests have demonstrated that the maximum wind speed played a decisive role in the model predictions. The threshold wind speed for snow transport refers to the minimum wind speed required to initiate the saltation of snow. Once the fastest wind speed exceeds the threshold wind speed, the wind shear stress overcomes the snow cohesion, bonding and frictional resistance, thereby initiating a blowing snow event. The blowing snow process can then be sustained by a relatively low wind speed. Thus, the maximum wind speed (instead of the mean wind speed) contributes most strongly to the classification accuracy of the DTM.

Air temperature, one of the most critical parameters affecting the microstructural structure and internal physical properties of the snowpack, is associated with snow cohesive resistance. Cohesive resistance increases considerably when snow becomes wet, as water increases the cohesive bonding force between particles. This leads to a sharp contrast of the threshold wind speed for snow transport between wet snow and dry snow. As demonstrated by Li and Pomeroy (1997b), condensation and crystal growth occur in the snowpack when the saturation vapor pressure is low, leading to a gradual increase in snow particle bonding resistance and lowering the probability of blowing snow occurrence. Relative humidity, the ratio of vapor pressure and saturation vapor pressure, therefore has important implications for snow aging processes (i.e., metamorphism, despite Armstrong and Brun (2008) reporting that snow metamorphism in alpine snowpack is mainly driven by temperature gradient). Wind direction is also an important factor influencing blowing snow events and is closely associated with topography and wind speed (Roebber et al., 2003). Wind speed can vary considerably with wind direction, as site-specific topography may preclude blowing snow under certain wind directions at particular stations. Preliminary studies have suggested an improvement in blowing snow estimates when taking into consideration the wind direction (Baggaley and Hanesiak, 2005; Vionnet et al., 2018). However, for the consideration of the highly site-specific wind speed and large variations of prevailing wind speed across stations, the impact of wind direction is not considered in this study in constructing the DTMs. Generally, snow is eroded from wind-exposed surfaces (e.g., flat surfaces, hilltops, windward slopes, and sparsely vegetated surfaces) and deposited in wind-sheltered areas such as densely vegetated surfaces and topographic depressions (Li and Pomeroy 1997a; Liston and Sturm, 1998; Xie et al., 2019). The topography is also very site-specific, and quantifying its potential impacts on blowing snow occurrence is challenging. Thus, the temporal transferability of the DTM is likely to drop sharply, and more widespread adoption of the DTM will be hindered once the wind direction and topography are used as feature variables. Vegetation can be effectively quantified by LAI or NDVI; however, the sparse stations in the study region limit its usage here.

4.3 Potential sources of error in the DTM

The FlowCapt sensor is sensitive to soil particles, resulting in false alarms for blowing snow events (Vionnet et al., 2018). Therefore, one of the greatest uncertainties is attributed to unreliable blowing snow events recorded by the FlowCapt sensors. Although records corresponding to detected blowing snow events with an absence of simultaneous snow cover and snowfall were removed, unreliable blowing snow events may still exist in the dataset with strict quality control applied. Internal defects of the FlowCapt sensor (in terms of hardware and numerical processing) are another important source of uncertainty (Trouvilliez et al., 2015), although the suitability of the instrument in measuring blowing snow has been evaluated and results have demonstrated its reliability in blowing snow studies (Chritin et al., 1999; Cierco et al., 2007; Das et al., 2012; Trouvilliez et al., 2015). Nevertheless, the rate of snow transported recorded by the FlowCapt can be underestimated (Trouvilliez et al., 2015). Therefore, as the occurrence of blowing snow events was determined based only on the FlowCapt measurements, inevitable uncertainties exist in this study. To minimize the underestimation of blowing snow measured by the FlowCapt, all records with blowing snow fluxes exceeding 0 were classified as blowing snow events in this study. This is different from Trouvilliez et al. (2015), who used a higher (non-zero) threshold value to remove non-significant blowing snow occurrence when processing FlowCapt measurements.

The problem of strongly skewed data distribution is rather common in real-world applications and introduces unique challenges when training machine-learning models. The term “imbalanced data” typically refers to the problem where the number of different classes of data is not equally distributed. In this study, for example, the blowing snow events are generally outnumbered by the non-blowing snow events. Learning from imbalanced data has been the subject of many papers, workshops, special sessions and dissertations. However, there is no definite solution. In practice, data imbalance is addressed by a number of methods: using ensemble cross-validation to justify the model robustness, undersampling the majority class or oversampling the minority class (Zhou and Liu, 2006), or assigning different weights to balance the ratio for each category (Jo and Japkowicz, 2004).

https://hess.copernicus.org/articles/25/3783/2021/hess-25-3783-2021-f12

Figure 12The POD (a), FAR (b), OA (c) and HSS (d) of indirect methods using constant threshold wind speed (7.7 m s⁻¹ for dry snow transport and 9.9 m s⁻¹ for wet snow transport, abbreviated as Constant_dry and Constant_wet, respectively), and dynamic threshold adapts to the evolution of air temperature (abbreviating as Ut(10), Ut(10)_wet and Ut(10)_dry shares the same expression but used to detect wet snow and dry snow transport, respectively). Constant_overall and Ut(10)_overall are the synthetical metrics of wet snow and dry snow conditions.

Download

4.4 Comparison with other indirect methods

It was extremely difficult to distinguish unreliable records in the quality-controlled data, and it is inevitable that the use of these records in the construction of the DTMs negatively affects the model's skill in detecting blowing snow events. Despite these shortcomings, the calculated accuracies for both the SIDTM and SSDTMS were superior to (and in most cases much better than) the empirical parameterization schemes using (i) constant threshold wind speed (7.7 m s⁻¹ for dry snow transport and 9.9 m s⁻¹ for wet snow transport, abbreviated as Constant_dry and Constant_wet, respectively; Li and Pomeroy, 1997a), and (ii) the dynamic threshold adapts to the evolution of air temperature (abbreviating as Ut(10), Ut(10)_wet and Ut(10)_dry shares the same expression but used to detect wet snow and dry snow transport, respectively; Li and Pomeroy, 1997a). As shown in Fig. 12, the dynamic threshold wind speed outperforms overwhelmingly the constant threshold wind speed in detecting blowing snow occurrence, while the former's performance evaluation metrics (except for the FAR) characterizing the ability and efficiency in blowing snow detection are inferior to the DTMs obviously (corresponding metrics are shown in Figs. 4, 5, 8 and 9, respectively), particularly the POD and HSS. A larger FAR achieved by DTMs indicates a high probability of false blowing snow detection by the data-driven model. When compared to the results from the S2M-Sytron (Vionnet et al., 2018), an avalanche hazard forecast model driven by high spatial and temporal resolution meteorological forcing data downscaled by SAFRAN (Durand et al., 2009), the values for POD and HSS in this study were similar to, or better than, the skill reported for numerical simulations using the S2M-Sytron forced with downscaled SAFRAN input. However, due to the limited capacity of DTMs to detect real blowing snow occurrences, particularly under snow-covered conditions, large differences exist when compared to R2 (an S2M-Sytron simulation using an updated parameterization for falling snow properties) and R3 (uses the observed 10 m wind speed and direction based on R2). The wide gap between the data-driven model and the physical constraints model highlights the need to conduct further experimental investigations and analyses to illustrate the limitations of the DTM in detecting blowing snow occurrence. This could shed more light on future developments.

We note that only those blowing snow events with a snow flux exceeding a threshold of 1 g m⁻² s⁻¹ were analyzed and that if this threshold value is applied in this study, the ability of the DTM to accurately detect blowing snow occurrence is projected to increase. In summary, a simple DTM constructed from conventional meteorological observations is therefore shown to be capable of detecting blowing snow events with a skill superior to the commonly used empirical parameterizations. However, there is still considerable room for improvement when compared with numerical models containing detailed representations of physical processes, but handling the data imbalance issue appropriately and minimizing the potential uncertainty resulting from blowing soil particles are a top priority.

4.5 Possible future directions

Machine-learning techniques can be a suitable way to reduce the process complexity and computational cost of traditional physically based blowing snow models. The complex interactive processes of ambient atmospheric conditions make a proper representation of blowing snow processes in the conventional blowing snow model challenging. A major difference between the process-based algorithms and machine-learning-based models is that the model structure for the former is based on underlying physical principles, whereas a machine-learning-based model is completely data-driven. Although machine-learning models have outperformed simple statistical models and have been widely used in the Earth sciences, they tend to violate physical law constraints, leading to unrealistic predictions (Zhao et al., 2019). Physically based models, although more complicated, tend to have superior interpretability (Pan et al., 2020). Complementing a physically based model and machine-learning-based model is a feasible way for further development. Integrating these two types of models with divergent model structures and introducing physical constraints on machine-learning-based models are two popular strategies (Pal and Sharma, 2021). One common way to integrate the constructed DTMs with blowing snow models in detecting blowing snow occurrence is to ensure the application of DTM only when the climatology shares a similar distribution property to the training data. In this case, a huge amount of training data with different distribution properties is an essential prerequisite for the performance of DTMs applied over a large area; otherwise, in case of rare events, the detection of blowing snow occurrence relies on the physically based blowing snow model. On the other hand, using blowing snow model outputs to pose physical constraints on the DTMs makes the hybrid model capable of learning the nonlinear relations from the training data while obeying the physical laws.

5 Conclusions

The accurate classification of blowing snow events is important in numerical models which include blowing snow, as it determines whether or not to invoke the parameterizations for sublimation and transport associated with blowing snow processes in the model (Lenaerts et al., 2012b; Xie et al., 2019). This paper described the construction and evaluation of the machine-learning-based DTM in detecting blowing snow occurrence in the European Alps. Here, DTMs were trained with routine meteorological observations (WS, WSMAX, T and RH).

An optimal ratio of 0.8 between the training subset and validation subset was chosen here when accuracy, efficiency and reliability of the DTM were taken into consideration. In snowfall conditions, SSDTMs and the SIDTM were trained based on WSMAX and RH (at stations where RH observations are available), while in snow-covered conditions, the models were trained with WS, WSMAX, T and RH (at stations where RH observations are available). Twenty repetitions of a random sub-sampling validation test showed that the maximum wind speed contributes the most to the classification accuracy of the DTMs, and models constructed using additional characteristic attributes achieved higher classification accuracy for blowing snow event detection. Both the SSDTMs and SIDTM showed strong capabilities for accurately detecting blowing snow; however, notable variations were seen between stations and conditions. The actual blowing snow events occurring in snowfall conditions were detected accurately at all stations except for Fber, Fgie and Fcel. However, in non-snowfall conditions, ambient meteorological conditions exerted complex, nonlinear impacts on the properties and structures of snow particles, hindering the accurate detection of blowing snow occurrence. The relatively low PODs at Fgie, Fcel and Fber were attributed to the significantly lower maximum wind speed than that at other stations, which cannot be well captured by the DTM. Overall, 73 % and 69 % of blowing snow events occurring under snowfall conditions and dry snow cover conditions were accurately detected by the SIDTM, but this proportion dropped to 41 % for wet snow surfaces.

The constructed DTMs demonstrated good performance in temporal extrapolation and were also able to accurately detect blowing snow occurrence at stations outside the training range. The spatial transferability is likely to decline when models are trained with strongly heterogeneous feature variables. Therefore, in some cases, a few representative predictor variables should be selected, and data preprocessing (e.g., the scenario classification method used in this study) should be applied to reduce the heterogeneity of the dataset and improve the effectiveness of the DTM. In summary, both the SSDTMs and SIDTM are useful tools in detecting the occurrence of blowing snow events and achieve acceptable accuracy in terms of their spatiotemporal predictions.

Progress towards the accurate estimation of blowing snow events at local scales relies largely on physically based blowing snow models driven by high-resolution meteorological inputs that include a detailed representation of the effects of ambient atmospheric conditions on the initiation and persistence of blowing snow processes. However, using such models can be challenging due to the high computation cost of such simulations and the difficulty in obtaining reliable field observations for the required input. The DTM, constructed from limited available observations, may provide a useful alternative method. Therefore, DTMs can facilitate research into blowing snow in data-scarce areas such as the Tibetan Plateau, where 10 FlowCapt instruments have been set up and are currently in operation.

Data availability

Observation data from the ISAW stations can be accessed at http://www.iav.ch (last access: 12 December 2020) (IAV, 2020).

Code availability

The code to construct a decision tree model in detecting the occurrence of blowing snow is available from GitHub (https://github.com/zpxie-cas/DTM, last access: 12 December 2020) (Github, 2020).

Author contributions

ZX, WM, YM and ZH were responsible for the conceptualization. ZX, GS, YH, WH, RS and YF were responsible for data searching and processing. ZX performed the formal analysis and prepared the manuscript with contributions from all the co-authors.

Competing interests

The authors declare that they have no conflict of interest.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Acknowledgements

We would like to express our special appreciation and thanks to the scikit-learn community and two anonymous reviewers for providing many constructive comments.

Review statement

This paper was edited by Xing Yuan and reviewed by two anonymous referees.

Financial support

This research has been supported by the National Natural Science Foundation of China (grant no. 41905012), the Second Tibetan Plateau Scientific Expedition and Research (STEP) program (grant no. 2019QZKK0103), and the China Postdoctoral Science Foundation (grant no. 2018M641489).

References

Armstrong, R. L. and Brun, E.: Snow and climate: physical processes, surface energy exchange and modeling, Cambridge University Press, Cambridge, 2008.

Baggaley, D. G. and Hanesiak, J. M.: An empirical blowing snow forecast technique for the Canadian arctic and the Prairie provinces, Weather Forecast., 20, 51–62, https://doi.org/10.1175/waf-833.1, 2005.

Bromwich, D. H.: Snowfall in High Southern Latitudes, Rev. Geophys., 26, 149–168, 1988.

Budd, W., Dingle, W., and Radok, U.: The Byrd snow drift project: outline and basic results, Stud. Antarct. Meteorol., 9, 71–134, 1966.

Chritin, V., Bolognesi, R., and Gubler, H.: FlowCapt: a new acoustic sensor to measure snowdrift and wind velocity for avalanche forecasting, Cold Reg. Sci. Technol., 30, 125–133, 1999.

Cierco, F.-X., Naaim-Bouvet, F., and Bellot, H.: Acoustic sensors for snowdrift measurements: How should they be used for research purposes?, Cold Reg. Sci. Technol., 49, 74–87, https://doi.org/10.1016/j.coldregions.2007.01.002, 2007.

Das, R. K., Datt, P., and Acharya, A.: An assessment of the FlowCapt acoustic sensor for measuring snowdrift in the Indian Himalayas, J. Earth Syst. Sci., 121, 1483–1491, https://doi.org/10.1007/s12040-012-0234-2, 2012.

Déry, S. J. and Yau, M. K.: Large-scale mass balance effects of blowing snow and surface sublimation, J. Geophys. Res.-Atmos., 107, ACL 8-1–ACL 8-17, 2002.

Doswell, C. A., Davies-Jones, R., and Keller, D. L.: On summary measures of skill in rare event forecasting based on contingency tables, Weather Forecast., 5, 576–585, 1990.

Durand, Y., Laternser, M., Giraud, G., Etchevers, P., Lesaffre, B., and Merindol, L.: Reanalysis of 44 Yr of Climate in the French Alps (1958–2002): Methodology, Model Validation, Climatology, and Trends for Air Temperature and Precipitation, J. Appl. Meteorol. Clim., 48, 429–449, 2009.

Essery, R. and Pomeroy, J.: Vegetation and topographic control of wind-blown snow distributions in distributed and aggregated simulations for an Arctic tundra basin, J. Hydrometeorol., 5, 735–744, 2004.

Gallée, H., Guyomarc'h, G., and Brun, E.: Impact of snow drift on the Antarctic ice sheet surface mass balance: possible sensitivity to snow-surface properties, Bound.-Lay. Meteorol., 99, 1–19, 2001.

Gallée, H., Trouvilliez, A., Agosta, C., Genthon, C., Favier, V., and Naaim-Bouvet, F.: Transport of snow by the wind: A comparison between observations in Adélie Land, Antarctica, and simulations made with the regional climate model MAR, Bound.-Lay. Meteorol., 146, 133–147, 2013.

Gharaei-Manesh, S., Fathzadeh, A., and Taghizadeh-Mehrjardi, R.: Comparison of artificial neural network and decision tree models in estimating spatial distribution of snow depth in a semi-arid region of Iran, Cold Reg. Sci. Technol., 122, 26–35, 2016.

Gossart, A., Souverijns, N., Gorodetskaya, I. V., Lhermitte, S., Lenaerts, J. T. M., Schween, J. H., Mangold, A., Laffineur, Q., and van Lipzig, N. P. M.: Blowing snow detection from ground-based ceilometers: application to East Antarctica, The Cryosphere, 11, 2755–2772, https://doi.org/10.5194/tc-11-2755-2017, 2017.

Github: zpxie-cas/DTM, available at: https://github.com/zpxie-cas/DTM, last access: 12 December 2020.

Guyomarc'h, G. and Mérindol, L.: Validation of an application for forecasting blowing snow, Ann. Glaciol., 26, 138–143, 1998.

Guyomarc'h, G., Bellot, H., Vionnet, V., Naaim-Bouvet, F., Déliot, Y., Fontaine, F., Puglièse, P., Nishimura, K., Durand, Y., and Naaim, M.: A meteorological and blowing snow data set (2000–2016) from a high-elevation alpine site (Col du Lac Blanc, France, 2720 m a.s.l.), Earth Syst. Sci. Data, 11, 57–69, https://doi.org/10.5194/essd-11-57-2019, 2019.

Hall, D. K. and Riggs, G. A.: MODIS/Aqua Snow Cover Daily L3 Global 500 m SIN Grid, Version 61, NASA National Snow and Ice Data Center Distributed Active Archive Center, Boulder, Colorado, USA, https://doi.org/10.5067/MODIS/MYD10A1.061, 2021.

He, S. W. and Ohara, N.: A New Formula for Estimating the Threshold Wind Speed for Snow Movement, J. Adv. Model. Earth Syst., 9, 2514–2525, 2017.

Huang, Q., Hanesiak, J., Savelyev, S., Papakyriakou, T., and Taylor, P. A.: Visibility during blowing snow events over Arctic sea ice, Weather Forecast., 23, 741–751, 2008.

IAV: Acoustics & vibration and signal processing engineering, available at: http://www.iav.ch, last access: 12 December 2020.

Jo, T. and Japkowicz, N.: Class imbalances versus small disjuncts, ACM Sigkdd Explorations Newsletter, 6, 40–49, 2004.

Lehning, M., and Fierz, C.: Assessment of snow transport in avalanche terrain, Cold Reg. Sci. Technol., 51, 240–252, 2008.

Lehning, M., Doorschot, J., and Bartelt, P.: A snowdrift index based on SNOWPACK model calculations, Ann. Glaciol., 31, 382–386, 2000.

Lenaerts, J. T. M., van den Broeke, M. R., van Angelen, J. H., van Meijgaard, E., and Déry, S. J.: Drifting snow climate of the Greenland ice sheet: a study with a regional climate model, The Cryosphere, 6, 891–899, https://doi.org/10.5194/tc-6-891-2012, 2012a.

Lenaerts, J. T. M., van den Broeke, M. R., Dery, S. J., van Meijgaard, E., van de Berg, W. J., Palm, S. P., and Rodrigo, J. S.: Modeling drifting snow in Antarctica with a regional climate model: 1. Methods and model evaluation, J. Geophys. Res.-Atmos., 117, D05108, https://doi.org/10.1029/2011JD016145, 2012b.

Leonard, K. C. and Maksym, T.: The importance of wind-blown snow redistribution to snow accumulation on Bellingshausen Sea ice, Ann. Glaciol., 52, 271–278, 2011.

Li, L. and Pomeroy, J. W.: Estimates of threshold wind speeds for snow transport using meteorological data, J. Appl. Meteorol., 36, 205–213, 1997a.

Li, L. and Pomeroy, J. W.: Probability of occurrence of blowing snow, J. Geophys. Res.-Atmos., 102, 21955–21964, 1997b.

Liston, G. E.: Representing subgrid snow cover heterogeneities in regional and global models, J. Climate, 17, 1381–1397, 2004.

Liston, G. E. and Sturm, M.: A snow-transport model for complex terrain, J. Glaciol., 44, 498–516, 1998.

Liston, G. E., Haehnel, R. B., Sturm, M., Hiemstra, C. A., Berezovskaya, S., and Tabler, R. D: Instruments and methods simulating complex snow distributions in windy environments using SnowTran-3D, J. Glaciol., 53, 241–256, 2007.

Mahesh, A., Eager, R., Campbell, J. R., and Spinhirne, J. D.: Observations of blowing snow at the South Pole, J. Geophys. Res.-Atmos., 108, 4707, https://doi.org/10.1029/2002JD003327, 2003.

Mott, R., Vionnet, V., and Grünewald, T.: The Seasonal Snow Cover Dynamics: Review on Wind-Driven Coupling Processes, Front. Earth Sci., 6, 197, https://doi.org/10.3389/feart.2018.00197, 2018.

Nishimura, K. and Nemoto, M.: Blowing snow at Mizuho station, Antarctica, Philos. Trans. A, 363, 1647–1662, 2005.

Notarnicola, C., Duguay, M., Moelg, N., Schellenberger, T., Tetzlaff, A., Monsorno, R., Costa, A., Steurer, C., and Zebisch, M.: Snow Cover Maps from MODIS Images at 250 m Resolution, Part 2: Validation, Remote Sens., 5, 1568–1587, https://doi.org/10.3390/rs5041568, 2013.

Oleson, K., Lawrence, D., Bonan G., Drewniak B., Huang, M., Samuel Levis, K., Li, F., Riley, W., Subin, Z., Swenson, S., Thornton, P., Bozbiyik, A., Fisher, R., Heald, C., Kluzek, E., Lamarque, J., Lawrence, P., Leung, L., Lipscomb, W., Muszalg, S., Ricciuto, D., Sacks, W., Sun, Y., Tang, J., and Yang, Z.: Technical description of version 4.5 of the Community Land Model (CLM), NCAR Tech. Notes, NCAR/TN-478CSTR, NCAR, available at: http://www.cesm.ucar.edu/models/cesm1.2/ clm/CLM45_Tech_Note.pdf (last access: 12 December 2020), 2013.

Pal, S. and Sharma, P.: A Review of Machine Learning Applications in Land Surface Modeling, Earth, 2, 174–190, 2021.

Palm, S. P., Yang, Y. K., Spinhirne, J. D., and Marshak, A.: Satellite remote sensing of blowing snow properties over Antarctica, J. Geophys. Res.-Atmos., 116, D16123, https://doi.org/10.1029/2011JD015828, 2011.

Palm, S. P., Kayetha, V., and Yang, Y. K.: Toward a Satellite-Derived Climatology of Blowing Snow Over Antarctica, J. Geophys. Res.-Atmos., 123, 10282–10294, 2018.

Pan, S., Pan, N., Tian, H., Friedlingstein, P., Sitch, S., Shi, H., Arora, V. K., Haverd, V., Jain, A. K., Kato, E., Lienert, S., Lombardozzi, D., Nabel, J. E. M. S., Ottlé, C., Poulter, B., Zaehle, S., and Running, S. W.: Evaluation of global terrestrial evapotranspiration using state-of-the-art approaches in remote sensing, machine learning and land surface modeling, Hydrol. Earth Syst. Sci., 24, 1485–1509, https://doi.org/10.5194/hess-24-1485-2020, 2020.

Park, I. and Lee, S.: Spatial prediction of landslide susceptibility using a decision tree approach: a case study of the Pyeongchang area, Korea, Int. J. Remote Sens., 35, 6089–6112, 2014.

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.: Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12, 2825–2830, 2011.

Pomeroy, J. and Gray, D.: Snowcover–Accumulation, Relocation and Management, National Hydrology Research Institute Science Report No. 7, National Hydrology Research Institute, Saskatoon, Canada, 144 pp., 1995.

Pomeroy, J. W. and Gray, D. M.: Saltation of Snow, Water Resour. Res., 26, 1583–1594, 1990.

Ragettli, S., Zhou, J., Wang, H., Liu, C., and Guo, L.: Modeling flash floods in ungauged mountain catchments of China: A decision tree learning approach for parameter regionalization, J. Hydrol., 555, 330–346, 2017.

Roebber, P. J., Bruening, S. L., Schultz, D. M., and Cortinas, J. V.: Improving snowfall forecasting by diagnosing snow density, Weather Forecast., 18, 264–287, https://doi.org/10.1175/1520-0434(2003)018<0264:Isfbds>2.0.Co;2, 2003.

Sato, T., Kimura, T., Ishimaru, T., and Maruyama, T.: Field test of a new snow-particle counter (SPC) system, Ann. Glaciol., 18, 149–154, https://doi.org/10.3189/S0260305500011411, 1993.

Schmidt, R. A.: Threshold Wind-Speeds and Elastic Impact in Snow Transport, J. Glaciol., 26, 453–467, 1980.

Schmidt, R. A.: Estimates of Threshold Windspeed from Particle Sizes in Blowing Snow, Cold Reg. Sci. Technol., 4, 187–193, 1981.

Sexstone, G. A., Clow, D. W., Fassnacht, S. R., Liston, G. E., Hiemstra, C. A., Knowles, J. F., and Penn, C. A.: Snow Sublimation in Mountain Environments and Its Sensitivity to Forest Disturbance and Climate Warming, Water Resour. Res., 54, 1191–1211, 2018.

Sharma, S., Blagrave, K., Magnuson, J. J., O'Reilly, C. M., Oliver, S., Batt, R. D., Magee, M. R., Straile, D., Weyhenmeyer, G. A., and Winslow, L.: Widespread loss of lake ice around the Northern Hemisphere in a warming world, Nat. Clim. Change, 9, 227–231, 2019.

Trouvilliez, A., Naaim-Bouvet, F., Bellot, H., Genthon, C., and Gallée, H.: Evaluation of the FlowCapt Acoustic Sensor for the Aeolian Transport of Snow, J. Atmos. Ocean. Tech., 32, 1630–1641, 2015.

Vionnet, V., Guyomarc'h, G., Bouvet, F. N., Martin, E., Durand, Y., Bellot, H., Bel, C., and Pugliese, P.: Occurrence of blowing snow events at an alpine site over a 10-year period: Observations and modelling, Adv. Water Resour., 55, 53–63, 2013.

Vionnet, V., Guyomarc'h, G., Lafaysse, M., Naaim-Bouvet, F., Giraud, G., and Deliot, Y.: Operational implementation and evaluation of a blowing snow scheme for avalanche hazard forecasting, Cold Reg. Sci. Technol., 147, 1–10, 2018.

Xie, Z., Hu, Z., Ma, Y., Sun, G., Gu, L., Liu, S., Wang, Y., Zheng, H., and Ma, W.: Modeling blowing snow over the Tibetan Plateau with the Community Land Model: Method and preliminary evaluation, J. Geophys. Res.-Atmos., 124, 9332–9355, 2019.

Yang, C., Wu, G., Ding, K., Shi, T., Li, Q., and Wang, J.: Improving land use/land cover classification by integrating pixel unmixing and decision tree methods, Remote Sens., 9, 1222, https://doi.org/10.3390/rs9121222, 2017.

Zhao, W. L., Gentine, P., Reichstein, M., Zhang, Y., Zhou, S., Wen, Y., Lin, C., Li, X., and Qiu, G. Y.: Physics-constrained machine learning of evapotranspiration, Geophys. Res. Lett., 46, 14496–14507, 2019.

Zhou, Z. H. and Liu, X. Y.: Training cost-sensitive neural networks with methods addressing the class imbalance problem, IEEE T. Knowled. Data Eng., 18, 63–77, https://doi.org/10.1109/tkde.2006.17, 2006.

Articles

Short summary

Ground information on the occurrence of blowing snow has been sorely lacking because direct observations of blowing snow are sparse in time and space. In this paper, we investigated the potential capability of the decision tree model to detect blowing snow events in the European Alps. Trained with routine meteorological observations, the decision tree model can be used as an efficient tool to detect blowing snow occurrences across different regions requiring limited meteorological variables.