A gridded multi-site precipitation generator for complex terrain: An evaluation in the Austrian Alps
- 1Department of Atmospheric and Cryospheric Sciences, University of Innsbruck, Innsbruck, Austria
- 2Unit for Engineering Mathematics, University of Innsbruck, Innsbruck, Austria
- 1Department of Atmospheric and Cryospheric Sciences, University of Innsbruck, Innsbruck, Austria
- 2Unit for Engineering Mathematics, University of Innsbruck, Innsbruck, Austria
Abstract. For climate change impact assessment, many applications require very high-resolution data of precipitation consistent both in space and time for current as well as future climate. In this regard, stochastic weather generators are designed as a statisti- cal downscaling tool that can provide such data. Here, we adopt the framework of a precipitation generator of Kleiber et al. (2012), which is based on latent and transformed Gaussian processes, and propose an extension to that for a mountainous region with complex topography. The model is used to generate two-dimensional fields of precipitation with 1 km spatial and daily temporal resolution in a small region with highly complex terrain in the Austrian Alps. This study aims at evaluating the model for its ability to simulate realistic precipitation fields over the region using historical observations from a network of 29 meteorological stations as an input, discusses its added value over the original set-up and its limitations. Results show that the model generates realistic fields of precipitation with good spatial and temporal variability. The model is able to generate some of the difficult areal statistics useful for impact assessment such as areal dry and wet spells of different lengths and areal monthly mean of precipitation with great accuracy. The model also captures the inter-seasonal and intra-seasonal variability very well while the inter-annual variability is well captured in summer but largely underestimated in autumn and winter. The proposed model adds substantial value over the original modeling framework, specifically for the precipitation amount. The model is not able to reproduce realistic spatio-temporal characteristics of precipitation in autumn. We conclude that with further development, the model is a promising tool for downscaling precipitation in complex terrain for a wide range of applications in impact assessment studies.
- Preprint
(5231 KB) -
Supplement
(713 KB) - BibTeX
- EndNote
Hetal Dabhi et al.
Status: closed
- AC1: 'Comment on hess-2022-21', Hetal Dabhi, 24 Jan 2022
-
RC1: 'Comment on hess-2022-21', Anonymous Referee #1, 09 Feb 2022
This article suggests a novel GLM-based space-time rainfall generator for Alpine region. While the suggested model shows the limitations in reproduction of reality, I think this is a meaningful attempt in our field given the extreme challenging nature of the space-time field generation. The model is original, and the article is well structured. Therefore, I believe that the article is suitable for publication in this journal after a few revisions.
- I suggest authors to compare extreme values too. e.g. extreme values by the model of this study vs. AGPD, both point values and areal values. This is because one of the primary reasons of developing weather generators is to analyze disaster (from a probabilistic viewpoints).
- Please consider excluding trivial precipitation (e.g. less than 1mm) from your analysis to calculate occurrence-related values (Pww, Pdd, Pw, etc), and reanalyze your result. This is a common issue with all stochastic rainfall generator drawing rainfall depths from a modelled mathematical distribution. You may get better results.
- L181-184. I would like to see the map of the interpolated scale and shape parameter. Interpolating parameter, in many cases, causes problems. The map should look smooth and should show dependency to the terrain. In addition, I suggest authors to consider obtaining these parameter maps based on the AGPD data or the KED-based rainfall map from your point observations to exclude the process of spatial interpolation.
- Figure 7 and Figure 18. I would also like to see the shades of the observed precipitations, which may be significantly greater than the current blue shades. This is not because I want to criticize, but because I would like you clearly show and mention the challenges of the stochastic rainfall generators (underestimation of large-scale variability) and to suggest potential remedies. Park et al. (2019) and Kim et al. (2020) discusses this issue in detail.
- Figure 9. Why not show on the log-log axis? Too many small value pairs.
- Figure 12 and Figure 15 look like a collection of chessboards rather than a heat map. Would you remove the white squares?
- Figure 11, 12, and 13 (Figure 14, 15, and 16 too): No need to show all the months. Please consider squeezing into one figure showing simulation (1st row), AGPD (2nd row), and differences (3rd row) for 4 seasonal months (columns)
- L 300-302, Figure 17. I am not sure which of the two variables that the authors are precisely comparing. Would you let me know how, for example, correlation coefficients were derived (e.g. x and y values of the scatter plot)?
Park, J., Onof, C., & Kim, D. (2019). A hybrid stochastic rainfall model that reproduces some important rainfall characteristics at hourly to yearly timescales. Hydrology and Earth System Sciences, 23(2), 989-1014.
Kim, D., & Onof, C. (2020). A stochastic rainfall model that can reproduce important rainfall properties across the timescales from several minutes to a decade. Journal of Hydrology, 589, 125150
- AC2: 'Reply on RC1', Hetal Dabhi, 06 May 2022
-
RC2: 'Comment on hess-2022-21', Anonymous Referee #2, 24 Feb 2022
The paper entitled 'A gridded multi-site precipitation generator for complex terrain: An evaluation in the Austrian Alps' by Hetal Dabhi and co-authors describes an extension of the daily stochastic spatio-temporal precipitation generator of Kleiber et al. (2012) to mountainous areas with complex topography, and illustrates the proposed framework in the Austrian Alps based on a network of 29 meteorological stations.
The paper is well written and structured, and the topic is relevant for the journal Hydrology and Earth System Sciences. The selected case study seems appropriate to test the proposed precipitation generator in an area with complex topography.
Despite the above qualities, the manuscript also contains major shortcomings, which, in my opinion, must be corrected before the paper can be considered for publication:
(1) It is claimed that the paper provides an extension of the model of Kleiber et al. (2012) to areas with complex topography, but the proposed extensions (Kriging with external drift (KED) of model parameters and altitude dependent covariance function) are not very convincing. According to assessment results, these extensions do not offer significant improvements compared to the original model despite the addition of a lot of model complexity. In my opinion, the authors should therefore test the original model in their case study, and propose extensions only if the added complexity generates clear improvements in simulation results. If it is not the case, I would recommend to stick to the simplest possible model, and introduce the current paper as a case study testing the performance of the original model in the presence of complex topography, which I believe is already an interesting contribution.
(2) Almost no information is provided about the spatial interpolation of model parameters (using different versions of Kriging, in particular KED), which is however a critical step of the model set-up. I think that it is inevitable to give more details about the Kriging step, including: (i) mention if a nugget term is used, and if yes what is the nugget contribution to the total variance, (ii) mention which variogram model is used, which method is used to fit or infer variograms, and maybe show some examples of adjusted variograms, (iii) prove by some data analysis than model parameters are (linearly) dependent of altitude to justify the use of KED, and finally (iv) display maps of kriged model parameters.
(3) A cross-validation is missing to evaluate the performance of the interpolation of model parameters by Kriging, and to assess the density of stations required for model calibration.
(4) The proposed model fails to reproduce precipitation at one of the three stations selected for illustration (i.e. Prutz), and according to Fig 9 also performs poorly for Dresdner Hütte, Kühtai, Nauders and Sankt Leonhard 2 (i.e. 17% of stations in total), and the reasons why the model fails at these locations are not investigated in enough details. The only explanation given for Prutz is that this station exhibits 'precipitation characteristics quite distinct from those of neighbouring (or more distant) stations', but nothing is said about why a model calibrated at station locations is unable to reproduce precipitation at the exact same locations. For me it is very probable that the Kriging of model parameters introduces errors when clustered stations exhibit distinct statistics (which is the case for Prutz, that forms a cluster with Ried in Oberinntal and Fendels) - and possibly also where sharp gradients of precipitation occur (Nauders or St Leonhard im Pitztal 2) as well as at the edges of the study area (Marienberg, St Martin or Kühtai) - but one cannot see it in the present manuscript because the Kriging step is completely overlooked.
(5) Regarding the problem at Prutz station, I would be interested in seeing (in supplementary material for instance) the raw time series for Prutz, Ried in Oberinntal, Fendels and Ladis to be sure that the modeling problem does not simply originate from instrumental errors at Prutz station... the fact that this station has so much differences with Ried in Oberinntal which is located 1km apart, at the exact same altitude, and with similar neighboring topography is very surprising to me.
In addition to these major concerns, I also have some minor comments detailed hereafter. Line and figure numbers refer to: https://hess.copernicus.org/preprints/hess-2022-21/hess-2022-21.pdf.
L 121: Threshold to define a wet day: I suppose that the value of 0.1mm corresponds to the lowest resolution of the rain gauges used in the case study. But I was not able to find this information. Please mention the resolution of the rain gauges in section 3.1.
L 125-128 and Eq4: Making the covariance of the occurrence altitude dependent seems a relatively arbitrary choice, and leads to a complex (and hardly tractable) model. This must be supported by some preliminary data analysis, showing that such altitude dependent covariance of precipitation occurrence actually exists in your study dataset. In addition, I wonder if including station altitude as a covariate in the vector Xo would not be a more convenient modeling choice. This may lead to a simpler model (also easier to 'validate' using the AIC/BIC model selection procedure introduced L260).
Eq 5 (L156): It is not clear to me how this equation derives from Eq 2 and Eq 4 considered at a single site, and using previous day's occurrence as a covariate. Could you give more details (maybe in supplementary or a reference)?
L166: Using KED with altitude as drift to interpolate regression parameters means that these parameters are all (linearly) correlated with altitude. This should be shown by a data analysis. In addition this leads to a complex model, and I wonder (as in my comment about Eq 5) if it would not be easier and equally effective to include altitude in Xo, and test if altitude is a relevant covariate.
Eq 6 and paragraph L179-188: the mean function of the latent process \mu_{A}, and the parameters of the Gamma distribution are space and time dependent, and in addition are interpolated by KED. This is a lot of parameters! You should quantify and acknowledge the complexity of your model. I'm not sure that so much model complexity (and degree of freedom) is necessary, but I'm ready to be convinced by a careful data analysis showing that all these dependencies are indeed present in your dataset. If i'm not mistaken, in Kleiber et al (2012), the mean function of the latent process Wa used to model precipitation amount is fixed to zero (and not regressed on covariates with regression parameters additionally interpolated by KED), which makes the model of precipitation amount way simpler. Such addition of complexity must be supported by data.
Figure 1: I think station 22 (Pitztaler Gletscher) should be in red instead of station 24 (Obergurgl).
L218-220: Could this extreme value be an outlier? Prutz is surrounded by very nearby stations (few kilometer apart) in all directions, and none of them measure more than 35mm this day (compared to 156mm in Prutz). I agree that summer convective rains can be very localized, but I'm still surprised by this observation, and I think this requires more investigation. And this also rises concerns about the quality of data at Prutz.
L223-224: different precipitation features at St Leohard 2: more details are needed to ensure that this station operates properly (same comment for Prutz).
L229-232: I do not understand this paragraph. How this 7-days window increases the amount of data? And how simulation will add robustness to the observations? This is very unclear. Maybe because I do not understand what you name data.
Eq 7 and Eq 8: A lot of temporal covariates are tested, but only one covariate linked to atmospheric circulation. Why such unbalance? I agree that NAOI can influence precipitation in the Austrian Alps, but it is definitely not the only covariate one can think about. In my opinion you should test other climate covariates, or justify why you think NAOI is enough.
L288-291: More information about the APGD dataset is required to allow the reader to understand the main features of this reference dataset. In addition, it should be mentioned somewhere that APGD is not a perfect reference. Finally, the use of a 5km spatial resolution reference does not make it possible to assess the fine scale patterns generated by your 1km resolution model. Hence, all fine scale patterns seen in Fig 11 and Fig 14 may only be artifacts of using KED driven by altitude. This must be mentioned somewhere in the paper, or Fig 11 and Fig 14 should be aggregated at 5km resolution to avoid over-interpretation of the results.
Fig 5 and Fig 6: Frequency -> Frequency (%).
Fig 9: Very useful figure. It could be improved by: (1) using the same range of values for abscissa and ordinates, and for all stations. (2) Add station Id in addition to station names, and maybe order stations according to their Id (as in Table 1). (3) Mention in caption which quantiles are used (percentiles I presume).
Section 4.2: It would also be interesting to display q-q plots of areal daily precipitation amount.
L420: The influence of topography on precipitation occurrence (and also amount) may be an artifact of the model. I don't say that it is the case, but just that you do not prove it in this paper.
Section 4.3: Results in this section prove that the anisotropic model is not relevant in the present case. They also show very few improvement when using KED instead of OK. When considering how much model complexity and arbitrary hypotheses about orographic precipitation enhancement are added with KED I wonder if a direct application of the model of Kleiber et al, (2012), with maybe altitude as a covariate (to be selected by BIC/AIC) would not be a better option.
L532-533: It would be interesting to compare the original and the extended model in a schematic, including the number of parameters involved.
L678-827: many typos in the doi of the references.
- AC3: 'Reply on RC2', Hetal Dabhi, 06 May 2022
-
RC3: 'Comment on hess-2022-21', Anonymous Referee #3, 25 Mar 2022
This paper discusses the results obtained using a space-time generator for daily rainfall in the Austrian Alps. This is an interesting topic and the paper is generally well written. The proposed model is very close to the one proposed by Kleiber et al. (2012) and the methodological contribution of the paper is rather limited. However, the authors have done an impressive work to validate the model on a complex data set. I think that this may be of interest for the readers of the journal Hydrology and Earth System Sciences. I suggest that the authors take into account the following comments before submitting a new version of their paper.
Line 3. “...as well as future climat “. Indeed, this would be great, but this is not discussed later in the paper. I think that it should be removed from the abstract or discussed in the paper.
Line 5 . “... propose an extension...”. Please detail, contributions should be clear when reading the abstract.
Line 38 “...typically of 1 km for spatial and daily for temporal scale”. Could you precise an application where such scales are involved? I am not a specialist in hydrology, but I have the feeling that if you consider such a high spatial resolution, then the temporal resolution should also be finer?
Line 90. “Such WGs are of limited use if the observed gridded data are not available which is often the case”. One option would be to “interpolate” the data on a grid before fitting the Wgs. Could you comment on the benefit of the proposed approach versus interpolation? It may be possible to obtain high quality gridded data using an interpolation method which merges all the available sources of information (meteorological stations, but also radar, models and so on), whereas the inclusion of such information in the proposed Wgs does not seem straightforward?
Line 100 “However, Kleiber et al. (2012) tested the model only for the multi-site precipitation generation, i.e. at locations with observation and not for the generated gridded data of precipitation.“ Does it make an important difference? If yes, please detail.
General comment on Section 2. I find that the statistical methodology is not described precisely enough. The reader sometimes has to guess how the model is defined, and I do not think that there are enough details for someone who would be interested in reproducing the results. This is especially true in Section 2.2, but I think that more details should also be given in Section 2.1. Please also
- explain how the model is fitted to the data, eventually provide the codes,
- give the number of parameters involved in the model,
- comment on the computational time to fit and simulate the model.
Line 164 “The Gaussian process itself provides a spatial interpolation method ‘kriging’ so that the model parameters βO associated with each covariate, which are estimated at observation locations, can be interpolated to any location of interest.” Not clear for me, please reformulate.
Line 230 “To reduce uncertainty and add more robustness to the observations, we increase the sample size of the observed data by considering a 7-days window centred at the day of interest.” This sentence is mysterious for me, please reformulate.
Table 1. Is this table useful?
Line 267. “We select the covariates using both AIC and BIC …”. If you consider AIC/BIC, then the model was fitted using Maximum Likelihood? Or only part of it?
General comment on Section 3.3. The authors have done an impressive work for validating their model. However, I have the feeling that the two following aspects are important but not discussed and thus should be further considered :
- Cross-validation. If I understand correctly, no cross-validation is performed although it is usually done when validating spatial Wgs. Cross-validation would consist in removing some stations when fitting the model and, then check if the model is able to generate realistic precipitations at these stations by comparing simulation and ‘true’ data. It may give confidence in the ability of the model to generate precipitation at locations where no data are available.
- Spatial dependance. The spatial dependence structure, which is an important aspect for many hydrological applications, is discussed only in the discussion (Line 585-610) and supplementary material. This should be discussed in Section 4.
Line 292. It is not clear for me why the authors use tolerance intervals instead of confidence intervals. Confidence (or fluctuation) intervals have the advantage of being widely used when validating Wgs and easily understood by most readers.
Line 300. “To quantify the model performance…”. Is it useful to have all these criteria? Do they bring complementary information? It takes space in the paper (with tables and plots) but it is barely discussed in the paper.
Line 395. The Kolmogorov-Smirnov test and the Wilkoxon-Mann-Whitney test are valid for continuous distributions, whereas rain gauge measurements are usually discrete (e.g. every 0.2 mm for tipping-buckets). Could you comment on the validity of the tests in such situation?
Line 496. “It is evident that by allowing the elevation as a covariate in the kriging interpolation for prediction at each grid point, the amount of precipitation is considerably improved”. Is it so obvious? Maybe there are some improvements, but I don’t have the feeling that they are ‘considerable’!
General comment on Section 5 and 6. There are repetitions in Section 5 and 6. I suggest that you concatenate both Sections and try to make it shorter.
- AC4: 'Reply on RC3', Hetal Dabhi, 06 May 2022
Status: closed
- AC1: 'Comment on hess-2022-21', Hetal Dabhi, 24 Jan 2022
-
RC1: 'Comment on hess-2022-21', Anonymous Referee #1, 09 Feb 2022
This article suggests a novel GLM-based space-time rainfall generator for Alpine region. While the suggested model shows the limitations in reproduction of reality, I think this is a meaningful attempt in our field given the extreme challenging nature of the space-time field generation. The model is original, and the article is well structured. Therefore, I believe that the article is suitable for publication in this journal after a few revisions.
- I suggest authors to compare extreme values too. e.g. extreme values by the model of this study vs. AGPD, both point values and areal values. This is because one of the primary reasons of developing weather generators is to analyze disaster (from a probabilistic viewpoints).
- Please consider excluding trivial precipitation (e.g. less than 1mm) from your analysis to calculate occurrence-related values (Pww, Pdd, Pw, etc), and reanalyze your result. This is a common issue with all stochastic rainfall generator drawing rainfall depths from a modelled mathematical distribution. You may get better results.
- L181-184. I would like to see the map of the interpolated scale and shape parameter. Interpolating parameter, in many cases, causes problems. The map should look smooth and should show dependency to the terrain. In addition, I suggest authors to consider obtaining these parameter maps based on the AGPD data or the KED-based rainfall map from your point observations to exclude the process of spatial interpolation.
- Figure 7 and Figure 18. I would also like to see the shades of the observed precipitations, which may be significantly greater than the current blue shades. This is not because I want to criticize, but because I would like you clearly show and mention the challenges of the stochastic rainfall generators (underestimation of large-scale variability) and to suggest potential remedies. Park et al. (2019) and Kim et al. (2020) discusses this issue in detail.
- Figure 9. Why not show on the log-log axis? Too many small value pairs.
- Figure 12 and Figure 15 look like a collection of chessboards rather than a heat map. Would you remove the white squares?
- Figure 11, 12, and 13 (Figure 14, 15, and 16 too): No need to show all the months. Please consider squeezing into one figure showing simulation (1st row), AGPD (2nd row), and differences (3rd row) for 4 seasonal months (columns)
- L 300-302, Figure 17. I am not sure which of the two variables that the authors are precisely comparing. Would you let me know how, for example, correlation coefficients were derived (e.g. x and y values of the scatter plot)?
Park, J., Onof, C., & Kim, D. (2019). A hybrid stochastic rainfall model that reproduces some important rainfall characteristics at hourly to yearly timescales. Hydrology and Earth System Sciences, 23(2), 989-1014.
Kim, D., & Onof, C. (2020). A stochastic rainfall model that can reproduce important rainfall properties across the timescales from several minutes to a decade. Journal of Hydrology, 589, 125150
- AC2: 'Reply on RC1', Hetal Dabhi, 06 May 2022
-
RC2: 'Comment on hess-2022-21', Anonymous Referee #2, 24 Feb 2022
The paper entitled 'A gridded multi-site precipitation generator for complex terrain: An evaluation in the Austrian Alps' by Hetal Dabhi and co-authors describes an extension of the daily stochastic spatio-temporal precipitation generator of Kleiber et al. (2012) to mountainous areas with complex topography, and illustrates the proposed framework in the Austrian Alps based on a network of 29 meteorological stations.
The paper is well written and structured, and the topic is relevant for the journal Hydrology and Earth System Sciences. The selected case study seems appropriate to test the proposed precipitation generator in an area with complex topography.
Despite the above qualities, the manuscript also contains major shortcomings, which, in my opinion, must be corrected before the paper can be considered for publication:
(1) It is claimed that the paper provides an extension of the model of Kleiber et al. (2012) to areas with complex topography, but the proposed extensions (Kriging with external drift (KED) of model parameters and altitude dependent covariance function) are not very convincing. According to assessment results, these extensions do not offer significant improvements compared to the original model despite the addition of a lot of model complexity. In my opinion, the authors should therefore test the original model in their case study, and propose extensions only if the added complexity generates clear improvements in simulation results. If it is not the case, I would recommend to stick to the simplest possible model, and introduce the current paper as a case study testing the performance of the original model in the presence of complex topography, which I believe is already an interesting contribution.
(2) Almost no information is provided about the spatial interpolation of model parameters (using different versions of Kriging, in particular KED), which is however a critical step of the model set-up. I think that it is inevitable to give more details about the Kriging step, including: (i) mention if a nugget term is used, and if yes what is the nugget contribution to the total variance, (ii) mention which variogram model is used, which method is used to fit or infer variograms, and maybe show some examples of adjusted variograms, (iii) prove by some data analysis than model parameters are (linearly) dependent of altitude to justify the use of KED, and finally (iv) display maps of kriged model parameters.
(3) A cross-validation is missing to evaluate the performance of the interpolation of model parameters by Kriging, and to assess the density of stations required for model calibration.
(4) The proposed model fails to reproduce precipitation at one of the three stations selected for illustration (i.e. Prutz), and according to Fig 9 also performs poorly for Dresdner Hütte, Kühtai, Nauders and Sankt Leonhard 2 (i.e. 17% of stations in total), and the reasons why the model fails at these locations are not investigated in enough details. The only explanation given for Prutz is that this station exhibits 'precipitation characteristics quite distinct from those of neighbouring (or more distant) stations', but nothing is said about why a model calibrated at station locations is unable to reproduce precipitation at the exact same locations. For me it is very probable that the Kriging of model parameters introduces errors when clustered stations exhibit distinct statistics (which is the case for Prutz, that forms a cluster with Ried in Oberinntal and Fendels) - and possibly also where sharp gradients of precipitation occur (Nauders or St Leonhard im Pitztal 2) as well as at the edges of the study area (Marienberg, St Martin or Kühtai) - but one cannot see it in the present manuscript because the Kriging step is completely overlooked.
(5) Regarding the problem at Prutz station, I would be interested in seeing (in supplementary material for instance) the raw time series for Prutz, Ried in Oberinntal, Fendels and Ladis to be sure that the modeling problem does not simply originate from instrumental errors at Prutz station... the fact that this station has so much differences with Ried in Oberinntal which is located 1km apart, at the exact same altitude, and with similar neighboring topography is very surprising to me.
In addition to these major concerns, I also have some minor comments detailed hereafter. Line and figure numbers refer to: https://hess.copernicus.org/preprints/hess-2022-21/hess-2022-21.pdf.
L 121: Threshold to define a wet day: I suppose that the value of 0.1mm corresponds to the lowest resolution of the rain gauges used in the case study. But I was not able to find this information. Please mention the resolution of the rain gauges in section 3.1.
L 125-128 and Eq4: Making the covariance of the occurrence altitude dependent seems a relatively arbitrary choice, and leads to a complex (and hardly tractable) model. This must be supported by some preliminary data analysis, showing that such altitude dependent covariance of precipitation occurrence actually exists in your study dataset. In addition, I wonder if including station altitude as a covariate in the vector Xo would not be a more convenient modeling choice. This may lead to a simpler model (also easier to 'validate' using the AIC/BIC model selection procedure introduced L260).
Eq 5 (L156): It is not clear to me how this equation derives from Eq 2 and Eq 4 considered at a single site, and using previous day's occurrence as a covariate. Could you give more details (maybe in supplementary or a reference)?
L166: Using KED with altitude as drift to interpolate regression parameters means that these parameters are all (linearly) correlated with altitude. This should be shown by a data analysis. In addition this leads to a complex model, and I wonder (as in my comment about Eq 5) if it would not be easier and equally effective to include altitude in Xo, and test if altitude is a relevant covariate.
Eq 6 and paragraph L179-188: the mean function of the latent process \mu_{A}, and the parameters of the Gamma distribution are space and time dependent, and in addition are interpolated by KED. This is a lot of parameters! You should quantify and acknowledge the complexity of your model. I'm not sure that so much model complexity (and degree of freedom) is necessary, but I'm ready to be convinced by a careful data analysis showing that all these dependencies are indeed present in your dataset. If i'm not mistaken, in Kleiber et al (2012), the mean function of the latent process Wa used to model precipitation amount is fixed to zero (and not regressed on covariates with regression parameters additionally interpolated by KED), which makes the model of precipitation amount way simpler. Such addition of complexity must be supported by data.
Figure 1: I think station 22 (Pitztaler Gletscher) should be in red instead of station 24 (Obergurgl).
L218-220: Could this extreme value be an outlier? Prutz is surrounded by very nearby stations (few kilometer apart) in all directions, and none of them measure more than 35mm this day (compared to 156mm in Prutz). I agree that summer convective rains can be very localized, but I'm still surprised by this observation, and I think this requires more investigation. And this also rises concerns about the quality of data at Prutz.
L223-224: different precipitation features at St Leohard 2: more details are needed to ensure that this station operates properly (same comment for Prutz).
L229-232: I do not understand this paragraph. How this 7-days window increases the amount of data? And how simulation will add robustness to the observations? This is very unclear. Maybe because I do not understand what you name data.
Eq 7 and Eq 8: A lot of temporal covariates are tested, but only one covariate linked to atmospheric circulation. Why such unbalance? I agree that NAOI can influence precipitation in the Austrian Alps, but it is definitely not the only covariate one can think about. In my opinion you should test other climate covariates, or justify why you think NAOI is enough.
L288-291: More information about the APGD dataset is required to allow the reader to understand the main features of this reference dataset. In addition, it should be mentioned somewhere that APGD is not a perfect reference. Finally, the use of a 5km spatial resolution reference does not make it possible to assess the fine scale patterns generated by your 1km resolution model. Hence, all fine scale patterns seen in Fig 11 and Fig 14 may only be artifacts of using KED driven by altitude. This must be mentioned somewhere in the paper, or Fig 11 and Fig 14 should be aggregated at 5km resolution to avoid over-interpretation of the results.
Fig 5 and Fig 6: Frequency -> Frequency (%).
Fig 9: Very useful figure. It could be improved by: (1) using the same range of values for abscissa and ordinates, and for all stations. (2) Add station Id in addition to station names, and maybe order stations according to their Id (as in Table 1). (3) Mention in caption which quantiles are used (percentiles I presume).
Section 4.2: It would also be interesting to display q-q plots of areal daily precipitation amount.
L420: The influence of topography on precipitation occurrence (and also amount) may be an artifact of the model. I don't say that it is the case, but just that you do not prove it in this paper.
Section 4.3: Results in this section prove that the anisotropic model is not relevant in the present case. They also show very few improvement when using KED instead of OK. When considering how much model complexity and arbitrary hypotheses about orographic precipitation enhancement are added with KED I wonder if a direct application of the model of Kleiber et al, (2012), with maybe altitude as a covariate (to be selected by BIC/AIC) would not be a better option.
L532-533: It would be interesting to compare the original and the extended model in a schematic, including the number of parameters involved.
L678-827: many typos in the doi of the references.
- AC3: 'Reply on RC2', Hetal Dabhi, 06 May 2022
-
RC3: 'Comment on hess-2022-21', Anonymous Referee #3, 25 Mar 2022
This paper discusses the results obtained using a space-time generator for daily rainfall in the Austrian Alps. This is an interesting topic and the paper is generally well written. The proposed model is very close to the one proposed by Kleiber et al. (2012) and the methodological contribution of the paper is rather limited. However, the authors have done an impressive work to validate the model on a complex data set. I think that this may be of interest for the readers of the journal Hydrology and Earth System Sciences. I suggest that the authors take into account the following comments before submitting a new version of their paper.
Line 3. “...as well as future climat “. Indeed, this would be great, but this is not discussed later in the paper. I think that it should be removed from the abstract or discussed in the paper.
Line 5 . “... propose an extension...”. Please detail, contributions should be clear when reading the abstract.
Line 38 “...typically of 1 km for spatial and daily for temporal scale”. Could you precise an application where such scales are involved? I am not a specialist in hydrology, but I have the feeling that if you consider such a high spatial resolution, then the temporal resolution should also be finer?
Line 90. “Such WGs are of limited use if the observed gridded data are not available which is often the case”. One option would be to “interpolate” the data on a grid before fitting the Wgs. Could you comment on the benefit of the proposed approach versus interpolation? It may be possible to obtain high quality gridded data using an interpolation method which merges all the available sources of information (meteorological stations, but also radar, models and so on), whereas the inclusion of such information in the proposed Wgs does not seem straightforward?
Line 100 “However, Kleiber et al. (2012) tested the model only for the multi-site precipitation generation, i.e. at locations with observation and not for the generated gridded data of precipitation.“ Does it make an important difference? If yes, please detail.
General comment on Section 2. I find that the statistical methodology is not described precisely enough. The reader sometimes has to guess how the model is defined, and I do not think that there are enough details for someone who would be interested in reproducing the results. This is especially true in Section 2.2, but I think that more details should also be given in Section 2.1. Please also
- explain how the model is fitted to the data, eventually provide the codes,
- give the number of parameters involved in the model,
- comment on the computational time to fit and simulate the model.
Line 164 “The Gaussian process itself provides a spatial interpolation method ‘kriging’ so that the model parameters βO associated with each covariate, which are estimated at observation locations, can be interpolated to any location of interest.” Not clear for me, please reformulate.
Line 230 “To reduce uncertainty and add more robustness to the observations, we increase the sample size of the observed data by considering a 7-days window centred at the day of interest.” This sentence is mysterious for me, please reformulate.
Table 1. Is this table useful?
Line 267. “We select the covariates using both AIC and BIC …”. If you consider AIC/BIC, then the model was fitted using Maximum Likelihood? Or only part of it?
General comment on Section 3.3. The authors have done an impressive work for validating their model. However, I have the feeling that the two following aspects are important but not discussed and thus should be further considered :
- Cross-validation. If I understand correctly, no cross-validation is performed although it is usually done when validating spatial Wgs. Cross-validation would consist in removing some stations when fitting the model and, then check if the model is able to generate realistic precipitations at these stations by comparing simulation and ‘true’ data. It may give confidence in the ability of the model to generate precipitation at locations where no data are available.
- Spatial dependance. The spatial dependence structure, which is an important aspect for many hydrological applications, is discussed only in the discussion (Line 585-610) and supplementary material. This should be discussed in Section 4.
Line 292. It is not clear for me why the authors use tolerance intervals instead of confidence intervals. Confidence (or fluctuation) intervals have the advantage of being widely used when validating Wgs and easily understood by most readers.
Line 300. “To quantify the model performance…”. Is it useful to have all these criteria? Do they bring complementary information? It takes space in the paper (with tables and plots) but it is barely discussed in the paper.
Line 395. The Kolmogorov-Smirnov test and the Wilkoxon-Mann-Whitney test are valid for continuous distributions, whereas rain gauge measurements are usually discrete (e.g. every 0.2 mm for tipping-buckets). Could you comment on the validity of the tests in such situation?
Line 496. “It is evident that by allowing the elevation as a covariate in the kriging interpolation for prediction at each grid point, the amount of precipitation is considerably improved”. Is it so obvious? Maybe there are some improvements, but I don’t have the feeling that they are ‘considerable’!
General comment on Section 5 and 6. There are repetitions in Section 5 and 6. I suggest that you concatenate both Sections and try to make it shorter.
- AC4: 'Reply on RC3', Hetal Dabhi, 06 May 2022
Hetal Dabhi et al.
Hetal Dabhi et al.
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
538 | 171 | 24 | 733 | 46 | 12 | 8 |
- HTML: 538
- PDF: 171
- XML: 24
- Total: 733
- Supplement: 46
- BibTeX: 12
- EndNote: 8
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1