Stochastic daily rainfall generation on tropical islands with complex topography
 ^{1}Water Resources Research Center, University of Hawai`i at Mānoa, 96822 Honolulu, Hawai`i, USA
 ^{2}GePaSud Laboratory, University of French Polynesia, 98702 Faa'a, Tahiti, French Polynesia
 ^{3}Department of Atmospheric Sciences, School of Ocean and Earth Science and Technology, University of Hawai`i at Mānoa, 96822 Honolulu, Hawai`i, USA
 ^{4}Department of Geography, University of Hawai`i at Mānoa, 96822 Honolulu, Hawai`i, USA
 ^{a}now at: Biostatistics and Spatial Processes (BioSP), INRAE, 84914 Avignon CEDEX 9, France
 ^{1}Water Resources Research Center, University of Hawai`i at Mānoa, 96822 Honolulu, Hawai`i, USA
 ^{2}GePaSud Laboratory, University of French Polynesia, 98702 Faa'a, Tahiti, French Polynesia
 ^{3}Department of Atmospheric Sciences, School of Ocean and Earth Science and Technology, University of Hawai`i at Mānoa, 96822 Honolulu, Hawai`i, USA
 ^{4}Department of Geography, University of Hawai`i at Mānoa, 96822 Honolulu, Hawai`i, USA
 ^{a}now at: Biostatistics and Spatial Processes (BioSP), INRAE, 84914 Avignon CEDEX 9, France
Correspondence: Lionel Benoit (lionel.benoit@inrae.fr)
Hide author detailsCorrespondence: Lionel Benoit (lionel.benoit@inrae.fr)
Stochastic rainfall generators are probabilistic models of rainfall space–time behavior. During parameterization and calibration, they allow the identification and quantification of the main modes of rainfall variability. Hence, stochastic rainfall models can be regarded as probabilistic conceptual models of rainfall dynamics.
As with most conceptual models in earth sciences, the performance of stochastic rainfall models strongly relies on their adequacy in representing the rain process at hand. On tropical islands with high elevation topography, orographic rain enhancement challenges most existing stochastic models because it creates localized precipitations with strong spatial gradients, which break down the stationarity of rain statistics.
To allow for stochastic rainfall modeling on tropical islands, despite nonstationarity of rain statistics, we propose a new stochastic daily multisite rainfall generator specifically for areas with significant orographic effects. Our model relies on a preliminary classification of daily rain patterns into rain types based on rainfall space and intensity statistics, and sheds new light on rainfall variability at the island scale. Within each rain type, the distribution of rainfall through the island is modeled by combining a nonparametric resampling of past analogs of a latent field describing the spatial distribution of rainfall, and a parametric gamma transform function describing rain intensity.
When applied to the stochastic simulation of rainfall on the islands of O`ahu (Hawai`i, United States of America) and Tahiti (French Polynesia) in the tropical Pacific, the proposed model demonstrates good skills in jointly simulating sitespecific and islandscale rain statistics. Hence, it provides a new tool for stochastic impact studies in tropical islands, in particular for watershed water resource management.
 Article
(2529 KB) 
Supplement
(3146 KB)  BibTeX
 EndNote
Stochastic rainfall generators are probabilistic tools aimed at simulating synthetic rainfall that mimic as closely as possible the statistical signature of rain observations (Richardson, 1981; Wilks and Wilby, 1999; Ailliot et al., 2015). More specifically, stochastic rainfall modeling consists of statistical learning (i.e., inference) of the joint space–time probability density function (pdf) of rainfall at all sites and times of interest, and sampling this pdf to efficiently generate synthetic rainfall. The ability of stochastic rainfall generators to emulate long and realistic rainfall sequences makes them an appropriate tool for the simulation of design storms (Niemi et al., 2016). Simulated rainfall can then be used as inputs for impact models assessing the effects of rainfall on different environmental processes including hydrology (Paschalis et al., 2014), water resources (Cappelaere et al., 2020), geomorphology (Peleg et al., 2020), and agronomy (Mavromatis and Hansen, 2001). The probabilistic approach followed by stochastic rainfall generators enables a comprehensive study of rainfall variability and, in turn, assessment of uncertainty propagation along the whole modeling chain (Gabellani et al., 2007). This makes stochastic rainfall generation a key tool for management of raininduced risk, in particular for flood (Caseri et al., 2016) and drought risks (Supit et al., 2012). In addition, the focus of stochastic rainfall models on the statistical signature of rainfall creates new ways to characterize rainfall space–time behavior (Marra and Morin, 2018), and assess the impact of rainfall variability on the hydrosphere (Morin et al., 2019). Finally, when conditioned to climate model outputs, stochastic rainfall generation can be used for the downscaling of future precipitation projections (Maraun et al., 2010), resulting in localscale and highresolution scenarios of the possible evolution of rainfall in the context of climate change (Jha et al., 2014; Volosciuk et al., 2017).
To enable fast and computationally efficient simulations and thereby allow for investigation of rainfall variability and associated uncertainty through the simulation of large ensembles, stochastic rainfall generators adopt an empirical approach that bypasses the detailed physical modeling of raingeneration processes (Bauer et al., 2015). To avoid the pitfall of physically unrealistic simulations, stochastic rainfall models embed a significant part of our conceptual knowledge about rainfall behavior in their parameterization (i.e., they implement statistical relationships that reflect as closely as possible the physical processes at work). However, rainfall properties (Krajewski et al., 2003) and, in turn, the performance of stochastic rainfall generators (Breinl et al., 2017; Vu et al., 2018) strongly depend on the climate of the area of interest. Hence, different models have been proposed for different climates, with each model focusing on a specific aspect of rainfall, for instance, rainfall seasonality in monsoonal climates (Greene et al., 2011), rainfall spatial–temporal correlation in temperate climates (Paschalis et al., 2013), or rainfall occurrence and extreme intensities in arid regions (Wilcox et al., 2021).
On high tropical islands or islands with high elevations and significant topography, rainfall is strongly location dependent due to complex interactions between atmospheric circulation and island topography, which trigger different mechanisms of orographic rain enhancement (Foresti and Pozdnoukhov, 2012; Houze, 2012). On monthly to annual scales, the effect of orographic lifting of relatively steady trade winds generates welldefined rain patterns (Lyons, 1982). In these patterns, highlands are usually wetter than lowlands, windward slopes wetter than leeward sides, and in case of successive mountain ridges, the first to be reached by the wet air masses is the wettest (Giambelluca et al., 2013; Laurent et al., 2019). To this firstorder quasistatic picture is added the important variability of daily rainfall patterns associated with processes ranging from synopticscale disturbances (Hopuare et al., 2018; Longman et al., 2021) to largescale atmospheric circulations (Hopuare et al., 2015; Frazier et al., 2018; Brown et al., 2020). Large deviations from the longterm rainfall patterns are thus observed, and usually dry leeward slopes can become the wettest part of the island, for example during Kona storms (seasonal cyclones) in the Hawai`i archipelago (Caruso and Businger, 2006).
Orographic effects lead to nonstationary (i.e., nonhomogeneous) rain statistics in both space and time, which challenges most existing stochastic rainfall models (Nerini et al., 2017). In the context of high tropical islands, the first difficulty arises from the longterm patterns of orographic rain enhancement that create nonstationarities in space. Rainfall generators must therefore account not only for the correlation between locations (Leblois and Creutin, 2013; Paschalis et al., 2013), but also for the location of the rain within the island. When orographic effects are directly related to topography, generalized linear models (GLMs) have been leveraged to account for spatial rain patterns by linking model parameters to local topographic information such as altitude or slope aspect (Ambrosino et al., 2014; Chandler, 2020). On tropical islands, however, the complexity of the rain–topography relationships (e.g., successive mountain ridges becoming drier at similar altitude) hinders the direct regression of model parameters on topographic data. An interesting option to overcome the nonuniqueness of topography–rainfall relationships is to interpolate the parameters of the rainfall model in space, in order to let data inform nonstationarity (Kleiber et al., 2012; Bennett et al., 2018). This, however, increases model complexity, and therefore requires training datasets with high spatial resolution, which are not yet available for most target areas (Benoit et al., 2021). Finally, the difficulty of accounting for orographic effects in the context of high tropical islands is further increased by the temporal variability of rainfall patterns (e.g., wet leeward slopes during Kona storms), which calls for the development of models able to identify and capture the main modes of variability of orographic effects over time.
A rainfall generator compatible with marine tropical climates and complex rain–topography interactions encountered on tropical islands is, to the best of our knowledge, still lacking in the toolbox of stochastic rainfall generators. To fill this gap, the present paper proposes a new rainfall model dedicated to high tropical islands with significant and complex topography, which aims to account for both the longterm quasistatic patterns of rain accumulation and the daytoday fluctuations of the rainfall spatial distribution. Specifically, the goal is to develop a daily resolution multisite stochastic rainfall generator able to simulate (1) sitespecific rain occurrence, persistence, intensity, and seasonality; (2) spatial patterns of daily rain accumulation; and (3) areal rain statistics at the island scale.
To achieve these objectives, the remainder of the article is structured as follows: Section 2 briefly reviews the main features of tropical island rainfall and describes our stochastic rainfall model. Section 3 illustrates the performance of the model for the island of O`ahu (Hawai`i, USA) in the tropical Pacific, and a similar test study is repeated in the Supplement for the island of Tahiti (French Polynesia) to demonstrate the versatility of the model. Finally, Sect. 4 discusses how the focus on orographic rain enhancement has influenced the design of the model and provides concluding remarks.
2.1 Rainfall features of interest
Because stochastic rainfall models are data driven, their structure depends on the rain features one wants to reproduce in simulations. Hence, identification of the main features of daily rainfall on high tropical islands is a prerequisite for the design of the present model. For illustration purposes, we focus throughout the main text on the island of O`ahu, Hawai`i (lon = 158^{∘} W, lat = 21.5^{∘} N, area = 1545 km^{2}, max altitude = 1220 m). The available rain gauge observation dataset consists of daily records from a network of 86 rain gauges spread over the island (Fig. 1a), and covers a 20year period 1991–2011. It corresponds to a compilation of qualitycontrolled and gapfilled daily observations (Longman et al., 2018). Gap filling was performed using the normal ratio method (Paulhus and Kohler, 1952), and only stations with less than 5 % gapfilled data are kept for this study in order to minimize the impact of gap filling on our results. To contextualize the observed rain patterns, several meteorological covariates (e.g., pressure, temperature, humidity, and wind) are investigated at the island scale. We use the ECMWF Reanalysis v5 (ERA5) dataset (Hersach et al., 2018) at 00:00 HST to inform these covariates and average the values of the 12grid cells (pixel size = $\mathrm{0.25}{}^{\circ}\times \mathrm{0.25}{}^{\circ}$) encompassing the island of O`ahu.
Figure 1 displays the main features of rainfall over the island of O`ahu. It shows the strong impact of trade windinduced orographic rain enhancement on the spatial distribution of annual precipitation (Fig. 1a), with windward (northeast) sides significantly wetter than leeward (southwest) ones, and highlands generally wetter than lowlands. Note some important details of this annual rainfall pattern, for example that the rain maximum is observed leeward of the main crest of the Ko`olau range, and that the Ko`olau mountains are significantly wetter than the Wai`anae range, despite higher elevation. In addition to prevailing orographic rainfall triggered by the interactions of trade winds with island topography (eastnortheasterly trade winds blow more than 280 d per year over the Hawaiian archipelago; Longman et al., 2015), the island of O`ahu also experiences spatially widespread rain events, mostly triggered by regional atmospheric disturbances such as cold fronts originating from midlatitudes and Kona storms (Longman et al., 2021). These atmospheric disturbances mostly occur during (boreal) winter, which corresponds to the local rainy season (spanning October–March). They represent the main source of precipitation for dry leeward locations and are responsible for the enhanced seasonality of rain accumulation in these areas (Fig. 1a).
The diversity of raingeneration mechanisms (e.g., orographic lifting, cold fronts, or Kona lows) coupled with the steep island topography of volcanic origin result in a complex distribution of rainfall in space and time, which produces highly variable islandscale rain statistics (i.e., statistics summarizing rain behavior throughout the island for a given day). Figure 1b–d shows that at the scale of the island of O`ahu, daily rainfall is strongly intermittent in space (only 3 % of the days record rain at all gauge locations, and half of the time at least 20 % of the gauges measure no rain, Fig. 1b), highly skewed (islandscale rain accumulation average < 2.25 mm d^{−1} 50 % of the time, but islandscale maximum accumulation > 15 mm d^{−1} 50 % of the time and reaches 500 mm d^{−1}, Fig. 1c), and strongly variable in space (coefficient of variation > 1.3 50 % of the time and >2.9 10 % of the time).
2.2 Model description
2.2.1 Model overview
To model the statistical features of daily rainfall in tropical islands while accounting for the main mechanisms of orographic rain enhancement, in particular the variability of rainfall–topography relationships under the influence of changing atmospheric conditions, the proposed model splits rainfall behavior into three components, i.e., temporal variability, intensity (i.e., marginal distribution), and spatial distribution. Figure 2 summarizes the structure of the model, which is briefly introduced in the later part of this subsection, and will be discussed in detail in Sects. 2.2.2 to 2.2.4.
The temporal variability of rain statistics and its relationships with the state of the atmosphere are modeled following a raintyping approach (Fig. 2a; Ailliot et al., 2015; Benoit et al., 2018b). In this framework, days with similar rain statistics are first pooled together in a finite number of rain types, which represent summaries of islandscale daily rain statistics. Next, rain type occurrence is modeled by a Markov chain describing how rain types transition to each other (Bárdossy and Plate, 1991; Wilby, 1994). To preserve climatological consistency and model the influence of atmospheric circulation on orographic rain enhancement, rain type occurrence is conditioned to local meteorological covariates (Benoit et al., 2020), making the Markov chain nonhomogeneous (Hughes and Guttorp, 1999; Vrac et al., 2007). It is important to note here that we do not use the widespread hidden Markov model (HMM) approach (Ailliot et al., 2009; Greene et al., 2011) to introduce rain types in our stochastic rainfall model, but rather resort to a twostep approach in which rain types are first determined by a direct classification of rain gauge observations and a statistical rainfall model is subsequently built for each rain type. This choice was motivated by the use of nonparametric methods to model the distribution of rainfall conditional to rain types, which prevents the formulation of the likelihood of the full statistical model and in turn the use of HMM to model rain types.
Conditional to each rain type, the distribution of rain across the island is modeled following a metaGaussian approach (also referred to as transGaussian or transformed Gaussian; Allard and Bourotte, 2015; Baxevani and Lennartsson, 2015; Papalexiou and Serinaldi, 2020). In this framework, a latent field with standardized Gaussian marginal distribution is nonlinearly transformed to match the marginal distribution of daily rainfall across the island, and the spatial dependencies of the latent field are used to encode the spatial distribution of rainfall (Fig. 2b and c). The latent field is often assumed to follow a multivariate Gaussian distribution, which allows for a parsimonious modeling of the spatial dependencies using geostatistics (Lantuéjoul, 2002) and, in turn, spatial interpolation of rain gauge observations (Benoit et al., 2018a). In this study, however, a careful investigation of intergauge spatial dependencies (Sect. S1 in the Supplement) shows the inadequacy of the multivariate Gaussian distribution to model the latent field. Instead, a nonparametric resampling of past analogs (Gangopadhyay and Clark, 2005; Yiou, 2014) of the daily latent field is used to model spatial dependencies. Aside from the choice of the marginal distribution of the latent field (Gaussian vs. uniform), this model is comparable to the use of empirical spatial copulas (Bárdossy and Pegram, 2009) coupled with a parametric marginal distribution of nonzero rain accumulation. This approach has the advantage of faithfully reproducing the complex spatial distribution of rainfall due to orographic effects, but at the expense of the ability for spatial interpolation. The present model should therefore be regarded as a multisite stochastic rainfall generator (and not a rain field generator), and the spatial interpolation of the simulated multisite rainfall is outside the scope of this paper.
2.2.2 MetaGaussian representation of islandscale daily rainfall
Rain intensity and spatial distribution are modeled jointly following a metaGaussian approach (Fig. 2b and c). For a given day, the observed rain accumulations ${R}_{i=\mathrm{1}\phantom{\rule{0.125em}{0ex}}\mathrm{\dots}\phantom{\rule{0.125em}{0ex}}{N}_{T}}$ across a network of N_{T} gauges are linked to their latent counterparts Z_{i} (which follow a standardized Gaussian marginal distribution, i.e., $Z\sim N(\mathrm{0},\mathrm{1})$) through a nonlinear transform function ψ. This transformation is performed by first assuming that nonzero rain accumulations observed throughout the island in a given day follow a Gamma distribution:
where N_{d} and N_{w} are the number of dry and wet gauges, respectively, Φ^{−1} is the inverse cumulative distribution function (cdf) of the univariate standardized Gaussian distribution, and $\mathrm{\Gamma}({R}_{i};k,\mathit{\theta})$ is the cdf of the gamma distribution with shape parameter k>0 and scale parameter θ>0.
In many instances, gauges measuring no rain (i.e., R_{i}=0) represent a significant part of the network, which creates a concentration of zero values in rain accumulation distribution and prevents a correct Gaussian transform using the function of Eq. (1). To circumvent this problem, the latent values corresponding to dry gauges are assigned based on the distance of the dry gauges to the closest wet gauge, such as the marginal distribution of the latent values matches the left portion of a standardized normal distribution. The idea behind this applicationspecific solution to deal with the spatial intermittence of rainfall is that a location far from any wet gauge should remain dry even after combining the associated latent field with slightly different parameters of the transform function (Eqs. 1 and 2) during the simulation step (Sect. 2.3.3). In case of a gauge measuring no rain (R_{i}=0), the corresponding latent value is given by
where Dw_{i} is the distance of the gauge i observing no rain to the closest gauge measuring nonzero rain. This transformation has the advantage of creating spatial patterns of censored latent values (i.e., corresponding to dry gauges) that are coherent with the ones of noncensored latent values (i.e., corresponding to wet gauges), and the transformation creates smooth transitions between wet and dry domains.
Once latent values (Z_{i}) are derived from rain observations (R_{i}), the spatial distribution of rain across the island is defined by the spatial distribution of the latent field (Bárdossy and Pegram, 2009), i.e., the joint cdf of Z_{i}. As mentioned in Sect. 2.1, the spatial distribution of daily rainfall on high tropical islands is complex and strongly nonstationary due to orographic effects, which prevents the use of a simple parametric form (such as the multivariate Gaussian distribution used in most metaGaussian models of precipitation (Benoit et al., 2018a; Papalexiou and Serinaldi, 2020) for the spatial dependencies. Hence, in the present case, the nonparametric resampling of past latent fields is used to simulate the spatial distribution of rainfall (Rüschendorf, 2009).
2.2.3 Rain typing
Based on the above metaGaussian representation of daily rain fields, days with similar rain statistics are pooled into rain types (Fig. 2a) using a nonsupervised clustering applied on the sixdimensional feature space defined by the following:

The three parameters of the transform function (ψ; i.e., ${p}_{\mathrm{0}}=\frac{{N}_{\mathrm{d}}}{{N}_{T}}$, k, θ), which inform the marginal distribution of daily rainfall over the island.

The first three components of the Karhunen–Loève expansion (Huang et al., 2001) of the latent field Z (PC_{1}, PC_{2}, PC_{3}), which inform the spatial distribution of rainfall across the island.
Based on this feature space $\mathit{Y}=({p}_{\mathrm{0}},k,\mathit{\theta},{\mathrm{PC}}_{\mathrm{1}},{\mathrm{PC}}_{\mathrm{2}},{\mathrm{PC}}_{\mathrm{3}}{)}^{T}$, the clustering is performed using a Gaussian mixture model (GMM; Fraley and Raftery, 2002), which approximates the pdf of Y as a weighted sum of multivariate normal distributions:
where p_{Y} is the joint pdf of the random vector Y, N_{C} is number of components in the GMM, b_{l} is a weight assigned to the lth component, and μ_{l} and Σ_{l} are the mean vector and covariance matrix of the multivariate normal distribution of the lth component. Here, the parameters embedded in the vector Y are assumed to be only slightly correlated, and the covariance matrices (Σ_{l}) are therefore assumed to be diagonal. The number of components of the GMM (N_{C}) is selected by minimization of the Bayesian information criterion (BIC; Schwartz, 1978) estimated for different numbers of components in order to select a parsimonious classification (i.e., with as few rain types as possible) while properly fitting the pdf of Y (i.e., p_{Y}). Once the pdf p_{Y} is known, the probability that an observed vector y_{obs} belongs to the lth component C_{l} is given by
and the classification is obtained by assigning each day (d_{i}) with a rain type (RT) that corresponds to the most probable mixture component:
2.2.4 Rain type occurrence
Once rain types have been defined based on rainfall statistical properties, their occurrence is conditioned to the vector MC_{d} of meteorological covariates observed at day d (Fig. 2a), and rain type occurrence is modeled by a nonhomogeneous Markov chain of order 1 (Hughes and Guttorp, 1999; Vrac et al., 2007):
where RT_{d} is the state of the Markov chain (i.e., the rain type) at day d, $p({\mathrm{RT}}_{d}=j\mathrm{}{\mathrm{RT}}_{d\mathrm{1}}=i,\mathit{M}{\mathit{C}}_{d})$ is the probability to transition from rain type i to rain type j, Σ_{ij} and μ_{ij} are the covariance matrix and the mean vector of the meteorological covariates when the transition from type i to type j occurs, respectively, and γ_{ij} is the baseline (i.e., longterm average) probability of transition from type i to type j. This model allows the transition probability $p({\mathrm{RT}}_{d}=j\mathrm{}{\mathrm{RT}}_{d\mathrm{1}}=i,\mathit{M}{\mathit{C}}_{d})$ to vary proportionally to the conditional density of MC_{d} given the transition, and conditions the occurrence of rain types to the state of the atmosphere characterized by the covariates.
2.3 Model implementation
2.3.1 Selection of meteorological covariates
The set of meteorological covariates used for conditioning of the nonhomogeneous Markov chain must be chosen so that (i) the covariates are only weakly correlated to each other, which ensures model parsimony (i.e., minimal redundancy between covariates); and (ii) the temporal variations of the covariates are correlated with variations in rain type occurrence (Sect. S2), which indirectly informs the seasonality and interannual variability of rainfall patterns. Note that in the present framework, the conditioning to covariates (i.e., the nonhomogeneous part of the Markov chain) is used to inform the lowfrequency fluctuations of rain type occurrence (seasonal to interannual time scales), with higher frequencies (weekly to daily time scales) being informed by the baseline transition probabilities (γ_{ij}). Hence, meteorological covariates are aggregated at the monthly scale prior to use for conditioning of the nonhomogeneous Markov chain. The monthly aggregated covariates inform monthly anomalies in atmospheric conditions and, in turn, the likelihood of rain types occurring during a given month.
In the present case, we selected the meteorological covariates according to our initial knowledge about raingeneration mechanisms on high tropical islands, and their links to the state of the atmosphere (Elison Timm et al., 2014; Réchou et al., 2019; Sanfilippo, 2020); this led to the following five covariates:

Geopotential height at 700 hPa (m^{2} s^{−2}). This covariate is correlated with the presence of synopticscale weather systems in the vicinity of the island and identifies regional atmospheric disturbances.

Temperature difference between 950 and 700 hPa (K). This covariate is correlated with the lower atmospheric instability and identifies days prone to shallow convection.

Specific humidity at 700 hPa (kg kg^{−1}). This covariate informs the presence of humidity above the height of the trade wind inversion and is negatively correlated with the strength of the inversion and positively correlated with the potential for deep convection and cold rain.

Meridional and

longitudinal humidity fluxes at 950 hPa (i.e., specific humidity multiplied by the u (east–west) or v (north–south) components of the wind field, m s^{−1} kg kg^{−1}). These covariates provide the amount of moisture crossing over the mountain barrier available for precipitation and are a proxy for orographic precipitation.
2.3.2 Model calibration
The model is calibrated from a training dataset made of N days of rain accumulation recorded by a network of N_{T} rain gauges (Fig. 1a). Data must be available for all stations and all days of the calibration period, and a preliminary gapfilling step is required in case of incomplete data (Longman et al., 2018; Oriani et al., 2020). Once a complete training dataset is available, the first step of model calibration consists of inferring the parameters of the transform function (ψ) for each day of the training period. This is performed by calculating the proportion of dry gauges and then estimating the parameters of the gamma distribution of the wet gauges using a maximum likelihood approach. Once the three parameters of ψ are known, this function can be inverted to derive the latent values at each gauge location.
After calibration of the transform function and derivation of the latent values for each day of the calibration dataset, days with similar rain statistics are pooled together by rain typing. The first three principal components of the latent field are preliminarily derived from the Karhunen–Loève transform of all latent values. Next, the parameters of the GMM are inferred using an expectationminimization approach (Fraley and Raftery, 2002). Finally, rain typing (i.e., clustering) is performed by assigning to each day the type that corresponds to the most probable component of the GMM.
After rain typing, the time series of observed rain types is analyzed in relation to observations of the meteorological covariates to calibrate the nonhomogeneous Markov chain. The baseline transition matrix (γ_{ij}) is first estimated by counting the transitions between each pair of rain types occurring during the calibration period and normalizing the result by the total number of transitions. Next, the parameters of the mean vector (μ_{ij}) and the covariance matrix (Σ_{ij}) used to make the Markov chain nonhomogeneous are estimated by the method of moments applied to covariate observations.
Conditional to each rain type, the joint distribution of the parameters of ψ is inferred by multivariate kernel density estimation using a trivariate Gaussian kernel. The bandwidth of the kernel is selected following Scott's rule (Scott, 1979, 2010), i.e., in the present case
where H is the bandwidth matrix of the kernel, N the number of days in the calibration dataset, and σ_{i} the standard deviation of the ith parameter (here i=1 … 3).
Finally, because the latent field is simulated using an analog approach (see Sect. 2.3.3 for details), this part of the model does not require formal inference.
2.3.3 Stochastic rainfall generation
After model calibration, stochastic rainfall generation is performed following the steps summarized in Fig. 2. Starting from a time series of meteorological covariates, rain types are first simulated using the nonhomogeneous Markov chain described in Eq. (6). Next, conditional to this simulated rain type time series, the parameters of the transform function are sampled from their joint distribution defined by Eq. (7). Subsequently, one realization of the latent field is simulated using an analog approach (Gangopadhyay and Clark, 2005; Mezghani and Hingray, 2009; Yiou, 2014), i.e., by randomly picking the latent field of a day belonging to the same rain type as the day to simulate from the calibration dataset. Finally, the simulated rain field is obtained by backtransformation of the simulated latent field (Eqs. 1 and 2) using the simulated parameters of the transform function.
2.4 Model assessment
The ability of the model to identify climatologically relevant rain types is assessed qualitatively by applying rain typing to the full study dataset of Sect. 2.1 and scrutinizing the emergent spatial–temporal rainfall patterns for each type. The resulting classification is subsequently interpreted in terms of rain generation processes by confronting rain types with cooccurring meteorological covariates. However, in doing so, one should keep in mind that the raintyping procedure is fully statistical and that the rain type description is based on emerging statistical patterns, not on physical modeling (e.g., using a numerical weather model to reproduce the observed patterns).
When discussing rain types and their link to raingeneration processes, special attention is paid to

the emergence of spatial patterns in relation to orographic effects,

the seasonality of rain type occurrence in relation to the regional annual rain cycle, and

the relationship of rain types to the state of the atmosphere quantified by the set of climate covariates described in Sect. 2.4 and used here at a daily resolution (i.e., not aggregated at the monthly scale as is the case for the conditioning of the nonhomogeneous Markov chain).
After the assessment of the climatological realism of rain types, the ability of the model to stochastically generate rainfall is assessed using a leaveoneyearout crossvalidation procedure. Data from one year are iteratively removed from the study dataset of Sect. 2.1 and the stochastic model is calibrated using the remaining data (i.e., 19 years of data are used for model calibration). The model is fully recalibrated, which includes rain typing, inference of the transform function, and creation of a training dataset of latent fields. After model calibration, daily rainfall is simulated for each day of the target year, i.e., the year excluded from the calibration dataset. Fifty simulations are generated to assess the uncertainty associated with stochastic rainfall generation. The same procedure is repeated for each year of the study dataset, which leads to a 20year long validation set made of 50 simulations for each gauge of the O`ahu rain monitoring network.
Simulated rainfall is compared to observations following a multicriteria approach. First, simulation results are evaluated qualitatively by visual inspection of rainfall time series for the four target stations of Fig. 1a. Next, a quantitative assessment is performed using the following evaluation statistics:

Sitespecific rain statistics. The following statistics are considered: quantiles 10 %, 50 %, and 90 % of monthly rain accumulation to assess seasonality; annual rain accumulation to assess interannual variability; quantile–quantile (q–q) plot of the percentiles of daily rain accumulation to assess the probability distribution of daily rainfall; and q–q plot of the percentiles of wetspell duration to assess rain persistence.

Spatial patterns of rain distribution across the island. The following statistics are mapped to investigate the spatial distribution of rainfall: quantiles 10 %, 30 %, 50%, 70 %, and 90 % of daily rain to assess how the probability distribution of rainfall varies in space.

Areal rain statistics with q–q plots of the percentiles of (i) the proportion of dry rain gauges, (ii) mean and (iii) max of daily rain, and (iv) the coefficient of variation of rain accumulation across the island to assess islandscale statistics.
3.1 Rain types in O`ahu
Figure 3 displays the 22 rain types identified for O`ahu Island during the period 1991–2011. Although this number may seem high compared to the number of rain types inferred for midlatitude continental climates (usually less than 10 types, see, e.g., Vrac et al., 2007; Benoit et al., 2018b), we believe that it reflects the tremendous variability of rainfall observed in the Hawai`i archipelago (Giambelluca et al., 2013), which led Hawaiians to use more than one hundred different words to describe rainfall (Akana and Gonzalez, 2015). In addition to the large number of rain types required to account for rainfall variability on tropical islands, one key attribute of the resulting classification is that although no information is given to the classifier about geographical coordinates, time of occurrence, or meteorological covariates, the identified rain types display welldefined patterns of spatial rain distribution (Fig. 3a), seasonality of occurrence (Fig. 3a), and correlation with the regional state of the atmosphere (Sect. S2).
To better identify the main modes of rainfall variability over O`ahu, rain types are pooled into three hyperclasses (H1–3) that can be linked to the three main raingeneration processes in the area (Fig. 3):

H1: almost dry days (Fig. 3, rain types a–g). During these days, most rain gauges report no rain, and no gauge reports more than 5 mm d^{−1} on average. These types of weather conditions are associated with a stable atmosphere and a low moisture flux (Fig. S2).

H2: trade wind days (Fig. 3, rain types h–q). This category displays welldefined spatial patterns of rain accumulation caused by orographic lifting, and are associated with a stable atmosphere, a welldefined trade wind inversion, and an important influx of moisture below the inversion layer under the influence of eastnortheasterly trade winds (Fig. S2). When scrutinizing intertype variability within this category, note that the location of the rain maximum shifts westward with increasing moisture flux, likely due to stronger trade winds causing an overshoot of orographic rain enhancement whereby rain forms over the mountains but falls further downwind on the leeward side (Daly et al., 2017). In addition, for similar wind conditions and, therefore, spatial patterns (compare for instance types j–l), rain intensity is correlated to the instability of the atmosphere (Fig. S2).

H3: regional atmospheric disturbance days (Fig. 3, rain types r–v). These types display either unstructured (types r–t) or relatively homogeneous (types u–v) spatial patterns of rain accumulation and are associated with low pressure, unstable atmosphere, and absent (or weak) trade wind inversion. This allows high moisture content at high altitude (Fig. S2). These rain types mostly occur during winter, i.e., the local rainy season. When scrutinizing intertype variability within this category, note that rain intensity increases with atmospheric instability and the presence of humidity at high altitude, and that the spatial patterns tend to become more structured when the lowlevel moisture influx increases (probably due to stronger and more uniform winds).
Hence, rain typing provides new insights on islandscale rain climatology (Fig. 3b). In particular, this step helps us gain a better understanding of how different atmospheric conditions lead to different raingeneration processes that, when interacting with island topography, generate contrasting orographic effects. In the case of the island of O`ahu, orographic rain enhancement occurring during days influenced by trade winds is the main explanation for the high annual rain accumulations in the Ko`olau mountains (up to 5000 mm annual rainfall), while widespread rainfall linked to regional atmospheric disturbances is the main source of rain at leeward locations, despite their relative temporal scarcity.
3.2 Simulation of sitespecific rainfall time series
Figure 4 displays stochastic rainfall generation outputs for the four rain gauges of Fig. 1a which experience different rainfall climatologies. Visual inspection of the simulated time series shows that our stochastic rainfall generator is able to simulate synthetic rainfall that is almost indistinguishable from the observed rainfall. The dry–wet ratio as well as the marginal distribution of daily rain accumulation seem properly replicated, except for the dry leeward gauge (first row in Fig. 4), where the 20year maximum tends to be overestimated (see also Fig. 5d). In terms of temporal variability, the simulated time series properly capture the seasonal cycle visible for the two leeward gauges (rows 1 and 2 in Fig. 4), as well as the interannual variability visible for the two coastal gauges (rows 1 and 4 in Fig. 4). Finally, the rainfall generator captures the relatively steady behavior of the wet gauge located in the Ko`olau range (row 3 in Fig. 4).
3.3 Simulation of sitespecific rain statistics
To complement the qualitative assessment of sitespecific rainfall time series in Fig. 4, Fig. 5 investigates the statistical features of the results of the crossvalidation procedure (50 realizations are drawn) for the same four rain gauges.
The results in Fig. 5 show that the proposed model correctly simulates rainfall seasonality (Fig. 5b) and interannual variability (Fig. 5c). Note that simulations capture both the stronger seasonality at leeward locations (compared to windward locations) as well as the near absence of seasonality at the wettest gauge located in the Ko`olau Mountains (Fig. 5, third row). The interannual variability of rain accumulation is also properly simulated, in particular at leeward locations, where the impact of winter storms is the highest. These results suggest that the nonhomogeneous Markov chain of order 1 conditioned to monthly aggregated meteorological covariates adequately models the longterm variability of rain accumulation, and that the selected covariates properly capture rain type occurrence in a tropical marine climate.
However, rain persistence is slightly underestimated at some locations, especially for the high percentiles, i.e., longlasting wet spells (Fig. 5e). This result exposes limitations in the use of the nonhomogeneous Markov chain of order 1 for modeling weekly scale temporal variability of rainfall. This may be explained by the fact that daily scale and seasonalscale rainfall fluctuations are informed, respectively, by the Markov chain of order 1 and conditioning to monthly aggregated meteorological covariates, but that the weekly scale is not explicitly included in the model. Although the resulting errors are of low magnitude, they should be kept in mind in case of applications requiring a precise estimation of rain persistence, e.g., crop models in semiarid environments that can be found in some leeward areas of the target islands.
The simulations properly reproduce sitespecific marginal distributions of daily rain accumulation (Fig. 5d), except for the driest gauge (top row in Fig. 5), where the 20year maximum tends to be overestimated. The satisfactory simulation of rainfall distribution at several sites suggests that a typedependent gamma distribution is an adequate model for the nonzero daily rain accumulations across the island. It is noteworthy that all percentiles of the marginal distribution of rain accumulation are properly reproduced in simulations, which suggests that our model is able to simulate the whole spectrum of daily precipitation – from dry days to intense rainfall.
3.4 Simulation of islandscale rain fields
Figure 6 displays the results of the crossvalidation procedure focusing on islandscale features. Figure 6a compares observed and simulated spatial patterns for five quantiles of daily rain accumulation across the island of O`ahu. Results show very good model performance in reproducing the spatial patterns of daily rainfall. This result was expected because the use of empirical copulas combined with rain typing is almost equivalent to resampling the observed spatial patterns conditional to meteorological covariates. However, satisfactory simulation results ensure that the rain typebased resampling of latent fields is unbiased and that the choice and calibration of the metaGaussian model are relevant for the study island.
Figure 6b–e assesses the ability of the model to simulate four key rain statistics – the proportion of dry gauges, mean and max of daily rain accumulation, and coefficient of variation of daily rain across the island – aggregated over all rain gauges of the rain monitoring network of O`ahu. Results show a slight underestimation of the low percentiles of the proportion of dry gauges (Fig. 6b), which corresponds to a slight overestimation of the frequency of proportions of dry rain gauges below 5 % (i.e., 4 gauges out of 86 in the present setting). Thus, our model tends to simulate rain at all 86 gauges when a very small number of gauges (less than 4) actually record no rain, leading to a slight drizzle effect in space. Careful examination of the spatial patterns of daily rain percentiles (Fig. 6a) shows that the patterns of spatial intermittency are properly simulated, which suggests that the drizzle effect is randomly distributed amongst locations, thus reducing its potential impact for applications. This level of accuracy in the simulation of the rain fraction shows that a truncated Gaussian latent field is an appropriate model for rain intermittency. In addition, the correct simulation of the spatial patterns of dry locations in Fig. 6a suggests that the distancebased modeling of the censored latent values (Eq. 2) coupled with empirical copulas is a proper model for the spatial distribution of dry locations. Similarly, the good agreement between observed and simulated coefficients of variation (Fig. 6e) coupled with the correct simulation of spatial patterns of nonzero daily rain accumulation in Fig. 6a suggest that the selected metaGaussian framework captures the spatial distribution of nonzero rain accumulations.
Finally, Fig. 6c and d shows that islandscale daily mean and maximum rain accumulations are properly simulated, despite an overestimation of the last percentile of the maximum, i.e., the 20year maximum observed over the whole island. This result suggests that the metaGaussian framework coupled with the kernel estimation of the transform function parameters performs reasonably well to reproduce the marginal distribution of islandscale rain accumulation. However, the attempt to reproduce both islandscale statistics and sitespecific marginal distributions (from dry days to heavy rainfall) results in an inaccurate simulation of the islandscale 20year extreme precipitation. This limitation calls for additional developments before the proposed model can be used for simulating extremes in a spatial context (Opitz et al., 2021).
3.5 Model versatility
To investigate the versatility of our stochastic daily rainfall model, the case study performed in Sects. 3.1–3.3 for the island of O`ahu (Hawai`i, USA) located in the North Pacific was repeated in Sect. S4 (in the Supplement) for the island of Tahiti (French Polynesia) located in the South Pacific. Figure 7 provides a brief overview of the results. This additional crossvalidation shows that our model also performs very well for Tahiti, despite a wetter (annual rain reaches 10 000 mm in Tahiti) and more seasonal climate than the O`ahu case study. In addition, the model adapts automatically to different dataset sizes (86 rain gauges × 21 years for O`ahu, 26 gauges × 11 years for Tahiti) and rain climatologies due to the selection of different numbers of rain types. These results suggest that our model may be adapted to most high tropical islands across the globe.
4.1 Stochastic modeling of orographic rainfall patterns
Validation results in Sect. 3 show that the proposed model is able to simulate realistic multisite rainfall time series, which accurately reproduce sitespecific and islandscale daily rain statistics for two different tropical islands. This has been made possible by a twostep modeling approach (rain typing and metaGaussian representation of islandscale daily rainfall), which takes into account our conceptual knowledge about orographic rain enhancement in tropical islands.
The first component consists of rain types, which summarize islandscale rain statistics. Unlike weather typebased approaches (Ailliot et al., 2015; Réchou et al., 2019), we define rain types based on rain features only, i.e., no information about meteorological covariates or largescale circulation are included during the classification step. This leads to a classification centered on rainfall intensity and spatial distribution, which allows us to explore how islandscale rainfall variability is impacted by orographic effects (Sect. 3.1). The links between rain types and local climate are established in a second step by conditioning the nonhomogeneous Markov model of rain type occurrence to meteorological covariates. We conceptualize rain types as the main modes of islandscale daily rainfall variability, which is assumed to be primarily influenced by orographic effects caused by interactions between changing atmospheric conditions and fixed island topography. In this context, one interesting contribution of this study is the refinement of the meteorological predictors proposed by Sanfilippo (2020) for rain type occurrence in a tropical marine climate, in particular to distinguish between shallow convection occurring during typical trade wind situations and deeper convection in the vicinity of atmospheric disturbances.
The second component of the model consists of a metaGaussian representation of islandscale daily rainfall. By explicitly separating rain intensity and spatial distribution, this representation contributed to the performance of the raintyping procedure detailed above and in the identification of rain types with welldefined spatial patterns. When used for stochastic rainfall generation, the adopted metaGaussian representation performed well in simulating sitespecific rain statistics as well as islandscale spatial patterns of daily rain accumulation. This good performance can be explained by two factors. First, the determination of the censored latent values based on the distance to the closest wet gauge (Eq. 2) generates realistic spatial patterns of dry areas and dry–wet transition (Schleiss et al., 2014). This contributes to the proper modeling of the spatial intermittency of daily rain fields in tropical islands, which is caused by the drying effect of sinking air masses after crossing mountains. The second innovation of the model is the joint use of empirical copulas and a parametric transform function to model the spatial patterns of nonzero rain. It has the advantage of faithfully preserving the spatial rainfall patterns while generating unobserved values through the kernel density estimation of the transform function parameters distribution. The choice of mimicking the observed spatial rainfall patterns as closely as possible is justified by the complexity of orographic effects and associated rain gradients on tropical islands (Giambelluca et al., 2013; Laurent et al., 2019; Benoit et al., 2021).
4.2 Concluding remarks
In this paper we present a new stochastic daily rainfall generator dedicated to high tropical islands. The combination of (i) rain types, (ii) a nonhomogeneous Markov model of rain type occurrence conditioned to meteorological covariates, and (iii) a metaGaussian representation of the spatial distribution of daily rainfall allowed us to generate realistic daily rain fields honoring both sitespecific and islandscale rain statistics. The performance of the model was carefully tested and illustrated for the islands of O`ahu (Hawai`i, USA) and Tahiti (French Polynesia), both located in the tropical Pacific. Crossvalidation results prove the ability of the model to capture and simulate the main features of daily rainfall over these two high tropical islands.
The main strength of our model is its ability to simulate diverse patterns of multisite daily rainfall, as well as their linkage to regional atmospheric conditions. It represents a new tool for stochastic investigation and modeling of orographic rain enhancement on tropical islands with complex topography. The main limitations, however, are (i) the inaccurate simulation of extreme rainfalls, which calls for caution when using our model for flood risk assessment, and (ii) the restriction to multisite simulation, which calls for an additional step of stochastic interpolation of the multisite patterns when gridded outputs are required.
Because of the abovementioned strengths and limitations, the main envisioned applications relate to impact studies that require detailed knowledge of daily precipitation in tropical islands, in particular when the spatial pattern of rainfall plays an important role. This includes watershed water resource management and ecohydrological studies.
Our model can also be used for stochastic downscaling of future precipitation projections and thereby contribute to the current efforts to better understand, manage, and secure tropical island water resources in a changing climate. An essential future investigation in this direction will be to assess how well GCMs simulate the set of meteorological covariates we selected to drive our stochastic rainfall generator, in particular the vertical temperature gradient used to inform shallow convection.
The implementation of the proposed stochastic rainfall model is open source (MATLAB implementation) and freely available in the following repository (https://doi.org/10.5281/zenodo.6462982; Benoit, 2022). The dataset of daily rainfall observations on O`ahu is open data and freely available on the Hawai`i Climate Data Portal (https://www.hawaii.edu/climatedataportal/; HCDP, 2022). An extract of this dataset is available in MATLAB format as a code demo in the same repository as the source code of the model. The dataset of daily rainfall observations on Tahiti is available upon request from Météo France (contact.polynesiefrancaise@meteo.fr) and Groupement d'Etudes et de Gestion du Domaine Public de Polynésie Française (secretariat@equipement.gov.pf).
The supplement related to this article is available online at: https://doi.org/10.5194/hess2621132022supplement.
LB, LS, and TWG designed the experiment. MPL and LS compiled the daily rainfall datasets of O`ahu and Tahiti, respectively. LB and ADN selected the meteorological covariates and designed the nonhomogeneous Markov chain. LB and MPL designed the metaGaussian model and the raintyping method. LB implemented the model and performed the numerical experiments. LB wrote the paper with input and corrections from all coauthors.
The contact author has declared that neither they nor their coauthors have any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The authors are grateful to the Hawai`i Climate Data Portal for providing the daily rainfall dataset of the island of O`ahu, and to the French Weather Agency (Direction Interrégionale en Polynésie française – Météo France) and the Polynesian public service named Direction de l'Equipement (Groupement d'Etudes et de Gestion du Domaine Public de Polynésie Française – GEGDP) for providing the daily rainfall dataset of the island of Tahiti. The authors are grateful to May Izumi from the Publication services of the School of Ocean and Earth Science and Technology, University of Hawai`i, for proofreading this manuscript.
This research has been supported by the Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (grant no. P2LAP2_191395). The work of Lydie Sichoix is supported by the Government of French Polynesia – Ministère de la Recherche through the project ECRQEST, grant number 05832 MED 08/26/2019.
This paper was edited by Lelys Bravo de Guenni and reviewed by Geoff Pegram and one anonymous referee.
Ailliot, P., Thompson, C., and Thomson, P.: Space–time modelling of precipitation by using a hidden Markov model and censored Gaussian distributions, Appl. Stat., 58, 405–426, https://doi.org/10.1111/j.14679876.2008.00654.x, 2009.
Ailliot, P., Allard, D., Monbet, V., and Naveau, P.: Stochastic weather generators: an overview of weather type models, Journal de la Société Française de Statistiques, 156, 101–113, 2015.
Akana, C. L. and Gonzalez, K.: Hanau ka Ua Hawaiian Rain Names, Kamehameha Publishing, 327 pp., ISBN 13:9780873362467, 2015.
Allard, D. and Bourotte, M.: Disaggregating daily precipitations into hourly values with a transformed censored latent Gaussian process, Stoch. Environ. Res. Risk A., 29, 453–462, https://doi.org/10.1007/s0047701409134, 2015.
Ambrosino, C., Chandler, R. E., and Todd, M. C.: Rainfallderived growing season characteristics for agricultural impact assessments in South Africa, Theor. Appl. Climatol., 115, 411–426, https://doi.org/10.1007/s007040130896y, 2014.
Bárdossy, A. and Plate, E. J.: Modelling daily rainfall using a semiMarkov representation of circulation pattern occurence, J. Hydrol., 122, 33–47, https://doi.org/10.1016/00221694(91)90170M, 1991.
Bárdossy, A. and Pegram, G. G. S.: Copula based multisite model for daily precipitation simulation, Hydrol. Earth Syst. Sci., 13, 2299–2314, https://doi.org/10.5194/hess1322992009, 2009.
Bauer, P., Thorpe, A., and Brunet, G.: The quiet revolution of numerical weather prediction, Nature, 525, 47–55, https://doi.org/10.1038/nature14956, 2015.
Baxevani, A. and Lennartsson, J.: A spatiotemporal precipitation generator based on a censored latent Gaussian field, Water Resour. Res., 51, 4338–4358, https://doi.org/10.1002/2014WR016455, 2015.
Bennett, B., Thyer, M., Leonard, M., Lambert, M., and Bates, B. C.: A comprehensive and systematic evaluation framework for a parsimonious daily rainfall field model, J. Hydrol., 556, 1123–1138, https://doi.org/10.1016/j.jhydrol.2016.12.043, 2018.
Benoit, L.: StochasticRainfallGenerator_TropicalIslands: Release v1 (Version v1), Zenodo [code], https://doi.org/10.5281/zenodo.6462982, 2022.
Benoit, L., Allard, D., and Mariethoz, G.: Stochastic Rainfall Modeling at Subkilometer Scale, Water Resour. Res., 54, 4108–4130, https://doi.org/10.1029/2018WR022817, 2018a.
Benoit, L., Vrac, M., and Mariethoz, G.: Dealing with nonstationarity in subdaily stochastic rainfall models, Hydrol. Earth Syst. Sci., 22, 5919–5933, https://doi.org/10.5194/hess2259192018, 2018b.
Benoit, L., Vrac, M., and Mariethoz, G.: Nonstationary stochastic rain type generation: accounting for climate drivers, Hydrol. Earth Syst. Sci., 24, 2841–2854, https://doi.org/10.5194/hess2428412020, 2020.
Benoit, L., Lucas, M. P., Tseng, H., Huang, Y.F., Tsang, Y.P., Nugent, A. D., Giambelluca, T. W., and Mariethoz, G.: High SpaceTime Resolution Observation of Extreme Orographic Rain Gradients in a Pacific Island Catchment, Front. Earth Sci., 8, 546246, https://doi.org/10.3389/feart.2020.546246, 2021.
Breinl, K., Di Baldassarre, G., Girons Lopez, M., Hagenlocher, M., Vico, G., and Rutgersson, A.: Can weather generation capture precipitation patterns across different climates, spatial scales and under data scarcity?, Scient. Rep., 7, 5449, https://doi.org/10.1038/s4159801705822y, 2017.
Brown, J. R., Lengaigne, M., Lintner, B. R., Widlansky, M. J., van der Wiel, K., Dutheil, C., Linsley, B. K., Matthew, A. J., and Renwick, J.: South Pacific Convergence Zone dynamics, variability and impacts in a changing climate, Nat. Rev. Earth Environ., 1, 530–543, https://doi.org/10.1038/s4301702000782, 2020.
Cappelaere, B., Feurer, D., Vischel, T., Ottlé, C., Issoufou, H. B.A., SauxPicart, S., Maïnassara, I., Oï, M., Chazarin, J.P., Barral, H., Coudert, B., and Demarty, J.: Modeling Land Surface Fluxes from Uncertain Rainfall: A Case Study in the Sahel with FieldDriven Stochastic Rainfields, Atmosphere, 11, 465, https://doi.org/10.3390/atmos11050465, 2020.
Caruso, S. J. and Businger, S.: Subtropical Cyclogenesis over the Central North Pacific, Weather Forecast., 21, 193–205, https://doi.org/10.1175/WAF914.1, 2006.
Caseri, A., Javelle, P., Ramos, M. H., and Leblois, E.: Generating precipitation ensembles for flood alert and risk management, J. Flood Risk Manage., 9, 402–415, https://doi.org/10.1111/jfr3.12203, 2016.
Chandler, R. E.: Multisite, multivariate weather generation based on generalized linear models, Environ. Model. Softw., 134, 104867, https://doi.org/10.1016/j.envsoft.2020.104867, 2020.
Daly, C., Slater, M. E., Roberti, J. A., Laseter, S. H., and Swift L. W.: Highresolution precipitation mapping in a mountainous watershed: ground truth for evaluating uncertainty in a national precipitation dataset, Int. J. Climatol., 37, 124–137, https://doi.org/10.1002/joc.4986, 2017.
Elison Timm, O., Giambelluca, T. W., and Diaz, H. F.: Statistical downscaling of rainfall changes in Hawai`i based on the CMIP5 global model projections, J. Geophys. Res.Atmos., 120, 92–112, https://doi.org/10.1002/2014JD022059, 2014.
Foresti, L. and Pozdnoukhov, A.: Exploration of alpine orographic precipitation patterns with radar image processing and clustering techniques, Meteorol. Appl., 19, 407–419, https://doi.org/10.1002/met.272, 2012.
Fraley, C. and Raftery, A. E.: ModelBased Clustering, Discriminant Analysis, and Density Estimation, J. Am. Stat. Assoc., 97, 611–631, https://doi.org/10.1198/016214502760047131, 2002.
Frazier, A. G., Elison Timm, O., Giambelluca, T. W., and Diaz, H. F.: The influence of ENSO, PDO and PNA on secular rainfall variations in Hawai`i, Clim. Dynam., 51, 2127–2140, https://doi.org/10.1007/s0038201740034, 2018.
Gabellani, S., Boni, G., Ferraris, L., Von Hardenberg, J., and Provenzale, A.: Propagation of uncertainty from rainfall to runoff: A case study with a stochastic rainfall generator, Adv. Water Resour., 30, 2061–2071, https://doi.org/10.1016/j.advwatres.2006.11.015, 2007.
Gangopadhyay, S. and Clark, M.: Statistical downscaling using Knearest neighbors, Water Resour. Res., 41, W02024, https://doi.org/10.1029/2004WR003444, 2005.
Giambelluca, T. W., Chen, Q., Frazier, A. G., Price, J. P., Chen, Y.L., Chu, P.S., Eischeid, J. K., and Delparte, D. M.: Online Rainfall Atlas of Hawai`i, B. Am. Meteorol. Soc., 94, 313–316, https://doi.org/10.1175/BAMSD1100228.1, 2013.
Greene, A. M., Robertson, A. W., Smyth, P., and Triglia, S.: Downscaling projections of Indian monsoon rainfall using a nonhomogeneous hidden Markov model, Q. J. Roy. Meteorol. Soc., 137, 347–359, https://doi.org/10.1002/qj.788, 2011.
HCDP – Hawai`i Climate Data Portal: https://www.hawaii.edu/climatedataportal/, last access: 15 April 2022.
Hersbach, H., de Rosnay, P., Bell, B., Schepers, D., Simmons, A., Soci, C., Abdalla, S., AlonsoBalmaseda, M., Balsamo, G., Bechtold, P., Berrisford, P., Bidlot, J.R., de Boisséson, E., Bonavita, M., Browne, P., Buizza, R., Dahlgren, P., Dee, D., Dragani, R., Diamantakis, M., Flemming, J., Forbes, R., Geer, A. J., Haiden, T., Hólm, E., Haimberger, L., Hogan, R., Horányi, A., Janiskova, M., Laloyaux, P., Lopez, P., MunozSabater, J., Peubey, C., Radu, R., Richardson, D., Thépaut, J.N., Vitart, F., Yang, X., Zsótér, E., and Zuo, H.: Operational global reanalysis: progress, future directions and synergies with NWP, ERA Rep. Ser. 27, ECMWF, 1–63, https://doi.org/10.21957/tkic6g3wm, 2018.
Hopuare, M., Pontaud, M., Céron, J.P., Ortega, P., and Laurent, V.: Climate change, Pacific climate drivers and observed precipitation variability in Tahiti, French Polynesia, Clim. Res., 63, 157–170, https://doi.org/10.3354/cr01288, 2015.
Hopuare, M., Guglielmino, M., and Ortega, P.: Interactions between intraseasonal and diurnal variability of precipitation in the South Central Pacific: The case of a small high island, Tahiti, French Polynesia, Int. J. Climatol., 39, 670–686, https://doi.org/10.1002/joc.5834, 2018.
Houze, R. A.: Orographic effects on precipitating clouds, Rev. Geophys., 50, RG000365, https://doi.org/10.1029/2011RG000365, 2012.
Huang, S. P., Quek, S. T., and Phoon, K. K.: Convergence study of the truncated Karhunen–Loeve expansion for simulation of stochastic processes, Int. J. Numer. Meth. Eng., 52, 1029–1043, https://doi.org/10.1002/nme.255, 2001.
Hughes, J. P. and Guttorp, P.: A nonhomogeneous hidden Markov model for precipitation occurence, Appl. Stat., 48, 15–30, 1999.
Jha, S. K., Mariethoz, G., Evans, J., McCabe, M. F., and Sharma, A.: A space and time scaledependent nonlinear geostatistical approach for downscaling daily precipitation and temperature, Water Resour. Res., 51, 6244–6261, https://doi.org/10.1002/2014WR016729, 2014.
Kleiber, W., Katz, R. W., and Rajagopalan, B.: Daily spatiotemporal precipitation simulation using latent and transformed Gaussian processes, Water Resour. Res., 48, W01523, https://doi.org/10.1029/2011WR011105, 2012.
Krajewski, W. F., Ciach, G., and Habib, E.: An analysis of smallscale rainfall variability in different climatic regimes, Hydrolog. Sci. J., 48, 151–162, https://doi.org/10.1623/hysj.48.2.151.44694, 2003.
Lantuéjoul, C.: Geostatistical Simulation: Models and Algorithms, Springer Science & Business Media, ISBN 9783662048085, 2002.
Laurent, V., Maamaatuaiahutapu, K., Brodien, I., Lombardo, S., Tardy, M., and Varney, P. : Atlas climatologique de la Polynésie française, Délégation Interrégionale de Polynésie Française, Météo France, ISBN 9782111551916, 232 pp., 2019.
Leblois, E. and Creutin, J. D.: Spacetime simulation of intermittent rainfall with prescribed advection field: Adaptation of the turning band method, Water Resour. Res., 49, 3375–3387, https://doi.org/10.1002/wrcr.20190, 2013.
Longman, R. J., Diaz, H. F., and Giambelluca, T. W.: Sustained Increases in LowerTropospheric Subsidence over the Central Tropical North Pacific Drive a Decline in HighElevation Rainfall in Hawaii, J. Climate, 28, 8743–8759, https://doi.org/10.1175/JCLID150006.1, 2015.
Longman, R. J., Giambelluca, T. W., Nullet, M. A., Frazier, A. G., Kodoma, K., Crausbay, S. D., Krushelnycky, P. D., Cordell, S., Clark, M. P., Newman, A. J., and Jeffrey, R. A.: Compilation of climate data from heterogeneous networks across the Hawaiian Islands, Scient. Data, 5, 180012, https://doi.org/10.1038/sdata.2018.12, 2018.
Longman, R. J., Elison Timm, O., Giambelluca, T. W., and Kaiser, L.: A 20Year Analysis of DisturbanceDriven Rainfall on O`ahu, Hawai`i, Mon. Weather Rev., 6, 1767–1783, https://doi.org/10.1175/MWRD200287.1, 2021.
Lyons, S. W.: Empirical Orthogonal Function Analysis of Hawaiian Rainfall, J. Appl. Meteorol., 21, 1713–1729, https://doi.org/10.1175/15200450(1982)021<1713:EOFAOH>2.0.CO;2, 1982.
Maraun, D., Wetterhall, F., Ireson, A. M., Chandler, R. E., Kendon, E. J., Widmann, M., Brienen, S., Rust, H. W., Sauter, T., Themeßl, M., Venema, V. K. C., Chun, K. P., Goodess, C. M., Jones, R. G., Onof, C., Vrac, M., and ThieleEich, I.: Precipitation downscaling under climate change: Recent developments to bridge the gap between dynamical models and the end user, Rev. Geophys., 48, RG3003, https://doi.org/10.1029/2009RG000314, 2010.
Marra, F. and Morin, E.: Autocorrelation structure of convective rainfall in semiaridarid climate derived from highresolution XBand radar estimates, Atmos. Res., 200, 126–138, https://doi.org/10.1016/j.atmosres.2017.09.020, 2018.
Mavromatis, T. and Hansen, J. W.: Interannual variability characteristics and simulated crop response of four stochastic weather generators, Agr. Forest Meteorol., 109, 283–296, https://doi.org/10.1016/S01681923(01)002726, 2001.
Mezghani, A. and Hingray, B.: A combined downscalingdisaggregation weather generator for stochastic generation of multisite hourly weather variables over complex terrain: Development and multiscale validation for the Upper Rhone River basin, J. Hydrol., 377, 245–260, https://doi.org/10.1016/j.jhydrol.2009.08.033, 2009.
Morin, E., Ryb, T., Gavrieli, I., and Enzel, Y.: Mean, variance, and trends of Levant precipitation over the past 4500 years from reconstructed Dead Sea levels and stochastic modeling, Quatern. Res., 91, 751–767, https://doi.org/10.1017/qua.2018.98, 2019.
Nerini, D., Besic, N., Sideris, I. V., Germann, U., and Foresti, L.: A nonstationary stochastic ensemble generator for radar rainfall fields based on the shortspace Fourier transform, Hydrol. Earth Syst. Sci., 21, 2777—2797, https://doi.org/10.5194/hess2127772017, 2017.
Niemi, T. J., Guillaume, J. H. A., Kokkonen, T., Hoang, T. M. T., and Seed, A. W.: Role of spatial anisotropy in design storm generation: Experiment and interpretation, Water Resour. Res., 52, 69–89, https://doi.org/10.1002/2015WR017521, 2016.
Opitz, T., Allard, D., and Mariethoz, G.: Semiparametric resampling with extremes, Spatial Stat., 42, 100445, https://doi.org/10.1002/2015WR017521, 2021.
Oriani, F., Stisen, S., Demirel, M. C., and Mariethoz, G.: Missing Data Imputation for Multisite Rainfall Networks: A Comparison between Geostatistical Interpolation and PatternBased Estimation on Different Terrain Types, J. Hydrometeorol., 21, 2325–2341, https://doi.org/10.1175/JHMD190220.1, 2020.
Papalexiou, S. M. and Serinaldi, F.: Random Fields Simplified: Preserving Marginal Distributions, Correlations, and Intermittency, With Applications From Rainfall to Humidity, Water Resour. Res., 56, e2019WR026331, https://doi.org/10.1029/2019WR026331, 2020.
Paschalis, A., Molnar, P., Fatichi, S., and Burlando, P.: A stochastic model for highresolution spacetime precipitation simulation, Water Resour. Res., 49, 8400–8417, https://doi.org/10.1002/2013WR014437, 2013.
Paschalis, A., Fatichi, S., Molnar, P. M., Rimkus, S., and Burlando, P.: On the effects of small scale space–time variability of rainfall on basin flood response, J. Hydrol., 514, 313–327, https://doi.org/10.1016/j.advwatres.2013.11.006, 2014.
Paulhus, J. L. and Kohler, M. A.: Interpolation of missing precipitation records, Mon. Weather Rev., 80, 129–133, https://doi.org/10.1175/15200493(1952)080<0129:IOMPR>2.0.CO;2, 1952.
Peleg, N., Skinner, C., Fatichi, S., and Molnar, P.: Temperature effects on the spatial structure of heavy rainfall modify catchment hydromorphological response, Earth Surf. Dynam., 8, 17–36, https://doi.org/10.5194/esurf8172020, 2020.
Réchou, A., Flores, O., Jumaux, G., Duflot, V., Bousquet, O., Pouppeville, C., and Bonnardot, F.: Spatiotemporal variability of rainfall in a high tropical island: Patterns and largescale drivers in Réunion Island, Q. J. Roy. Meteorol. Soc., 145, 893–909, https://doi.org/10.1002/qj.3485, 2019.
Richardson, C. W.: Stochastic simulation of daily precipitation, temperature, and solar radiation, Water Resour. Res., 17, 182–190, https://doi.org/10.1029/WR017i001p00182, 1981.
Rüschendorf, L.: On the distributional transform, Sklar's theorem, and the empirical copula process, J. Stat. Plan. Infer., 139, 3921–3927, https://doi.org/10.1016/j.jspi.2009.05.030, 2009.
Sanfilippo, K. M.: Predictor selection and model evaluation for future rainfall projection in Hawai`i, University of Hawai`i at Mānoa, Honolulu, 124 pp., http://hdl.handle.net/10125/73340 (last access: 15 April 2022), 2020.
Schleiss, M., Chamon, S. and Berne, A.: Nonstationarity in Intermittent Rainfall: The “Dry Drift”, J. Hydrometeorol., 15, 1189–1204, https://doi.org/10.1175/JHMD13095.1, 2014.
Schwartz, G.: Estimating the dimension of a model, Ann. Stat., 6, 461–464, https://doi.org/10.1214/aos/1176344136, 1978.
Scott, D. W.: On optimal and databased histograms, Biometrika, 66, 605–610, https://doi.org/10.1093/biomet/66.3.605, 1979.
Scott, D. W.: Scott's rule, WIREs Comput. Stat., 2, 497–502, https://doi.org/10.1002/wics.103, 2010.
Supit, I., van Diepen, C. A., de Wit, A. J. W., Wolf, J., Kabat, P., Baruth, B., and Ludwig, F.: Assessing climate change effects on European crop yields using the Crop Growth Monitoring System and a weather generator, Agr. Forest Meteorol., 164, 96–111, https://doi.org/10.1016/j.agrformet.2012.05.005, 2012.
Volosciuk, C., Maraun, D., Vrac, M., and Widmann, M.: A combined statistical bias correction and stochastic downscaling method for precipitation, Hydrol. Earth Syst. Sci., 21, 1693–1719, https://doi.org/10.5194/hess2116932017, 2017.
Vrac, M., Stein, M., and Hayhoe, K.: Statistical downscaling of precipitation through nonhomogeneous stochastic weather typing, Clim. Res., 34, 169–184, https://doi.org/10.3354/cr00696, 2007.
Vu, T. M., Mishra, A. K., Konapala, G., and Liu, D.: Evaluation of multiple stochastic rainfall generators in diverse climatic regions, Stoch. Environ. Res. Risk A., 32, 1337–1353, https://doi.org/10.1007/s0047701714580, 2018.
Wilby, R. L.: Stochastic weather type simulation for regional cliamte change assessment, Water Resour. Res., 30, 3395–3403, https://doi.org/10.1029/94WR01840, 1994.
Wilcox, C., Aly, C., Vischel, T., Panthou, G., Blanchet, J., Quantin, G., and Lebel, T.: Stochastorm: A Stochastic Rainfall Simulator for Convective Storms, J. Hydrometeorol., 22, 387–404, https://doi.org/10.1175/JHMD200017.1, 2021.
Wilks, D. S. and Wilby, R. L.: The weather generation game: a review of stochastic weather models, Prog. Phys. Geogr., 23, 329–357, https://doi.org/10.1177/030913339902300302, 1999.
Yiou, P.: AnaWEGE: a weather generator based on analogues of atmospheric circulation, Geosci. Model Dev., 7, 531–543, https://doi.org/10.5194/gmd75312014, 2014.