Impact of high-resolution sea surface temperature representation on the forecast of small Mediterranean catchments’ hydrological responses to heavy precipitation

Operational meteo-hydrological forecasting chains are affected by many sources of uncertainty. In coastal areas characterized by complex topography, with several medium-to-small size catchments, quantitative precipitation forecast becomes even more challenging due to the interaction of intense air–sea exchanges with coastal orography. For such areas, which are quite common in the Mediterranean Basin, improved representation of sea surface temperature (SST) space–time patterns can be particularly important. The paper focuses on the relative impact of different resolutions of SST representation on regional operational forecasting chains (up to river discharge estimates) over coastal Mediterranean catchments, with respect to two other fundamental options while setting up the system, i.e. the choice of the forcing general circulation model (GCM) and the possible use of a three-dimensional variational assimilation (3D-Var) scheme. Two different kinds of severe hydro-meteorological events that affected the Calabria region (southern Italy) in 2015 are analysed using the WRF-Hydro atmosphere–hydrology modelling system in its uncoupled version. Both of the events are modelled using the 0.25 resolution global forecasting system (GFS) and the 16 km resolution integrated forecasting system (IFS) initial and lateral atmospheric boundary conditions, which are from the European Centre for Medium-Range Weather Forecasts (ECMWF), applying the WRF mesoscale model for the dynamical downscaling. For the IFS-driven forecasts, the effects of the 3D-Var scheme are also analysed. Finally, native initial and lower boundary SST data are replaced with data from the Medspiration project by Institut Français de Recherche pour L’Exploitation de la Mer (IFREMER)/Centre European Remote Sensing d’Archivage et de Traitement (CERSAT), which have a 24 h time resolution and a 2.2 km spatial resolution. Precipitation estimates are compared with both ground-based and radar data, as well as discharge estimates with stream gauging stations’ data. Overall, the experiments highlight that the added value of high-resolution SST representation can be hidden by other more relevant sources of uncertainty, especially the choice of the general circulation model providing the boundary conditions. Nevertheless, in most cases, high-resolution SST fields show a non-negligible impact on the simulation of the atmospheric boundary layer processes, modifying flow dynamics and/or the amount of precipitated water; thus, this emphasizes the fact that uncertainty in SST representation should be duly taken into account in operational forecasting in coastal areas.

Abstract.Operational meteo-hydrological forecasting chains are affected by many sources of uncertainty.In coastal areas characterized by complex topography, with several medium-to-small size catchments, quantitative precipitation forecast becomes even more challenging due to the interaction of intense air-sea exchanges with coastal orography.For such areas, which are quite common in the Mediterranean Basin, improved representation of sea surface temperature (SST) space-time patterns can be particularly important.The paper focuses on the relative impact of different resolutions of SST representation on regional operational forecasting chains (up to river discharge estimates) over coastal Mediterranean catchments, with respect to two other fundamental options while setting up the system, i.e. the choice of the forcing general circulation model (GCM) and the possible use of a three-dimensional variational assimilation (3D-Var) scheme.Two different kinds of severe hydro-meteorological events that affected the Calabria region (southern Italy) in 2015 are analysed using the WRF-Hydro atmosphere-hydrology modelling system in its uncoupled version.Both of the events are modelled using the 0.25 • resolution global forecasting system (GFS) and the 16 km resolution integrated forecasting system (IFS) initial and lateral atmospheric boundary conditions, which are from the European Centre for Medium-Range Weather Forecasts (ECMWF), applying the WRF mesoscale model for the dynamical downscaling.For the IFS-driven forecasts, the effects of the 3D-Var scheme are also analysed.Finally, native initial and lower boundary SST data are replaced with data from the Medspiration project by Institut Français de Recherche pour L'Exploitation de la Mer (IFRE-MER)/Centre European Remote Sensing d'Archivage et de Traitement (CERSAT), which have a 24 h time resolution and a 2.2 km spatial resolution.Precipitation estimates are compared with both ground-based and radar data, as well as discharge estimates with stream gauging stations' data.Overall, the experiments highlight that the added value of high-resolution SST representation can be hidden by other more relevant sources of uncertainty, especially the choice of the general circulation model providing the boundary conditions.Nevertheless, in most cases, high-resolution SST fields show a non-negligible impact on the simulation of the atmospheric boundary layer processes, modifying flow dynamics and/or the amount of precipitated water; thus, this emphasizes the fact that uncertainty in SST representation should be duly taken into account in operational forecasting in coastal areas.

Introduction
Operational river flood forecasting is a highly challenging activity for several reasons that go beyond strictly scientific aspects.Hydrometeorological forecasting requires extremely complex systems, where issues like communication of warning, accessibility of the results, and administrative and/or institutional factors can be as important as monitoring and modelling activities (Pagano et al., 2014;Silvestro et al., 2017).Nevertheless, the cornerstone of such systems, and undoubtedly the most demanding part from a scientific point of view, is still the meteorological-hydrological mod-Published by Copernicus Publications on behalf of the European Geosciences Union.
elling chain, supported by in situ or remotely sensed measurements.
Increasingly refined modelling chains have been developed in recent years (e.g.UK Environmental Prediction research, Lewis et al., 2019a;Canadian Great Lakes, Gronewold et al., 2011; the US Navy's Coupled Ocean/Atmosphere Mesoscale Prediction System COAMPS ® , Hodur, 1997).Despite their complexity, these systems all have to deal with some inherent limitations of the meteorological and hydrological models.The main sources of errors in weather forecasts are connected to both inaccuracy in defining the initial state, due to the lack of available measures or observation/assimilation errors, and approximations of the models, whose structures are not capable of properly representing the phenomena of interest (Allen et al., 2002;Buizza, 2018).These problems are exacerbated by the chaotic nature of the atmosphere.Even though hydrological models are much simpler than meteorological models with respect to their structure (Liu et al., 2012;Pagano et al., 2014), they also have to struggle with different sources of uncertainty that, according to Renard et al. (2010), can be grouped into four categories: (1) input uncertainty, (2) output uncertainty (e.g.runoff estimates are not straightforward), (3) structural model uncertainty, and (4) parametric uncertainty.Furthermore, as catchments are very seldom perfect natural systems, some effects of human disturbances can virtually not be modelled.
The main link between atmospheric and hydrological compartments in a forecasting chain is precipitation forecast, which is an output variable for weather models and constitutes the main input for hydrological models.Quantitative precipitation forecast (QPF) is a major challenge for operational meteorology, because the reliability of precipitation forecasts crucially affects streamflow forecasts' skill (for a review see Cuo et al., 2011; for recent applications see e.g.Davolio et al., 2015Davolio et al., , 2017;;Tao et al., 2016;Li et al., 2017).Among the various strategies adopted for addressing this issue, in recent years, several studies that were focused on coastal areas have assessed the importance of sea surface temperature (SST) initial and boundary conditions as relevant drivers of QPF, which, as previously stated, is consequently capable of influencing the streamflow forecast.This impact can be particularly strong in topographically complex coastal areas, characterized by several small catchments, such as in the Mediterranean Basin, for which several cooperative research efforts have been activated, including the MEDiterranean EXperiment (MEDEX; Jansa et al., 2014) and the HYdrological cycle in the Mediterranean eXperiment (HyMeX; Drobinski et al., 2018).
Several studies have recently focused on the effects of sea surface-atmosphere interactions over heavy precipitation at midlatitudes, particularly in the Mediterranean area (e.g.Manzato et al., 2015;Romaniello et al., 2015;Rainaud et al., 2016).Some of them showed that large variations in the average values of SST boundary conditions significantly affected the location and intensity of high-impact events (Lebeaupin et al., 2006;Miglietta et al., 2011;Senatore et al., 2014;Meredith et al., 2015;Pastor et al., 2015;Miglietta et al., 2017;Pytharoulis, 2018).Furthermore, using coupled atmosphere-ocean simulations, Berthou et al. (2014Berthou et al. ( , 2015) ) highlighted the major effects of long-term SST changes in the representation of Mediterranean intense rain events, although features at shorter timescales can also contribute significantly.Lebeaupin et al. (2006) found that higher-resolution SST fields have poor effects on convection in the case study they analysed (southern France).Ivatek-Šahdan et al. (2018), examining several events in the eastern Adriatic, also found that more realistic SST fields did not substantially improve precipitation estimates; furthermore, they showed that the impact of high-resolution SST varied in different cases.Conversely, Katsafados et al. (2011) found noticeable deviations among the forecast skills of simulations with SST boundary conditions at different resolutions in a test case in the eastern Mediterranean, whereas Cassola et al. (2016) verified that high-resolution SST fields can positively impact QPF in the forecasting range of 36-48 h in a study in north-western Italy.Finally, Berthou et al. (2016), in southern France, and Stocchi and Davolio (2017), in the Adriatic Sea, highlighted that SST-atmosphere interactions mainly affect precipitation patterns and intensity via complex (and varying event-by-event) modifications of the stability of the upstream atmospheric boundary layer.
The main objective of this paper is to contribute to the current discussion on the impact of SST representation by extending the analysis over the whole meteo-hydrological forecasting chain, i.e. going beyond precipitation forecasts and evaluating sensitivity on streamflow forecasts.Furthermore, SST sensitivity is assessed in the context of the overall uncertainty linked to initial and boundary conditions in regional modelling, using different forcing GCMs, with and without data assimilation.To this aim, different accuracy levels of SST representation are used in an operational meteorological-hydrological forecasting chain over a coastal Mediterranean area including, in addition to the native SST fields of the general circulation models (GCMs), higher-resolution fields: the Medspiration level 4 ultra-highresolution foundation SST "SSTfnd" from the Medspiration project by the Centre European Remote Sensing d'Archivage et de Traitement (CERSAT) and the Institut Français de Recherche pour L'Exploitation de la Mer (IFREMER; Merchant et al., 2008;Robinson et al., 2012).Furthermore, two GCM forecasts are used, namely the global forecasting system (GFS) provided by the US National Weather Service (NWS) and the integrated forecasting system (IFS) developed at the European Centre for Medium-Range Weather Forecasts (ECMWF), as well as a three-dimensional variational assimilation (3D-Var) scheme.
The study area, corresponding to the Calabrian Peninsula (southern Italy), due to its particular position in the middle of the Mediterranean Sea and its complex and steep orography, quite regularly experiences severe precipitation events and is particularly prone to significant ground effects (Federico et al., 2003a(Federico et al., , b, 2008;;Chiaravalloti and Gabriele, 2009;Llasat et al., 2013;Gascòn et al., 2016;Avolio and Federico, 2018), the most recent of these (at the time of publication) being a flash flood event on 20 August 2018 that caused 10 casualties (Avolio et al., 2019).According to Avolio and Federico (2018), severe precipitation events over Calabria can be classified as either short-lived events, which last less than 24 h, or long-lived events, which last more than 24 h.Following this classification, in this paper, two case studies of events that occurred in 2015 are considered, the former is characterized by convective, very localized precipitation (11-12 August;CFM, 2015a) and the latter by more persistent and widespread stratiform precipitation (30 October-2 November; CFM, 2015b).
The meteorological-hydrological forecasting chain is based on the WRF-Hydro modelling system (Gochis et al., 2015).This open-source community model, originally developed as the hydrological extension of the Weather Research and Forecast (WRF; Skamarock et al., 2008) model, provides a coupling architecture that allows the user to connect vertical water fluxes between the Earth surface and the atmosphere, which are simulated at coarse resolution by the atmospheric model, to lateral surface and sub-surface fluxes, simulated at high-resolution by the hydrological model, in both a one-way (i.e. with no feedback from the routing models to the atmosphere) and two-way (with feedback) manner.The WRF-Hydro system has dramatically evolved in recent years (Salas et al., 2018;Lin et al., 2018;Lahmers et al., 2019), and has been operationally adopted into the NOAA National Water Model (NWM, Cohen et al., 2018) across the continental US, as well as being used for research applications (e.g.Yucel et al., 2015;Senatore et al., 2015;Arnault et al., 2016;Verri et al., 2017).
The paper is organized as follows.Section 2 describes the study area; the two events analysed; and the numerical model, including its set-up and details on the space and time resolutions of the boundary conditions.In Sect. 3 the results of the meteorological and hydrological outputs are analysed separately for the two events.Finally, Sect. 4 discusses and summarizes the main findings and outlines future research lines.

Study area and description of the events
Calabria is a peninsula characterized by a complex orography.Its geographical and morphological features produce a very irregular precipitation distribution (average annual precipitation varies between 600 and 1500 mm; Federico et al., 2010) and foster the occurrence of extreme weather events, which often caused deaths (Petrucci et al., 2018).Among the relatively numerous recent events, this study focuses on two case studies that occurred in 2015 and were characterized by distinctive features.
The first high-impact event (case study 1) was very localized in space and time and hit the north-eastern part of the region on the morning of 12 August 2015.The analysis at the synoptic scale (Fig. 1a, b) shows that a main lowpressure system which originated from the Atlantic moved over the French and Spanish coasts in the early hours of 12 August 2015, while a cut-off low occurred over the central Mediterranean, giving rise to a new low-pressure vortex with reduced dimensions that caused intense local rainfall.The observed precipitation patterns (Fig. 1c) involved only small areas of the mainland, specifically the Corigliano and Rossano municipalities.The data provided by the Italian national radar network (integrated into the same map in Fig. 1c), although underestimating ground observations, show that most of the precipitation occurred over the Ionian Sea.The Corigliano rain gauge measured high rainfall values (Fig. 1d).During the 48 h from 00:00 UTC on 11 August 2015 until 00:00 UTC on 13 August 2015, 255.2 mm of rain was recorded, with a maximum of 246.4 mm in 24 h (from 18:00 UTC on 11 August to 18:00 UTC on 12 August), 223.2 mm in 12 h (from 01:45 UTC on 12 August to 13:45 UTC on 12 August), 167.4 mm in 6 h, 107.2 mm in 3 h, and 51.4 mm in 1 h.The hydrological impact concerned some small/very small coastal catchments, the most important of which was the Citrea Creek (11.4 km 2 -catchment boundaries highlighted in Fig. 1e), which overflowed causing several tens of millions of euros in damage.
The second event (case study 2) involved a much larger area and developed over 4 d, from 30 October to 2 November 2015.The synoptic analysis (Fig. 2e) shows another cutoff low remaining stationary over Sicily for much of the period and attracting humid and warm air from the Ionian Sea to the south-east (a detailed synoptic description of the event is provided by Avolio and Federico, 2018).The orographic effect in this event turned out to be decisive, with the Calabrian mountain ranges acting as a real barrier; therefore, a large part of the rainfall occurred on the Ionian (eastern) side of the region.While on 30 October 2015 only the northern part of the region was affected (Fig. 2a; about 200 mm in 24 h at the Oriolo station), the highest precipitation during the entire event was recorded on the southern coast (Fig. 2b, c, d), with a maximum of about 740 mm (Chiaravalle Centrale station) and a daily maximum of about 370 mm (Sant'Agata del Bianco).In Fig. 2a-d, rain gauge observations overlap the precipitation fields detected by the weather radars, also extending over the sea.The hydrological impact of the event concerned the whole eastern side of the region.Two catchments are selected for this study, namely the Ancinale River that is closed at the Razzona gauging station (116 km 2 , Fig. 2f) and the Bonamico Creek that is closed at the Casignana gauging station (138 km 2 , Fig. 2g).These catchments are chosen because they are two of the biggest with available water level observations (unfortunately no discharge data are available), and they are located to the north and south of the rainiest area, respectively.Specifically, Chiaravalle Centrale station is located at the Ancinale River outlet.

WRF
The Advanced Research WRF (ARW) model, version 3.7.1, is used in two one-way nested domains (Fig. 3).The external domain, D01, covers a large area of the  • N, 3.59-28.59• E) with a 10 km (187× 205 grid points) horizontal resolution, whereas the innermost domain, D02, is centred over the Calabrian Peninsula (37.10-40.87• N, 13.88-18.71• E), with a 2 km (200 × 200 grid points) horizontal resolution.The model runs on 44 vertical atmospheric layers, up to a 50 hPa pressure top (about 20 000 m), and on 4 soil layers, down to 2 m below the surface.The time step of the model simulation is 60 s in D01 and 12 s in D02.
Physical parameterization of the model is the same as that used by Senatore et al. (2014) and is reported in Table 1.Boundary and initial conditions are provided by two operational forecast GCMs, namely the GFS in forecast mode with a spatial resolution of 0.25 • (about 27 km) and the highresolution (HRES) IFS-ECMWF in forecast mode with a spatial resolution of about 16 km.In both cases, boundary conditions are provided every 6 h.As a further step, both initial and lower boundary SST data are replaced by the Medspiration L4 ultra-high-resolution SSTfnd (obtained as a daily mean with a resolution of 0.022 • ).The high-resolution Medspiration SST fields are ingested into the WRF initial and lower boundary condition files of both domains via GISbased techniques, following Senatore et al. (2014).
Furthermore, two relevant options allowed by the WRF modelling system are always activated for all SST boundary conditions: the sst_update option, allowing dynamical lower boundary (i.e.SST) conditions, and the sst_skin option, based on Zeng and Beljaars (2005), which permits the simulation of SST dynamics.It is noteworthy that the higher resolution of the SST fields does not imply a greater accuracy, which can be objectively assessed via a comparison with in situ observations.For this purpose, a preliminary search is performed in the Copernicus Marine Environment Monitoring Service (CMEMS), in particular in the Coriolis Ocean database for ReAnalysis (CORA; Cabanes et al., 2013), using the latest version (5.2, April 2019).Useful data (i.e.continuous measurements with a sub-daily time step at the sea-surface interface) are only found at the border of the external domain (D01) for both of the case studies (Fig. S1a in the Supplement).
Finally, both as an additional comparison and with the aim of highlighting its relative impact with respect to the effects of different boundary conditions provided by different GCMs and/or more detailed SST fields, a data assimilation technique is also used for both of the test cases.Specifically, a Table 1.Main WRF physical options selected for the study.
A summary of all of the simulations carried out is reported in Table 2.

WRF-Hydro
In this work, WRF-Hydro version 3.0 is used in one-way mode.Therefore, the atmospheric model outputs are used as input of the hydrological model utilizing an hourly time step.According to the WRF parameterization, the land surface model (LSM) is unified NOAH and is used at the same resolution as the D02 domain, whereas an increased horizontal resolution of 200 m is used (2000 × 2000 grid points) for the lateral routing of surface and subsurface water; this results in an aggregation factor of 1/10 from the atmospheric to the hydrological model.
No observed discharge or flow depth data are available for case study 1; therefore, model calibration is not performed.In case study 2, model calibration is performed manually with respect to the available water level data for the two selected catchments (Ancinale and Bonamico) with the aim of reproducing the timing of the hydrological responses to heavy precipitation and, primarily, to correctly simulate the peak flow time, which is a paramount variable for civil protection activities.
The humidity and temperature conditions in the four soil layers at the beginning of the analysed event (30 October 2015 at 00:00 UTC) are achieved using offline simulations with a spin-up time of 1 month.The meteorological forcing for this period is basically given by the spatial interpolation of ground-based observations (provided by the monitoring network managed by the Centro Funzionale Multi-rischi -ARPACAL, Calabria region).The interpolation techniques adopted are the same as those described in Senatore et al. (2015) except that precipitation fields are interpolated via inverse distance weighting (IDW) instead of exponential kriging.Furthermore, during the event (i.e. from 30 October to 2 November 2015) precipitation fields (Fig. 4a, b) are only achieved by merging hourly ground-based rainfall observations to hourly radar data estimates provided by the Italian weather radar network managed by the National Department of Civil Protection.The merging procedure follows Sinclair and Pegram (2005) with the difference that a simpler double IDW interpolation method is used instead of a double kriging interpolation.The merging technique guarantees an increase in the total "observed" rainfall volume of +4.6 % over the Ancinale River and +10.6 % over the Bonamico Creek in comparison with a simple IDW interpolation.
The parameters involved in the calibration procedure are broadly the same as those used in previous studies with WRF-Hydro (e.g.Yucel et al., 2015;Senatore et al., 2015).Specifically, the LSM parameters calibrated are the infiltration factor (REFKDT), the coefficient governing deep drainage (SLOPE), and the thicknesses of the four soil layers.In addition, two spatially distributed parameters of the hydrological model, namely the overland flow roughness scaling factor (OVROUGHRTFAC) and the initial retention height scaling factor (RETDEPRTFAC), are calibrated along with the Manning roughness coefficients (one value for each stream order).
The calibrated parameters are shown in Table 3, whereas resulting hydrographs and uncalibrated hydrographs are shown in Fig. 4c and d.The more impulsive behaviour of the Bonamico Creek, typical of Calabrian "fiumare", is sim-  ulated using lower values of the infiltration factor and lower soil layer thicknesses.Nevertheless, in order to allow timely simulation of peak flows, a small delay of the initial response is necessary via an increase in the RETDEPRTFAC value, which is compatible with noteworthy initial ponding in the wide alluvial bed and infiltration in the gravelly soil.Conversely, the abundance of organic matter in the soils of the dense forests within the Ancinale River catchment, which (especially in autumn) can store considerable quantities of water, most probably contributes substantially to the smoother response of the Ancinale River. Figure 4c and d highlight that the calibration procedure mainly influences the results for the Ancinale River, especially in terms of total volumes.
As for the hydrographs, adopting typical stage-discharge power relationships (i.e.q = a • h b , where q is the discharge, h is the water level, and a and b are the two calibration coefficients) the coefficients of determination (R 2 ) between simulated discharge values and observed water levels are equal to 0.942 and 0.831 for the Ancinale River and the Bonamico Creek respectively.Concerning the reliability of the simulated discharge amount, as reference observations are missing, an indirect validation of the peak flows achieved is performed using the Hydrologic Engineering Center's (CEIWR-HEC) River Analysis System (HEC-RAS) (Hydrologic Engineering Center, 2016).Cross-sections for both of the outlets of the catchments and for four upstream and downstream points, approximately 50 m apart, are determined by merging data from an ultra-high-resolution (5 m) digital terrain model provided by the Calabria Region Cartographic Centre with the heights given in very recent official maps (Technical Cartography of Calabria Region) at a scale of 1 : 5000.Such cross-sections are further validated by field sample measurements.One-dimensional steady flow simulations reaching observed peak heights provide peak discharges broadly comparable to the results achieved with the model.
For the sake of brevity, hereafter the WRF-Hydro hydrographs calibrated using observed precipitation fields (shown in Fig. 4) will be referred to as "observed hydrographs" or simply "observations".
A. Senatore et al.: Impact of high-resolution sea surface temperature though originating from a high-resolution dataset, do not provide a clear improvement of skin SST representation compared to the original GCM fields.It is to be noticed, however, that the SST boundary conditions in D01 are aggregated at a 10 km resolution.Panels in Fig. S2 focus on D02 and show the skin SST fields from 11 August at 18:00 UTC to 12 August at 18:00 UTC with a time step of 6 h, for all simulations carried out in this case study.The main features highlighted by the skin SST maps are the strong underestimation of native IFS fields close to the coastline (this is due to a known interpolation problem along coastlines that lowers temperatures to unrealistic values; Linus Magnusson, personal communication, 2019) and the overestimation, especially in the Tyrrhenian Sea (up to more than 2 K), of the native GFS fields.The other skin SST fields mostly differ by less than ±0.5 K from each other.It is noteworthy that skin SST fields in the simulations using the Medspiration product are not identical due to the fact that, with the method of Zeng and Beljaars (2005), skin SST values are influenced by the surface winds and net radiation fluxes modelled by the different simulations.
A comparison of the time evolution of the average skin SST values in the whole D02 domain would be biased by the non-negligible IFS underestimation near the coastline.Instead, an analysis performed on selected significant points could provide more interesting insights.For this reason, focusing on the Ionian Sea, in Fig. 5, point 1, which is closer to Corigliano-Rossano, and point 2, which is off the Calabrian southern coast (the exact location of both points is given in Fig. 3b), are examined.Concerning daily values, a clear but slight (< 0.5 K) overestimation of GFS-O is shown for both days in point 1 and on the second day in point 2. Some hourly differences are more evident: e.g. in point 1 GFS-O values are up to 1.5 K higher than other models on 12 August at around 12:00 UTC, whereas in point 2 peak values of IFS-O and IFS-M on 11 August at around 12:00 UTC are about 1 K higher than other models.Nevertheless, the differences among models during the night between 11 and 12 August (i.e.right before and during the rain event) are generally low.The only noteworthy difference is given (in point 2) by the small underestimation of Medspiration simulations (i.e.GFS-M, IFS-M, and IFS-DA-M) of about 0.3-0.4K, shown by a sudden reduction of their skin SST values, most probably due to the change of the Medspiration SST field (from 11 to 12 August).This behaviour, which is clearly not realistic, highlights a weakness occurring while directly ingesting such external data in the WRF simulation.
The accumulated precipitation modelled by all simulations for the 24 h period from 11 August at 18:00 UTC to 12 August at 18:00 UTC is shown in Fig. 6.Overall, all models miss the location of the event, moving it further south, off the Ionian coast.GFS-based simulations forecast more rainfall than IFS-based simulations (average respective values in the domain of 10.1 and 8.9 mm with the native SST fields and 10.4 and 9.5 mm with the Medspiration SST fields for GFS and IFS), but are centred more to the south.IFS-based simulations forecast rainfall clusters with more elongated shapes in the south-north direction, allowing more precipitation to reach the central and northern Ionian coasts (namely, the Corigliano-Rossano area).Even though simulations based on the 3D-Var scheme still miss the correct location of the event, they both provide more rainfall in the domain (average values of 11.2 mm with the native SST fields and 11.0 mm with the Medspiration SST fields) and also show a well-defined rainfall cluster close to the central Ionian coast.Both IFS and 3D-Var simulations overestimate land precipitation in that area.
According to the generally small differences identified in the SST fields, Fig. 6 clearly shows that ingesting highresolution SST information provides (in terms of spatial distribution of accumulated precipitation) much less relevant (and partially chaotic) effects than changing initial and boundary conditions or using data assimilation schemes, and a minor or possibly opposite impact on the accuracy of the simulations.Given the peculiar features of the analysed event, it makes sense to focus on the area surrounding the Corigliano gauge station.For each simulation, the graph in Fig. 7a merges intensity, location, and time correlation information of the closest rainfall peaks (with a threshold of at least 40 mm) to that station, whereas Fig. 7b explicitly shows the time evolution of accumulated rainfall for each of the locations identified (the points are highlighted using small stars in the panels of Fig. 6).Given that all simulations strongly underestimate the observed rainfall value of 246.4 mm (the highest simulated value of about 100 mm is given by IFS-DA-M), there is no configuration clearly outperforming the others.Both GFS peaks are located to the south (about 20 km) and delay the rain event by 8 (GFS-M) to 11 (GFS-O) hours.IFS-O and IFS-DA-O peaks are lower than IFS-M and IFS-DA-M, but they are generally closer to the Corigliano station (about 13 and 22 km respectively).Furthermore, Fig. 7b shows that ingesting Medspiration fields moves the rainfall events up for both IFS-O and IFS-DA-O.This suggests that removing the unrealistic low SST values along the coastline near Corigliano station (i.e.considering IFS-M and IFS-DA-M in place of IFS-O and IFS-DA-O respectively) has the twofold effect of increasing rainfall amounts and accelerating flow dynamics.Such effects are more easily recognizable looking at the 3D-Var simulations, which provide more water vapour and precipitation.Moving from IFS-O to IFS-DA-O to IFS-DA-M, the 850 hPa wind speed on 12 August at 00:00 UTC generally increases in D02 and specifically off the northern Ionian coast of Calabria (Fig. S3).As a result, Fig. 8 shows that when moving from IFS-O to IFS-DA-O to IFS-DA-M, the integrated water vapour (IWV) cluster off the Ionian Sea simulated 3 h later (03:00 UTC) is both larger and closer to the coast.
Differences between IFS-O and IFS-DA-O are due to the assimilation of 14 vertical profiles of pressure, wind speed and direction, absolute and dew point temperature, and relative humidity in D01, with 14 point measurements pro- vided by aircrafts at a fixed pressure level (corresponding to about 12 km).In contrast, differences between IFS-DA-O and IFS-DA-M are mainly due to different skin SST values.Specifically, higher SST values given by ingesting Medspiration fields enhance water vapour concentration in the atmosphere (the average upward moisture flux from the sea surface in domain D01 increases by about 5 %), while they concurrently affect the stability of the atmospheric boundary layer, providing more energy to the system and accelerating the flow dynamics (as reported by e.g. by Stocchi and Davolio, 2017).The early arrival of the moist air mass in the Corigliano-Rossano area using Medspiration SST fields is highlighted by the time series of the hourly averaged water vapour flux through section A-A' (Fig. 9; section A-A' is shown in Fig. 3b).Local flow peaks are moved up from 2 to 4 h in advance using IFS-DA-M compared with IFS-DA-O, and similar behaviour, even though less evident, is observed with IFS-M compared with IFS-O.
Concerning the assessment of the hydrological impact of the forecast event, notwithstanding the detailed analysis performed, case study 1 does not provide relevant results (Table 4).The centre of the Citrea catchment is located approximately 8 km south-east of the Corigliano gauge station, has a maximum length of about 7 km in the south-north direction, and a maximum width of only 2.5 km.The level of accuracy achieved by all simulations performed is not yet sufficient to correctly forecast the hydrological impact for such small catchments in areas with very complex topography, such as those areas analysed in this study.The maximum rainfall accumulation value over the catchment is forecast by IFS-O, with 16 mm in 3 h.However, the accuracy of the models is already high enough to make them very useful (it is worthwhile recalling that the starting time of the simulation is more than 24 h before the event).In fact, if model forecasts are used to infer information about wider "warning areas" than single small catchments (as carried out by the Italian Civil Protection system), they provide essential inputs for civil protection activities.

Case study 2
Case study 2 embraces a longer period than case study 1.In this section, forecasting skills are assessed considering both the whole 4 d length of the event (i.e. from 30 October 2015) and a 3 d forecast starting on 31 October 2015, in order to reduce the uncertainties attributable to the longer lead time forecast.
Such as in the previous case study, the first analysis is devoted to skin SST fields.The comparison with the SST measurements available from the CORA database shows much better behaviour of the Medspiration SST fields in the analysed region of the external domain (Fig. S1e-g).Focusing on the innermost domain, Fig. S4 highlights (besides the abovementioned IFS-related problem along coastlines) that, in this case, Medspiration fields for the whole period overestimate both GFS and IFS native SST fields.Specifically, average differences with respect to GFS SST vary from about 0.6 to 0.8 K, whereas differences with respect to IFS SST fields are higher than 0.8 K (the average difference increases to about 1.5 K if the values along coastlines are also considered).It is noteworthy that GFS also underestimates skin SST particularly near coastlines, whereas there is an overestimation off the Tyrrhenian Sea, such as that seen in the previous test case.Focusing on points 1 and 2 (Fig. 10), the following is shown: (1) both points replicate similar general behaviour, with Medspiration fields' values being higher than values from GFS, which, in turn, are higher than values from IFS; (2) differences are more marked in point 1 (average values of +1.0 and +0.6 K for IFS and GFS respectively) than in point 2 (+0.9 and +0.3 K for IFS and GFS respectively); (3) as in case study 1, a sudden reduction of about 0.5 K can also be observed for Medspiration in the graph related to point 1, moving from 1 to 2 November (Fig. 10a).Nevertheless, a similar abrupt change, although less marked (about 0.2 K), is observed also for GFS on 31 October at 06:00 UTC.In summary, this case study shows an evident skin SST increase from IFS to GFS to Medspiration.Figure 11 shows the accumulated rainfall fields in the 4 d simulation period from the six WRF configurations compared with a rainfall map of Calabria achieved by merging ground measurements with radar observations (the merging procedure followed Sinclair and Pegram, 2005; distinct rain gauge and radar data are available in Fig. 2a-d).It clearly highlights, in agreement with the previous case study, that the main impact on rainfall output is given by the choice of the GCM providing the boundary conditions.Average accumulated precipitation in D02 is equal to 80 mm with GFS-O, 71 mm with IFS-O, and 68 mm with IFS-DA-O.Interestingly, the introduction of the 3D-Var scheme this time leads to reduced precipitation (in D01, 22 vertical profiles and 16 point measurements are assimilated).Higher skin SST values with Medspiration result in increased average precipitation in D02 for all three cases, from +8 % (IFS-DA) to +11 % (GFS).Concerning the precipitation patterns, for the  aims of this study, it is interesting to focus on the biggest cluster in the south-east corner of the domain (i.e. the direction from which the humid air mass originates).Moving from GFS to IFS to IFS-DA, quite independently of the SST fields' change, a shift of this cluster can be observed from north-east to south-west.
The main change produced by the 3 d forecast compared with the 4 d forecast is the higher correspondence of the GFSbased simulations to the IFS-based simulations (Fig. S5).The GFS-based rainfall footprints located in the south-east of D02 meet the Calabrian Ionian coast further south with respect to the 4 d simulation, which is in agreement with the IFS-based simulations.Overall, the simulated rainfall fields are rather similar to each other and seem to reproduce the observations in the southern part of the region reasonably well (i.e. the area most affected by the event), while the overforecast found in the 4 d simulation in the central zone is confirmed.3D-Var forecasts starting on 31 October assimilate 15 vertical profiles and 12 point measurements.Although the IFS-based simulations forecast higher rainfall peaks off the southern Ionian coast (up to 1000 mm), the average accumulated precipitation in D02 is almost identical for all simulations (51 mm with GFS-O, 52 mm with IFS-O, and 53 mm with IFS-DA-O).Precipitation increase caused by the higher skin SST Medspiration fields varies from +9 % (IFS-DA) to +12 % (GFS), which is in agreement with the upward moisture flux increase in D01 (+7 % with GFS, and +8 % with IFS and IFS-DA).
With the aim of objectively assessing the performance of each WRF configuration, a detailed analysis using categorical scores is carried out considering ground-based observations in the civil protection warning areas more affected by the event (grey areas in the reproduction of the Calabria region in Fig. 12).Specifically, 30, 19, and 22 rain gauges are considered for the Cala4, Cala7, and Cala8 zones respectively.Among the numerous scores available in the literature (for a review see e.g.Wilks, 2006), for each zone Fig. 12 shows the results with respect to the frequency bias index (FBI), FBI = hits + false alarms hits + misses , and the equitable threat score (ETS), ETS = hits − hits r hits + misses + false alarms − hits r . (2) Here hits r = (hits + misses) (hits + false alarms) hits + misses + false alarms + correct negatives . (3) In the previous equations, the terms hits, misses, false alarms, and correct negatives refer to a typical 2 × 2 contingency table.The FBI indicates if the forecast system has a tendency to underestimate (FBI < 1) or overestimate (FBI > 1) the event frequency, whereas ETS measures the fraction of correctly predicted events, adjusted for hits associated with random forecasts, and ranges from −1/3 to 1 (perfect score).Both scores are used for consecutive 6 h time intervals for the analysed rainy period, utilizing precipitation thresholds with a step of 0.2 mm from 0.2 to 1 mm, a step of 1 mm up to 10 mm, a step of 2 mm up to 20 mm, and a step of 5 mm for higher rates.
Focusing on the 4 d simulations, ETS graphs show the generally better performance of IFS-DA-M, especially for higher thresholds.Other models have conflicting levels of accuracy: e.g.IFS-DA-O is the best in Cala4, but the worst in Cala7.Nevertheless, ingesting high-resolution SST generally provides better scores in all cases.Complementary information provided by FBI highlights a significant under-forecast of GFS-based simulations in both Cala4 and Cala8 and an overforecast in Cala7.Other simulations behave better, but FBI also points out that the 3D-Var scheme alone does not necessarily improve IFS-based forecasts (e.g. in Cala7 IFS-O is more accurate than IFS-DA-O), unless a high-resolution SST representation is also considered (IFS-DA-M always shows FBI values around 1).The ETS values of the 3 d simulations are generally higher, but, in this case, the GFS-based simulations are the worst, and introducing the Medspiration fields further reduces their performance.Conversely, a more detailed SST resolution increases the ETS values of the IFSbased simulations in zones Cala4 and Cala8 (but not in zone Cala7).Concerning bias, FBI graphs show substantial underforecasts in the Cala4 and Cala8 zones and an over-forecast in the Cala7 zone.However, in this case, the GFS-based simulations provide better results, especially in Cala7 and for high thresholds.Results achieved with ETS and FBI are generally also confirmed by other scores (not shown), such as the probability of detection (POD) score or the false alarm rate (FAR).
As previously stated, higher skin SST Medspiration values affect precipitation magnitude.This outcome agrees with the average increase of upward moisture flux from the sea surface in D01 (+8 % with GFS, +13 % with IFS, and IFS-DA in the 4 d time period).Vice versa (and contrary to what was found in the previous case study), the simulations do not show relevant differences in the timing of the event.If the accumulated values of average precipitation in each of the warning areas are considered, all simulations are very highly correlated (≥ 0.98, graph not shown) with observations.Figure 13, showing the time series of hourly averaged water vapour flux through section B-B', highlights that there are no relevant forward nor backward time deviations between the simulations with original skin SST fields and the corresponding simulations with Medspiration fields.The main effect observed in Fig. 13 is the lower flux of the GFS-based simulations because the main flow of soil moisture is shifted towards the north-east with respect to section B-B' (in agreement with the precipitation maps in Fig. 11).The average flux increase with IFS-M and IFS-DA-M is about 3 %-4 %  compared with IFS-O and IFS-DA-O respectively.Figure 14, showing a snapshot of the IWV distribution in D01 during the event (31 October at 21:00 UTC), confirms the similar timing of the simulations.Moving from IFS-O to IFS-DA-O to IFS-DA-M, the size of the cluster of humid air south of Calabria increases, but its position is basically the same.Similar conclusions are inferred from Fig. S6, which provides additional information about 850 hPa wind fields in D02 at the same time as in Fig. 14.
All simulations performed for this case study show that the greater energy supplied to the system by the higher skin SST Medspiration fields affects lower layers' flow dynamics, allowing more transport but not accelerating it.This behaviour can be attributed to the long-lasting characteristics of the event that, developing at a wider scale than case study 1 and providing humid air continuously, smooths potential differences in terms of timing.
Assessing the hydrological impact in the two selected catchments is more interesting in this case study, because all simulations forecast heavy rain over the catchment areas of the Ancinale River and Bonamico Creek, yet it is still challenging because reliable hydrological forecasts require accurate QPFs at the catchment scale.A QPF performance analysis is carried out for the catchment areas, considering the average values of the interpolated precipitation fields.The simulated average precipitation over the Ancinale River catchment is strongly overestimated by all of the IFS-based simulations in the 4 d forecasts (from +53 % to +72 %).Such over-forecasts are only partially reduced to about +40 % (except for an increase to +80 % with IFS-O) in the 3 d forecasts.GFS-based simulations provide much more reasonable biases in the 4 d forecasts (+12 % and −1 % for GFS-O and GFS-M respectively), which are only partially confirmed in the 3 d forecasts, where GFS-M provides a nearly unbiased estimate (−3 %) but the GFS-O over-forecast worsens to +44 %.Concerning the Bonamico Creek catchment, in the 4 d forecasts the IFS-based biases The 3 d forecasts do not provide a substantial improvement.The hydrological simulations over the Ancinale River (Fig. 15f) are still affected by precipitation over-forecasts.Furthermore, all simulations forecast the peak flows in advance compared with observations.Nevertheless, IFS-M, IFS-DA-O, and IFS-DA-M show r values of around 0.6.In particular, IFS-DA-O (highest r value of 0.65) forecasts the peak flow only 4 h in advance.Despite the good performance with precipitation forecasts, the GFS-M hydrograph is not well correlated with observations and simulates the observed peak flow about 9 h in advance.For Bonamico Creek, IFS-DA-O results are even better (Fig. 15h).The simulated peak flow, according to precipitation forecasts, underestimates the observed peak flow by about 20 %, but the correlation between the simulated and observed hydrographs is high (0.89) and the observed peak flow time (1 November at 16:00 UTC) is delayed by only 2 h.Generally, all of the IFS-based simulations are well correlated (r values always higher than 0.6) even though the peak flow time is always delayed (by up to 12 h).GFS-based simulations are poorly correlated and show significant overestimation and early forecast of the peak flow.

Discussions and conclusions
Table 5 aims to support the discussion summarizing the main outcomes concerning (1) the representation of the skin SST fields; (2) the accumulated precipitation values in the internal domain and the related spatial distribution; (3) the time distribution of precipitation; and (4) hydrological impact (hydrograph shape, total discharge, peak flow times), depending on the GCM choice for determining the boundary conditions, the use of the 3D-Var scheme, and the use of the high-resolution Medspiration fields.Skin SST fields Generally small differences (slightly higher values with GFS and lower with Medsp), but strong IFS underestimation along coastlines.
Strong IFS underestimation along coastlines.Average Medsp values higher than GFS (from about 0.6 to 0.8 K) and IFS (> 0.8 K, even not considering the IFS underestimation along coastlines).Also, with GFS underestimation along coastlines, but overestimation in the Tyrrhenian Sea.

Precipitation amount and spatial pattern
Average rainfall increases in D02 moving from IFS to GFS to IFS-DA.GFS rainfall centred to the southeast, and IFS and DA show more elongated shapes in the southnorth direction.The Medsp effect is minor with respect to varying GCM or including DA.
More rainfall in D02 with GFS.Moving from GFS to IFS to IFS-DA, a shift of the biggest rainfall cluster over the sea is observed from the north-east to the south-west direction.Medsp fields increase average precipitation (by about 10 %) but do not affect spatial patterns significantly.
GFS-based simulations are closer to the IFS-based.Medsp fields increase average precipitation (by about 10 %) but do not affect spatial patterns significantly.

Precipitation timing and scores
Close to the Corigliano rain gauge, GFS-based simulations delay the event.Ingesting Medsp fields accelerates flow dynamics, especially in IFS-based simulations.
Globally, better performances with IFS-DA-M in the three CP warning areas analysed.Relevant over-or underforecasts with GFS-based simulations.Medsp fields are especially useful for improving the 3D-Var scheme, but do not change the timing of the event.
Scores of GFS-based simulations are still worse, even with Medsp fields.Also, for IFS-based simulations, the Medsp effect is less relevant and is not always positive.Substantial over-or under-forecasts with almost all simulations.

Hydrological impact
Not feasible, because no simulation can forecast reliable precipitation values for the Citrea catchment.
QPF The most evident outcome across the case studies, yet far from surprising, is that the choice of the GCM providing boundary conditions is, comparatively, the most relevant factor affecting the simulations.Specifically, for the case studies analysed, GFS-based simulations generally do not perform as well as IFS-based simulations (this difference is emphasized if the forecast time window is increased, as demonstrated in case study 2).Of course, this is not a generalizable result given the few events involved and the lack of further analyses (e.g. the evaluation of different parameterizations).For example, for case study 2, Avolio and Federico (2018) found (via detailed sensitivity tests) that simulations forced by GFS have better performance than those forced by ECMWF data.Nevertheless, for the purpose of this study it is shown that the various features differentiating the two GCMs (including the spatial resolution, which is improved with IFS) can considerably affect the precipitation fields calculated via dynamical downscaling, comparatively more than using 3D-Var data assimilation methods or imposing specific (high-resolution) skin SST boundary conditions.
The use of the 3D-Var scheme in this study has to be primarily considered as a strategy for improving ini-tial conditions.Several studies have adopted data assimilation approaches to achieve improvements for forecast periods shorter than the 48 to 96 h used in this study (e.g.Sun et al., 2016;Gustafsson et al., 2018;Thiruvengadam et al., 2019), unless specific strategies were used (e.g.cycling 3D-Var runs; Liu et al., 2018).Here, we focus on 2 to 4 d periods (depending on the case studies) for the sake of simplicity and clarity, testing the capability of different model configurations to reproduce the overall development of the hydrometeorological events, from their beginning to their end, and also checking their usefulness with respect to providing a proper warning lead time.Therefore, extensively testing the 3D-Var scheme in order to get the highest benefit goes beyond the scope of this study.
Even though they are used with the outlined limitations, 3D-Var simulations provide some worthy outcomes.For case study 1, applying the 3D-Var scheme with IFS boundary conditions results in a substantial increase in the average rainfall in the innermost domain (up to 25 %), but this change does not provide clear advances in forecasting skill.This is consistent with previous studies which have demonstrated that the effects of data assimilation do not lead to an effective improvement for highly convective events (Liu et al., 2013).In case study 2, it is noteworthy that IFS-O is capable of providing better ETS values than IFS-DA-O in some warning areas, especially in the 4 d forecasts, meaning that sources of uncertainty other than initial conditions can strongly affect forecasting skill.Among those uncertainties, representation of SST conditions can be important, given that, in general, IFS-DA-M (i.e. the simulation including both data assimilation and high-resolution SST representation) provides better performance.
Unlike the 3D-Var scheme, the effects of high-resolution SST representation on forecasts are emphasized to the maximum in this study; this is due to the fact that observed rather than forecast SST fields are replaced as lower boundary conditions in the simulations, which provides a kind of "upper limit" to the effects provided by well forecast SST fields.The foundation SST fields used (defined as the temperature of the water column, free of diurnal temperature variability; Donlon et al., 2007) are produced by the Medspiration project once every 24 h, but the diurnal cycles are ensured by the sst_skin option.They especially improve the SST fields provided by IFS boundary conditions that, although they allow better forecasts than GFS, show very evident problems along the coastlines.The forecast periods analysed in this study allow users to largely overcome the problem highlighted by Cassola et al. (2016), who found that the forced ingestion of high-resolution SST fields can be counterproductive for forecasting ranges shorter than 36-48 h, due to the relatively slow adjustment of initial atmospheric fields.
High-resolution SST fields often provide (but not always, and not always significantly) enhanced forecast performance with respect to the corresponding simulations with native SST fields.Especially in case study 1, the effects close to the Corigliano rain gauge seem to be somehow linked to generally chaotic behaviour.As discussed in the case of improved initial conditions (i.e. the 3D-Var scheme), these outcomes are related to the fact that sources of uncertainties other than SST representation hinder enhanced forecast skill.Furthermore, the average impact of high-resolution SST on the simulated precipitation in D02 is lower than expected with comprehensive approaches used to represent simulation uncertainty (e.g.ensemble forecast systems applied at convection-permitting resolutions; Evans et al., 2014).A preliminary analysis performed by the authors on case study 1 (not shown) using a convection-permitting ensemble system based on the ECMWF ensemble prediction system (EPS) highlights a mean absolute percentage deviation of about 24 % for the perturbed simulations (against that found here, related to the SST fields resolution, of 4 %).
Nevertheless, using more realistic SST fields leads to enough clear changes in the simulation of the atmospheric boundary layer dynamics in both case studies, especially with respect to the configurations with clearly unrealistic fields (i.e.IFS lower boundary conditions).Specifically, in the (shorter, convective, highly localized) summer event higher SST values along the coastlines accelerate flow dynamics, moving faster humid air towards the coast and moving precipitation up (thus agreeing with the results achieved by Stocchi and Davolio, 2017).Conversely, in the (longer, caused by a frontal system, widespread) autumn event the higher energy supplied to the system by a continuously warmer sea surface leads to a generalized increase of the precipitation amount; however, this does not substantially change either the spatial pattern or the timing of the event.The missing change in timing is most probably due to the fact that the stability of the atmospheric boundary layer and the related flow dynamics in case study 2 depend more on large-scale (synoptic) conditions than on local factors (which is also possibly the same reason that the 3D-Var scheme is capable of influencing case study 2 more than case study 1).Such large-scale conditions are capable of leading to much stronger winds than seen in case study 1 (which is evident when comparing Figs.S3 and S6).
Exploring the hydrological impact of case study 2 in detail, the analysis must be related to the resolution of the smallscale catchments, where the experiments show that results achieved on larger scales (i.e. at the resolution of the warning areas) can be "doubly" reversed.For example, both bias analyses and Taylor diagrams related to the Ancinale River basin (Fig. 15a, b) highlight better QPF performance for GFS-based simulations, which is not as obvious (or even not found) in the analysis of models skills on a larger scale.Nevertheless, IFS-based hydrographs are better correlated with those calculated using observed rainfall and peak flow times that are closer to observed values (it is worth recalling that a quantitative discharge analysis is less significant in this case, given that only water level observations are available).Contrary to what was found by Yucel et al. (2015), streamflow Hydrol. Earth Syst. Sci., 24, 269-291, 2020 www.hydrol-earth-syst-sci.net/24/269/2020/ simulations are not particularly improved by initial data assimilation.This result is most likely due to the relatively long forecast periods (from 72 to 96 h).Indeed, in the 3 d forecasts, the benefits of the improved initial conditions partially come to light in both catchments even though, interestingly, the best simulation (even if only slightly) is IFS-DA-O, i.e. that using the 3D-Var scheme but not the Medspiration SST fields.Overall, the impact of a reduced lead time from 4 to 3 d provides only slight enhancements (e.g.better performance of the GFS-based simulations or slightly higher ETS values); however, these enhancements do not considerably affect the performance level of the hydrological forecasts.Summarizing, the results achieved in this study show that none of the different versions of the forecasting chain adopted are capable of achieving quantitative precipitation and (consequently) streamflow forecast in all of the cases analysed, yet several interesting clues are provided.Specifically, similar to past studies, it is shown that the highresolution representation of SST fields can significantly change the simulation of the atmospheric boundary layer processes, modifying flow dynamics and/or the amount of precipitated water.Nevertheless, the potentially positive impact of high-resolution SST fields can be easily hidden by several other sources of uncertainty (mainly, the relevance of the choice of the GCM providing boundary conditions).Further improvements in both GCMs (e.g. the higherresolution IFS cycle since March 2016) and RCMs will reduce uncertainties, which clearly highlights the need for high-resolution SST representation in regional modelling.The topic of higher temporal frequency updating of lateral boundary conditions is also being actively investigated (Termonia et al., 2009;Matte et al., 2017;Keresturi et al., 2019).Furthermore, emerging approaches like regional-scale fully coupled ocean-atmospheric (e.g.within the Baltic Sea Experiment -BALTEX, Gustafsson et al., 1998;Pullen et al., 2003;Ren et al., 2004;Loglisci et al., 2004;Ricchi et al., 2019;Lewis et al., 2019b) or ocean-atmospheric-hydrologic (Ruti et al.;2016;Somot et al., 2018) modelling aim to directly calculate SST fields dynamics.Meanwhile, with the current generation of operational models, a reasonable (yet computationally demanding) solution is to adequately take the uncertainty of SST in forecasting chains into account by also adopting ensemble approaches for this variable.

Figure 2 .
Figure 2. Panels (a) to (d) represent the respective daily rainfall (mm) amounts observed from 30 October to 2 November 2015; the points in these panels represent the weather stations, and the spatially distributed values represent the radar estimation.(e) Surface pressure and weather fronts at 00:00 UTC on 1 November 2015 from http://www1.wetter3.de/,© Met Office.(f) Elevation maps of the Ancinale River catchment and (g) the Bonamico Creek catchment.

Figure 3 .
Figure 3. (a) The outer domain (D01) with a spatial resolution of 10 km; (b) the inner domain (D02) with a spatial resolution of 2 km.Points 1 and 2 are considered when evaluating SSTSK (skin SST) evolution locally during the events according to different configurations (Figs. 5 and 10 respectively).Vertically integrated water vapour fluxes are calculated across A-A' and B-B' (Figs. 9 and 13 respectively).

Figure 4 .
Figure 4. (a) Observed hourly rainfall (mm) averaged over the Ancinale River catchment; (b) as in panel (a), but for the Bonamico Creek catchment.(c) A comparison between observed hydrometric levels (m) with respect to uncalibrated and calibrated simulated flow (m 3 s −1 ) over the Ancinale River catchment; (d) as in panel (c), but for the Bonamico Creek catchment.

Figure 6 .
Figure 6.The 24 h accumulated precipitation (mm) from 18:00 UTC on 11 August 2015 to 18:00 UTC on 12 August 2015: (a) merged ground measurements and radar observations simulated (b-g) using different configurations.The small blue (b-c) or white (d-g) stars highlight the accumulated rainfall peaks near Corigliano, which are analysed in detail in Fig. 7.

Table 4 .Figure 7 .
Figure 7. Circles in panel (a) are located at the peaks highlighted in Fig. 6 for each of the different configurations; the colours indicate the time correlation, whereas the size refers to the percentage rain amount with respect to Corigliano observations.(b) Temporal accumulated rainfall (mm) observed at Corigliano and simulated by the different peaks.

Figure 11 .
Figure 11.Accumulated precipitation (mm) over the whole period of 96 h starting from 00:00 UTC on 30 October 2015: (a) merged ground measurements and radar observations; (b-g) simulated fields with the different configurations.

Figure 12 .
Figure12.Categorical scores ETS (equitable threat score) (a-f) and FBI (frequency bias index) (g-l) calculated on the rain gauges located in the three civil protection warning areas more affected by the event from case study 2 (highlighted as grey areas on the map in the top left) for both the 4 and the 3 d period.

Figure 15
Figure 15.(a-d) Taylor diagrams related to the averaged hourly precipitation series over the Ancinale River catchment and the Bonamico Creek catchment simulated by the different configurations forecasting both 4 and 3 d, compared with observations.(e-h) The resulting hydrographs (m s −3 ) obtained by the different WRF-Hydro simulations compared with observations.

Table 3 .
Calibrated parameters of the offline WRF-Hydro model for the Ancinale River and the Bonamico Creek.

Table 5 .
Synoptic table summarizing the main findings for the different case studies."Medsp" refers to Medspiration, "BC" refers to boundary conditions, "CP" refers to civil protection, and "DA" refers to data assimilation.