The relative importance of antecedent soil moisture and precipitation in ﬂood generation in the middle and lower Yangtze River basin

. Floods have caused severe environmental and so-cioeconomic losses worldwide in human history and are projected to exacerbate due to climate change. Many ﬂoods are caused by heavy rainfall with highly saturated soil; how-ever, the relative importance of rainfall and antecedent soil moisture and how it changes from place to place has not been fully understood. Here we examined annual ﬂoods from more than 200 hydrological stations in the middle and lower Yangtze River basin. Our results indicate that the dominant factor in ﬂood generation shifts from rainfall to antecedent soil moisture with the increase in watershed area. The ratio of the relative importance of antecedent soil moisture and daily rainfall (SPR) is positively correlated with topographic wetness index and has a negative correlation with the magnitude of annual ﬂoods. This linkage between watershed characteristics that are easy to measure and the dominant ﬂood-generation mechanism provides a framework to quantitatively estimate potential ﬂood risk in ungauged watersheds in the middle and lower Yangtze River basin.


Introduction
Flooding is one of the most destructive and costly natural hazards in the world, resulting in considerable fatalities and property losses (Suresh et al., 2013).River floods have affected nearly 2.5 billion people between 1994 and 2013 worldwide (CRED, 2015) and caused USD 104 billion in losses every year (Desai et al., 2015).The damages may be further exacerbated by the increasing frequency and intensity of extreme rainfall events, according to climate change projections (IPCC, 2012;Ohmura and Wild, 2002).Flood control infrastructure and more accurate predictions are needed to reduce flood damages, which requires a better understanding of the underlying mechanism of flood generation and the drivers of change (Villarini and Wasko, 2021).
Numerous studies have been conducted to investigate the cause of floods across the world (Blöschl et al., 2013;Munoz et al., 2018;Zhang et al., 2018).Many studies focused on examining the environmental and social characteristics that lead to specific catastrophic flood events (Blöschl et al, 2013;Liu et al., 2020;Zhang et al., 2018).Others concentrated on single locations, usually catchment outlets, to explore the influential factors of floods and the future trends (Brunner et al., 2016;Munoz et al., 2018).Yet, given the amount of data and time required, it is not practical to apply these detailed studies to hundreds of catchments to generate an overview of the flood-generation mechanism at a large scale.
Recently, researchers started to investigate the dominant flood-generation mechanisms at regional scales (Berghuijs et al., 2019b;Do et al., 2020;Garg and Mishra, 2019;Smith et al., 2018;Tramblay et al., 2021;S. Ye et al., 2017).Most of these studies are conducted in North America and Europe with well-documented long-term records (Berghuijs et al., 2016;Blöschl et al., 2019;Do et al., 2020;Musselman et al., 2018;Rottler et al., 2020).Some research was conducted in China recently (Yang et al, 2019;Yang et al, 2020), though this kind of work is still limited, and further investigations Published by Copernicus Publications on behalf of the European Geosciences Union.
are needed given the considerable spatial heterogeneity and complexity in flood generation.
As the largest river in China, the Yangtze River basin has long suffered from floods.In summer 2020, 378 tributaries of the Yangtze River had floods exceeding the alarm level, causing billions of U.S. dollars in damage (Xia and Chen, 2021).With increasing public awareness, more accurate prediction is needed, which relies on better understanding.However, due to the limitation of observations, there are only a few regional studies of the flood-generation mechanism in China, with a few in the Yangtze River basin (Zhang et al., 2018;Yang et al., 2019Yang et al., , 2020)).The large number of dams and reservoirs built along the river further complicated the situation (Feng et al., 2017;Li et al., 2011;Yang et al., 2019).
Because of the relatively warm temperature, snowmelt has little impact on flood generation in the Yangtze River basin (Yang et al., 2020).Floods in the Yangtze River basin usually occur during summer with relatively wet soil and high rainfall (Wang et al., 2022).Heavy rainfall with high antecedent soil moisture has also been identified as a dominant driver of floods across the world (Beighuijs et al., 2019b;Garg and Mishra, 2019;Tramblay et al., 2021;Wasko et al., 2020).Recently, studies started to examine the relative importance of rainfall and antecedent soil moisture in flood generation (Brunner et al., 2021;Wasko et al., 2021;Bennett et al., 2018;Bertola et al., 2021).Quantitative evaluation of the relative contribution of rainfall and antecedent soil moisture and its change across watersheds is still limited and currently unavailable in China (Y.Liu et al., 2021;Wu et al., 2018).
Based on the watersheds in the middle and lower Yangtze River basin, this study attempts to explore the following questions: (1) is there a way to quantitatively describe the relative importance of antecedent soil moisture and rainfall on flood generation?(2) How would this combination of floodgeneration rainfall and soil moisture vary across watersheds, and what are the influential factors?Based on the observations and model estimation (Sect.2), the spatial distribution patterns of antecedent soil moisture and rainfall were obtained and analyzed to investigate their individual contribution to flood generation and the influential factors (Sect.3).This allows for a further examination of the relative importance of antecedent soil moisture and rainfall on flood generation and its linkage to watershed characteristics and its implications for flood prediction (Sect.4).All of the results are summarized in Sect. 5.

Study area
The Yangtze River is the largest river in China, with a total length of 6300 km and annual discharge of 920 km 3 at the outlet (Yang et al., 2018).It drains through an area of 1.8 × 10 6 km 2 , lying between 90 • 33 and 122 • 25 E and 24 • 30 and 35 • 45 N and is home to over 400 million people, most of which live in the middle and lower Yangtze River basin (YZRB; Cai, 2020).The elevation of the YZRB declines from west to east, from over 3000 m in Qinghai-Tibet Plateau and around 1000 m in the central mountain region to 100 m in the East China Plain (Wang et al., 2013).The vegetation types in the YZRB are forests, shrubs, grassland, and agricultural land, accounting for 11.85 %, 12.65 %, 32.26 %, and 42.88 %, respectively.Grassland and shrubs are the dominant vegetation in the middle and upper YZRB, while the downstream YZRB is dominated by forests and agricultural land (Miao et al., 2015).There are more than 51 000 reservoirs of different sizes in the whole basin, including 280 large ones (Peng et al., 2020).
Most of the YZRB is semi-humid and humid, with a typical subtropical monsoon climate.The mean annual temperature is approximately 13.0 • C, varying from −4 to 18 • C downstream.The mean annual precipitation of the whole basin is about 1200 mm, increasing from 300 mm in the western headwaters to 2400 mm downstream (Li et al., 2021).Most of the precipitation comes between June and September.The premise of the persistent heavy rain in the Yangtze River basin is the frequent activity of weak cold air in the north and the intersection of mid-latitude air masses and monsoon air masses (Kato, 1985).Studies have found that both annual precipitation and the frequency of extreme precipitation events have increased in the middle and lower reaches of the Yangtze River (Qian and Xu, 2020;Fu et al., 2013).As a result, floods have occurred frequently in the middle and lower reaches of the Yangtze River where most of the population in the YZRB live (N.Liu et al., 2018).

Data
In this work, we focus on the middle and lower reaches of the Yangtze River for the high population density and increasing flood risk.A 30 m digital elevation model (DEM) was downloaded from Geospatial Data Cloud (http://www.gscloud.cn/,last access: 1 November 2020), from which the drainage area corresponding to the hydrological station was extracted by ArcGIS.Daily precipitation data and temperature data between 1970 and 2016 from 247 meteorological stations within and near the YZRB were downloaded from China Meteorological Data Network (https://data.cma.cn/, last access: 1 November 2020; Fig. 1).The temperature data were used to estimate potential evaporation.The observed precipitation and estimated potential evaporation were interpolated into the whole YZRB using the Thiessen polygon method (Meena et al., 2013).The interpolated precipitation and potential evaporation were then averaged for the drainage area corresponding to each hydrological station.
The daily streamflow data were collected from 267 hydrological stations from the Annual Hydrological Report of the People's Republic of China, among which 224 stations with at least 20 years of records from both the period from 1970 to 1990 and the period from 2007 to 2016 were selected; the data from 1990 to 2007 were not found in the online repository (see Fig. S1 in the Supplement for data availability).Information on 361 reservoirs in the middle and lower YZRB, including capacity and controlling area, was downloaded and extracted from the Global Reservoir and Dam database (GRanD;Lehner et al., 2011).A previous study showed that this database provides reliable information on medium-sized and large reservoirs in China (Yang et al., 2021).Watersheds with more than 80 % of the drainage area under control reservoirs, according to the GRanD database, and/or located right downstream of reservoirs and water gates were considered to be watersheds under strong regulation (regulated watersheds).

Potential evaporation estimation
The temperature data were used to estimate potential evaporation following the Hargreaves method (Allen et al., 1998;Vicente-Serrano et al., 2014;Berti et al., 2014).
where ET 0 is potential evaporation (mm d −1 ), T max is the highest temperature ( • C), T min is the lowest temperature ( • C), T mean is the mean temperature ( • C), and Ra is the outer space radiation [MJ m −2 d −1 ], which can be calculated as follows: where d r is the reciprocal of the relative distance between the Sun and the Earth, ω s is the angle of sunshine hours, δ is the inclination of the Sun (rad), and ϕ is geographic latitude (rad).d r , δ, and ω s can be calculated by the following formula: where J is the daily ordinal number (1 January is 1).

Soil water storage estimation
The soil water storage was estimated based on the daily water balance, as follows (Berhuijs et al., 2016(Berhuijs et al., , 2019b)): where S is the soil water storage (mm), which is initially set to 0. Due to the long term of simulation, the change in the initial value would not significantly affect the results.P is precipitation (mm d −1 ), Q is discharge normalized by area (mm d −1 ), and ET is evaporation (mm d −1 ), which can be calculated from potential evapotranspiration (ET 0 ), where the soil water storage (S) is used as the upper limit of daily ET, as follows: The estimation of soil water storage and ET is highly simplified and is not used for prediction but to capture the first order of the temporal variation in and the relative wetness of soil in the study time period, which helps develop a framework that differentiates the relative contribution of precipitation and soil moisture in flood generation.

Topographic wetness index estimation
Topographic wetness index was calculated to represent the combined impacts of drainage area and topographic gradient, as follows (Alfonso et al., 2011;Grabs et al., 2009): where A d is drainage area, and α is topographic gradient estimated from DEM. TWI represents the propensity of subsurface flow accumulation and frequency of saturated conditions and thus can be used to predict relative surface wetness and hydrological responses (Meles et al., 2020).It is widely used to quantify topographic impact on hydrological processes (spatial-scale effects, hydrological flow path, etc.) and in land surface models for hydrological, biogeochemical, and ecological processes (Sørensen et al., 2006).The maximum daily discharge of each year was selected as the annual flood, which was then averaged across years as the mean annual maximum flood (AMF).The observed rainfall on that day and the estimated soil water storage on the day before AMF in each year were also averaged across years as daily rainfall (P ) and antecedent soil moisture (S 0 ).Since almost all the AMFs in our study region come during rainy season when rainfall occurs on most of the days, it could be difficult to isolate the events of AMFs among consecutive flow events.To avoid the bias that may be caused in event separation, the soil moisture on the day before the AMF was used as antecedent soil moisture instead of the day before the AMF event.To examine the impacts from a long-lasting rainfall event, especially for the large watersheds with longer concentration time, we also calculated the mean accumulated rainfall from 2 d (rainfall on the flood day and the day before; P 2 ) to 7 d before (weekly rainfall; P 7 ).
The percentile of antecedent soil moisture (S 0 ) was calculated to represent the relative saturation of soil moisture in the time series, while the percentile of daily rainfall (P ) was estimated to show the relative intensity (P ), thus representing the relative magnitude of rainfall events across time.The percentile of accumulated rainfall was also calculated for the 2 to 7 d of rainfall.
To quantify the relative importance of antecedent soil moisture and rainfall in flood generation, ratio of the relative importance of antecedent soil moisture and daily rainfall (SPR) at the AMFs was derived, i.e., SPR = S /P .When SPR is large, the antecedent soil moisture is much closer to the maximum, while, when the daily rainfall is less extreme, floods are more affected by the antecedent soil moisture.On the other hand, a smaller SPR indicates a relatively larger magnitude of rainfall compared with antecedent soil moisture; that is, rainfall is more extreme and influential in flood generation.

Spatial patterns of antecedent soil moisture and precipitation during floods
Figure 2 shows the spatial distribution of the percentile of antecedent soil moisture and daily rainfall during the annual maximum floods (AMFs) in the middle and lower reaches of the Yangtze River.As we can see from Fig. 2a, in the middle and lower reaches of YZRB, when AMFs occurred, the percentile of antecedent soil saturation was generally high, and most of them are larger than 0.6; the farther away it was from the main stream, the more saturated the soil was.On the other hand, along and near the main stream and the delta, the antecedent soil saturation rate could be much smaller, with values even less than 0.4.Figure 2b shows the daily rainfall during the AMFs.As we can see, the percentile of daily rainfall is relatively high (> 0.8) at more than half of the study sites, while it is small (< 0.5) for the sites along the main stream and in the delta (Fig. 2b).Comparison between Fig. 2a and b suggests that, except for the sites on the main stream and in the delta, sites with relatively high antecedent soil saturation rate (i.e., > 0.8; the blue dots) during AMFs are also the ones with relatively small daily rainfall contribution (i.e., < 0.8; the light blue and cyan dots).That is, for these sites, the AMFs usually occur under much wetter conditions, while extreme rainfall on the flood day is not necessary, suggesting the relative importance of soil wetness.For the sites with both the percentile of soil moisture and rainfall between 0.6 and 1, both the antecedent soil moisture and rainfall play important roles in flood generation.As for the sites on the main stream and in the delta, both antecedent soil moisture and rainfall are low during AMFs; this is likely due to the regulations from large reservoirs and water gates.

The scaling effect in the contribution of antecedent soil moisture and rainfall
To further investigate the relative importance of antecedent soil moisture and rainfall in flood generation and the potential influential factors, we examined their correlation with the catchment area (Fig. 3).Given the complicated environmen- tal and social impacts, the regulated watersheds and sites on the main stream are presented separately (the green dots and cyan dots in Fig. 3, respectively).Our study will focus on the sites that are not dominated by regulations (the blue dots in Fig. 3).For simplicity, we will refer them as natural watersheds.
As we can see from Fig. 3, during the occurrence of AMFs, the percentile of antecedent soil wetness increases with watershed area (p value < 0.001), while the percentile of daily rainfall decreases with watershed area (p value < 0.001).That is, with the increase in watershed size, antecedent soil moisture becomes more and more saturated, while the precipitation is less and less extreme during AMFs, suggesting the rising contribution of antecedent soil moisture and declining importance of daily precipitation in flood generation.As for the regulated watersheds (green dots in Fig. 3), there is no clear correlation between the drainage area and the percentile of antecedent soil moisture or rainfall, which is understandable.Meanwhile, both the percentile of antecedent soil moisture and rainfall decreases with watershed area for main stream sites.

The scaling impacts on accumulated rainfall
The saturation of soil before floods could be due to previous rainfall events and could also be caused by accumulated rainfall in long-lasting rainfall events that eventually generate floods (Xie et al., 2018).Figure 4 presents the correlation between the percentile of accumulated rainfall and drainage area.When a single day of rainfall is considered, it is negatively correlated with drainage area (Fig. 3a); when accumulated rainfall is considered, the correlation gradually shifts from a negative to a positive correlation (Fig. 4).For example, when 2 d of rainfall were examined, the correlation between accumulated rainfall and drainage area shifts from negative to positive at 10 000 km 2 ; the negative correlation in Fig. 3a is only valid for watersheds larger than 10 000 km 2 (Fig. 4a).This transition area increases from 10 000 km 2 for 2 d of rainfall to 100 000 km 2 for 4 d of rainfall (Fig. 4c).The number of watersheds with negative correlation also decreases.Eventually, the weekly rainfall has a similar positive correlation with drainage area to that of antecedent soil moisture (Fig. 4f).The increase in the transition area may be explained by the increasing response time and confluence time in large watersheds; it takes days to generate flow events by heavy rainfall and for them to reach outlets where they can be observed in large watersheds.This is also consistent with the conclusion in the Yellow River basin (Ran et al., 2020b) and our previous findings of the dominant flood-generation mechanism in the middle and lower YZRB because weekly rainfall is the dominant flood driver for sites on the main streams and the major tributaries (Wang et al., 2022).The regulated watersheds do not show a significant correlation, which is understandable for the strong human intervention.For the negative correlation between accumulated rainfall and drainage area at main stream sites, it is difficult to decide whether it is due to the scaling effect or human intervention.

3.4
The interlink of watershed characteristics, flood, antecedent soil moisture, and rainfall Figure 5 presents the percentile of antecedent soil moisture and rainfall during the AMFs at the study watersheds.The circles are scaled by watershed size and colored with topographic gradient.Except for the watersheds with strong human intervention (regulated ones and the ones on the main stream), there is a negative correlation between the contribution of rainfall and antecedent soil moisture.The lower right of the scatter is mostly big blue dots which are large watersheds with a gentle topographic gradient.That is, AMFs usually occur when soil moisture is close to saturation, while extreme rainfall is not necessary for AMFs in these watersheds.On top of the scatter are relatively small yellow and green dots.Those are medium to small watersheds with a steep topographic gradient; that is, AMFs are usually generated with extreme rainfall, while the saturation of soil moisture is not necessary.shift in dominance in AMF generation from extreme rainfall to antecedent soil wetness from small steep watersheds to large flat ones.
Figure 6 shows the relative importance of antecedent soil moisture and rainfall.For the natural watersheds (the circles), SPR increases with drainage area and declines with topographic gradient.That is, the larger the drainage area, the more essential the contribution of antecedent soil moisture to floods, and the less influential the rainfall is in flood generation.For watersheds with a similar drainage area (i.e., the green or light blue dots in Fig. 6b), the topographic gradient also impacts on SPR, i.e., SPR decreases with slope.That is, the relative importance of rainfall increases at steeper watersheds.This may be attributed to the shortened hydrological response time due to the steep topography which facilitates rainfall-induced floods generation.As a combination of both drainage area and topographic gradient, TWI is positively correlated with SPR at natural watersheds, with less scatter than the correlation between SPR and drainage area or topographic gradient alone.That is, watersheds with a larger area and gentler topographic gradient that are easier to wet tend to have larger SPR; thus, soil wetness is more important in flood generation.There is no significant correlation between SPR and TWI for the regulated watersheds along tributaries (black triangles).However, the sites on the main stream show opposite pattern because the SPR at these sites decreases with TWI and drainage area.It is difficult to determine whether this is because of reservoir regulation or not.More data about watersheds larger than 10 000 km 2 but with limited human intervention are needed to examine this hypothesis.
Besides TWI, SPR is also correlated with the magnitude of AMF (Fig. 7).As Fig. 7 shows, the area-normalized flood peak declines with flood-generation SPR.Watersheds with a large flood peak are mostly the ones with a steep topographic gradient and small SPR (i.e., SPR < 1) and vice versa.Catch-  ments with more extreme floods are the ones with relatively smaller influence of soil moisture on flood generation.Similar correlation was also found at the event scale in our experimental mountainous watershed, which is located at a headwater of the Yangtze River (L. Liu et al., 2021).

The relative importance of antecedent soil moisture and rainfall in flood generation
While soil moisture and rainfall are the two main drivers of floods in the middle and lower Yangtze River basin, the dominance of each factor varies across the relatively natural watersheds.Floods in large watersheds are usually generated when soil is almost saturated, despite the relatively small rainfall amount, while extreme rainfall is usually observed during floods in small to medium-sized watersheds (blue dots in Fig. 3).The rising contribution of antecedent soil moisture in large watersheds was consistent with the findings in Australian watersheds (Wasko and Nathan, 2019), and the declining influence of rainfall at larger watersheds was also found in Indian watersheds (Garg and Mishra, 2019).This contrast correlation with watershed size indicates a shift in dominance in AMF generation, which may be attributed to the longer confluence time in the large watersheds and less heterogeneity in small watersheds.This shift in dominance can be observed more straightforwardly from the negative correlation between the percentile of rainfall and antecedent soil moisture in Fig. 5.The natural watersheds in Fig. 5 could be grouped into three classes based on their drainage area and topographic gradient.When a watershed is large and flat, the flood occurrence is mainly determined by soil wetness (i.e., the big blue dots at the lower right of the scatter); on the other hand, when a watershed is small and steep, heavy rainfall dominates (i.e., the small yellow and green dots at the upper left of the scatter).Between these two groups are relatively small watersheds with a gentle topographic gradient for which the occurrence of AMF requires both highly saturated soil and relatively heavy rainfall.That is, the dominant influential factor(s) in AMFs generation across watersheds is correlated with the topographic characteristics (i.e., watershed size and topographic gradient).This helps quantify the relative importance of soil moisture and rainfall in flood generation in the existing work.
This shift in dominance is not observed in the main stream sites (i.e., cyan dots in Fig. 3), where the percentile of both antecedent soil moisture and precipitation declines with drainage area.This may be attributed to the more complicated flood-generation mechanism at a large scale and the strong human intervention on the main stream (reservoirs, water gate regulation, etc.; Gao et al., 2018;Long et al., 2020;Zhang et al., 2017).The major responsibilities of reservoirs on the main stream are to reduce peak flow and postpone the time to flood peak (Volpi et al., 2018).As a result, the original https://doi.org/10.5194/hess-26-4919-2022 Hydrol.Earth Syst.Sci., 26, 4919-4931, 2022 flood peak would be delayed by the regulation, and the actual flood peak would occur when rainfall declines/stops and soil water drains.Another possibility is that, when watershed size is larger than 100 000 km 2 , the impact of antecedent soil moisture declines as well.To examine this hypothesis, more data from watersheds larger than 100 000 km 2 and with limited human intervention are needed.However, this is beyond the scope of this work and requires future studies.

Linkage between topographic characteristics, SPR, and floods
The correlation between TWI and SPR (Fig. 6c) demonstrates that the relative importance of soil moisture and rainfall could be inferred from topographic characteristics quantitatively.We could derive the relative dominance of soil moisture and rainfall in flood generation in specific watershed from its TWI for the natural watersheds without significant human intervention.Rainfall and soil moisture level have been identified as being dominant drivers of floods, individually or together, in watersheds worldwide (Berghuijs et al., 2016(Berghuijs et al., , 2019b;;Garg and Mishra, 2019;Tramblay et al., 2021;S. Ye et al., 2017).Our findings provide a framework to quantify the relative importance of rainfall and soil moisture and to further identify the influential factors of their importance based on topographic characteristics that are easy to measure.Meanwhile, the SPR also presents a negative correlation with the magnitude of AMFs (Fig. 7).That is, we could infer the mean annual AMF based on the SPR for each watershed.Since the characteristic SPR could be estimated from TWI, we could derive a quantitative estimation of the mean AMFs from topographic characteristics that are easy to measure, even in watersheds with limited hydrologic records.There is also similar negative correlation between TWI and AMFs (Fig. S2).This would be helpful for flood control management in ungauged watersheds, especially in the mountainous watersheds with risks of flash floods.Similar correlation was also found in the observations from our experimental watershed, which is a headwater of Yangtze River (L. Liu et al., 2021).The ratio of observed antecedent soil moisture and event precipitation also presents a similar declining trend with discharge at the event scale.However, the correlation between SPR and discharge at the event scale is preliminary, and more data with a higher resolution and more detailed analysis are needed for validation at the event scale.For this study, our goal is to present the framework to derive floodgeneration SPR that could be estimated from topographic characteristics and to provide information on mean AMFs.
In conclusion, based on the topographic characteristics, we could derive the relative importance of soil moisture and rainfall in flood generation (SPR), and from this relative importance ratio, we could further infer the average flood magnitude at these watersheds.As a result, we could link the topo-graphic characteristics and annual floods through the characteristic SPR during the AMFs.

Implications
Our findings could be helpful for potential flood risk evaluation in ungauged basins, e.g., headwaters in a mountainous region.With the construction of large reservoirs, the capability of flood risk control has improved substantially along the main stream (Zou et al., 2011;Zhang et al., 2015).However, it is still difficult for a quantitative evaluation of flood risk in upstream mountainous watersheds, which are vulnerable to floods but difficult for hydrological modeling and prediction due to limited hydrologic records.
Our findings suggest that we could derive the floodgeneration SPR of each watershed from the drainage area and topographic gradient that are easy to measure.The correlation between SPR and the flood peak provides information of the mean annual floods in ungauged watersheds.Therefore, in regions without observation data, by building flood control infrastructure such as dams and gates, the mean annual flood peak obtained by SPR based on the topographic characteristics can be used to provide quantitative information for flood control and disaster management.Flood control infrastructure could be designed based on the estimated mean annual flood peak and the demographic information.With a further validation of this framework at the event scale, by using the observed soil moisture from remote sensing data and precipitation forecast to generate real-time prediction of SPR values, we could further provide an early warning of floods in these ungauged watersheds.This would be helpful, given the increasing possibility of extreme rainfall events due to climate change; however, more data and examination are needed in future studies.

Limitations
Previous works usually identified the dominant floodgeneration mechanism based on the comparison of the timing of events (Berghuijs et al, 2016;2019b;Blöschl et al., 2017;S. Ye et al., 2017).Similar work has been implemented in our study watersheds, suggesting the importance of soil moisture and rainfall (Wang et al., 2022).Based on that, we further looked into the records to quantitatively evaluate the relative importance of soil moisture and rainfall in flood generation.However, there are limitations to our methods.
The precipitation data we used were averaged for the study watersheds from 247 meteorological stations.Given the large area and considerable spatial heterogeneity, the precipitation data we used may not always be representative of the actual precipitation events.The daily data could also average the rainfall intensity at an hourly scale, which could be influential in small mountainous watersheds.ET was scaled as 0.75 × ET 0 to ensure that it is smaller than the potential evaporation.This is a simplified estimation of ET, and a more so-phisticated method is needed for further analysis on specific catchments at the event scale.
The estimation of soil moisture is also highly simplified, which cannot be considered to be a precise estimation at the event scale.To reduce the influence from this simplification, we used the percentile of soil moisture to represent the relative wetness of soil moisture and the seasonal trend of soil moisture, which was then compared with the percentile of rainfall (see the Supplement and Figs.S3 and S4).While more sophisticated models can be used for soil moisture estimation, there could still be substantial uncertainties (Ran et al., 2020a), yet the seasonal trend and the relative magnitude, after averaging through long-term records, would be less impacted by the simplification in the estimation (Berghuijs et al., 2019b;Zhang et al., 2019).
Our findings may appear different from those in Yang et al. (2020), which attributed the dominant flood-generation mechanism in the Yangtze River basin to rainfall.This may be explained by different classification criteria.Yang et al. (2020) considered both short rain and long rain as rainfall impacts, while here we only considered the daily rainfall.Thus, the importance of antecedent soil moisture may be considered to be long-rain impacts in Yang et al. (2020).It is possible that soil moisture on the day before the AMFs may not be the soil moisture before the event in large catchments due to the long concentration time.We estimated the concentration time for 10 sites with the largest drainage areas (larger than 100 000 km 2 ).The ones on the main stream and at the outlets of major tributaries following the United States Bureau of Reclamation (USBR) method (USBR, 1973;Ockert and Jeff, 2014).The concentration time is mostly within 2 d for main stream sites and is less than 24 h for sites at the outlets of major tributaries (Table S1 in the Supplement).Since the rest of the sites are all smaller than these, so too would be the concentration time.That is, for the natural watersheds we focused on, the concentration time is likely to be within 1 d.Thus, the soil moisture on the day before the AMFs would contribute to the generation of AMFs and should be applicable for this study.
Besides, the exchange with groundwater was not considered in the soil moisture estimation.The exchange with groundwater is more complicated and heterogenous (i.e., rivers could receive groundwater recharge in hilly areas and recharge groundwater in lower-lying land; Che et al., 2021).According to Huang et al. (2021), the variation in groundwater level in the Yangtze River basin is relatively small.Since the goal of this study is to capture the first-order seasonal variation in soil moisture and develop a framework that differentiates the relative importance of precipitation and soil moisture in flood generation, in this study, we estimated the soil moisture following Berhuijs et al. (2016Berhuijs et al. ( , 2019b) ) with a simple water balance equation.
Moreover, this work is focused on the relative importance of soil moisture and rainfall, so the impact of snowmelt is not considered due to the warm and humid climate in the study watersheds.To apply our findings to cold watersheds with significant snow impact, the snowmelt component needs to be incorporated.In addition, our method is based on the average values from many years.While previous work indicated that the occurrence of floods in our study watersheds is highly concentrated (Wang et al., 2021), there could be strong interannual variability in other watersheds.In future studies, an annual-scale and event-scale analysis are needed to examine and improve our findings before it can be applied to watersheds with more diverse climate and landscape conditions.There could be uncertainties embedded in the estimation of soil moisture due to the uncertainties in the inputs and model structures.Comprehensive evaluation of the performance and uncertainty is beyond the scope of our study.More sophisticated models with a groundwater component, remote sensing data, and reanalysis product with a higher spatiotemporal resolution are needed to provide a more accurate estimation and further validation of soil moisture, ET, and advance our understanding of flood-generation SPR.

Conclusions
Heavy rainfall on highly saturated soil was identified as the dominant flood-generation mechanism across world (Berghuijs et al., 2019b;Wang et al., 2021;Wasko et al., 2020).This study aims to further evaluate the relative importance of antecedent soil moisture and rainfall on flood generation and the controlling factors.Climate and hydrological data from 224 hydrological stations and 247 meteorological stations in the middle and lower reaches of the Yangtze River basin were analyzed, along with the modeled soil moisture.Except for the regulated watersheds, the relative importance of antecedent soil moisture and daily rainfall present a significant correlation with the drainage area; the larger the watershed, and the more essential the antecedent soil saturation rate is in flood generation, the less important the daily rainfall will be.
Using the percentile of antecedent soil moisture and rainfall as coordinates, the flood-generation mechanism(s) of study watersheds could be grouped into three classes, namely antecedent soil-moisture-dominated large flat watersheds, heavy rainfall-dominated steep and small to middle-sized watersheds, and small to middle-sized watersheds with a gentle topographic gradient where flood occurrence requires both highly saturated soil and heavy rainfall.Our analysis further shows that the ratio of relative importance between antecedent soil moisture and rainfall (SPR) can be predicted by a topographic wetness index.When the topographic wetness index is large, the dominance of the antecedent soil moisture for extreme floods is stronger, and vice versa.The SPR also presents a negative correlation with areanormalized flood peak.
Q. Ran et al.: The relative importance of antecedent soil moisture and precipitation in flood generation With the potential increase in extreme rainfall events (Gao and Xie, 2016;Chen and Zhai, 2016), upstream mountainous watersheds in the middle and lower Yangtze River basin are facing a higher risk of extreme floods.The lack of hydrological records further increases the vulnerability of people in these watersheds.The flood risks could be reduced by the construction of flood control facilities, but it is difficult to set flood control standards in these ungauged watersheds.Our findings provide a framework to quantitatively estimate the possible flood risk for these ungauged watersheds.Based on measurable watershed characteristics (i.e., drainage area and topographic gradient), the flood-generation SPR could be derived, which could then be used to estimate the mean annual flood.This information can provide scientific support for flood control management and infrastructure construction.
Future analysis at the event scale could help generate the flood-generation curve between SPR and discharge at the event scale to further improve flood risk predictions in these small, ungauged watersheds.With more data from other regions and the improved estimation or observation of soil moisture, we could expand our analysis to watersheds with more diverse climate and topographic characteristics to examine and refine our findings and to enhance our understandings of flood generation.Comparison between different time periods (i.e., before and after 2000) could also reveal temporal changes in flood generation, which may be linked to climate change, yet longer data records are needed to generate representative patterns.

Figure 1 .
Figure 1.Map of the Yangtze River basin, the meteorological stations, and hydrological stations.The blue line is the main stream of the Yangtze River.

Figure 2 .
Figure 2. The spatial distribution of (a) the percentile of antecedent soil moisture during annual maximum flood and (b) the percentile of daily precipitation during annual maximum flood.

Figure 3 .
Figure 3. Scatterplot of the drainage area and (a) the percentile of antecedent soil moisture of AMF events (the linear regression for blue dots is R 2 = 0.46, and p value < 0.001) and (b) the percentile of precipitation on the day of AMF events (the linear regression for blue dots is R 2 = 0.61, and p value < 0.001).The green dots represent the regulated watershed, the cyan dots represent the sites on the main stream, and the rest of the sites are shown in blue.

Figure 5 .
Figure 5. Scatterplot of the percentile of precipitation and antecedent soil moisture.The color represents the topographic gradient, and the size of circles is scaled by drainage area.

Figure 6 .
Figure 6.Scatterplots between the ratio of antecedent soil moisture and precipitation (SPR) and (a) drainage area, (b) topographic gradient, and (c) topographic wetness index (TWI).

Figure 7 .
Figure 7. Scatterplot between the ratio of antecedent soil moisture and precipitation (SPR) and area-weighted annual maximum discharge (Q P ).The color represents topographic gradient.