Explaining changes in rainfall–runoff relationships during and after Australia’s Millennium Drought: a community perspective

. The Millennium Drought lasted more than a decade and is notable for causing persistent shifts in the relationship between rainfall and runoff in many southeastern Australian catchments. Research to date has successfully characterised where and when shifts occurred and explored relationships with potential drivers, but a convincing physical explanation for observed changes in catchment behaviour is still lacking. Originating from a large multi-disciplinary workshop, this paper presents and evaluates a range of hypothesised process explanations of ﬂow response to the Millennium Drought. The hypotheses consider climatic forcing, vegetation, soil moisture dynamics, groundwater, and anthropogenic inﬂuence. The hypotheses are assessed against evidence both temporally (e.g. why was the Millennium

Drought different to previous droughts?)and spatially (e.g.why did rainfall-runoff relationships shift in some catchments but not in others?).Thus, the strength of this work is a large-scale assessment of hydrologic changes and potential drivers.Of 24 hypotheses, 3 are considered plausible, 10 are considered inconsistent with evidence, and 11 are in a category in between, whereby they are plausible yet with reservations (e.g.applicable in some catchments but not others).The results point to the unprecedented length of the drought as the primary climatic driver, paired with interrelated groundwater processes, including declines in groundwater storage, altered recharge associated with vadose zone expansion, and reduced connection between subsurface and surface water processes.Other causes include increased evaporative demand and harvesting of runoff by small private dams.Finally, we discuss the need for long-term field monitoring, particularly targeting internal catchment processes and subsurface dynamics.We recommend continued investment in the understanding of hydrological shifts, particularly given their relevance to water planning under climate variability and change.

Importance of understanding hydrological response to multi-year drought
Change and variability are ubiquitous in environmental and human systems (e.g.Milly et al., 2008;Wagener et al., 2010;Montanari et al., 2013).In recent times, much effort has been spent on understanding and modelling dynamic behaviour, notably under the change in hydrology and society decade of the International Association of Hydrological Sciences for 2013-2022 (Montanari et al., 2013).While many short-term hydrological phenomena (e.g.flooding and seasonal transitions) are relatively well studied, long-term (multi-year) hydrological dynamics have been receiving increased attention over the last decade or so (e.g.Jaramillo and Destouni, 2014;Van der Velde et al., 2014).This paper is concerned with hydrological response to multi-annual droughts, a topic recently explored on multiple continents, including North America, South America, Europe, Asia, and Australia (e.g.Avanzi et al., 2020;Fowler et al., 2020a;Peterson et al., 2021;Alvarez-Garreton et al., 2021;Roodari et al., 2021;Massari et al., 2022).With climate change, many parts of the world will experience long-term drying with more frequent and severe droughts (e.g.Cook et al., 2018).Therefore, it is particularly important to study historic multi-year droughts as the best available analogue for such conditions.However, few informative examples of multi-year drought are available because of short streamflow records which do not include historic droughts (e.g.Verdon-Kidd and Kiem, 2009) and often are affected by lengthy gaps in monitoring data.Lack of concur-rent observations of different components of the catchment water balance (e.g.shallow and deep groundwater heads, soil moisture, and actual evapotranspiration) also limits our ability to explore and test alternative explanations for observed behaviour.Likewise, these issues impede studies on drought recovery (e.g.Peterson et al., 2021) and limits distinguishing long-term changes from shorter-term variability, particularly in highly variable climates (Morin, 2011).
For hydrologic modellers, it is difficult to be confident that modelling techniques are robust without past examples of sustained changes in climate to test them on.There are many aspects of catchment behaviour that are not explicitly simulated but which may prove salient to determine the hydrologic response, such as transient ecosystem dynamics related to water stress and regrowth, in addition to anthropogenic factors (e.g.Van Loon et al., 2016;Speich et al., 2020;Stephens et al., 2021;Bouaziz et al., 2022).Non-linear behaviour and/or tipping points may be part of the response of a system to climate change (Rodriguez-Iturbe et al., 1999;Peterson et al., 2009Peterson et al., , 2021;;Tauro, 2021).In acknowledging these issues, the ability for models to extrapolate to changing conditions was recognised as one of 23 key challenges (unsolved problems) in hydrology by Blöschl et al. (2019).Even models successfully tested against past hydroclimate data may still be unable to extrapolate predictions under enhanced atmospheric CO 2 concentrations, higher temperatures, and potentially more frequent, severe, and longer dry spells (Saft et al., 2016a;Fowler et al., 2018;Stephens et al., 2020).Nonetheless, testing against a range of historic conditions remains an important first step towards robust modelling.
Thus, examples of multi-year drought are valuable to hydrology, and Australia's Millennium Drought is one such example (Van Dijk et al., 2013;Chiew et al., 2014).This paper focuses on the Millennium Drought as a case study, and it is hoped that lessons learnt from this example are transferable and useful for understanding long-term hydrologic change in other places around the world.As explained in Sect.1.2, this drought was notable for causing persistent shifts in the relationship between rainfall and runoff in many catchments (Saft et al., 2015), and globally, other locations have since seen similar behaviour, such as the United States (California; Avanzi et al., 2020), Chile (Garreaud et al., 2017;Alvarez-Garreton et al., 2021), China (Tian et al., 2020), Europe (Massari et al., 2022), and central Asia (Roodari et al., 2021).Future research should examine the underlying causes for all such cases, seeking lessons that are transferable in time and space to other regions that are yet to experience such shifts.With many regions of the world likely to see long-term drying, more frequent droughts, and water scarcity (Schewe et al., 2014;Cook et al., 2018;Kirono et al., 2020), it is critical to understand the causes of apparent hydrological non-stationarity and, ideally, incorporate this into operational models used by water planners (Fowler et al., 2022).

The Millennium Drought
The Millennium Drought occurred in southeastern Australia, affecting an area of more than 500 000 km 2 .The drought commenced in the mid-to-late 1990s (1997 is taken as the start date for this paper) and is assumed to have ended with widespread flooding in 2010-2011(starting in September 2010)).It was primarily a rainfall deficit drought (Van Loon and Van Lanen, 2012), although it is noted that, in some places, average rainfall years occurred mid-way.Post-2011, climatic conditions have been similar to pre-drought averages in some places and below average in others (Argent, 2017;Pepler et al., 2021).In many affected areas, the Millennium Drought was the longest drought on record (with records typically going back 100-150 years), and it caused significant social and economic impacts.The severe hydrological drought was estimated to have a return period exceeding 300 years (Potter et al., 2010;Freund et al., 2017).Water shortages induced low water use allocations for irrigators, water restrictions in urban areas, and significant investment in infrastructure such as desalination plants and pipelines (Van Dijk et al., 2013).The economic costs of the drought have been estimated to be as much as 1.6 % of Australia's gross domestic product (GDP; Horridge et al., 2005).Concern for environmental degradation led to significant water reforms, notably the Water Act (Connell and Grafton, 2011;Skinner and Langford, 2013), including a cap-and-trade-type system to restrict total water use in the Murray-Darling basin.This drought, and possibly projections of a drier future, accelerated major water reforms, such as the return of almost 20 % of irrigation water entitlements to the environment to be managed by new agencies called Environmental Water Holders (Skinner and Langford, 2013;Connell, 2011;Gawne et al., 2020).
Many catchments showed unexpected hydrological behaviour during the Millennium Drought, with streamflow being surprisingly low even when the low rainfall was taken into account.This behaviour, referred to as a shift (Saft et al., 2015), or a change in rainfall-runoff relationship (Petheram et al., 2011), occurred in some catchments but not others (see Sect. 2), and many streams also became more intermittent and/or less saline as part of the transition.Despite the end of the meteorological drought in 2010, many catchments have not recovered, meaning that they have not shifted back to pre-drought behaviour and appear to be persisting within a lower rainfall-runoff relationship (rather than simply being slow to recover; Peterson et al., 2021).In shifted catchments that have not recovered, an average rainfall year produces less streamflow today than it did pre-drought.
The physical mechanisms underlying these shifts are not well understood.As discussed in Sect.2, research to date has successfully characterised where and when shifts occurred (Petheram et al., 2011;Saft et al., 2015;Peterson et al., 2021) and explored statistical relationships with potential drivers (Saft et al., 2016b;Peterson et al., 2021).However, the sci-entific and management communities still lack a convincing and widely accepted physical explanation for observed changes in catchment behaviour, which is the subject of this paper.

Context and aims of this article
This article arose out of a virtual workshop in late 2020, where a multi-disciplinary group of scientists gathered to discuss the hydrological consequences of the Millennium Drought.Integrating perspectives from hydrogeology, plant ecophysiology, remote sensing, hydroclimatology, experimental hydrology, and hydrological modelling, the workshop aimed to share and discuss hypotheses about catchment behaviour and associated flow response that could explain the Millennium Drought streamflow observations, considering both the spatial and temporal patterns of hydrological shifts.
Synthesising and building upon the discussions in the workshop, this article aims to achieve the following: discuss relevant processes and how they could be responsible for observed shifts, review available evidence and discuss the strengths and limitations of that evidence, consider whether a single holistic perceptual model of catchment response can be formulated by integrating the process explanations which are consistent with evidence, and recommend future research and/or monitoring activities which may confirm, distinguish between, and/or improve the process explanations.
Figure 1 shows the structure of the remainder of this article.Section 2 describes the hydrological shifts in greater detail, including a summary of key research to date.Section 3 considers a variety of processes relevant to streamflow generation, reviews the current state of knowledge in each case, and hypothesises process explanations based on this knowledge.Section 4 presents diagnostic evidence, tests the hypotheses against the evidence, and seeks to harmonise them into an integrated description of hydrologic response.Section 5 discusses future monitoring and research priorities required to further develop and/or distinguish between the plausible process explanations.
Underlying this structure is the implicit assumption that all considered processes and hypotheses are potentially relevant to the scientific community, even in cases which turn out to be inconsistent with empirical data.An alternative would be to only present the final set of process explanations along with associated evidence, but we feel this would remove much of the richness of the process of discovery and evaluation.Recent articles by Beven and Chappell (2021) and Wagener et al. (2021) affirm the value of documenting changes in the qualitative understanding of catchment behaviour and https://doi.org/10.5194/hess-26-6073-2022 Hydrol.Earth Syst.Sci., 26, 6073-6120, 2022 the perceptual (mental) models we use to describe catchments (see also McGlynn et al., 2002).Sharing perceptual information (and opening it to wider evaluation) is particularly important when pushing the limits of hydrological knowledge (Wagener et al., 2021) and working in multidisciplinary contexts (e.g.Staudinger et al., 2019).In any case, process explanations inconsistent with empirical data may correctly explain the hydrology of drought events in other locations or at other times.Thus, we feel there is value in providing a full account of the hypotheses considered.
2 Current understanding of drought-induced hydrological shifts The purpose of this section is to describe the current understanding of the Millennium-Drought-induced hydrological shifts.To provide a concise summary, we focus on two papers which most directly examine the issue (namely Saft et al., 2015, andPeterson et al., 2021), while acknowledging that other papers exploring related themes exist (e.g.Petheram et al., 2011;Westra et al., 2014;Saft et al., 2016a, b;Fowler et al., 2016Fowler et al., , 2020a;;Ajami et al., 2017;Deb et al., 2019), including elsewhere in Australia (Petrone et al., 2010;Hughes et al., 2012) and globally (see introduction).Both papers examine the rainfall-runoff relationships using the annual time step, which reduces the impact of seasonal storage differences and smooths out within-year extremes.For this section only, we focus on the annual time step as a way of introducing the shifts in rainfall-runoff relationships, including the temporal and spatial scale of the changes.However, as seasonal and event dynamics may be a crucial element of process explanations, they are considered at length in Sect.3, and plots of the seasonal precipitation, streamflow, and runoff coefficient are provided in Appendix B.
The most important observation is that the hydrological response during the drought was out of proportion with the change in rainfall over the same period -in other words, the streamflow during the Millennium Drought was lower than expected, even after the lower rainfall is taken into account (Fig. 2a-b).For example, Saft et al. (2015) demonstrated shifts in the annual rainfall-runoff relationship when considered separately over the Millennium Drought period and the pre-drought period.The degree of change necessary to be considered a shift varied by catchment, since the key criterion was a statistical test designed to identify statistically significant differences in relationships (α = 0.05), and the results of this test are sensitive to background temporal variability.For catchments showing shifts, the implication is that a given annual rainfall led to a smaller annual streamflow during the drought than before.Approximately half of the studied catchments shifted (Fig. 2c), especially drier and flatter catchments with low forest cover.The shifts were often associated with increases in the number of zero-flow days.
The shifted behaviour has continued post-drought.Using more recent data, Peterson et al. (2021) showed that more than half of the shifted catchments had not yet recovered by 2017 (Fig. 2d), despite a return to wetter conditions particularly in 2010, 2011, and 2016.In shifted catchments that have not yet recovered, years with average rainfall produce less streamflow today than they did pre-1997.Peterson et al. (2021) found that the catchments appear to be persisting within lower rainfall-runoff relationships, and that nonrecovery is not explained by changes in seasonal rainfall, land cover change, or the rating curve or gauge method.While this study was over a smaller extent (state of Victoria; Fig. 2c-d), Peterson et al. (2021) suggested the non-recovery was best explained by increased actual evapotranspiration per unit of precipitation, rather than alternatives such as increased recharge.
The shifted catchments had experienced drought previously during the period of streamflow record, as reflected by the low pre-1997 rainfall years in Fig. 2b.However, there was something different about the Millennium Drought that caused streamflow response to diminish relative to previous droughts.Having experienced prior years of low rainfall, this set up an expectation of how much streamflow could be expected in a dry year of given severity (measured by rainfall), and this expectation was subsequently broken during (and after) the Millennium Drought.

Possible process explanations
In this presentation of process explanations, several categories of processes are considered, each in a separate subsection, and explanations involving more than one category are marked accordingly and grouped with the most important category for the mechanism in focus.For clarity, hypothesised process explanations are labelled with the acronym HPE, followed by an identification number and a one-sentence summary, starting with "streamflow was lower than expected because . . .". Table 1 summarises all the HPEs.

Changes in meteorological conditions
The problem framing in Sect. 2 already accounts for changes in precipitation, but this includes considered annual values only and not within-year changes, such as sequencing and seasonality, or changes in variables other than precipitation.These changes are the focus below.For reference, the maps in Appendix B show annual and seasonal precipitation, streamflow and runoff coefficient over the pre-drought, drought, and post-drought periods.

Changes in rainfall seasonality
Earlier studies such as Potter and Chiew (2011) suggested the seasonality of rainfall as being one causative factor for lower Millennium Drought streamflows.As seen in Fig. 3a, precipitation reduced across the seasons, and to different degrees in different areas, but reductions in autumn (March-May) were the most severe in many areas.There were also reductions in winter rainfall (see also Pepler et al., 2021), although the percentage of reductions was not as great as those in autumn.
In this study area, autumn is the time when many catchments wet up ahead of the main flow-generating period in winter (June-August) and early spring.In terms of hydrological processes, tightly bound soil water, once seasonally depleted, likely receives priority in the seasonal re-wetting process (e.g.Brooks et al., 2010), since the mobility of water moving to the stream is restricted until after the soil pores have been replenished and the soil reaches field capacity.
During the Millennium Drought, lower autumn rainfall may have extended the soil-pore-filling period later into the year, diminishing the period of flow generation, a concept supported by Wasko et al. (2020).The hypothesised process explanation (HPE) is thus summarised as follows: HPE01: streamflow was lower than expected because lower autumn rainfall led to delayed wetting up and muted flow response during winter (when runoff coefficients are usually highest).

Increased time spent in spells of little or no rainfall
As shown in Fig. 3b, during the Millennium Drought there was an increased proportion of time spent in spells with little or no rainfall in many locations.Although HPE01 and HPE02 are similar, HPE01 is focussed on the seasonal dynamics of refilling the soil moisture deficit accrued over the entire dry season, whereas HPE02 is concerned with shorter timescales and occurred in all seasons, albeit to different degrees (Fig. 3b).With longer periods between significant rainfall events, soil moisture deficits likely arose from continued evaporation and/or transpiration, which then needed to be filled prior to the resumption of streamflow (similar to HPE01).Because of this, a greater proportion of event rainfall would be used to fill this deficit rather than converted to streamflow.Although Verdon-Kidd and Kiem (2009) note that the increase in dry spells was not as high as during two other protracted droughts pre-1950, it is retained as a hypothesis for now as HPE02: streamflow was lower than expected because more time spent in spells without rain led to higher soil moisture deficits between rainfall events and thus less conversion to streamflow.

Decreases in frequency of high-rainfall events
Generally, a small number of large rainfall events each year contribute disproportionately to annual flow volumes, and there is recent evidence for a reduction in the frequency of large rainfall events in many parts of Australia (Wasko et al., 2019;Dey et al., 2020).Heavier rainfall events are more conducive to preferential flow, but smaller or less intense events may be more readily absorbed by the soil and thus available to plants (e.g.Elliott et al., 2015).A reduction in the frequency (Fig. 3c) of high rainfall events (i.e.those above thresholds that produce runoff1 ; Saffarpour et al., 2016) could produce a downward shift in streamflow generation, even if annual reductions in precipitation are relatively moderate.Furthermore, we might expect that flatter catchments require higher rainfall thresholds to produce runoff than steep catchments; it is thus possible that we might see differences in response even within a homogenous rainfall region.Sup-  porting this hypothesis, Verdon-Kidd and Kiem (2009) reported that the Millennium Drought had less intense rainfall, on average, than two other protracted droughts pre-1950.Thus, HPE03: streamflow was lower than expected because fewer rainfall events exceeded thresholds to produce runoff.

Increases in evaporative demand
Stephens et al. (2018) reported recent increases in pan evaporation within the area affected by the Millennium Drought (despite earlier declines), while Potter and Chiew (2011) cited higher evaporative demand as a potential driver of hydrological behaviour.Although southeastern Australia is typically water limited rather than energy limited, anomalies in evaporative demand may still be important seasonally and in wetter parts of the landscape.Increases in potential evapotranspiration (PET) when water is available (e.g.winter and spring are the wettest times in the southern part of the affected area) are more likely to impact the water balance.Figure 3d shows that evaporative demand -here quantified as Morton's (1986) wet environment evaporation -increased significantly at precisely this time of year and more so in the drier south and south west of the study area (in central and western Victoria; cf.Fig. 2).Conspicuously, this is also an area where almost all catchments exhibit hydrologi-cal shifts (Fig. 2c-d).The question of which environmental variable drives this increase is held over to Sect. 4 (see also HPE07).In summary, HPE04: streamflow was lower than expected because evaporative demand increased at a time of year when catchments are relatively less water limited and relatively more energy limited.

Length of drought
The Millennium Drought lasted 10, 13, or 16 years, depending on the location and choice of definition.By comparison, many earlier droughts, such as those in 1965-1968 and 1982-1983, were shorter.It is possible that the unexpectedly severe streamflow reductions during the Millennium Drought were caused by the activation of long-term or long-memory processes that were absent from these earlier droughts.The pre-1950 period included relatively longer droughts such as the Federation Drought (1895-1903) and the World War II Drought (1939)(1940)(1941)(1942)(1943)(1944)(1945), and while comparison of meteorological conditions among these droughts is insightful (e.g.Verdon-Kidd and Kiem, 2009), few streamflow gauges were active during these periods (see also Sect.4.1), which makes intercomparison of flow difficult.Thus HPE05: streamflow was lower than expected because expectations were conditioned by shorter droughts for which long-term or long- memory process responses were less apparent.This broad, overarching HPE is incomplete without further detail about which processes are in view, but this is held off to later HPEs (specifically HPEs 9, 15, 16, 17, and 18).

Comments on climatic drivers
The climate drivers presented above -namely PET, rainfall seasonality, dry spells, and intensity -all relate to the Millennium Drought period and not post-drought.As many catchments did not recover post-drought, a test of these HPEs is whether the stated climatic driver continued post-drought or went back to pre-drought conditions.For example, did the changes in seasonality cease with the drought or continue post-drought?We leave such questions to Sect. 4 as part of the evaluation of the HPEs.

Vegetation dynamics
In southeastern Australia, high evaporative demand relative to rainfall means that evapotranspiration dominates the water balance, and rainfall-runoff ratios are low (typically < 10 %).Therefore, vegetation behaviour has the potential to significantly impact streamflow, as a small percentage change in plant water availability or actual evapotranspiration (AET) may translate into a large percentage change in streamflow.Recent findings suggesting an increase in AET per unit of precipitation (Peterson et al., 2021; see also Sawada and Koike, 2016) underscore the need for investigation.

Greening induced by increased atmospheric CO 2
Atmospheric CO 2 concentrations have been increasing over the instrumental period (IPCC, 2021), with their effect on vegetation water use and impact on catchment or regional water resources being the subject of considerable research (e.g.Peel et al., 2009;Cheng et al., 2017).Higher atmospheric CO 2 concentrations generally increase plant water use efficiency (e.g.Ainsworth and Long, 2005;Gedney et al., 2006;Liu et al., 2020), which may manifest as a reduction in transpiration (e.g. Warren et al., 2011).However, the increase in water use efficiency can also manifest as increased growth, and in some cases, this can lead to increased transpiration (e.g.Piao et al., 2007;Uddling et al., 2008;Jaramillo et al., 2018).Observational studies from eastern Australia supporting the latter view include Ukkola et al. (2016), Ajami et al. (2017), andTrancoso et al. (2017).This suggests a potential cause for the reduced streamflow, but it is noted that responses may depend on land cover, e.g.forests and grasslands (Morgan et al., 2004(Morgan et al., , 2011; see next subsection), and may also vary under drought conditions (Yang et al., 2016).It should also be noted that, because CO 2 fertilisation typically leads to a reduction in transpiration per unit leaf area, increased growth can often occur without any increase in transpiration (De Kauwe et al., 2021;Rifai et al., 2022).HPE06: streamflow was lower than expected because fertilisation by higher CO 2 concentrations caused higher vegetation water use (relative to previous droughts).

Atmospheric coupling, vapour pressure deficit, and evaporative demand
This HPE spans vegetation behaviour, climatic forcing (HPE04), and atmospheric physics but is listed under vegetation because it is the characteristics of the vegetation itself which are key to the process and lead to spatial differences in this effect.Atmospheric coupling refers to the degree of mixing of the air between the canopy and the atmosphere above, which is related to the turbulent flow over and within the canopy (McNaughton and Jarvis, 1991).Small-stature canopies (e.g.grasslands) are more decoupled due to lower turbulence associated with homogeneous canopies (e.g.De Kauwe et al., 2017).This lower turbulence in grasslands allows a near-surface boundary layer to build up, which decouples the vapour pressure deficit (VPD) experienced by plants from the atmospheric VPD.Thus, transpiration in grasslands is relatively less responsive to atmospheric VPD and relatively more responsive to other factors such as solar radiation and temperature.Thus HPE07: streamflow was lower than expected because higher radiation and/or air temperature increased transpiration, particularly in grassland catchments, due to differences in atmospheric coupling between grasslands and forests.Discussion of whether increases in radiation actually occurred (and with a spatial distribution supporting this hypothesis) is held over to Sect. 4. Note that, in contrast to grasslands, transpiration from forests in southeastern Australia can be very sensitive to atmospheric VPD, and this is discussed further in the next subsection.

Moisture stress, delayed desiccation, and vegetation replacement
Under low moisture, plants may become water stressed, and leaves (or, more rarely, the whole plant) may die.In grasslands with pronounced dry seasons, loss of leaves due to desiccation may occur seasonally, leading to reduced water use, but the root system typically remains alive despite loss of foliage.Higher CO 2 allows more efficient use of available water in grasslands (e.g.Hovenden et al., 2019) and may delay seasonal desiccation in C 4 grasses under warmer conditions (Morgan et al., 2004; see also Morgan et al., 2011).Paradoxically, grasses that retain foliage for longer may transpire rainfall from mid-to-late summer storms (and/or cause greater evaporation via interception), thereby using more water than grass where leaves are already dead.This is a paradox because the higher water use efficiency, which enables the longevity of the C 4 grass leaves, could actually result in greater annual water use.HPE08: streamflow was lower than expected because higher CO 2 concentrations allowed C 4 grasslands to delay seasonal desiccation and transpire water for longer (relative to previous droughts).
If vegetation dies entirely (e.g.Allen and Breshears, 1998;Fensham et al., 2005;Semple et al., 2010), then this could increase water use if the vegetation is replaced by species that are more opportunistic in their water use.HPE09: streamflow was lower than expected because vegetation mortality led to species replacement with plant types that are more opportunistic and thus use more water.
The next HPE is one of three (namely HPEs 10, 11, and 12) which are phrased differently.As noted earlier, forested catchments rarely shifted, whereas grasslands commonly shifted.The most straightforward way to explain this spatial pattern is to posit a hypothesis or hypotheses that act in grasslands catchments and not forested catchments.A less straightforward, but still possible, explanation is to posit that the mechanism causing the shifts acted on all catchments regardless of land cover, and then a second mechanism, specific to forested catchments, counteracted the first to produce the observed spatial pattern.HPEs 10, 11, and 12 are in this latter category, and this is why the HPE wording is preceded by the words "assuming some other mechanism for lower streamflow . . .".
As noted, transpiration from forests can be very sensitive to atmospheric VPD (e.g.Gharun et al., 2013), with plants exercising strong stomatal control to limit moisture loss during periods of high VPD and low soil moisture (e.g.Renchon et al., 2018;Grossiord et al., 2020).Thus, if some other mechanism was acting to cause hydrological shifts across grasslands and forests alike, then stomatal control in forests may have counteracted this mechanism (in which case forest catchments would appear less shifted than grasslands).HPE10: assuming some other mechanism for lower streamflow, stomatal control by trees (responding to high VPD) made extra water available for streamflow, counteracting this mechanism in forests.Discussion of whether increases in VPD actually occurred (and with a spatial distribution supporting this hypothesis) is held over to Sect. 4.
Trees demonstrate additional water-saving strategies (e.g.McDowell et al., 2008) not related to stomatal control, with eucalypts being able to reduce leaf area (Nolan et al., 2021) or undergo xylem cavitation (sometimes temporarily), thus impairing water use (Blackman et al., 2019;Nolan et al., 2021).These are listed separately to the previous HPE because they may persist for longer than rapid stomatal dynamics and be less dependent on VPD.HPE11: assuming some other mechanism for lower streamflow, water use controls such as xylem cavitation or reduced leaf area made extra water available for streamflow, counteracting this mechanism in forests.

Wildfire
Wildfire recovery is known to affect water use, with impacts subject to competing factors.Tree mortality and morbidity reduce water use, whereas re-growth increases it, and this is species specific (Heath et al., 2014).Due to this combination of factors, yield impacts reported by Australian studies vary widely and include significant increases (Lane et al., 2006;Guo et al., 2021), decreases in the short-medium term (Nolan et al., 2015) and longer term (Kuczera, 1987), and negligible change in catchment yield (Feikema et al., 2013;Heath et al., 2014).Many eucalypts have post-fire resprouting mechanisms (e.g.epicormic buds and lignotubers) which can be activated, provided the fire does not kill the tree (e.g.Nolan et al., 2014).Thus, fire severity is an important factor, as is species mix (Nolan et al., 2015).A further consideration is changes to soil hydraulic properties and water repellency following fire (e.g.Nyman et al., 2010).Southeastern Australia was subject to multiple fire events during the Millennium Drought.For example, the 2003 bushfires were the first widespread fire events since the early 1980s, and subsequent major fires occurred in the summers of 2006-2007 and 2008-2009 and over different areas in each case.In general, the areas affected by wildfire were those that did not shift.The hypothesis is as follows: assuming a dip in vegetation water use after the wildfire (e.g.Lane et al., 2006;Guo et al., 2021), possibly coupled with runoff-induced changes in soil properties (e.g.Nyman et al., 2010), the wildfire caused an increase in streamflow that masked the underlying shift in behaviour.In other words, if not for wildfire, more catchments would have appeared to have shifted.HPE12: assuming some other mechanism for lower streamflow, the occurrence of forest wildfires made extra water available for streamflow, counteracting this mechanism in forests.

Vegetation recovery from waterlogging and salinity
This HPE spans vegetation and groundwater processes.Prior to the Millennium Drought, multi-decadal wet conditions combined with historic deforestation led to high water tables and waterlogging in many areas, with naturally occurring soil salt brought to the surface (e.g.Cartwright et al., 2004).
The waterlogged and saline topsoil reduced the health, and thus water use, of vegetation (Lambers, 2003).Waterlogged and salt-affected areas were planted with tolerant grasses or shrubs, while trees were added to the landscape in groundwater recharge areas or upslope of discharge areas (Schofield, 1992;Marcar, 2016).These revegetation efforts were focussed on the late 1980s and 1990s, and thus the timing of maturation of much of this vegetation would have coincided with the onset of the Millennium Drought (commencing in the late 1990s).Furthermore, the drought onset lowered water tables, removing waterlogging as a stressor of vegetation (although it is noted that legacy salinity may persist for many years and even intensify during drought due to evapoconcentration; see, e.g., Hrachowitz et al., 2015).HPE13: streamflow was lower than expected because vegetation recovered from prior waterlogging and because of the maturation of vegetation planted earlier to combat salinity.

Increased interception due to decomposing material
The drier conditions during the Millennium Drought, together with potential greening, may have led to a greater accumulated dead plant matter in grasslands.Litter in grasslands has been shown to increase interception of precipitation (e.g.Naeth et al., 1991).HPE14: streamflow was lower than expected because accumulated decomposing litter in grasslands intercepted more precipitation, causing higher AET.

Biophysical adaptation to climatic disturbance associated with multiple stable states of catchment behaviour
Peterson et al. ( 2021) suggest vegetation behaviour may explain the apparent appearance of hydrological shifts.Furthermore, they suggest that their findings (as summarised in Sect.2) are "consistent with . . .watersheds having multiple stable states [of behaviour] and a finite resilience" (Peterson et al., 2021).In this view, the hydrological shift towards a lower rainfall-runoff relationship corresponds to a transition from one stable state to another (see Peterson et al., 2009).
It is noted that the word "state" here has a different meaning to its common hydrological usage (e.g.state variable), as it refers to the behaviour of a system being organised into discrete preferential states (e.  (Peterson et al., 2021:4-5).The exact nature of this biophysical adaptation is yet to be determined and could be related to one or more of the preceding HPEs.However, it should be noted that none of the preceding HPEs inherently gives rise to multiple stable states, and thus such a hypothesis must be stated separately, which is why it is given its own HPE here.In any case, given that AET is the largest element (after rainfall) of the water balance in southeastern Australia, it seems reasonable to explain the hydrological shifts by hypothesising a direct change to the dynamics of AET by some cause.If it is accepted that the vegetation is the primary cause, then this explains both the streamflow shifts (since less water is available for runoff) and the water table decline described in the next subsection (since a higher fraction of drought rainfall is transpired, which dries out the soil, reducing recharge).HPE15: streamflow was lower than expected because biophysical adaptation, possibly associated with multiple stable states of catchment behaviour, caused an increase in the fraction of precipitation going to transpiration.

Groundwater and subsurface processes
As discussed above (HPE05), it has been hypothesised that the long duration of the Millennium Drought increased the role of long-term (long-memory) processes in determining hydrological behaviour.Given that groundwater is a relatively large store of water that can exhibit multi-year trends and long memory, it seems reasonable to assume that some of the streamflow declines were related to changes to groundwater inflows -a hypothesis strengthened by the observed regional-scale decline in the water table in southeastern Australia throughout the Millennium Drought (Fig. 4; LeBlanc et al., 2009;Van Dijk et al., 2013;Fowler et al., 2020a).
For this hypothesis to work, it is first necessary to establish that groundwater influences streamflow generation, as briefly summarised here.In headwater catchments, inflows of both shallow and deep groundwater contribute to streamflow and may have different response times to disturbance.For example, streamflow may arise from relatively shallow interactions such as perched saturation along hydraulic gradient fronts (e.g. at the transition from A to B horizons) -behaviour which is relatively transient, with short (sub-annual; days-weeks) response times (e.g.Hughes et al., 2007).In-teractions with unconfined aquifers include groundwater discharge directly into streams or via springs (e.g.O' Grady et al., 2010;Cartwright and Gilfedder, 2015;Zhu et al., 2021).Also, partial saturated areas and areas with shallow water table are primed for event runoff (e.g.Dunne and Black, 1970).These varied processes -referred to hereafter as groundwater-surface water (GW-SW) interaction -mean that unconfined aquifers can interact and contribute to short timescale (event-to-seasonal) hydrological processes in headwater catchments.However, unconfined aquifers also exhibit slower dynamics such as multi-year trends.These dynamics depend on conditions.For high rainfall headwater catchments with steep slopes, lag times will be less than a few years, while for medium rainfall lower slope catchments the timescale can be decades, depending on geology (Cook, 2003).The observed drought-induced multi-year water table decline (Fig. 4a) likely caused discharge areas to dry up and seasonal wet areas to contract, thus reducing flow in headwater streams.In addition, groundwater-induced contractions in stream network length and extent increased subsurface flow path lengths to the nearest stream, potentially increasing the probability of water being diverted to transpiration (Jensen et al., 2017(Jensen et al., , 2018) ) rather than contributing to streamflow.Together, the above points establish the connection between groundwater and streamflow generation in headwater streams, leaving HPEs 16, 17, 19, and 20 to focus on the cause of the groundwater decline.
Note that some of the vegetation-themed HPEs presented above (specifically HPEs 6-9 and 13-15) could also reduce recharge and are thus also possible causes or contributors to groundwater decline.Even in the absence of these HPEs, it is reasonable to assume reduced recharge during the Millennium Drought since it is highly correlated with rainfall (e.g.Ferdowsian et al., 2001;Peterson and Western, 2014).Furthermore, note that the effect of groundwater on streamflow can involve extensive groundwater systems (Sect.3.3.2) and is not limited to the headwater streams in focus here, since the fall in the water table also impacts middle and lower reaches of rivers in catchments where the SW and GW boundaries coincide (see also Sect.3.3.3).

Sustained local groundwater drainage from hillslopes
Multi-year water table declines in response to the Millennium Drought imply that the depleting fluxes of the groundwater water balance (see Fig. 4b) are less sensitive to the multi-year dry conditions than additive fluxes.This HPE explains the depletion of groundwater by sustained underground drainage from hillslopes to local low points in the landscape (e.g.base of slope and valley bottoms).These low points often contain deposits of alluvium (river deposits) and/or colluvium (unconsolidated sediments deposited at the base of hillslopes by gravity).Separate treatment of hillslope and alluvial aquifers has precedent in local groundwater-surface water studies such as the salinity-focussed rainfallrunoff modelling by Stenson et al. (2011), in addition to modelling in Africa undertaken by Hulsman et al. (2021a), albeit at a larger scale.According to this HPE, the alluvial/colluvial deposits receive the drained water, and then evaporation and transpiration within these deposits keep the water table below the surface and ensure that much of this drainage does not contribute to streamflow.This drainage is hypothesised to occur continuously throughout the pre-drought, drought, and post-drought periods.During pre-drought, the drainage was balanced by higher recharge, meaning that the water table in hillslopes was closer to the surface.This earlier shallow groundwater facilitated runoff processes in hillslopes that were subsequently diminished when the water table dropped, contributing to the hydrological shifts.During the drought, recharge declined, while, according to this HPE, the drainage was assumed to be more sustained, resulting in lower groundwater heads particularly in hillslopes.The plausibility of this HPE assumes that lateral aquifer transmissivity does not decline so much with lower head as to shut off the lateral drainage from the hillslopes.Thus, this HPE depends on the presence of (i) geological strata that support the sustained drainage (e.g.deeply fractured rock; Rempe and Dietrich, 2018) and (ii) colluvial/alluvial deposits (flat valley bottoms; not erosional/v-shaped).HPE16: streamflow was lower than expected because water slowly drained from hillslopes (contributing to water table decline and reduced GW-SW interaction) to nearby alluvial areas and was subsequently lost via evapotranspiration.

Groundwater export across catchment boundaries
Whereas the previous HPE assumed drainage within a catchment, we can also hypothesise drainage crossing boundaries of surface water catchments as part of a regional flow system (e.g.Bouaziz et al., 2018;Condon et al., 2020;Hulsman et al., 2021b;Fowler et al., 2020a) and flowing inland or, alternatively, discharging to the ocean.Such regional systems are also known as mountain block recharge but in this case are associated with hills rather than mountains.
Where such systems exist, it is likely that shallower and deeper groundwater flow paths co-exist together (e.g.Frisbee et al., 2012;Condon et al., 2020), associated with local and intermediate/regional groundwater systems, respectively.According to this HPE, regional drainage is hypothesised to occur continuously throughout the pre-drought, drought, and post-drought periods, with the drought bringing changes in the balance with other elements of the water balance in Fig. 4b.HPE17: streamflow was lower than expected because sustained drainage of groundwater across catchment boundaries contributed to declines in the water table and reduced GW-SW interaction.We note that, in the context of headwater catchments, the distinction between local and regional drainage is important because (i) regional discharge cannot support within-catchment wetting up, which would otherwise amplify event runoff, and (ii) at the point of discharge, groundwater discharge may be an important control on AET, so the distinction affects the appearance of a headwater catchment in remotely sensed data.Both (i) and (ii) affect how the catchment might be represented in simulation models.
A further distinction is that groundwater systems vary in their response times to disturbance.In southeastern Australia, groundwater systems which cover larger areas tend to have longer response times (Walker et al., 2003).Locations that are influenced by a combination of two or more systems may be subject to a mixture of dominant timescales -say, a 1-5-year timescale associated with a local system mixed with a 10-30-year timescale associated with a larger system.Likewise, drought recovery may be a question of the relative influence of shorter and longer timescale groundwater systems.

Gaining and losing streams
Even if rainfall-runoff relationships remained unchanged (i.e.no shift) within lower-order catchments, flow within higher-order stream reaches can be lost as it travels downstream, via seepage into the surrounding substrate.This is called a losing stream reach, in contrast to a gaining reach in which the stream receives water.The losing/gaining dynamic is driven by the stream level relative to the groundwater level in the surrounding substrate.This dynamic may be important in cases where the river flows over extensive alluvial deposits, since (i) these deposits can store significant amounts of water, and (ii) they commonly have relatively high hydraulic conductivity, which may assist in dispersing the water horizontally over the extent of the deposit, creating opportunities for transpiration or evaporation (e.g.Salama et al., 1993).Reaches can change from gaining to losing state, and vice versa (often associated with seasonal cycles), and they experience a net gain or loss while in a given state -a key point of difference with bank storage, which functions as a delaying process (instead of a source/sink term) as water flows into and then back out of stream banks as gradients switch back and forth episodically.Losing/gaining dynamics could have contributed to the observed shifts at stream gauges.The regional groundwater declines could have affected the spatiotemporal extent of losing reaches (i.e.greater total length losing; greater time spent losing rather than gaining), amplified water loss rates from streams, and increased the number of zero flow days.HPE18: streamflow was lower than expected because stream reaches lost more water (or gained less water) due to water table decline.Note that, while losses are sensitive to the water table, the reverse is also true, with losses from the stream causing the water table to decline less slowly than it would otherwise (subject to other fluxes such as evaporation and transpiration from the alluvium).The strength of this feedback/coupling increases for smaller alluvial stores.

Thickened vadose zone: effect on recharge dynamics and transpiration
Regardless of the cause of the water table decline, the result was a thicker vadose (unsaturated) zone as the groundwater subsided, leaving behind newly unsaturated layers.This HPE is concerned with the rate at which these layers drain from saturation to field capacity.Well-drained near-surface soils are generally considered to drain quickly, of the order of days (e.g.Cassel and Nielsen, 1986).However, drainage may be slower for deeper layers (particularly those composed primarily of bedrock), possibly associated with spatially discontinuous preferential flow mechanisms including funnelled flow (e.g.Nimmo, 2021) and unstable flow (e.g.Jury et al., 2003).These deeper unsaturated layers might temporarily absorb some recharge, delaying its passage downwards to saturated zones and spreading the recharge signal over time.Whereas a rapid recharge event might trigger rapid groundwater discharge to the stream, the dampening of the recharge signal might increase the probability of water being diverted to transpiration (particularly by vegetation close to water courses and drainage lines) that would have otherwise contributed to streamflow, thus increasing transpiration per unit precipitation.HPE19: streamflow was lower than expected because delaying of recharge by the enlarged unsaturated zone increased opportunities for transpiration.

Changes in soil properties and/or structure with time
Soils may change properties with time, and some droughtrelated changes may decrease streamflow.While many soil properties and temporal changes are driven by humans (discussed below in HPE21), there are other mechanisms that are not explicitly human-driven, although they may interact with land management.For example, soil cracking may lead to increased infiltration, more evaporation, and less surface runoff (e.g.Arnold et al., 2005), provided the infiltrated water remains in the soil rather than recharging groundwater.There is also evidence that soil porosity may increase, as the climate becomes drier, on relatively short timescales of years to decades (Hirmas et al., 2018;Caplan et al., 2019).HPE20: streamflow was lower than expected because drought-induced changes in soil properties such as higher porosity and/or cracking led to higher infiltration and evaporation.On the other hand, soil water repellency can build up during dry periods (e.g.Filipović, 2018), which would tend to increase streamflow, which is counter to observations; likewise, soil cracking may provide an avenue for preferential flow, thus increasing runoff (e.g.Beven and Germann, 1982).

Changes in agricultural practices
The last 50 years have seen changes in land management practices, and the hydrological impacts of these changes may be important.Soil loss around Australia in the 1960s and 1970s prompted a nationwide evaluation of land degradation and soil conservation strategies in the 1970s, in addition to a National Soil Conservation Program, established in 1983 (Hannam, 2003).As a result, soil conservation measures such as no-till farming (e.g.Cornish et al., 2020) were more prevalent by the 1990s than in previous decades, which may have caused hydrological differences (see e.g.Huang and Zhang, 2004) between the Millennium Drought and previous droughts.In general, poor management practices lead to degraded soils that may amplify event runoff, with partial soil recovery (towards, e.g., higher soil carbon) possibly causing trends towards higher infiltration.Techniques such as contour banks may also retain water in the landscape.Last, soil properties were impacted by trends in livestock stocking rates driven by both market forces and government policy.HPE21: streamflow was lower than expected because better management of agricultural land, including soil conservation techniques, caused differences in soil hydraulic properties (compared to previous droughts).

Land use change and plantation forestry
Land use change is commonly reported to alter hydrology (e.g.Zhang et al., 2001;Brown et al., 2005;Zhang and Schilling, 2006;Farley et al., 2005).In southeastern Australia, a key example is new plantation forestry, with the higher water demand of plantation trees impacting groundwater (Benyon and Doody, 2004;Lane et al., 2005;Webb and Kathuria, 2012;Brown et al., 2015;Dresel et al., 2018) and presumably streamflow.Plantation data (Legg et al., 2021) show a peak in nationwide plantations in the late 1990s, with approximately 100 000 ha yr −1 added for 1998-2000 and much of it within the area impacted by the Millennium Drought (Legg et al., 2021; Table 1).HPE22: streamflow was lower than expected because increased plantation forestry led to increased water use.The 1970s-1990s also saw planting for salinity mitigation (see HPE13).

Farm dams
Farm dams, also known as farm ponds, are small private dams located on hillsides, drainage lines, or small streams.Individual dams typically harvest only small volumes, but collectively, the impact can be large (Habets et al., 2018;Morden et al., 2021).Thus, the total water storage capacity of all dams in a watershed may be an important index, particularly when relativised by precipitation or streamflow to account for aridity; that is, a given storage size may have a higher relative impact in drier areas (e.g.Liu et al., 2014).
https://doi.org/10.5194/hess-26-6073-2022Hydrol.Earth Syst.Sci., 26, 6073-6120, 2022 Volumetrically, water harvesting by farm dams is greatest during moderate and high flows, but if expressed in relative terms, impacts are greatest during the dry season and during dry years (Morden et al., 2021).Flow impacts occur due to both extracted water (for stock, domestic, or irrigation use) and evaporated water being unavailable to flow downstream.Farm dam development has increased with time (with a higher rate of increase before 2000; Malerba et al., 2021), so there were more dams during the Millennium Drought than previous droughts.HPE23: streamflow was lower than expected because farm dam harvesting of water increased due to trends in farm dam development (i.e. more dams than prior droughts).

Extractions
Regionally, the bulk of agricultural water use occurs in broad, flat plains using water captured in large reservoirs.This activity occurs downstream of the headwater catchments in focus here.However, minor modes of extraction are relevant in the headwater catchments, including extractions from groundwater and diversions from streams by individual landholders.For the set of catchments used by Peterson et al. (2021) and shown in Fig. 2, individual diversions from streams are uncommon because they were considered to be a criterion in catchment selection, and where they do exist, they are strictly controlled.Extractions from groundwater can be relatively larger than those from streams within some of these catchments and are of interest, given the strong groundwater trends reported above.Most extraction bores are for stock and domestic use.However, licenses for irrigation, while rarer, typically correspond to larger volumes and thus may be significant.HPE24: streamflow was lower than expected because groundwater extraction caused declines in groundwater levels and reduced GW-SW interaction.

Interaction between process explanations
The set of HPEs are mostly complementary, meaning there is no inherent reason precluding co-occurrence of multiple HPEs simultaneously in the same catchment.Some HPEs exhibit dependence, such as HPE15 (gaining to losing), which assumes a regional groundwater decline a priori and thus is dependent on a separate explanation for the decline via one of the other HPEs.Likewise, HPE19 amplifies effects of other mechanisms.HPEs 10 and 11 (relating to tree regulation of water use) and 12 (relating to wildfire) are in a unique category; they posit mechanisms that may have masked underlying shifts, rather than mechanisms of the shifts themselves.Such HPEs may be important to explain spatial patterns in hydrological shifts.Overall, assuming that the evidence tests in the next section do not eliminate too many HPEs, the complementary nature of the HPEs seems to indicate that we should expect a combination of process explanations, rather than a single smoking gun.

Evidence-based critique of hypotheses
As per Fig. 1, the purpose of this section is to decide which hypothesised process explanations are plausible or more likely, based on consideration of diagnostic evidence.This section is subdivided as follows: Sect.4.1 explains the logic of evaluation, Sect.4.2 presents the diagnostic evidence, assesses the implications for HPEs, and compares findings with previous literature, Sect.4.3 attempts to harmonise the plausible HPEs to provide an integrated picture of catchment behaviour, and Sect.4.4 discusses the limitations of the deductive method used.

Logic of evaluation
There are four ways in which HPEs are assessed for consistency with the evidence, and these are denoted with symbols (∇; ; ; ), as indicated below.While in principle it would be preferable that any HPE declared plausible would be assessed in all of these ways, in practice it is often impossible to do so due to lack of data, particularly when going back in time.
1. Temporal consistency (∇).The focus is on comparing droughts across time, focussing on the Millennium Drought and prior droughts.Since prior droughts set our expectations for what flow to expect in a dry year, there must be something different about the Millennium Drought that caused the streamflow to be lower than expected (in those catchments that shifted).Those droughts that occurred before streamflow gauging commenced have not contributed to our expectations of what flow is normal in a drought year.Thus, the prior droughts under consideration are those that occurred after a subjectively selected date (1 January 1950) that nominally represents the start of streamflow gauging.The year 1950 is chosen because less than 20 % of streamflow gauges were operational at that time (Fig. 6); hence, results from studies such as Saft et al. (2015) and Peterson et al. (2021) were dominated by post-1950 data.
2. Spatial consistency ( ).The focus is on the spatial distribution of the hypothesised driving factor(s).To support the hypothesis, the spatial distribution of the driving factor should be consistent with the spatial distribution of shifted versus unshifted catchments.For example, the spatial distribution of radiation increases during the Millennium Drought (Fig. C1) across the southern state of Victoria broadly matches the spatial distribution of shifted catchments, lending credence to HPE07.
3. Water balance considerations ( ).The focus is on whether the hypothesised driving factor is sufficient to reduce flow volumes by the observed amount.This criterion is less critical because the observed flow reductions may have been the combined result of multiple HPEs (in which case no single HPE would cause sufficient reductions on its own).
4. Drought recovery ( ).The focus is on whether the hypothesis can explain the lack of post-drought recovery in most of the catchments that shifted (Fig. 2d).As per Peterson et al. (2021), Across all 161 watersheds, we found that 8 years into the drought, 51 % of watersheds switched into a low (or very low) runoff state.
When the drought ended in 2010, predominantly only the eastern watersheds shifted back to a normal-runoff state.Notably, 7 years after the drought, 37 % (n = 55) of watersheds remained within a low-runoff state.
Two different spatial scales are considered in the following section.When assessing changes in climatic variables, we consider the entire area affected by the Millennium Drought, consistent with earlier consideration of climatic variables (Fig. 3).However, for all other assessments, we concentrate on the southern state of Victoria (2.3 × 10 5 km 2 ; Fig. 2d), for the following three reasons: 1. Existing analyses of hydrological shifts.At the scale of Victoria, there are two recent studies (Peterson et al., 2021;Saft et al., 2019) that examine hydrological shifts using every suitable streamflow gauge and using data up to 2017.In contrast, at the wider scale, the only available study (Saft et al., 2015) examined a dataset with a lower density of spatial coverage and using data only up to 2008, which is sufficient to assess impacts during the drought but not afterwards.
2. Data availability.Certain datasets, for example, farm dams (Fig. 8d), are available in a homogenous dataset across Victoria, but such data were not available in New South Wales and Queensland at the time of analysis.
3. Spatial patterns of impact.As shown in Fig. 2, Victoria has the clearest spatial groupings of shifted versus nonshifted catchments, with the latter dominating in the east and the former dominating in the west.These groupings arise despite considerable similarities between east and west.For example, the streamflow regime is winter dominant in both cases.Starting the process of explaining the shift in Victoria, where we at least see some spatial organisation of shifted/non-shifted catchments, makes sense.Later work needs to extend this to the other states.

Evaluation of diagnostic evidence
The evaluation of individual HPEs is undertaken in Table 2, based on the variety of diagnostic evidence presented in Figs.5-8.Of the 24 HPEs, 3 are considered plausible, 10 are considered inconsistent with evidence, and 11 are in a category in-between, denoting HPEs that may be important, but their widespread plausibility or applicability is questioned by factors such as (i) limited spatial extent, (ii) limited water balance impact, and (iii) limited ability to explain post-drought non-recovery in rainfall-runoff relationship.
The 11 HPEs in the in-between category (amber in Table 2), along with the factors comprising their widespread plausibility or applicability, are listed below.

Harmonising plausible process explanations
This subsection seeks to combine the plausible process explanations into an integrated picture of landscape-scale hydrological response.The following description, depicted visually in Figs. 9 and 10, incorporates process explanations marked with green or yellow in Table 2. Given the focus of Sect.4.2 on evidence from the southern state of Victoria, the following applies to the same area, although many elements may apply to other affected areas.
Concerning climatic conditions, the following two things distinguish the Millennium Drought from prior droughts post-1950: 1. Longer drought.The drought lasted more than 10 years (Fig. 6; HPE05), allowing long-memory aspects of catchment behaviour to become important, as discussed below.
2. Evaporative demand increases.Over a significant area in central-west Victoria, the evaporative demand increased, particularly during spring (Fig. 5c; HPE04).This increase was higher than increases seen in prior Prior droughts refer to a set of years chosen separately for each catchment from the period 1950-1996.Years are selected if their rainfall was less than the average over the Millennium Drought (1997Drought ( -2009)).If this selects more than one-quarter of years in the period 1950-1996, then the selection is limited to the driest quartile (12 years).This selection process chooses a different number of years for each catchment, as shown in Fig. E1.Line α-α' is mentioned in Table 2.
droughts (post-1950) and was driven by higher radiation.The affected area matches very closely with regional patterns of shifting versus not shifting.Given that the area is majority pasture, the nature of the increase contributed to its hydrologic impact, since grasslands are relatively less sensitive to atmospheric VPD and relatively more sensitive to radiation (see HPE07).Last, the time of year (spring, when soils are wettest) likely maximised the impact on the water balance.
Other aspects of climatic conditions, such as autumn rainfall reductions, fewer large rainfall events, and longer dry spells (HPEs 1, 2, and 3, respectively), likely contributed to the lower streamflow but are considered less explanatory of the shifting behaviour.This is because these aspects were no worse, and in some cases considerably less severe, than other drought years post-1950.However, the effect of these aspects may have been amplified due to (i) the unusual duration of the drought and (ii) the fact they were happening in concert with (and contributing to) an interannual drop in water storage (negative water balance).Thus, while none of HPE1-3 are marked as explanatory in their own right, they may still be important pieces of the water balance puzzle.
In the combined perceptual model presented in Figs. 9 and 10, the lower water table is a key driver of lower streamflow.This is because groundwater is integral to flow generation in headwater catchments via numerous mechanisms (Fig. 10).On hillslopes, groundwater declines reduce episodic saturation of high conductivity near-surface layers and/or transient perched water tables.In broad colluvial and alluvial valley bottoms, water table declines reduce flow in springs https://doi.org/10.5194/hess-26-6073-2022 Hydrol.Earth Syst.Sci., 26, 6073-6120, 2022 and diffuse-area discharge.Shallow groundwater creates wet areas in the landscape primed for event runoff, but such areas contract under water table decline.Thus, it is hypothesised that the unexpectedly low streamflow was in large part due to the declining water table, which was itself caused by the following: 1. Recharge reductions.Recharge reduced partly due to rainfall declines and also due to increased evapotranspiration resulting from higher evaporative demand (HPE04) and possibly other changes associated with vegetation (HPEs 7, 13, and 15).
2. Sustained removal of groundwater via multiple fluxes.
The fluxes removing groundwater were less sensitive to the climatic conditions, and quite sustained (Fig. 4b).These fluxes included the following: a. Local (within-catchment) drainage (HPE16) from hillslopes to the base of slopes and valley bottoms  A1.
(Fig. 9b) via deep fractures in the bedrock.Having drained via the subsurface to colluvial or alluvial valley floors where the water table is shallower (Fig. 9b), this water can then be removed by transpiration or direct soil evaporation, which also helps to explain the increase in AET per unit of precipitation emphasised by Peterson et al. (2021).According to this HPE, most of the groundwater de-cline takes place in the hillslopes, which are capable of slow but significant drainage due to, for example, fractured rock aquifers, which are widespread in those parts of Victoria that contain shifted catchments.Likewise, colluvial and alluvial deposits (refer to HPE16 in   A1. ments tend to be erosional with v-shaped valleys (Gallant and Dowling, 2003;CSIRO, 2016).
b. Drainage into regional groundwater systems (HPE17; Fig. 9a; commonly known as mountain recharge) which may export the groundwater outside of the boundary of the surface water catchment.The spatial applicability of this export mechanism is lower because approximately one-third of shifted catchments are located away from regional systems (see HPE17 in Table 2 and Fig. 7d).
It is important to recognise that drainage in (a) and (b) above were not activated by the drought, but rather are hypothesised to be continuous, and the drought acted to change the balance that previously existed between these combined fluxes and recharge.
As groundwater drained, the thicker vadose zone potentially acted to delay recharge and spread it over time, in- creasing opportunities for transpiration, particularly near water courses and drainage lines (HPE16).
Furthermore, streamflow generated at the microcatchment scale was likely susceptible to losses within downstream reaches.Lowland reaches which previously seasonally gained groundwater may now lose groundwater year-round (HPE18), with the potential for subsequent removal from the alluvial store via AET.Water stored in river banks, which is gradually released on timescales up to a few years and thus capable of sustaining low flows over an isolated dry year, became more permanently depleted.
Last, anthropogenic factors also likely impacted the hydrological shifts.The increase in farm dam development with time meant that the Millennium Drought saw greater farm dam harvesting of water than previous droughts (HPE23); however, the magnitude of this impact is uncertain.Furthermore, the period prior to the drought saw improvements in land management that may have resulted in less degraded soils (e.g.greater soil carbon) that were better able to retain water, even with less rainfall partitioned towards event runoff or recharge.All of the above mechanisms, except those related to climate, display some degree of persistence (see items labelled with an asterisk in Fig. 10), and this may explain the persistent non-recovery that has been reported (Peterson et al., 2021) in some areas.In particular, groundwater-related mechanisms would persist until the recovery of the water table to pre-drought levels.This is likely to be spatially uneven, as different groundwater systems have different lag times and dynamics.Thus, a possible research avenue might be to compare perceived lag times (based on hydrogeological knowledge) with observed recovery of streamflow response.
Most mountainous study catchments in Victoria did not exhibit hydrologic shifts, and this may be explained by the absence of the above mechanisms in those catchments, as follows: -Such catchments tend to have erosional v-shaped valleys; flat valleys arising from colluvial or alluvial deposits are relatively rarer (CSIRO, 2016), as are aquifers large enough to receive significant seepage from a losing reach.
https://doi.org/10.5194/hess-26-6073-2022Hydrol.Earth Syst.Sci., 26, 6073-6120, 2022 Figure 10.Chain of causality from cause (top) to effect (bottom) for plausible hypothesised process explanations.Pure causes are at the top and pure effects are at the bottom, while those in-between are a mixture (i.e.effects of some things and causes of others).
-Regional groundwater flow systems are typically less common in such catchments (Fig. 7d).
-Farm dam development is very low (or absent) in such catchments (Fig. 8d).
-Spring PET increases neither affected such catchments nor the catchments further north in New South Wales and Queensland (Fig. 5d).
In addition, mountainous catchments are mostly forested (Fig. 7a and c), and some trees have mechanisms for reducing water use in times of water stress, such as temporary xylem cavitation and reducing leaf area (see Sect. 3.2.3).Tree water use is very sensitive to VPD and may steeply decline in times of high VPD.Such mechanisms, which alter partitioning within the water balance, may have contributed to the lack of shifting behaviour in forested catchments.Postdrought, uniformly high VPD across southeastern Australia (Fig. D7) may likewise be contributing to a lack of hydrological shifts in forested catchments, as trees regulate water use via stomatal control.Last, while some research has examined the impact of rising CO 2 on streamflow (Ukkola et al., 2016), this prior work focussed exclusively on forested catchments, whereas the hydrological shifts were associated with grasslands.It is possible that greening contributed to reduced streamflow in such catchments (HPE06), but historic greening effects in southeastern Australian grasslands are currently a knowledge gap.

Limitations of deductive procedure used
Despite the varied methods of assessment and wide variety of evidence considered, there are nonetheless clear limitations in the deductive procedures adopted here.First, it is possible that mechanisms for changes in the rainfall-runoff relationships have been missed by the workshop participants, or that we are not aware of the importance of certain factors or their interactions (Hunt and Welter, 2010).While this study focuses on identifying individual common or frequently applicable mechanisms, in reality, different causes can act together and in different proportions in different landscapes, possibly leading to unexpected outcomes or to the same mechanism/interaction producing different outcomes in different landscapes.In cases of two competing processes, merely conceptualising interactions (as here) is often not sufficient, but it is necessary to also quantify the interactions (e.g. via simulation) to be able to guess at what the outcome might be.Such quantification is a key recommendation as a next step to this work (see the recommendations in Sect.5).
The assessment methods are each rather blunt because they rely on generalisations and the assessment of largescale patterns in either space, time, or both.This contrasts with common field methods for deducing hydrological processes at-site, which may rely on techniques such as hydrogeochemistry, tracers, etc.Such techniques provide individualised insight for a given catchment, in line with the concept of uniqueness of place (Beven, 2000), i.e., that each catchment is subject to distinctive hydrological processes.Furthermore, the results of such individual studies may be more directly relatable to hydrological processes.Although these benefits are not present in the methods used here, the strength of this work is a large-scale assessment of hydrologic changes and potential drivers.The landscape-scale patterns of hydrological shifts (e.g.Fig. 2; Saft et al., 2015;Peterson et al., 2021) require corresponding landscape-scale methods of evaluation.Experimental studies are difficult to conduct over large areas and/or large samples of catchments, and it is difficult to generalise regional impacts from individual field studies.In any case, the Millennium Drought is now in the past, so it is not possible to conduct experimental studies to directly confirm or refute hydrological hypotheses related to this event.
Nonetheless, the current study may be used as a prelude to planning more extensive field studies (and/or modelling studies) to test predictions of hypotheses in the present time.It may be possible via new experimental studies to gain insights that are transferable backwards in time to past droughts, and this is discussed further in Sect. 5.In summary, although the methods used here are subject to clear limitations, they are arguably suited to the present context.

Future research and monitoring needs
As per Fig. 1, the key question of this section is what future research and/or monitoring is required to decide between and/or harmonise the plausible process explanations?Given the dominance of complementary HPEs (Sect.3.5), it seems unnecessary to decide between those remaining (since they can occur together), although it is still desirable to investigate and confirm their relevance.The set of plausible process explanations is dominated by subsurface processes, so an overarching theme is that we need to better understand the role of groundwater directly and indirectly in streamflow generation, including indirect interactions with plant water use and the effect of multi-year declines in groundwater level.It follows that the main monitoring need is for long-term (10+year) monitoring programmes of groundwater and the vadose zone.Given the potential for interaction among processes, it is important to have coordinated monitoring of groundwater, vadose zone, ET fluxes, streamflow, and vegetation conditions to accelerate a holistic understanding of process interactions.
Most bores in southeastern Australia have been targeted where groundwater is used, whereas this work affirms the need for investment in bores for hydrological understanding rather than impact assessment.By considering the range of hypotheses being tested, and the range of data sources currently available (including their spatial distributions and uncertainties), it should be possible to pinpoint locations where new bores would yield the greatest advances in understanding.In general, the largest spatial gaps in bore coverage are in upland areas, so this should be considered when planning new monitoring.
For specific HPEs, future research and/or monitoring needs to provide additional evidence and improve confidence include the following: -HPE18 (gaining to losing).Priority should be placed on retaining and commissioning bores adjacent to streams and rivers.There would likely be considerable value in reviving, even for a short time, decommissioned bores with historic pre-drought data.As with new bores, the focus for recommissioning should be on objectively identifying bores that might provide the most information, complementing existing data and considering uncertainties.
-HPE16 (sustained hillslope drainage).be possible to design experiments to detect this using remote sensing or high-resolution products.
-HPE17 (sustained regional drainage).Further research into the volumetric rates of flow through regional systems is required, for example through analysing water age (e.g.Markovich et al., 2019).Most current monitoring of regional systems is done via analysis of levels; quantification of flow rates is relatively rare and would provide valuable insights (even if estimating temporal changes in this rate proves infeasible).
-HPE19 (interactions between recharge and thickened vadose zone).Simulation of these interactions may be the most effective test to increase confidence in this hypothesis.If a temporally variable vadose zone thickness together with associated recharge attenuation and enhanced opportunities for transpiration were to improve the simulation of the Millennium Drought, then this could be interpreted as support for the HPE.Few widely used models currently support such dynamics.
Moving away from groundwater to consider other perceptual models, we require the following: -HPE23 (farm dams).Better integration of datasets and modelling is required.Estimates of farm dam growth have recently been produced (Malerba et al., 2021), and the next step is to use such estimates in modelling (e.g.Fowler et al., 2015;Morden et al., 2021) to quantify the additional impact during the Millennium Drought relative to earlier droughts.
-HPE13 (salinity recovery).Further confirmation may be difficult, since it is difficult to retrospectively identify areas formerly affected by salinity.The most promising avenue may be to collate and standardise existing information from the 1980s and 1990s (some of which may exist non-electronically).
-HPE04 (evaporative demand) and HPE07 (coupling and radiation/temperature).Simulation experiments may be used to estimate the water balance impacts of mechanisms.This would require a framework capable of representing the coupling effects (e.g.Clark et al., 2015).
-HPE21 (agricultural practices).Knowledge gaps exist regarding the prevalence of land management trends.These gaps can be filled via a mixture of a literature review (including grey literature) and holding workshops with land managers.
The Millennium Drought was longer than other droughts post-1950 (HPE05 -long drought), and this point underlies many findings in this paper.However, two pre-1950s droughts -the Federation and World War II droughts -are more comparable to the Millennium Drought than droughts in the 1950-1990 period (see Verdon-Kidd and Kiem, 2009, for more information on these droughts).Thus, a potential research direction might be to collate historical information, both qualitative and quantitative, about the hydrological impacts of these droughts, to provide new perspectives on whether the Millennium Drought was unprecedented from a hydrological perspective.Last, efforts to gather, homogenise, and publicly release historical experimental data relevant to the Millennium Drought may also be of considerable value, and we feel this should be considered a priority for Australian hydrologists.This could include identifying previously unpublished data held by individual academics or research institutions and/or making connections between previously disparate datasets.

Conclusion
The Millennium Drought is an important historic case study of a multi-annual event resulting in persistent shifts in the relationship between rainfall and runoff.This paper, the culmination of a multi-disciplinary eWorkshop, has provided new perspectives on the processes responsible for these hydrological changes, while at the same time demonstrating the value of using shared perceptual models and evaluation as a strategy for community research.
Through community collaboration, we have presented a broad range of hypotheses across categories including climatic forcing, vegetation, soil moisture dynamics, groundwater, and anthropogenic influence.Assessment of these hypotheses against a large body of evidence reveals that many of the hypotheses are not supported by available data.Most remaining hypotheses point to the central role of subsurface processes either directly causing the flow reductions or mediating them via long-memory processes (e.g.multi-annual groundwater storage trends).Recommendations for future research focus on developing monitoring and modelling to further confirm these hypotheses and assess their interactions.
The resulting contribution is 3-fold.
1. We have a systematic catalogue of possible mechanisms explaining flow reductions which may be relevant to droughts in other times or places and/or for future changing climate.This is the case even for process explanations subsequently determined to be inconsistent with evidence for this case study.
with the means to anticipate future water crises globally (Fowler et al., 2022).

Figure 1 .
Figure 1.Key questions for each section in this article.

Figure 2 .
Figure 2. Hydrological behaviour prior to and during the Millennium Drought (a-c) and post-drought recovery (d), based on prior studies.(a) annual time series of precipitation (P ; bars) and streamflow (Q; lines and crosses) for four selected catchments which exhibited hydrological shifts.(b) Scatterplots corresponding to panel (a).(c) Location map of shifted catchments from Saft et al. (2015).(d) Drought recovery with time and mapped across the state of Victoria for selected time slices from Peterson et al. (2021).Elapsed time is measured from the end of the meteorological drought.

Figure 3 .
Figure 3. Change in selected climatic statistics during the Millennium Drought (taken as April 1997 to August 2010, inclusive) compared to the long-term average (taken as calendar years 1950 through 2017, inclusive) for each season.Each dot corresponds to a single catchment; values plotted are areal averages sourced from Fowler et al. (2021).

-
HPE04 -evaporative demand (factors are limited spatial extent and lack of continuity post-drought) -HPE06 -greening (factors are uncertainty regarding direction/magnitude of greening impact on transpiration) -HPE07 -coupling and radiation/temperature (factors are limited spatial extent and lack of continuity postdrought) -HPE10 -tree regulation of water use (stomatal control; factors are drought VPD anomalies that are not exceptional) -HPE11 -tree regulation of water use (other; factors are uncertainty regarding persistence of effects postdrought) -HPE13 -salinity recovery (factors are spatial extent not sufficient within study catchments) -HPE15 -biophysical adaptation (factors are only evidence that is hydrological; ecophysiological evidence is lacking) -HPE17 -sustained regional drainage (factors are spatial mismatch between groundwater systems and shifted catchments) -HPE18 -gaining to losing (factors are spatial mismatch between groundwater systems and shifted catchments and uncertainty regarding persistence of effects postdrought) -HPE20 -soil property changes (factors are lack of local evidence) -HPE21 -agricultural practices (factors are little or no data available regarding hydrological impacts) -HPE23 -farm dams (factors are dams are not sufficiently numerous or voluminous to cause observed hydrological impacts).https://doi.org/10.5194/hess-26-6073-2022Hydrol.Earth Syst.Sci., 26, 6073-6120, 2022

Figure 5 .
Figure 5. Changes compared to the long-term average for (i) the Millennium Drought and (ii) prior droughts (selected from the period 1950-1996, inclusive), allowing HPEs to be assessed by temporal consistency.Variables and definitions are the same as in Fig.3.Prior droughts refer to a set of years chosen separately for each catchment from the period 1950-1996.Years are selected if their rainfall was less than the average over the MillenniumDrought (1997Drought ( -2009)).If this selects more than one-quarter of years in the period 1950-1996, then the selection is limited to the driest quartile (12 years).This selection process chooses a different number of years for each catchment, as shown in Fig.E1.Line α-α' is mentioned in Table2.

Figure 6 .
Figure 6.(a, c) Visualisation of the longer duration of the Millennium Drought and World War II Drought, compared to other droughts in the 1950s-1980s, by showing anomalies in annual precipitation.(b) Flow gauging coverage through time.Precipitation and flow data are sourced from Fowler et al. (2020a).

Figure 7 .
Figure 7. Maps of the state of Victoria, showing various attributes used to assess HPEs by their spatial consistency.Each map is shown with 156 catchments categorised by Saft et al. (2019).Note that a given location in space can be marked with more than one Saft et al. (2019) category due to nestedness (i.e. if it is upstream of two gauges in different categories).Data sources are listed in TableA1.

Figure 8 .
Figure 8. Similar to Fig. 7 but mapping different variables.Relativisation by annual flow for panels (c) and (d) uses areal flow estimates from Weligamage et al. (2021).Other data sources are listed in TableA1.

Figure 9 .
Figure9.Visual synthesis of selected process explanations explaining why the Millennium Drought had less streamflow than expected, relative to prior droughts (post-1950; these droughts were too short for significant groundwater decline).The situation depicted describes a typical catchment in central or western Victoria.

Figure B3 .
Figure B3.Same as Fig. B2 but for autumn (March, April, and May).

Figure C2 .
Figure C2.Repeat of Fig. C1 but showing changes in percentage terms rather than actual values.

Figure D4 .
Figure D4.Same as Fig. D1 except for PET (see Fig. 3 for PET definition).Based on data from Fowler et al. (2021).

Figure D5 .
Figure D5.Same as Fig. D1 except for radiation.Based on data from Fowler et al. (2021).

Figure D6 .
Figure D6.Same as Fig. D1 except for t min (minimum daily temperature).Based on data from Fowler et al. (2021).

Figure E1 .
Figure E1.Number of years that met the definition of prior droughts used to create Fig. 5, for each catchment.Based on data from Fowler et al. (2021).

Figure E2 .
Figure E2.Fire history (most recent fire) over eastern Victoria, mapped with study catchments from Peterson et al. (2021).

Figure E3 .
Figure E3.Spatial distribution of plantation forestry in Victoria in 2016.Processed from raw data from ABARES (2016).

Table 1 .
Continued.HPE Short nameStreamflow was lower than expected because . ..Water slowly drained from hillslopes (contributing to water table decline and reduced GW-SW interaction) to nearby alluvial areas and was subsequently lost via evapotranspiration.
23Farm dams Farm dam water harvesting increased due to trends in dam development (i.e. more dams than prior droughts).24Groundwaterextraction Groundwater extraction caused declines in groundwater levels and reduced GW-SW interaction.
Peterson et al. (2021) 2004), with intermediate conditions having low probability of occurrence.The Millennium Drought was an extreme disturbance that might have pushed several catchments from a wetter to a drier state, with the drier state otherwise seldom being apparent in a catchment's behaviour.Regarding vegetation,Peterson et al. (2021)say that "Evidence suggests that the vegetation responded to the drought by increasing the fraction of precipitation going to transpiration. . . .Like other natural systems with multiple stable states, this persistence may be caused by a biophysical adaptation to disturbances . . .that results in a positive feedback" g. the wet and dry states of https://doi.org/10.5194/hess-26-6073-2022Hydrol.Earth Syst.Sci., 26, 6073-6120, 2022 D'