the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Nonlinear hydro-climatic controls on an arid-region lake: evidence from 40 years of remote sensing
Rui Zou
Xiaojun Wang
Jianyun Zhang
Wentai Pang
Jianfeng Liu
Accurate delineation of lake surface area is fundamental for understanding eco-hydrological processes in arid regions, yet long-term lake records are often constrained by cloud contamination, seasonal ice cover, and data gaps. In this study, we develop an optimized lake-area extraction framework that integrates seasonal water-index selection, adaptive threshold segmentation, maximum connected-component analysis, and mutual-information-based image gap filling to construct a continuous monthly lake-area time series for Bahannao Lake from 1984 to 2024. This framework substantially improves the temporal continuity and robustness of long-term monitoring for small lakes in arid environments, and its regional applicability is further validated through comparative analyses with Hongjiannao Lake and Wuliangsuhai Lake. Based on the reconstructed time series, we quantitatively assess the multi-climatic controls on lake-area variability by combining correlation analysis with an XGBoost model. The results reveal pronounced seasonal differences and distinct stage-dependent evolution in lake dynamics, with the dominance alternating between precipitation-driven water input and evaporative demand across different temporal scales. Our findings highlight the nonlinear hydro-climatic responses of arid-region lakes to climate variability and provide both technical support and scientific insight for long-term lake monitoring and water-resource management in dryland regions.
- Article
(12668 KB) - Full-text XML
- BibTeX
- EndNote
Over the past century, with the intensification of global climate change and the increasing human ability to modify nature, the impact of climate change on lake systems and the surrounding water environment has become more pronounced. The formation and disappearance, expansion and contraction of lakes, as well as changes in water and ecological environments, are the result of interactions among global, regional, and local tectonic activities, climate events, and human activities. Within these systems, a series of complex interactions drive the evolution of lake systems (Ma et al., 2020).
Lakes are vital natural resources that are highly sensitive to climate change (Adrian et al., 2009; Schmid et al., 2014). Globally, there are over 100 million lakes, which store 87 % of the Earth's liquid surface freshwater. Climate change is one of the most severe threats to global lake ecosystems. As observed in recent decades, lake surface conditions – such as ice cover, surface temperature, evaporation, and water levels – have responded significantly to this threat (Woolway et al., 2020; Tong et al., 2023). Approximately 53 % of the world's lakes have experienced a decline in water storage, with a reduction of about 22 billion tons per year. Climate change and human water use have primarily driven the net decrease in water volume in approximately 100 large natural lakes worldwide. Lakes in both arid and humid regions are experiencing water loss, with drying trends being more widespread than previously understood. Despite the shrinking of most lakes globally, 24 % of lakes and reservoirs have shown a significant increase in water storage. These lakes and reservoirs are mostly located in sparsely populated regions, such as the Tibetan Plateau and the northern Great Plains of North America, as well as areas with newly constructed reservoirs, including the Yangtze River, Mekong River, and Nile River basins (Pickens et al., 2020).
China has a vast territory with an extensive network of rivers and lakes. There are 2693 lakes with an area greater than 1 km2, among which 2557 lakes (95 % of the total) have an area between 1 and 100 km2. Additionally, there are 10 exceptionally large lakes with an area exceeding 1000 km2. The total lake area in China has shown a significant increasing trend, expanding by approximately 7858.53 km2 (11.41 %) over the past 30 years (Ma et al., 2010, 2011). However, the spatial and temporal imbalance of water resources has intensified, with notable differences in trends across various lake regions. The lake areas in the Tibetan Plateau and Xinjiang regions have increased significantly, contributing 111.55 % and 28.41 % of the national lake area growth, respectively. In contrast, the lake areas in the Eastern Plain, Inner Mongolia Plateau, Northeast Plain and Mountainous Region, and Yunnan-Guizhou Plateau have declined significantly, with reductions of 24.53 %, 9.30 %, 6.06 %, and 0.54 %, respectively. Among these, the Mongolian-Xinjiang Plateau experienced the largest decline in lake numbers, with a loss of 111 lakes. Some lakes in this region have shown signs of shrinkage and salinization (Yang et al., 2010). However, despite increasing attention to global lake changes, small and medium-sized closed-basin lakes in arid and semi-arid regions remain poorly characterized in long-term observations. These lakes are highly sensitive to climate variability but are often underrepresented in existing global or regional datasets, highlighting an urgent need for improved long-term monitoring.
Scientists have discovered that the abrupt change timing of river and lake systems varies significantly across different latitudes and altitudes (Råman Vinnå et al., 2021; Zhou et al., 2021). Mountain and polar lakes tend to experience abrupt changes earlier than temperate and tropical river-lake systems (Jeppesen et al., 2014). Additionally, under varying levels of human impact, the timing of abrupt changes in lakes also differs. Lakes in regions with low human impact generally experience abrupt changes earlier than those in areas with strong human influence (Preston et al., 2016). Analysis of the driving factors of lake abrupt changes indicates that the causes vary. Before the 1950s, climate change was the primary factor controlling abrupt changes in lake ecosystems. However, after the 1950s, both climate change and human disturbances became dominant factors. In temperate and tropical regions with strong human influence, lake changes are mainly driven by nutrient enrichment and pollution. In contrast, lakes located in high-altitude and high-latitude regions, which are less affected by human activities, are more vulnerable to climate change. Furthermore, the interaction of multiple drivers increases the likelihood of abrupt changes in lakes, with climate change being the most frequently interacting factor leading to transformations in river-lake ecosystems (Vincent, 2009). Li et al. (2025) pointed out that seasonality is the dominant driver of lake-surface-extent variations globally.
For example, Plug et al. (2008) investigated lake area changes in the Tuktoyaktuk Peninsula in northwest Canada. They found that from 1978 to 1992, the total lake area increased, while from 1992 to 2001, the total lake area decreased. Their study identified precipitation as the main factor driving these changes. Similarly, Carroll et al. (2011) studied the lake area changes in high-latitude northern Canada and discovered that lake areas showed a significant decline, exhibiting regional clustering characteristics, with climate factors driving these changes. Laba et al. (2017) explored the expansion of Tangra Yum Co from 1977 to 2014. Their results indicated that, under the background of climate warming, the combined effects of glacier melt, precipitation increase, and evaporation changes contributed to the lake's expansion. Likewise, Li et al. (2017) examined the changes in the water surface area and water storage of Nam Co from 1976 to 2015. Their findings showed that the water surface area and water storage of Nam Co continued to increase, with the fastest growth in water storage occurring between 1997 and 2009. The study concluded that the primary factor driving the increase in Nam Co's water volume was glacier melt, followed by increased precipitation and reduced evaporation.
However, the precise measurement of lake area remains a major constraint for analyzing lake changes. With advancements in science and technology, remote sensing has provided a unique and effective method for monitoring the spatiotemporal variations in surface water areas on broad geographic scales (Liu et al., 2020).
Currently, water extraction methods using optical sensors have been widely applied (McFeeters 1996; Yao et al., 2015; Donchyts et al., 2016). However, existing water body area products often fail to meet ideal spatial or temporal resolution requirements (Cooley et al., 2017; Huang et al., 2018). For example, the 2016 Global Climate Observing System (GCOS) Implementation Plan recommended a resolution of 20 m and a daily monitoring frequency (Secretariat, 2009). High-temporal-resolution sensors, such as the Moderate Resolution Imaging Spectroradiometer (MODIS) onboard Terra and Aqua satellites, have been used to assess water body areas at time scales ranging from daily to 16 d intervals (Bergé-Nguyen and Crétaux, 2015; Wang et al., 2018a). However, many small water bodies (e.g., 10–50 km2 or smaller) and irregularly shaped larger water bodies may not be accurately distinguished using coarse-resolution MODIS images (250–500 m in the visible and near-infrared bands) (Tao et al., 2015). Compared with MODIS, Landsat images (e.g., Landsat 5 Thematic Mapper (TM), Landsat 7 Enhanced Thematic Mapper Plus (ETM+), and Landsat 8 Operational Land Imager (OLI)) offer higher spatial resolution (30 m) and a temporal resolution of 16 d (or better when combining multiple Landsat sensors). However, due to cloud contamination (Rossow and Schiffer, 1999), the actual temporal frequency of water body mapping based on Landsat is often much lower than the nominal resolution and may extend to a year for lakes with persistent ice cover (Yao et al., 2018). The recently launched Sentinel-2A and 2B satellites, equipped with Multispectral Instruments (MSI), provide a resolution of 10 m in the visible and near-infrared bands, with a revisit period of 5–10 d. However, their observations currently cover only the past few years (since 2015) and are not yet suitable for long-term decadal monitoring.
Beyond the trade-offs between spatial and temporal resolution, several other factors challenge high-resolution monitoring of long-term global surface water area changes (Klein et al., 2017). These include the inherent spectral heterogeneity of water, atmospheric influences (clouds and aerosols), topographic shadows, aquatic vegetation, and spectral contamination from ice/snow cover. In such complex conditions, integrating multiple techniques is often necessary to achieve robust water body extraction.
Recently, Pekel et al. (2014) utilized a large training dataset, combined with expert systems and visual analysis, to identify the presence or absence of water on a monthly basis for each pixel in archival Landsat images from 1984 to 2015. This product was named the Joint Research Centre (JRC) Global Surface Water dataset (hereinafter referred to as GSW). Despite its significant achievements, GSW is based on cloud-free pixels, meaning that the mapped extent of specific water bodies is only complete when monthly composite images have minimal cloud cover. A follow-up study by Busker et al. (2019) used a subset of the GSW dataset, selecting images with cloud cover below 5 %, to extract the monthly area of 137 lakes/reservoirs. For nearly half of these lakes/reservoirs, the correlation between area and radar altimetry-measured water levels exceeded 0.8. However, the temporal frequency of the resulting area time series was still constrained by the availability of cloud-free images, and due to the current availability of GSW, the time series was interrupted after October 2015.One potential method to increase the temporal frequency of lake mapping based on Landsat data is to estimate water surface area from contaminated images (e.g., those affected by clouds or observation gaps). Although these images are of relatively lower quality, the exposed portions of lakes within them may provide useful information for inferring the complete extent. For instance, Zhao and Gao (2018) 41 applied the monthly water mapping data from the GSW dataset to generate area time series for 6,817 reservoirs worldwide from 1984 to 2015. Their method involved recovering complete reservoir extents from cloud-contaminated images by segmenting pixels based on the water occurrence probability provided in the GSW dataset. Compared to the results of Busker et al. (2019), their generated area time series increased the number of observations by approximately 80 %. However, the reliance on the existing GSW dataset restricted their reservoir area records to the 1984–2015 period, and the validation of their recovery method was limited to only nine reservoirs with significant water level variations. These studies demonstrate the feasibility of large-scale lake monitoring, but also highlight persistent limitations related to temporal continuity, cloud dependence, and the applicability of existing products to small lakes. As a result, many existing lake-area studies rely on annual or seasonal snapshots derived from a limited number of cloud-free images, which may obscure important intra-annual variability, abrupt changes, and short-term climate responses, particularly for small lakes with strong seasonal dynamics.
To address this limitation, we construct a continuous 40-year monthly lake-area time series for Bahannao Lake by integrating multi-source Landsat imagery and applying a tailored image-processing workflow. This higher-temporal-resolution dataset enables a more detailed assessment of seasonal and interannual lake dynamics.
Bahannao Lake is a small closed-basin lake located in a semi-arid desert region of northern China. Owing to its remote location and the long-term absence of systematic in situ observations, continuous records of lake area are lacking. Nevertheless, as a water body embedded in a fragile desert ecosystem, variations in lake area are highly sensitive to hydro-climatic changes and play an important role in regional eco-hydrological stability.
In recent decades, intensified warming and drying have caused pronounced lake shrinkage, characterized by strong interannual variability and multiple abrupt changes. However, compared with larger or well-monitored lakes, the dynamic behavior and driving mechanisms of Bahannao Lake remain poorly understood due to the lack of long-term, high-temporal-resolution observations. As a typical but underrepresented small lake in arid regions, Bahannao Lake provides an ideal case for testing robust remote-sensing monitoring methods and investigating hydro-climatic controls on dryland lake dynamics.
Despite substantial progress in global lake monitoring, significant gaps remain for lakes in arid and semi-arid regions. Long-term and continuous lake area records are often interrupted by cloud contamination, seasonal ice cover, and striping artifacts, while the role of hydro-climatic drivers – particularly their nonlinear interactions – remains insufficiently understood.
To address these challenges, this study develops an optimized lake area extraction framework that integrates seasonal index selection, adaptive thresholding, connectivity analysis, and mutual information–based gap filling to construct a continuous monthly lake-area record for Bahannao Lake from 1984 to 2024. By coupling this reconstructed time series with multi-factor analysis using the XGBoost model, we quantify the relative importance and nonlinear effects of key hydro-climatic drivers on lake dynamics. This framework not only improves the reliability of long-term lake monitoring under complex conditions, but also provides new insights into seasonal and interannual climate controls on small lakes in arid and semi-arid regions.
2.1 Study area and data
Closed-basin lakes of various sizes are widely distributed across the Ordos Plateau, formed since the late Quaternary through combined aeolian and fluvial erosion processes. Bahannao Lake is the terminal basin of a chain of seven bead-like erosional lake depressions that developed along an ancient river valley. Bahannao Lake (109°16′ E, 39°19′ N) is located in the central Ordos Plateau at an elevation of 1278 m, with a lake-basin area of 26.50 km2. The basin is underlain by a continuous and intact Lower Cretaceous sandstone formation, which provides a closed geomorphic setting primarily recharged by atmospheric precipitation. The sandstone contains abundant sodium- and calcium-rich carbonates, serving as the major source of dissolved salts in Bahannao Lake. Administratively, the study area belongs to Wushen Banner of the Ordos region in Inner Mongolia (Fig. 1).
The zonal vegetation is dominated by arid to semi-arid desert steppe. The region is controlled for most of the year by the northwesterly monsoon, resulting in a cold and dry climate, while the southeasterly monsoon occasionally influences the area and plays a decisive role in seasonal precipitation. The mean annual temperature ranges from 6 to 9 °C, and the mean annual precipitation is only 200–300 mm, concentrated mainly from June to September with short-duration high-intensity rainfall events. In contrast, the annual potential evaporation reaches 2500–3000 mm, approximately ten times the precipitation amount, and the regional aridity index ranges from 3.5 to 4.0.
Because of the extremely fragile water balance and rapid hydrological response to climatic anomalies, Bahannao Lake and other nearby lakes are widely recognized as important natural indicators of climate variability, drought intensification, and land–atmosphere interactions in the arid and semi-arid regions of northern China.
This study utilizes remote sensing imagery from the Landsat 5 TM, Landsat 7 TM, and Landsat 8 OLI sensors, specifically using atmospherically corrected reflectance data (Tier 1 TOA Reflectance). Tier 1 data is selected due to its highest quality, making it suitable for time-series analysis and studies on global surface water extent and dynamics. The Landsat 5 TM imagery covers the period from 1984 to 2011, while Landsat 8 imagery spans from 2013 to 2023. Since imagery for 2012 is missing in both datasets, Landsat 7 TM is used as a supplement. However, Landsat 7 TM imagery exhibits significant striping artifacts, which were avoided as much as possible during data selection.
For hydro-climatic elements, this study employs the fifth-generation atmospheric reanalysis dataset from ECMWF (European Centre for Medium-Range Weather Forecasts), covering global climate data from January 1950 to the present. The dataset has a temporal resolution of daily and a spatial resolution of 0.1° × 0.1°. The hydro-climatic variables used in this study include precipitation (P, mm), air temperature at 2 m (T, °C), 2 m dew point temperature (Td, °C), relative humidity (RH, %), potential evapotranspiration (PET, mm), net shortwave radiation at the surface (msnswrf, W m−2), net longwave radiation at the surface (msnlwrf, W m−2), surface latent heat flux (mslhf, W m−2), and surface sensible heat flux (msshf, W m−2). These variables jointly characterize atmospheric moisture conditions, energy balance, and evaporative demand in the study region.
2.2 Methods
2.2.1 Optimized lake area extraction method
Although water-index-based lake extraction from Landsat imagery is well established, long-term monthly monitoring of small lakes in arid regions poses specific challenges, including frequent cloud contamination, striping artifacts in ETM+ data, and strong intra-annual variability. To address these issues, we developed an optimized processing workflow tailored to long-term monthly lake monitoring.
This study employs 30 m full-atmosphere imagery from the Landsat 5 Thematic Mapper (TM), Landsat 7 Enhanced Thematic Mapper Plus (ETM+), and Landsat 8 Operational Land Imager (OLI) satellites to derive monthly lake area estimates for the study region from January 1984 to December 2024.
Different lake remote sensing indices were selected for non-freezing and freezing periods, respectively. For non-freezing periods, remote sensing indices were processed to remove cloud and snow interference. Images were filtered based on cloud cover percentage (C), and monthly composite images were generated. The Otsu thresholding method was then applied to automatically determine segmentation thresholds. To distinguish between lakes and mountainous areas, a digital elevation model (DEM) was used, setting the slope (θ) and aspect (ϕ) thresholds to 0.
Considering that most lakes exhibit connectivity, this study adopts the maximum connected component analysis algorithm from the OpenCV computer vision library to delineate lake boundaries. Images were categorized based on cloud cover information (“CLOUD_COVER”): those with cloud cover ≤30 % were classified as cloud-free images, while the remaining images were considered cloudy. For cloudy images, the MI (Mutual Information) algorithm was used to match them with the most similar cloud-free images. The most similar image was then merged with the original cloudy image to generate a filled version.
For images with striping artifacts, the same filling method was applied as for cloudy images. Clear lake boundaries from historical cloud-free images were used, and the MI algorithm was employed to find the most similar historical cloud-free images for filling missing water pixels in striped areas, ultimately obtaining the final lake water extent. The specific process is shown in Fig. 2.
2.2.2 Aridity index (AI)
The aridity index (AI) was used to quantify regional drought conditions. AI is defined as the ratio of precipitation (P) to potential evapotranspiration (PET), expressed as:
where P represents precipitation and PET denotes potential evapotranspiration. AI reflects the balance between atmospheric water supply and evaporative demand.
In this study, PET was calculated using the FAO Penman–Monteith method, which is widely recognized as a physically based and robust approach for estimating atmospheric evaporative demand. PET was computed as:
Where Rn is the net radiation at the surface (MJ m−2 d−1), G is the soil heat flux (MJ m−2 d−1), T is the mean air temperature at 2 m height (°C), u2 is the wind speed at 2 m height (m s−1), es is the saturation vapor pressure (kPa), ea is the actual vapor pressure (kPa), Δ is the slope of the saturation vapor pressure–temperature curve (kPa °C−1), and γ is the psychrometric constant (kPa °C−1).
All meteorological variables required for PET estimation were obtained from the ERA5 reanalysis dataset and spatially averaged over the study area to ensure consistency with basin-scale analysis.
Based on AI values, climatic conditions were classified following the United Nations Environment Programme (UNEP) scheme: AI <0.05 indicates hyper-arid conditions, represents arid conditions, corresponds to semi-arid conditions, indicates dry sub-humid conditions, and AI ≥0.65 represents humid conditions. This classification allows a quantitative interpretation of regional aridity and facilitates comparison with previous studies in arid and semi-arid regions.
2.2.3 XGBoost Model
In this study, the XGBoost model is employed primarily as an interpretative tool. The objective is to quantify the relative importance of different hydro-climatic factors and to explore potential nonlinear relationships between lake-area variability and climatic drivers. Given the limited sample size, strong interannual variability, and high nonlinearity characteristic of arid-region lake systems, model performance metrics (e.g., R2) are used as auxiliary indicators, while greater emphasis is placed on feature-importance rankings for mechanism interpretation.
Compared with linear correlation analysis, the XGBoost results highlight the importance of nonlinear and season-dependent controls, particularly during transitional seasons when linear correlations are weak. This demonstrates the added value of XGBoost in revealing climatic influences that cannot be fully captured by linear statistical methods alone.
The objective function of the XGBoost model is:
Where L(θ) represents the objective function, which measures the model's performance in prediction and consists of two parts: l(yi,f(xi)) is the loss function, indicating the difference between the true value yi and the predicted value f(xi), while Ω(fk) is the regularization term used to control the model complexity.
The input factors include precipitation (P), air temperature (T), relative humidity (RH), and potential evapotranspiration (PET), which represent the primary components of the lake water balance in arid and semi-arid regions. These variables directly or indirectly regulate lake-area changes through their influence on water input and evaporative loss. Energy-related variables (e.g., radiation and heat fluxes) are included as background indicators of atmospheric conditions and are not interpreted as direct driving forces of lake-area change.
Here, FI(xj) represents the feature importance of factor xj, while I(t,xj) denotes the contribution of factor xj when used as a splitting point in tree t, with T being the total number of trees. The generated feature importance ranking chart illustrates the contribution of various input factors (such as temperature, precipitation, and humidity) to lake area changes. This ranking chart provides an intuitive way to identify the most influential factors.
To improve model performance, hyperparameters can be optimized using Grid Search or Random Search. Common hyperparameters include Learning rate, Max depth of trees and Number of trees. Adjusting these parameters affects the model's fitting ability and generalization performance.
Data Splitting. Divide the dataset into a training set and a test set (e.g., 80 % for training, 20 % for testing).
Train the XGBoost model on the training set. XGBoost uses the Gradient Boosting Algorithm, which iteratively improves the model by building multiple weak learners to reduce prediction errors. Each iteration refines the model by fitting the residuals (i.e., prediction errors).
Model Validation. Evaluate model performance using metrics such as Mean Squared Error (MSE) and Coefficient of Determination (R2) to assess accuracy and stability.
The formula for Mean Squared Error (MSE) is:
The formula for the coefficient of determination R2 is:
Where represents the mean of the samples.
The lake area model is based on model training, the predicted lake area can be expressed as a nonlinear combination of input factors xi:
Where: ωk is the weight of the k tree, and hk(xi) is the prediction function of the tree, represented as a set of decision rules.
The feature importance derived from the XGBoost model reflects the relative contribution of each climatic variable in reducing prediction error across all decision trees. It should be noted that this importance ranking does not imply direct causality, but rather indicates the sensitivity of lake-area variability to different climatic factors under nonlinear interactions. Therefore, feature importance is interpreted in conjunction with linear correlation analysis to provide a more robust understanding of hydro-climatic controls.
3.1 Remote sensing interpretation and monthly lake image synthesis
3.1.1 Selection of water indices and image preprocessing
The study area is located in a high-altitude region, where lake surfaces freeze between November and March. Since the NDWI index is less effective for frozen lakes, different indices are used for different seasons. During the non-freezing period (May–November), the NDWI index is applied for conventional water body extraction. During the freezing period (December–April), the Modified Normalized Difference Snow Index (MNDSI) is used to evaluate water surface area.
The NDWI index utilizes the strong absorption of water bodies in the near-infrared band and their high reflectance in the green band to enhance the distinction between water and other land cover types. However, this index may misidentify bright white buildings, clouds, snow, and mountain shadows as water bodies. Therefore, additional data quality bands and methods are integrated to remove these interferences and improve the accuracy of water body extraction.
Where: Green band typically refers to the green portion of the visible spectrum, generally ranging from 500–570 nm. NIR band refers to the near-infrared spectrum, generally ranging from 800–900 nm.
The Modified Normalized Difference Snow Index (MNDSI) is an index calculated using the reflectance of the near-infrared (NIR) and short-wave infrared (SWIR) bands. It is an effective method for distinguishing ice surfaces from water bodies. This index is particularly suitable for regions with frozen water surfaces, such as lakes and rivers, where seasonal changes are significant. Ice surfaces and water bodies have different reflectance characteristics in various bands. Ice has higher reflectance in the SWIR band, while water has lower reflectance. By calculating the difference between the NIR and SWIR bands, MNDSI can effectively distinguish between ice surfaces and water bodies, thus improving the accuracy of ice extraction. By combining these two bands, MNDSI highlights the differences between water bodies and ice surfaces, making it easier to differentiate between them. Similar to NDWI, MNDSI enhances the contrast between ice and water by utilizing reflectance values from different bands.
MNDSI (Modified Normalized Difference Snow Index) is calculated by combining the reflectance of the near-infrared (NIR) and short-wave infrared (SWIR) bands. The typical formula for MNDSI is as follows:
Where NIR is the reflectance in the near-infrared band (typically 800–900 nm), SWIR is the reflectance in the short-wave infrared band (typically 1500–1700 nm).
Figure 3Lake extraction from Landsat imagery during non-freezing and freezing periods. (a) Original Landsat image during the non-freezing period; (b) Lake area identified using NDWI; (c) Original Landsat image during the freezing period; (d) Lake area identified using MNDSI. Source: Landsat imagery courtesy of the U.S. Geological Survey (USGS), processed and interpreted by the authors.
The cloud and snow interference removal is only applied to the NDWI of the non-freezing period from May to November. The Landsat series satellites provide their own pixel-scale data quality band (QA_PIXEL), which can be used to eliminate noise pixels in the image.
The QA_PIXEL band in the Landsat dataset provides information on various quality types, where different bits (Bit) correspond to different types of quality information. For example, Bit 3 corresponds to clouds, Bit 5 corresponds to snow, and Bit 7 corresponds to water bodies. Within the same bit, values of 0 and 1 represent different data qualities. For example, a 0 in Bit 7 indicates that the pixel has poor water body information, being land or covered by clouds, while a 1 indicates that the pixel represents water.
Using this pixel quality information, we selected Bit 3 (cloud), Bit 5 (snow), and Bit 7 (water body). By performing bitwise AND and OR operations, we generated a water body mask file with good data quality after cloud and snow removal. This mask file is then overlaid with the actual image to remove pixels affected by cloud or snow interference. The effect of cloud and snow removal is shown in Fig. 4.
Figure 4Illustration of cloud and snow removal and its effect on NDWI-based water extraction. (a) True-color Landsat image before cloud and snow removal. (b) True-color image after cloud and snow masking, where contaminated pixels are excluded. (c) NDWI image derived from the original true-color image, shown in grayscale, with brighter values indicating higher likelihood of water presence. (d) NDWI image after cloud and snow removal; blue areas indicate pixels affected by cloud or snow that were excluded from water-body extraction. Source: Landsat imagery courtesy of the U.S. Geological Survey (USGS), processed and interpreted by the authors.
The NDWI, MNDSI index calculation, and cloud/snow interference removal are performed directly on the GEE platform, followed by monthly composite image downloads. Based on the cloud cover information (“CLOUD_COVER”), which represents the cloud amount (range from 0 to 100, with larger values indicating more cloud coverage), the data is classified into three levels: 0–30, 30–60, and 60–100. If data is available in Level 1, Level 2 is not executed, and if Level 2 contains data, Level 3 is processed. All images from each year and month within the cloud cover level are selected, and the median pixel value is calculated to generate the composite monthly NDWI (for 5–11 months) and MNDSI (for December to the following April) grayscale images.
Data is filtered based on the cloud cover proportion C, where .
Composite image =Med(S(C)), where C=CLOUDCOVER
Where I(C) is a set of image data filtered by cloud cover.
3.1.2 Threshold-based water segmentation and noise removal
Lake water pixels were first identified using threshold-based segmentation. To reduce false positives caused by mountain shadows and built-up areas, topographic and ancillary data were applied to remove non-water pixels.
-
Threshold segmentation. This step applies the Otsu threshold algorithm to the downloaded NDWI and MNDSI monthly composite grayscale images, automatically generating a segmentation threshold. Pixels below the threshold are classified as water, and those above the threshold are classified as other areas.
The core of the Otsu thresholding method is to divide the image into two classes (foreground and background) by maximizing the between-class variance, thereby achieving the optimal threshold segmentation. Specifically, it involves iterating through all possible thresholds, and the optimal threshold is determined when the between-class variance is maximized while the variance within both the foreground and background is minimized. Compared to other methods, this algorithm maximizes the inclusion of the target feature while excluding other interfering factors.
The Otsu thresholding method is used to automatically generate the segmentation threshold, dividing the image into water and other regions:
Where, is the between-class variance, defined as:
Where ω1(τ) and ω2(τ) are the weights of the foreground and background at the threshold τ, and μ1(τ) and μ2(τ) are the mean gray values of the foreground and background, respectively.
The portion smaller than the threshold T is classified as water, symbolized as water pixels, while the portion greater than the threshold is classified as other categories.
-
Mountain shadow and buildings removal. Since the lake surface typically exhibits a flat state without significant slope and aspect features, digital elevation models (DEM) can be used to distinguish lakes from mountainous regions by utilizing slope and aspect information. By setting threshold values of 0 for slope and aspect, the distinction between lakes and mountainous areas can be made. However, the current frequency of elevation data updates does not align with real-time imagery, leading to an inability to accurately reflect seasonal changes in lake water levels within the elevation data. This limitation affects the precision of water body area extraction using the data. Given that most lakes are interconnected, this study employs the maximum connected component analysis algorithm from the Open-CV vision field to define the boundaries of lakes and extract their areas.
By setting the thresholds for slope θ and aspect φ to 0 in the digital elevation model (DEM), lakes are distinguished from mountainous areas:
Where θ(x,y) and ϕ(x,y) represent the slope and aspect values at a given point (x,y), respectively. By setting and as threshold conditions for the lake area, the lake region is defined as the area where both the slope and aspect are equal to 0.
Where L represents the total number of pixels in the largest lake area, Ci represents the ith connected component in the image, the function ∑ denotes the summation of pixel points, and indicates the selection of the largest connected component as the lake area.
The construction of the building index currently mainly relies on the fact that the surface temperature of buildings is usually higher than that of surrounding land cover, and the mid-infrared band can effectively reflect surface temperature differences. However, in previous land cover classification studies, the extraction results using this algorithm were not ideal. Considering that most buildings in the study area are not distributed along lakes, the maximum connected component algorithm can effectively exclude parts where buildings are misidentified as water bodies.
Based on the NDWI (Normalized Difference Water Index), a threshold T is used to binarize the image, separating water bodies from non-water bodies.
Connected Component Calculation: In the binarized image, the Connected Components Labeling (CCL) algorithm is used to identify all connected regions. A connected component is determined by scanning the neighboring pixels in the image (up, down, left, right, or diagonally). The formula is expressed as:
Where R represents the connected regions in the image, and Cidenotes the connected components.
To eliminate interference from buildings, a threshold condition τ is set, retaining only connected components with an area greater than τ. Since buildings typically have smaller areas, while lakes exhibit larger connected components, the lake regions can be filtered using the following condition:
The lake boundary is extracted using a boundary detection algorithm (e.g., the Canny edge detection algorithm) applied to the selected largest connected region.
As shown in the Fig. 5, the white areas in the original image include both lakes and buildings. When using threshold segmentation to extract water bodies, buildings may also be mistakenly identified as water. By applying the maximum connected component method, buildings can be effectively separated.
Figure 5Illustration of water extraction and building removal processes. (a) Original Landsat image, in which white areas include both lake water and built-up surfaces; (b) water bodies extracted using threshold-based segmentation; (c) buildings separated from water bodies using the maximum connected component method. Source: Landsat imagery courtesy of the U.S. Geological Survey (USGS), processed and interpreted by the authors.
3.1.3 Cloudy and striped image reconstruction
-
Cloudy image filling processing. For cloud-free images, the subsequent water-extraction steps are applied directly. For cloudy images, cloud-free images are first used to reconstruct missing pixels, after which the same processing steps are executed.
The filling approach is as follows: Based on the cloud coverage information (CLOUD_COVER), images with cloud cover less than or equal to 30 % (CLOUD_COVER ≤30 %) are classified as cloud-free images, while others are considered cloudy images. The formula is as follows:
Then, the Mutual Information (MI) algorithm is used to perform the most similar matching between the cloudy image and all cloud-free images. Next, the most similar image is combined with the original cloudy image through a union operation to obtain the filled cloudy image. Finally, the reconstructed images are processed using the same lake-extraction workflow as applied to cloud-free images, producing the final water-body area estimates:
-
Candidate Cloud-Free Image Set. In the time periods before and after the cloudy image, select images with low cloud coverage (CLOUD_COVER ≤30 %) as the candidate image set.
-
Mutual Information Algorithm. Use the MI algorithm to calculate the similarity between the cloudy image and the candidate cloud-free images. The formula is as follows:
Where Icloudy represents the cloudy image, Iclear represents the candidate cloud-free image, and p is the joint probability distribution of the pixel grayscale values. III denotes mutual information, which measures the correlation between the cloudy image and the cloud-free image.
Selecting the Most Similar Image: Based on the mutual information value, the cloud-free image most similar to the cloudy image is selected.
-
-
Striped image filling. The previously mentioned dataset indicates that Landsat 7 TM images have significant striping interference. Additionally, Landsat 5 TM and Landsat 8 OLI images also experience striping interference in certain months, such as Landsat 5 TM from 2001 to 2003 and Landsat 8 in 2008. To more accurately obtain the temporal changes in lake area, it is necessary to fill the missing portions of striped images. The method is the same as for cloud-filled images. By utilizing the clear contours of historical cloud-free images and applying the MI algorithm, the most similar historical cloud-free images are searched to fill the water pixels in the striped regions. The method for filling striped images is the same as that for cloud-filled images.
Figure 6Filling processing of cloudy and striped-interference lake images using similar cloud-free references. (a, d) Cloud-free reference images identified as most similar to the cloudy images; (b, e) Original cloudy images; (c, f) Cloud-filled results after processing; (g) Cloud-free image most similar to the striped-interference image; (h) Original striped-interference image; (i) Result after stripe-filling processing. Source: Landsat imagery courtesy of the U.S. Geological Survey (USGS), processed and interpreted by the authors.
3.1.4 Monthly synthesis and time-series construction
After applying the maximum connectivity component processing to the image, the number of water pixels is counted. Then, based on the spatial resolution of the pixels (30 m × 30 m), the actual area is calculated.
Collect all known lake area data for specific time points, where ti dots represent time points with available data. For each missing data point tmissing, use the known data points tmissing−1 and tmissing+1, and apply the selected interpolation method to calculate the lake area A(tmissing) at time.
3.2 Lake area time series construction
3.2.1 Monthly lake-area time series and seasonal variability of Bahannao Lake
The interannual variation of Bahannao is quite drastic, but the overall trend is declining (Fig. 7a), linear regression analysis based on the year index reveals a significant declining trend in the lake area from 1984 to 2024, with a decrease rate of −0.29 km2 yr−1 (p=0.001). The coefficient of determination (R2=0.25) indicates substantial interannual variability, suggesting that although the long-term trend is statistically robust, short-term fluctuations and nonlinear processes play an important role in shaping lake-area dynamics. Before 1999, the changes were relatively stable. In 2000, the lake area shrank severely, decreasing by 82.98 % compared to 1999, leaving only 3.12 km2. Since then, the lake has exhibited a cyclical fluctuation pattern with a period of approximately 5–6 years. In 2021, the lake area reached its minimum value of just 0.71 km2, followed by a rapid increase, reaching its maximum of 23.38 km2 in 2023.
Figure 7Interannual and intra-annual variation of Bahannao Lake area. (a) Interannual variations of lake area; (b) multi-year mean monthly variations of lake area.
Due to its location in the Mu Us Desert and the lack of long-term observational data, this study references the lake area interpreted via remote sensing in the Comprehensive Lake Water Ecological Management Plan of Uxin Banner. This report provides remote sensing imagery data for 24 years from 1988 to 2018 (with six years lacking clear images suitable for analysis).
A comparison of the data (Fig. 8a) shows that the lake area interpreted in this study aligns with the trend reported in the management plan. Over the 23 years of overlapping interpretation, the error remains within 15 % for 12 years. However, in years when the lake area was smaller, the error was relatively larger, such as in 2000, 2001, 2009, 2010, 2011, and 2015. According to records, Bahannao Lake shrank significantly during these years but did not completely dry up until 2021, which is consistent with the results of this study.
The interpreted lake area in this study also indicates (Fig. 8b) that the annual average area of Bahannao Lake in 2021 was only 0.71 km2. The lake area was at its smallest in August, September, and October, reaching only 0.2 km2, while the largest area was recorded in March at 3.5 km2. The rapid expansion of the lake area observed in the spring of 2021 may be related to short-term hydrological inputs. Although the precipitation in this region during winter and early spring is generally limited, occasional short-term precipitation events or snowmelt may temporarily increase surface runoff and inflow to the lake. In addition, as temperatures rise in late spring, enhanced evaporation and reduced inflow may lead to a rapid decrease in lake area. Therefore, the rapid increase and subsequent decrease in lake area in spring 2021 may reflect the combined effects of short-term hydrological inputs and seasonal evaporation processes.
Figure 8Validation of lake area estimates and intra-annual variability in a typical year (2021). (a) Comparison between lake area derived in this study and reference datasets; (b) monthly variations of lake area in 2021, selected as a representative year to illustrate intra-annual dynamics.
From the perspective of seasonal (Fig. 9) and monthly (Fig. 7b) variation characteristics, Bahannao exhibits significant seasonal differences. The lake area in summer, autumn, and winter is noticeably larger than in spring, with autumn having the largest lake area, averaging 16.21 km2 and reaching a peak of 16.24 km2 in September. In contrast, spring has the smallest lake area, averaging only 13.57 km2, with the lowest value of 12.48 km2 occurring in April.
3.2.2 Method validation using representative lakes in arid regions
To further evaluate the robustness and regional applicability of the proposed lake-area extraction method, we applied the same remote-sensing workflow to two representative lakes in arid and semi-arid northern China: Hongjiannao Lake and Wuliangsuhai Lake. These lakes differ markedly in size, hydrological conditions, and degree of human influence, and have been widely investigated in previous remote-sensing studies, providing independent reference datasets for method validation.
Using the identical image-processing procedures and water-body extraction criteria as those employed for Bahannao Lake, we constructed annual lake-area time series for both Hongjiannao Lake and Wuliangsuhai Lake (Fig. 10). The derived time series capture the major interannual fluctuations and long-term trends of lake-area variability for both lakes.
To quantitatively assess consistency with existing studies, the lake-area estimates obtained in this study were compared with previously published lake-area datasets (Fig. 11). For both lakes, the temporal evolution and long-term trends derived in this study show good agreement with reference datasets reported in the literature.
For Hongjiannao Lake, quantitative comparison indicates that the relative differences between lake-area estimates derived in this study and published datasets generally remain within a reasonable range. Specifically, the maximum and minimum relative differences are 14.65 % and 9.12 % when compared with Ji et al. (2023), 18.70 % and 9.57 % with Xie et al. (2021), 11.82 % and 8.29 % with Ma et al. (2020), 11.30 % and 7.94 % with Wang et al. (2018b), and 10.57 % and 3.15 % with Liu and Yue (2016).
For Wuliangsuhai Lake, the relative differences are generally smaller, with maximum differences of 8.50 % (minimum 1.74 %) compared with Guan (2022), 8.02 % (minimum 1.79 %) compared with Li et al. (2023), and 18.12 % (minimum 1.09 %) compared with Tan et al. (2021). These results indicate a high level of consistency between the lake-area estimates derived in this study and those reported in previous literature.
Although minor discrepancies in absolute lake-area values are observed, these differences can be attributed to variations in image selection, water-index thresholds, temporal coverage, and post-processing strategies among different studies. An additional source of discrepancy arises from differences in temporal aggregation strategies. In this study, annual lake area is calculated as the mean of monthly lake-area estimates derived from all available images within a year, which reduces the influence of short-term fluctuations and image-specific noise. In contrast, many previous studies report lake area based on a single image or a limited number of images selected for each year. Such differences in temporal representation can lead to systematic deviations in absolute lake-area values, particularly for lakes exhibiting strong intra-annual variability.
Overall, the consistency between our results and independent reference datasets supports the robustness and transferability of the proposed lake-area extraction method across different lake types in arid and semi-arid regions. This validation provides confidence that the method is suitable for long-term lake-area monitoring and comparative analysis in data-sparse dryland environments.
Figure 11Comparison of lake area estimates derived in this study with published reference datasets. (a) Comparison of Hongjiannao Lake area with lake-area estimates reported in previous studies; (b) Comparison of Wuliangsuhai Lake area with lake-area estimates reported in previous studies.
Figure 12Temporal and seasonal variations in air temperature and 2 m dew point temperature over the study area during the period of 1984–2024. (a) Interannual variations in air temperature; (b) Multi-year mean seasonal cycle of air temperature; (c) Interannual variations in 2 m dew point temperature; (d) Multi-year mean seasonal cycle of 2 m dew point temperature.
3.3 Impact of climate change
3.3.1 Changes of hydro-climate series
- 1.
Temperature and moisture conditions
- a.
Temperature and 2m dew point temperature. The rise in air temperature directly affects the evaporation rate of the lake. The warming rate is 0.043 °C yr−1 (Fig. 12a), leading to an increase in the lake surface temperature and, consequently, higher evaporation. High temperatures intensify water evaporation, reducing the lake's water volume and causing a gradual decrease in lake area over the years.
The increase in air temperature enhances heat input into the water body, accelerating evaporation. As more heat is absorbed, surface water transforms more easily into water vapor, leading to a decline in lake water levels. Although the influence of temperature on lake area varies across different time periods, its continuous upward trend has a long-term impact on the reduction of lake area.
The 2 m dew point temperature increases at a rate of 0.0095 °C yr−1 (Fig. 12c), indicating changes in atmospheric humidity. A rising dew point temperature suggests an increase in water vapor content in the air, typically associated with higher humidity. However, humidity changes do not always directly impact lake area; instead, they influence lake water volume indirectly by affecting evaporation and precipitation. While an increase in dew point temperature usually indicates higher humidity, if precipitation is insufficient or evaporation rates are too high, this increase in humidity may not effectively replenish lake water. Instead, it could contribute to lake shrinkage. The varying influence of the 2 m dew point temperature over different periods suggests a complex relationship with lake area changes, requiring a comprehensive analysis alongside other climatic factors.
- b.
Precipitation and relative humidity. The total precipitation is decreasing at a rate of 1.736 mm yr−1 (Fig. 13a). Precipitation is one of the primary sources of lake water. A reduction in precipitation leads to insufficient water replenishment for the lake, resulting in a decline in water levels and a reduction in lake area.
The relative humidity decreases at a rate of 0.099 yr−1 (Fig. 13c). A decrease in humidity typically accelerates evaporation from the lake, leading to a reduction in lake area. The decrease in humidity means that the air becomes drier, and the evaporation rate increases. This accelerates the evaporation of lake water, resulting in a decline in both lake water levels and area, intensifying the process of lake desiccation.
- a.
Figure 13Temporal and seasonal variations in precipitation and relative humidity over the study area during the period of 1984–2024. (a) Interannual variations in precipitation; (b) Multi-year mean seasonal cycle of precipitation; (c) Interannual variations in relative humidity; (d) Multi-year mean seasonal cycle of relative humidity.
- 2.
Surface radiation and heat flux components
- a.
Net longwave radiation and net shortwave radiation at the surface. Net longwave radiation at the surface decreases by 0.084 W m−2 yr−1 (Fig. 14a). The reduction in longwave radiation means that the lake receives less radiative heat, which theoretically could reduce evaporation. However, this effect is overshadowed by other factors such as reduced precipitation and rising temperatures. While the decrease in longwave radiation could reduce heat loss from the lake, in conditions of drought and high evaporation, the impact of this reduction is likely limited.
Net shortwave radiation at the surface increases by 0.065 W m−2 yr−1 (Fig. 14a). The increase in shortwave radiation enhances the evaporation process, thereby reducing the lake's surface area. The rise in shortwave radiation leads to an increase in surface temperature, which accelerates evaporation. The intensified evaporation exacerbates the loss of water from the lake. The effect of increased shortwave radiation on the lake's area is significant during all periods, especially under drought and high-temperature conditions, where its impact is particularly pronounced.
- b.
Mean surface latent heat flux and sensible heat flux. The latent heat flux decreases at a rate of 0.081 W m−2 yr−1 (Fig. 14c). The decrease in latent heat flux indicates a reduction in the moisture carried by the air, possibly as a result of decreased humidity, which further intensifies evaporation from the water.
The sensible heat flux increases by 0.095 W m−2 yr−1 (Fig. 14c), meaning that the heat exchange between the surface and the atmosphere is enhanced. This leads to more evaporation, particularly during the summer when temperatures are higher.
- a.
Figure 14Regional variations of surface net radiation and surface heat flux during 1984–2024. (a) Interannual variations of mean surface net radiation; (b) multi-year mean seasonal variations of surface net radiation; (c) interannual variations of mean surface heat flux; (d) multi-year mean seasonal variations of surface heat flux. Abbreviations: msnlwf denotes mean surface net longwave radiation; msnswf denotes mean surface net shortwave radiation; mslhf denotes mean surface latent heat flux; msshf denotes mean surface sensible heat flux. The suffixes Spr, Sum, Aut, and Win represent spring, summer, autumn, and winter, respectively.
- 3.
Evaporative demand and aridity conditions
- a.
Potential evapotranspiration. Potential evapotranspiration increases at a rate of 1.937 mm yr−1 (Fig. 15a). The increase in evapotranspiration directly leads to the loss of water from the lake, making it an important factor contributing to the reduction in lake area. The rise in potential evapotranspiration indicates that both evaporation and plant transpiration in the lake area are increasing, further reducing the water volume of the lake. The increase in potential evapotranspiration has a significant impact on the lake area in all time periods, especially under drought and high-temperature conditions, where its effect is even more pronounced.
- b.
Drought. The aridity index (AI) exhibits pronounced interannual variability over the study period, with values generally fluctuating between approximately 0.18 and 0.40 (Fig. 15c), indicating persistently dry climatic conditions in the study area. Although a decreasing trend is observed (−0.0015 yr−1), the trend is not statistically significant at the 0.05 level (p=0.07), suggesting that long-term aridity intensification is moderate rather than abrupt.
Despite the weak linear trend, the consistently low AI values (<0.5) confirm that Bahannao Lake is located within a semi-arid to arid climatic regime, where water availability is inherently limited and highly sensitive to changes in hydro-climatic forcing. The wide prediction band further reflects strong year-to-year variability in regional moisture conditions, likely driven by fluctuations in precipitation and evaporative demand.
Importantly, the combination of a marginally decreasing AI trend and a significant increase in potential evapotranspiration implies a gradual shift toward enhanced atmospheric water demand, even in the absence of a statistically significant drying trend in AI alone. This suggests that lake-area dynamics are more strongly controlled by evaporative processes than by precipitation-driven moisture supply, particularly in recent decades.
- a.
In addition to interannual variability, the aridity index (AI) exhibits pronounced seasonal contrasts (Fig. 15d). Summer shows the highest AI values, with a relatively wide distribution and higher median, indicating comparatively wetter conditions driven by concentrated precipitation during the warm season. Autumn presents intermediate AI values, reflecting a transition from moisture input to increasing evaporative demand.
In contrast, spring and winter are characterized by distinctly lower AI values. Spring exhibits low median AI and limited dispersion, indicating persistent moisture deficit during the lake recharge period. This seasonal dryness coincides with rising temperatures and increasing evaporative demand, which constrains lake expansion despite episodic precipitation events. Winter shows the lowest AI values overall, reflecting extremely dry atmospheric conditions dominated by minimal precipitation and suppressed moisture availability.
The seasonal pattern of AI highlights that Bahannao Lake is subject to strong intra-annual asymmetry in hydro-climatic conditions, with relatively favorable moisture supply confined to summer, while prolonged dry conditions prevail during spring and winter. Such seasonal dryness amplifies the sensitivity of lake area to evaporative processes.
Figure 15Regional variations in evaporation and drought conditions during 1984–2024. (a) Interannual variations of potential evapotranspiration; (b) multi-year mean seasonal variations of potential evapotranspiration; (c) interannual variations of the AI; (d) multi-year mean seasonal variations of the AI.
3.3.2 Impacts of hydro-climate elements on lake area
-
Linear relationships between lake area and climatic variables. A sliding T test on the lake area (Fig. 16) reveals two turning points in the lake's area change, specifically in 2000 and 2015. Therefore, we divide the study period into three time segments: the first period from January 1984 to December 1999, the second period from January 2000 to December 2014, and the third period from January 2015 to July 2024, to investigate the causes of the changes in lake area.
The seasonal correlation analysis reveals pronounced differences in lake–climate relationships across seasons (Fig. 17a). In spring, lake area exhibits a significant positive correlation with relative humidity (RH) (r=0.403, p<0.01) and a significant negative correlation with temperature (T) (, p<0.05), indicating that spring lake-area variability is sensitive to atmospheric moisture conditions and warming processes. In contrast, correlations with precipitation (P) and potential evapotranspiration (PET) are weak and not statistically significant.
During summer, the lake–climate relationships are strongest. Lake area shows significant negative correlations with temperature (, p<0.01) and PET (, p<0.05), and significant positive correlations with precipitation (r=0.437, p<0.01) and RH (r=0.468, p<0.01). These results indicate that summer lake-area variability is jointly controlled by moisture supply and enhanced evaporative demand.
In autumn, lake area is significantly negatively correlated only with temperature (, p<0.05), whereas correlations with precipitation, RH, and PET are not significant, suggesting that autumn lake-area variations may reflect cumulative effects of antecedent hydro-climatic conditions. In winter, lake area shows a significant positive correlation with RH (r=0.315, p<0.05), while correlations with other climatic variables remain weak, reflecting reduced hydrological activity during the cold season.
At the interdecadal scale, lake–climate correlations exhibit clear stage-dependent characteristics (Fig. 17b). During the period 1984–1999, lake area shows no significant correlation with temperature, precipitation, PET, or RH, indicating a relatively weak response to individual climatic factors.
During 2000–2014, lake area becomes significantly positively correlated with precipitation (p<0.05) and RH (p<0.01), suggesting an enhanced sensitivity of lake-area variability to moisture conditions during this period. In the most recent period (2015–2024), lake area maintains a significant positive correlation only with RH (p<0.05), while correlations with other climatic variables weaken, implying a dominant role of atmospheric moisture conditions in regulating recent lake-area changes.
-
Nonlinear hydro-climatic controls revealed by XGBoost. To further quantify the relative importance of climatic variables and explore potential nonlinear effects beyond linear correlations, an XGBoost model was applied using precipitation, temperature, relative humidity, and potential evapotranspiration as predictors.
Model evaluation indicates that training performance generally exceeds testing performance, and testing R2 values are relatively low or even negative in some cases. This behavior reflects the limited sample size, strong interannual variability, and inherent nonlinearity of lake-area dynamics in arid regions, rather than model inadequacy. Therefore, in this study, XGBoost is primarily used as an interpretative tool to assess the relative importance of climatic drivers rather than as a predictive model.
XGBoost-derived feature importance exhibits clear seasonal contrasts that broadly agree with the correlation analysis while providing additional insights into nonlinear controls (Fig. 18a). In spring, XGBoost feature importance indicates that air temperature is the most influential predictor (T, 0.31), followed by relative humidity (RH, 0.28), whereas precipitation (P, 0.20) and potential evapotranspiration (PET, 0.20) play secondary roles. This finding is consistent with the correlation analysis, which shows a significant positive correlation between lake area and RH (r=0.403, p<0.01) and a significant negative correlation with air temperature (, p<0.05). Together, these results highlight the sensitivity of springtime lake dynamics to atmospheric moisture conditions and evaporative demand. Long-term trend analysis further indicates a significant increase in air temperature at a rate of 0.043 °C yr−1 (p<0.001) and a significant decline in RH (−0.099 yr−1, p=0.009), reinforcing the role of enhanced evaporation and atmospheric drying in shaping spring lake-area changes.
Figure 17Seasonal and interdecadal differences in correlations between lake area and climatic drivers. (a) Seasonal correlations between lake area and RH, PET, T, and P. (b) Correlations between lake area and climatic variables across three sub-periods (1984–1999, 2000–2014, and 2015–2024).
In summer, air temperature (T, 0.35) and relative humidity (RH, 0.26) dominate the feature-importance rankings, with precipitation (P, 0.18) and potential evapotranspiration (PET, 0.21) also contributing substantially. This aligns well with the correlation results, which indicate that summer lake area is positively correlated with precipitation (r=0.437, p<0.01) and RH (r=0.468, p<0.01), and negatively correlated with temperature (, p<0.01) and PET (, p<0.05). Trend analysis shows that although precipitation exhibits a decreasing tendency (−1.736 mm yr−1, p=0.135, not significant), PET increases significantly at a rate of 1.937 mm yr−1 (p=0.01). This suggests that summer lake-area variability is increasingly constrained by enhanced evaporative demand, with the balance between water input and evaporation losses playing a dominant role.
In autumn, linear correlations between lake area and most climatic variables are weak and statistically insignificant. However, XGBoost results still indicate relatively high importance for relative humidity (RH, 0.34) and potential evapotranspiration (PET, 0.31), suggesting that autumn lake dynamics may be governed by nonlinear processes or threshold effects that are not adequately captured by linear methods alone. Considering the significant upward trend in PET and the declining tendency of the aridity index (AI; −0.015 yr−1, p=0.07), autumn lake systems appear to be transitioning toward evaporation-dominated control.
In winter, overall feature importance values are relatively low due to reduced hydrological activity, yet relative humidity (RH, 0.34) remains the most influential variable in the XGBoost model. This is consistent with the correlation analysis showing a significant positive relationship between winter lake area and RH (r=0.315, p<0.05), indicating that background atmospheric moisture conditions still serve as an important indicator of lake variability during the frozen period.
At the decadal scale, XGBoost results reveal a clear temporal shift in the dominant climatic controls on lake-area variability, as shown in Fig. 18b. During 1984–1999, the importance of individual climatic variables is generally low and dispersed, consistent with the weak correlations observed during this period. This suggests that the lake system exhibited relatively low sensitivity to climatic fluctuations in the early stage.
During 2000–2014, precipitation (P, 0.22), potential evapotranspiration (PET, 0.23) and relative humidity (RH, 0.34) show markedly higher importance in the XGBoost model, in agreement with correlation results indicating significant positive relationships between lake area and precipitation (r=0.179, p<0.05) and RH (r=0.388, p<0.01). This period is therefore characterized by a precipitation- and moisture-dominated control regime.
In the most recent period (2015–2024), the importance of temperature and PET increases noticeably, while the contribution of precipitation weakens. Combined with the observed warming trend and enhanced evaporative demand, these results indicate a transition toward an evaporation-dominated climatic control on lake-area dynamics in recent years.
By integrating long-term trend analysis, linear correlation analysis, and XGBoost-based nonlinear feature importance, this study demonstrates that lake-area variability in arid regions is not governed by a single climatic factor, but rather by the interplay between water supply and evaporative demand across different seasons and time scales. Linear correlation analysis effectively captures the summer lake–climate relationship dominated by water balance, whereas the nonlinear XGBoost approach provides complementary insights into more complex control mechanisms during transitional seasons such as spring and autumn. Overall, the results indicate that with continued regional warming, increasing PET, and intensifying aridity, evaporative processes are playing an increasingly important role in controlling lake-area variability, offering important implications for understanding the response of arid-region lakes to future climate change.
Figure 18Weight of influencing factors by season. Abbreviations: RH denotes relative humidity (%); PET denotes potential evapotranspiration (mm); T denotes air temperature (°C); P denotes precipitation (mm).
To further investigate the nonlinear relationships between lake area and climatic factors, partial dependence plots (PDPs) were generated for each season and period. The seasonal partial dependence plots (Fig. 19a–p) reveal pronounced seasonal differences and nonlinear responses of lake area to hydro-climatic variables.
In spring, lake area exhibits a clear threshold response to relative humidity (RH). When RH remains below approximately 34 %–35 %, the lake area shows only minor variation. However, when RH increases to around 38 %–40 %, the lake area rises rapidly, indicating that enhanced atmospheric moisture conditions can significantly support lake water storage. In contrast, PET shows a weak negative influence on lake area, while temperature exhibits a marked decline in lake area when it approaches approximately 10–11 °C, suggesting that intensified evaporation may suppress lake expansion. Precipitation shows only a limited influence, implying that spring lake dynamics are primarily controlled by atmospheric moisture conditions and evaporative demand.
During summer, precipitation demonstrates a pronounced nonlinear positive effect on lake area. When precipitation increases to approximately 200 mm, lake area expands rapidly, indicating that rainfall represents a major water input supporting lake expansion in the warm season. Relative humidity also shows a generally positive relationship with lake area, whereas increasing temperature leads to a clear decline in lake area, reflecting the strong evaporative effects under high-temperature conditions.
In autumn, lake area responds strongly to variations in PET, displaying a distinct nonlinear pattern. When ET0 approaches approximately 300 mm, lake area decreases sharply, whereas at higher PET levels the lake area shows partial recovery, suggesting a complex regulatory role of evaporative demand on lake water balance. Meanwhile, increasing temperature generally leads to a decline in lake area, highlighting the importance of evaporation processes in controlling autumn lake dynamics.
In winter, RH exhibits a strong positive effect on lake area. When RH increases from approximately 40 % to 50 %, lake area increases markedly, indicating that atmospheric moisture conditions play a key role in regulating winter lake variability. Precipitation exerts a relatively weak influence, while temperature variations mainly affect lake dynamics indirectly through their influence on evaporation processes.
Overall, these results demonstrate that the response of lake area to hydro-climatic variables is highly nonlinear and strongly season-dependent, with atmospheric moisture dominating in spring and winter, precipitation controlling summer expansion, and evaporative demand exerting a stronger influence during autumn.
The partial dependence plots across the three periods (Fig. 20a–l) further reveal a clear temporal evolution in the hydro-climatic controls on lake area. During the first period, lake area shows a strong positive response to relative humidity (RH). When RH increases from approximately 30 % to around 40 %, lake area expands rapidly, indicating that atmospheric moisture conditions play a key role in sustaining lake water storage. Precipitation also exhibits a positive influence on lake area, particularly when precipitation exceeds approximately 80–100 mm, suggesting that water supply conditions were an important driver of lake expansion during this stage. In contrast, PET and temperature show relatively weaker influences, indicating that evaporative demand played a less dominant role during the early period.
In the second period, the nonlinear responses of lake area become more complex. RH still exerts a noticeable influence on lake area, but the relationship becomes less monotonic compared with the earlier stage. Meanwhile, the influence of PET becomes more evident, suggesting that evaporative demand began to exert a stronger regulatory effect on lake dynamics. Precipitation continues to show a positive relationship with lake area, although the magnitude of the response is slightly reduced, indicating a gradual shift from water-supply dominance toward a combined influence of water supply and evaporation processes.
During the most recent period, the PDPs indicate that temperature and PET exert stronger impacts on lake area variability. Increasing temperature generally leads to a decline in lake area, reflecting enhanced evaporation under warming climatic conditions. Similarly, higher PET values correspond to reduced lake area, indicating that evaporative demand has become an increasingly important control on lake dynamics. In contrast, the influence of precipitation becomes relatively weaker compared with earlier periods, suggesting that the hydrological sensitivity of the lake has gradually shifted toward stronger evaporation-driven regulation.
Overall, the PDP analysis across the three periods suggests a gradual transition in the dominant hydro-climatic controls on Bahannao Lake, shifting from moisture-supply-dominated processes in the early period toward stronger regulation by evaporative demand under recent warming conditions.
These results highlight that lake dynamics in Bahannao Lake are governed by complex nonlinear hydro-climatic interactions, and that the dominant climatic controls have evolved over time.
This study constructed a continuous monthly lake-area time series for Bahannao Lake spanning 1984–2024 using an optimized lake-area extraction framework that integrates seasonal water-index selection, adaptive thresholding, maximum connectivity analysis, and mutual information–based gap filling. Compared with widely used long-term products such as the JRC Global Surface Water dataset, which are often constrained by cloud contamination, seasonal ice cover, and temporal discontinuities, the proposed framework substantially improves temporal continuity and robustness under complex environmental conditions. This improvement is particularly important for small lakes in arid and semi-arid regions, where data gaps and seasonal disturbances are pervasive in existing datasets.
At the methodological level, this study introduces targeted improvements at several critical steps relative to previous approaches. First, the seasonal application of NDWI and MNDSI for non-freezing and freezing periods, respectively, enhances the stability of water-body identification under varying surface conditions, outperforming traditional single-index methods (McFeeters, 1996; Yao et al., 2015). Second, the combination of Otsu thresholding with DEM-based terrain constraints effectively reduces misclassification caused by topographic shadows and complex terrain, which is a common challenge for inland lakes in arid environments. Third, the mutual information–based image-filling strategy reconstructs cloud- and stripe-contaminated pixels by matching historically most similar cloud-free images, thereby extending the usability of long-term Landsat archives. Compared with approaches relying solely on interpolation (Zhao and Gao, 2018), this strategy substantially improves the completeness and reliability of multi-decadal lake-area records. Collectively, these methodological enhancements systematically address key challenges repeatedly identified in previous studies, including cloud contamination, seasonal variability, topographic interference, and spectral complexity of inland waters (Mouw et al., 2015; Palmer et al., 2015; Shen et al., 2017; Cao et al., 2019), and establish a transferable framework suitable for lake monitoring in arid and data-scarce regions.
From a hydro-climatic perspective, the reconstructed long-term record provides important insights into the mechanisms controlling lake dynamics in arid environments. Consistent with previous studies, precipitation and evaporation emerge as the primary factors regulating lake-area variability, particularly during the warm season when both water inputs and evaporative losses are enhanced (Tao et al., 2015; Li et al., 2017). The correlation analysis indicates that lake area is significantly positively correlated with precipitation and relative humidity in summer, whereas atmospheric moisture conditions exert a more pronounced influence during spring and winter. These findings reinforce the view that lake dynamics in arid regions are governed by the seasonal balance between water supply and evaporative demand.
However, compared with many existing studies that rely primarily on annual-scale analyses, the monthly lake-area time series developed here reveals pronounced seasonal heterogeneity and transitional behavior. In spring and autumn, linear correlations between lake area and individual climatic variables are generally weak, whereas XGBoost feature-importance analysis consistently identifies relative humidity and potential evapotranspiration as influential factors. This discrepancy suggests that lake responses during transitional seasons may be governed by nonlinear processes or threshold effects that cannot be fully captured by linear statistical methods alone. The combined use of correlation analysis and XGBoost therefore provides complementary perspectives on lake–climate relationships across different temporal scales.
At the decadal scale, both correlation analysis and XGBoost results indicate a clear evolution in dominant climatic controls on lake-area variability. During 2000–2014, precipitation and relative humidity exhibit increased importance and significant positive associations with lake area, indicating a moisture-dominated control regime. In contrast, during 2015–2024, the importance of air temperature and potential evapotranspiration increases markedly, while the contribution of precipitation weakens. This shift reflects a transition toward evaporation-dominated control under sustained warming conditions and highlights a dynamic reorganization of hydro-climatic drivers. Such temporal evolution extends existing understanding by explicitly demonstrating how dominant controls on arid-region lakes can shift under intensified climate variability.
These results have broader implications for studies of lakes in arid and semi-arid regions. The fragile water balance and limited buffering capacity of dryland lakes render them highly sensitive to even modest changes in precipitation, atmospheric moisture, and evaporative demand. The observed transition from precipitation-dominated to evaporation-dominated control suggests increasing vulnerability of arid-region lakes under ongoing climate warming. Even in the absence of a pronounced decline in precipitation, enhanced evaporation and atmospheric drying may offset or exceed water inputs, thereby accelerating lake shrinkage. This finding underscores the necessity of considering multiple hydro-climatic factors simultaneously when assessing future lake trajectories in arid environments.
From a water-resources management perspective, the results indicate that lake conservation and management strategies in arid regions should not focus solely on precipitation trends but must also account for changes in evaporative demand, drought intensity, and atmospheric moisture conditions. The lake-area extraction framework and the insights into evolving climatic controls presented here provide a robust technical foundation for long-term lake monitoring, risk assessment, and adaptive water-management strategies in data-sparse dryland regions.
Several limitations of this study should be acknowledged. First, while remote sensing reliably captures surface-area dynamics, subsurface processes such as groundwater inflow and outflow were not explicitly quantified and may influence lake water balance. The relatively weak correlation between lake area and precipitation observed in this study further suggests that additional hydrological processes, such as groundwater exchange or delayed runoff responses, may contribute to lake-area variability in this semi-arid environment. However, due to the lack of long-term groundwater observations in the study area, these processes could not be directly evaluated in the present study. Second, the 30 m spatial resolution of Landsat data limits detection of fine-scale shoreline changes, and future studies could benefit from integrating higher-resolution sensors such as Sentinel-2. Third, although XGBoost effectively captures nonlinear relationships, its data-driven nature limits physical interpretability relative to process-based hydrological models. Future research could integrate remote sensing, machine learning, ecohydrological modeling, and socioeconomic data to further advance understanding of lake dynamics in arid regions.
This study developed an optimized remote-sensing framework to construct a continuous monthly lake-area time series for Bahannao Lake from 1984 to 2024. By integrating seasonal water-index selection, adaptive thresholding, connectivity analysis, and mutual information-based image reconstruction, the proposed method effectively overcomes common limitations associated with cloud contamination, seasonal ice cover, and data gaps in long-term Landsat archives. Validation using Hongjiannao Lake and Wuliangsuhai Lake further demonstrates the robustness and regional applicability of the framework for lake monitoring in arid and semi-arid environments.
The reconstructed time series reveals pronounced interannual variability and strong seasonal contrasts in lake-area dynamics. Linear correlation analysis indicates that lake-area variations are primarily associated with precipitation and atmospheric moisture conditions during the warm season, whereas evaporative demand plays an increasingly important role during cold and transitional seasons. The combined use of correlation analysis and XGBoost modeling further reveals a clear stage-dependent evolution of climatic controls. During 1984–1999, lake dynamics were mainly associated with humidity and precipitation variability; during 2000–2014, the influence of moisture-related conditions remained important under enhanced climate variability; and during 2015–2024, rising air temperature and potential evapotranspiration emerged as dominant contributors, indicating a transition toward evaporation-dominated control under sustained warming.
Overall, the results demonstrate that lake-area changes in arid regions are governed by nonlinear and evolving interactions between water supply and evaporative demand rather than by any single climatic factor. By combining high-temporal-resolution lake-area reconstruction with both linear and nonlinear analytical approaches, this study provides new insights into the mechanisms underlying arid-region lake dynamics. The proposed framework offers a valuable tool for long-term lake monitoring and contributes to improved understanding of eco-hydrological responses and water-resource vulnerability under ongoing climate change in dryland regions.
The original datasets used in this study are publicly available from Landsat 5 TM, Landsat 7 TM, and Landsat 8 OLI imagery and from the fifth-generation ECMWF atmospheric reanalysis dataset (ERA5), as described in the Methods section. The processed data and code that support the findings of this study are available from the corresponding author upon reasonable request. These materials are not yet publicly archived because they require further curation and documentation before formal public release.
RZ and XW conceived and designed the study, developed the methodology, curated the data, and performed the formal analysis. RZ was responsible for visualization. RZ and XW prepared the original draft of the manuscript, and all authors contributed to reviewing and editing the paper. XW provided overall supervision.
The contact author has declared that none of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
We are grateful to the National Key R&D Program of China (No. 2023YFC3206504), National Natural Science Foundation of China (No. 52121006, 41961124006), Postgraduate Thesis Fund of Nanjing Hydraulic Research Institute(Yy524010), Young Top-Notch Talent Support Program of National High-level Talents Special Support Plan, and Research Project of Ministry of Natural Resources (No. 20210103), Research Project of Inner Mongolia Academy of Science and Technology (No. 2024RCYJ05003) for providing financial support for this research. We are also thankful key authors and their agencies. We are also thankful to anonymous reviewers and editors for their helpful comments and suggestions.
This research has been supported by the National Key R&D Program of China (grant no. 2023YFC3206504), National Natural Science Foundation of China (grant nos. 52121006 and 41961124006), Postgraduate Thesis Fund of Nanjing Hydraulic Research Institute (grant no. Yy524010), Young Top-Notch Talent Support Program of National High-level Talents Special Support Plan, the Research Project of Ministry of Natural Resources (grant no. 20210103), and the Research Project of Inner Mongolia Academy of Science and Technology (grant no. 2024RCYJ05003).
This paper was edited by Xing Yuan and reviewed by two anonymous referees.
Adrian, R., O'Reilly, C. M., Zagarese, H., Baines, S. B., Hessen, D. O., Keller, W., Livingstone, D. M., Sommaruga, R., Straile, D., Van Donk, E., Weyhenmeyer, G. A., and Winder, M.: Lakes as sentinels of climate change, Limnol. Oceanogr., 54, 2283–2297, https://doi.org/10.4319/lo.2009.54.6_part_2.2283, 2009.
Bergé-Nguyen, M. and Crétaux, J. F.: Inundations in the Inner Niger Delta: Monitoring and analysis using MODIS and global precipitation datasets, Remote Sens., 7, 2127–2151, https://doi.org/10.3390/rs70202127, 2015.
Busker, T., de Roo, A., Gelati, E., Schwatke, C., Adamovic, M., Bisselink, B., Pekel, J.-F., and Cottam, A.: A global lake and reservoir volume analysis using a surface water dataset and satellite altimetry, Hydrol. Earth Syst. Sci., 23, 669–690, https://doi.org/10.5194/hess-23-669-2019, 2019.
Cao, Z. G., Ma, R. H., Duan, H. T., and Xue, K.: Effects of broad bandwidth on the remote sensing of inland waters: implications for high spatial resolution satellite data applications, ISPRS J. Photogramm. Remote Sens., 153, 110–122, https://doi.org/10.1016/j.isprsjprs.2019.05.001, 2019.
Carroll, M. L., Townshend, J. R. G., DiMiceli, C. M., and Loboda, T., and Sohlberg, R. A.: Shrinking lakes of the Arctic: Spatial relationships and trajectory of change, Geophys. Res. Lett., 38, https://doi.org/10.1029/2011GL049427, 2011.
Cooley, S. W., Smith, L. C., Stepan, L., and Mascaro, J.: Tracking dynamic northern surface water changes with high-frequency Planet CubeSat imagery, Remote Sens., 9, 1306, https://doi.org/10.3390/rs9121306, 2017.
Donchyts, G., Baart, F., Winsemius, H., Gorelick, N., Kwadijk, J., and Van de Giesen, N.: Earth's surface water change over the past 30 years, Nat. Clim. Change, 6, 810–813, https://doi.org/10.3390/rs9121306, 2016.
Guan, L. G.: Morphological evolution and ecological function positioning of Ulansuhai Lake under the influence of human activities, Inner Mongolia Water Resour., 8, 8–10, 2022.
Huang, C., Chen, Y., Zhang, S., and Wu, J.: Detecting, extracting, and monitoring surface water from space using optical sensors: A review, Rev. Geophys., 56, 333–360, https://doi.org/10.1029/2018RG000598, 2018.
Jeppesen, E., Meerhoff, M., Davidson, T., Trolle, D., Sondergaard, M., Lauridsen, T., Bekliouglu, M., Brucet, S., Volta, P., Bergonzoni, I. G., and Nielsen, A.: Climate change impacts on lakes: an integrated ecological perspective based on a multi-faceted approach, with special focus on shallow lakes, J. Limnol., 73, https://doi.org/10.4081/jlimnol.2014.844, 2014.
Ji, L. W., Wang, Z. W., and Cui, L. L.: Remote sensing monitoring of spatiotemporal dynamics of Hongjiannao Lake over the past 30 years, Shaanxi Meteorol., 1, 41–49, 2023.
Klein, I., Gessner, U., Dietz, A. J., and Kuenzer, C.: Global WaterPack – A 250 m resolution dataset revealing the daily dynamics of global inland water bodies, Remote Sens. Environ., 198, 345–362, https://doi.org/10.1016/j.rse.2017.06.045, 2017.
Laba, Z., Deji, Y., La, B., Chen, T., Ci, Z., Qiu, Y. B., and Pubu, C.: Remote sensing analysis on the area variations of Tangra Yutso in the Tibetan Plateau over the past 40 years, J. Lake Sci., 29, 480–489, https://doi.org/10.18307/2017.0224, 2017.
Li, L., Long, D., Wang, Y., and Woolway, R, I.: Global dominance of seasonality in shaping lake-surface-extent dynamics, Nature, 642, 361–368, https://doi.org/10.1038/s41586-025-09046-3, 2025.
Li, M., Yan, D. H., and Liu, S. H.: Variation characteristics of water surface area and water storage capacity of Namucuo Lake in recent 40 years, Water Resour. Power, 35, 41–43, 2017.
Li, S., Qu, W., and Zhang, T. T.: Study on changes in water surface area and water volume of Ulansuhai Lake based on remote sensing imagery, Yellow River, 45, 22–23, 2023.
Liu, Y. and Yue, H.: Analysis of Hongjiannao Lake area based on SMMI, Sci. Technol. Eng., 16, 122–127, https://doi.org/10.5194/isprs-archives-XLII-3-2201-2018, 2016.
Liu, Y. B., Wu, G. P., Zhao, X. S., Fang, X. W., Pan, X., Gan, G. J., Liu, Y. W., Guo, R. F., Zhou, H., Wang, Y., Wang, R. N., and Cui, Y. F.: Remote sensing for watershed hydrology: issues and challenges, Adv. Earth Sci., 35, 488–496, https://doi.org/10.11867/j.issn.1001-8166.2020.045, 2020.
Ma, H. L., Bai, M., and Guo, Y.: Analysis on the spatiotemporal evolvement process and cause of Hongjiannao during 1957–2019, Geomatics Spat. Inf. Technol., 43, 143–146, 2020.
Ma, R. H., Duan, H. T., Hu, C. M., Feng, X. Z., Li, A. N., Ju, W. M., Jiang, J. H., and Yang, G. S.: A half-century of changes in China's lakes: global warming or human influence?, Geophys. Res. Lett., 37, L24106, https://doi.org/10.1029/2010GL045514, 2010.
Ma, R. H., Yang, G. S., Duan, H. T., jiang, J. H., Wang, S. M., Feng, X. Z., Li, A. N., Kong, F. X., Xue, B., Wu, J. L., and Li, S. J.: China's lakes at present: number, area and spatial distribution, Sci. China Earth Sci., 54, 283–289, https://doi.org/10.1007/s11430-010-4052-6, 2011.
Ma, Y., Xu, N., Zhang, W. H., Sun, J. Y., and Feng, X. J.: Increasing water levels of global lakes between 2003 and 2009, IEEE Geosci. Remote Sens. Lett., 17, 187–191, https://doi.org/10.1109/LGRS.2019.2920387, 2020.
McFeeters, S. K.: The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features, Int. J. Remote Sens., 17, 1425–1432, https://doi.org/10.1080/01431169608948714, 1996.
Mouw, C. B., Greb, S., Aurin, D., DiGiacomi, P. M., Lee, Z. P., Twardowski, M., Binding,C., Hu, C. M., Ma, R. H., Moore, T., Moses, W., and Craig, S. E.: Aquatic color radiometry remote sensing of coastal and inland waters: challenges and recommendations for future satellite missions, Remote Sens. Environ., 160, 15–30, https://doi.org/10.1016/j.rse.2015.02.001, 2015.
Palmer, S. C. J., Kutser, T., and Hunter, P. D.: Remote sensing of inland waters: challenges, progress and future directions, Remote Sens. Environ., 157, 1–8, https://doi.org/10.1016/j.rse.2014.09.021, 2015.
Pekel, J. F., Vancutsem, C., Bastin, L., Clerici, M., Vanbogaert, E., Bartholomé, E., and Defourny, P.: A near real-time water surface detection method based on HSV transformation of MODIS multi-spectral time series data, Remote Sens. Environ., 140, 704–716, https://doi.org/10.1016/j.rse.2013.10.008, 2014.
Pickens, A. H., Hansen, M. C., Hancher, M., Stehman, S. V., Tyukavina, A., Potapov, P., Marroquin, B., and Sherani, Z.: Mapping and sampling to characterize global inland water dynamics from 1999 to 2018 with full Landsat time-series, Remote Sens. Environ., 243, 111792, https://doi.org/10.1016/j.rse.2020.111792, 2020.
Plug, L. J., Walls, C., and Scott, B. M.: Tundra lake changes from 1978 to 2001 on the Tuktoyaktuk Peninsula, western Canadian Arctic, Geophys. Res. Lett., 35, https://doi.org/10.1029/2007GL032303, 2008.
Preston, D. L., Caine, N., McKnight, D. M., Williams, M. W., Hell, K., Miller, M. P., Hart, S. J., and Johnson, P. T. J.: Climate regulates alpine lake ice cover phenology and aquatic ecosystem structure, Geophys. Res. Lett., 43, 5353–5360, https://doi.org/10.1002/2016GL069036, 2016.
Råman Vinnå, L., Medhaug, I., Schmid, M., and Bouffard, D.: The vulnerability of lakes to climate change along an altitudinal gradient, Commun. Earth Environ., 2, 35, https://doi.org/10.1038/s43247-021-00106-w, 2021.
Rossow, W. B. and Schiffer, R. A.: Advances in understanding clouds from ISCCP, B. Am. Meteorol. Soc., 80, 2261–2288, https://doi.org/10.1175/1520-0477(1999)080<2261:AIUCFI>2.0.CO;2, 1999.
Schmid, M., Hunziker, S., and Wüest, A.: Lake surface temperatures in a changing climate: a global sensitivity analysis, Clim. Change, 124, 301–315, https://doi.org/10.1007/s10584-014-1087-2, 2014.
Secretariat, G.: Implementation plan for the global observing system for climate in support of the UNFCCC (2010 update), in: Proceedings of the Conference of the Parties (COP), Copenhagen, Denmark, 7–18, 2009.
Shen, M., Duan, H. T., Cao, Z. G., Xue, K., Loiselle, S., and Yesou, H.: Determination of the downwelling diffuse attenuation coefficient of lake water with the Sentinel-3A OLCI, Remote Sens., 9, 1246, https://doi.org/10.3390/rs9121246, 2017.
Tan, R. J., Ma, Q., and Wang, R.: Remote sensing monitoring of long-term time-series dynamics of water area and aquatic vegetation, Yellow River, 43, 77–79, 2021.
Tao, S., Fang, J., Zhao, X., and Guo, Q. H.: Rapid loss of lakes on the Mongolian Plateau, P. Natl. Acad. Sci. USA, 112, 2281–2286, https://doi.org/10.1073/pnas.1411748112, 2015.
Tong, Y., Feng, L., Wang, X., Pi, X. H., Xu, W., and Woolway, I.: Global lakes are warming slower than surface air temperature due to accelerated evaporation, Nat. Water, 1, 929–940, https://doi.org/10.1038/s44221-023-00148-8, 2023.
Vincent, W. F.: Effects of climate change on lakes, in: The Impact of Climate Change on European Lakes, edited by: George, D. G., Springer, Dordrecht, 55–60, https://doi.org/10.1007/978-90-481-2945-4, 2009.
Wang, J., Song, C., Reager, J. T., Yao, F. F., Famiglietti, J. S., Sheng, Y. W., MacDonald, G. M., Brun, F., Schmided, H. M., Marston, R, A., and Wada, Y.: Recent global decline in endorheic basin water storages, Nat. Geosci., 11, 926–932, https://doi.org/10.1038/s41561-018-0265-7, 2018a.
Wang, Y., Yan, Z. L., and Gao, F.: Monitoring spatio-temporal changes of water area in Hongjiannao Lake from 1957 to 2015 and its driving forces analysis, Trans. Chin. Soc. Agric. Eng., 34, 265–271, 2018b.
Woolway, R. I., Kraemer, B. M., Lenters, J. D., Merchant, C. J., O'Reilly, C. M., and Sharma, S.: Global lake responses to climate change, Nat. Rev. Earth Environ., 1, 388–403, https://doi.org/10.1038/s43017-020-0067-5, 2020.
Xie, Z. G., Lian, Y. X., and Wu, H. P.: Water area change of Hongjiannao wetland from 2000 to 2018 and related policies, Shaanxi For. Sci. Technol., 49, 33–38, 2021.
Yang, G. S., Ma, R. H., Zhang, L., Jiang, J. H., Yao, S. C., Zhang, M., and Zeng, H. A.: Lake status, major problems and protection strategy in China, J. Lake Sci., 22, 799–810, https://doi.org/10.18307/2010.0601, 2010.
Yao, F., Wang, C., Dong, D., Luo, J. C., Shen, Z. F., and Yang, K. H.: High-resolution map of urban surface water using ZY-3 multi-spectral imagery, Remote Sens., 7, 12336–12355, https://doi.org/10.3390/rs70912336, 2015.
Yao, F., Wang, J., Yang, K., Wang, C., Walter, B. A., and Crétaux, J. F.: Lake storage variation on the endorheic Tibetan Plateau and its attribution to climate change since the new millennium, Environ. Res. Lett., 13, 064011, https://doi.org/10.1088/1748-9326/aab5d3, 2018.
Zhao, G. and Gao, H.: Automatic correction of contaminated images for assessment of reservoir surface area dynamics, Geophys. Res. Lett., 45, 6092–6099, https://doi.org/10.1029/2018GL078343, 2018.
Zhou, W., Wang, L., Li, D., and Leung, L. R.: Spatial pattern of lake evaporation increases under global warming linked to regional hydroclimate change, Commun. Earth Environ., 2, 255, https://doi.org/10.1038/s43247-021-00327-z, 2021.