Preprints
https://doi.org/10.5194/hess-2024-172
https://doi.org/10.5194/hess-2024-172
14 Aug 2024
 | 14 Aug 2024
Status: this preprint is currently under review for the journal HESS.

Technical Note: An illustrative introduction to the domain dependence of spatial Principal Component patterns

Christian Lehr and Tobias Ludwig Hohenbrink

Abstract. Principal Component Analysis (PCA) of synchronous time series of one variable, e.g. water level or discharge, measured at multiple locations, has been applied in a wide spectrum of hydrological analyses. Principal Components (PCs) were used in regionalisation and to identify dominant modes, signals, processes or other hydrological properties of the analysed system. The possibility that the PCs of such analysis can exhibit domain dependence (DD) found only little recognition in the hydrological PCA literature so far. DD describes the situation in which the spatial PC patterns are mainly determined by the size and shape of the analysed spatial domain. Domain size means the spatial extent of the analysed data set, domain shape the spatial arrangement of the data sets´ locations. Thus, instead of the hydrological functioning of the analysed system, the spatial PC patterns rather reflect the functioning of the PCA within the context of the data set´s spatial domain. The effect is caused by homogeneous spatial autocorrelation in the analysed series, a common feature in hydrological data sets. DD patterns are distinct, with strong gradients and contrasts, and can come together with substantial accumulation of variance in the leading PCs. In addition, DD can cause effectively degenerate multiplets, i.e. PCs which are not well separable. All these features are highly suggestive and easily lead to wrong hydrological interpretations. Consequently, DD should be considered for any application in which the PCs are used to draw conclusions about spatially distinct properties of the analysed system. DD patterns calculated for the analysed spatial domain can be used as reference to test whether spatial PC patterns differ significantly from pure DD patterns. We present two methods, one stochastic, one analytic, to calculate DD reference patterns for defined spatial correlation properties and arbitrary spatial domains. With a series of synthetic examples, we explore the DD effect with respect to a) domain shape, b) domain size and spatial correlation length and c) effectively degenerate multiplets. Particular focus is given to the effect of DD on the explained variance of the PCs and the contrasts of their spatial patterns. Finally, considering DD is discussed. Accompanying this technical note, R-scripts to (i) demonstrate and explore the DD effect, and (ii) perform the presented DD reference methods are provided.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Christian Lehr and Tobias Ludwig Hohenbrink

Status: open (extended)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on hess-2024-172', Anonymous Referee #1, 29 Sep 2024 reply
Christian Lehr and Tobias Ludwig Hohenbrink

Data sets

R-scripts to (i) explore the domain dependence (DD) of spatial Principal Components and (ii) calculate DD reference patterns Christian Lehr https://doi.org/10.5281/zenodo.11213430

Christian Lehr and Tobias Ludwig Hohenbrink

Viewed

Total article views: 332 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
209 55 68 332 43 7 12
  • HTML: 209
  • PDF: 55
  • XML: 68
  • Total: 332
  • Supplement: 43
  • BibTeX: 7
  • EndNote: 12
Views and downloads (calculated since 14 Aug 2024)
Cumulative views and downloads (calculated since 14 Aug 2024)

Viewed (geographical distribution)

Total article views: 301 (including HTML, PDF, and XML) Thereof 301 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 20 Nov 2024
Download
Short summary
In hydrology, domain dependence (DD) of spatial Principal Component patterns is a rather unknown feature of the widely applied Principal Component Analysis. It easily leads to wrong hydrological interpretations. DD reference patterns enable to differentiate from the effect. Here, we (1) explore the DD effect, (2) present two methods to calculate DD reference patterns and (3) discuss considering DD. Scripts with an introduction to the DD effect and an implementation of both methods are provided.