Preprints
https://doi.org/10.5194/hess-2021-621
https://doi.org/10.5194/hess-2021-621
 
04 Jan 2022
04 Jan 2022
Status: a revised version of this preprint was accepted for the journal HESS and is expected to appear here in due course.

Seasonal forecasting of lake water quality and algal bloom risk using a continuous Gaussian Bayesian network

Leah Jackson-Blake, François Clayer, Sigrid Haande, James Sample, and Jannicke Moe Leah Jackson-Blake et al.
  • Norwegian Institute for Water Research (NIVA), 0349 Oslo, Norway

Abstract. Freshwater management is challenging, and advance warning that poor water quality was likely, a season ahead, could allow for preventative measures to be put in place. To this end, we developed a Bayesian network (BN) for seasonal lake water quality prediction. BNs have become popular in recent years, but the vast majority are discrete. Here we developed a Gaussian Bayesian network (GBN), a simple class of continuous BN. The aim was to forecast, in spring, total phosphorus (TP), chlorophyll-a (chl-a), cyanobacteria biovolume and water colour for the coming growing season (May–October) in lake Vansjø in southeast Norway. To develop the model, we first identified controls on inter-annual variability in water quality using correlations, scatterplots, regression tree based feature importance analysis and process knowledge. Key predictors identified were lake conditions the previous summer, a TP control on algal variables, a colour-cyanobacteria relationship, and weaker relationships between precipitation and colour and between wind and chl-a. These variables were then included in the GBN and conditional probability densities were fitted using observations (≤ 39 years). GBN predictions had R2 values of 0.37 (cyanobacteria) to 0.75 (colour) and classification errors of 32 % (TP) to 13 % (cyanobacteria). For all but lake colour, including weather nodes did not improve predictive performance (assessed through cross validation). Overall, we found the GBN approach to be well-suited to seasonal water quality forecasting. It was straightforward to produce probabilistic predictions, including the probability of exceeding management-relevant thresholds. The GBN could be purely parameterised using observed data, despite the small dataset. This wasn’t possible using a discrete BN, highlighting a particular advantage of using GBNs when sample sizes are small. Although low interannual variability and high temporal autocorrelation in the study lake meant the GBN performed similarly to a seasonal naïve forecast, we believe the forecasting approach presented could be useful in areas with higher sensitivity to catchment nutrient delivery and seasonal climate, and for forecasting at shorter time scales (e.g. daily to monthly). Despite the parametric constraints of GBNs, their simplicity, together with the relative accessibility of BN software with GBN handling, means they are a good first choice for BN development, particularly when datasets for model training are small.

Leah Jackson-Blake et al.

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on hess-2021-621', Anonymous Referee #1, 21 Jan 2022
    • AC1: 'Reply on RC1', Leah Jackson-Blake, 09 Feb 2022
      • RC6: 'Reply on AC1', Anonymous Referee #1, 28 Feb 2022
        • AC6: 'Reply on RC6', Leah Jackson-Blake, 28 Feb 2022
  • RC2: 'Comment on hess-2021-621', Anonymous Referee #2, 16 Feb 2022
    • AC2: 'Reply to RC2', Leah Jackson-Blake, 24 Feb 2022
      • RC3: 'Reply on AC2', Anonymous Referee #2, 24 Feb 2022
        • AC3: 'Reply on RC3', Leah Jackson-Blake, 24 Feb 2022
          • RC4: 'Reply on AC3', Anonymous Referee #2, 24 Feb 2022
            • AC4: 'Reply on RC4', Leah Jackson-Blake, 25 Feb 2022
              • RC5: 'Reply on AC4', Anonymous Referee #2, 25 Feb 2022
                • AC5: 'Reply on RC5', Leah Jackson-Blake, 28 Feb 2022
                  • RC7: 'Reply on AC5', Anonymous Referee #2, 28 Feb 2022
                    • AC7: 'Reply on RC7', Leah Jackson-Blake, 02 Mar 2022

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on hess-2021-621', Anonymous Referee #1, 21 Jan 2022
    • AC1: 'Reply on RC1', Leah Jackson-Blake, 09 Feb 2022
      • RC6: 'Reply on AC1', Anonymous Referee #1, 28 Feb 2022
        • AC6: 'Reply on RC6', Leah Jackson-Blake, 28 Feb 2022
  • RC2: 'Comment on hess-2021-621', Anonymous Referee #2, 16 Feb 2022
    • AC2: 'Reply to RC2', Leah Jackson-Blake, 24 Feb 2022
      • RC3: 'Reply on AC2', Anonymous Referee #2, 24 Feb 2022
        • AC3: 'Reply on RC3', Leah Jackson-Blake, 24 Feb 2022
          • RC4: 'Reply on AC3', Anonymous Referee #2, 24 Feb 2022
            • AC4: 'Reply on RC4', Leah Jackson-Blake, 25 Feb 2022
              • RC5: 'Reply on AC4', Anonymous Referee #2, 25 Feb 2022
                • AC5: 'Reply on RC5', Leah Jackson-Blake, 28 Feb 2022
                  • RC7: 'Reply on AC5', Anonymous Referee #2, 28 Feb 2022
                    • AC7: 'Reply on RC7', Leah Jackson-Blake, 02 Mar 2022

Leah Jackson-Blake et al.

Leah Jackson-Blake et al.

Viewed

Total article views: 879 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
648 201 30 879 7 7
  • HTML: 648
  • PDF: 201
  • XML: 30
  • Total: 879
  • BibTeX: 7
  • EndNote: 7
Views and downloads (calculated since 04 Jan 2022)
Cumulative views and downloads (calculated since 04 Jan 2022)

Viewed (geographical distribution)

Total article views: 760 (including HTML, PDF, and XML) Thereof 760 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 25 May 2022
Download
Short summary
We developed a Gaussian Bayesian Network (GBN) for seasonal forecasting of lake water quality, including algal bloom risk, in a shallow nutrient-impacted lake in southern Norway. Discrete Bayesian networks are commonly used in environmental modelling, but this is one of very few applications of a continuous BN. The GBN approach proved to be promising, particularly where training datasets are small.