Water quality parameters
For a general overview of the comparison between STOM and grab sampling, the mean of all yearly Performances (according to
Figure 4) were calculated and concluded as a heat map. Results for single years can be found in the supporting material. The parameters were chosen from the 6 x 21 matrix (1-6 months return interval and 1-21 days of sensor application duration) and selected to favor a lower interval of return interval over the duration of the sensor application. For example, if a similar Performance of zero was achieved for either a 3 month return interval and 20 days of sensor application or for 2 month return interval and 2 days of sensor application the first parameter set was chosen (as a longer application is easier feasibly than a regular installation of the sensor). The predicted intercepts, or break even points, where STOM becomes more accurate than grab sampling, are marked in the following graphs with a black frame. Differences between empirical (number in the box) and fitted intercepts of the Performance index are caused by deviation between the mixed linear regression model and resampling data. The results of the five monitoring stations are sorted by parameter and catchment area:
Figure 5.
Comparison of STOM and grab sampling for nitrogen-nitrate, values were calculated according to
Figure 4, break even points from regression model are highlighted with black frames.
Figure 5.
Comparison of STOM and grab sampling for nitrogen-nitrate, values were calculated according to
Figure 4, break even points from regression model are highlighted with black frames.
For nitrogen-nitrate monthly grab sampling lead to 3.0% mean absolute deviation or assessment error from the complete data set. Mean grab sampling errors were similar among all catchments ranging from 2.5 to 3.5%, with a tendency to get reduced by increasing watershed size.
As
Figure 5 concludes, STOM outperformed grab sampling at similar duration-interval combinations in all catchments. Return intervals were monthly or bi-monthly, whereas a duration of one day sufficed in four out of five catchments. For all catchments, the resampling-based Performance yielded similar or better Performance with monthly one-day STOM.
The coefficient for STOM sampling duration were extracted from the mixed linear regression model and ranged from 0.007 to 0.01, with lowest value at Mulde river and highest at Neiße river, the two intermediate size rivers in the study. This indicates a systematic improvement of the relative STOM Performance with increasing sampling duration. Notably, the Performance improvement is more pronounced at shorter return intervals. The coefficient of log-transformed sampling interval ranged from -0.44 to -0.52, again with Mulde (lowest) and Neiße (highest) defining the range. Hence, for Mulde river STOM Performance appeared more sensitive to return interval while for Neiße sampling duration was more decisive.
Results of STOM for single years did also not exceed a return period of two months, only in 2013 Neiße STOM showed the best outcome, that is indicated by the earliest break-even point among all stations with a return interval of three months and an application duration of 15 days.
Figure 6.
Comparison of STOM and grab sampling for chloride, values were calculated according to
Figure 4, break even points from regression model are highlighted with black frames.
Figure 6.
Comparison of STOM and grab sampling for chloride, values were calculated according to
Figure 4, break even points from regression model are highlighted with black frames.
The overall mean values for chloride did exhibit similar patterns as nitrogen-nitrate. Monthly grab sampling lead to 3.6% mean absolute deviation or assessment error from the complete data set. While the smallest catchment (Lockwitzbach, Neiße) showed the highest mean absolute errors (4.2% & 4.3%) the error margin got reduced towards bigger rivers to 3.5% at Mulde and 2.0% at Elbe.
For all catchments a return-intervals of two months provided better Performance of STOM than grab sampling. Application durations at the break-even point, ranged between 12 to 21 days (
Figure 6). Combinations of monthly one-day sampling or bi-monthly 15-day sampling always outperformed grab sampling.
The coefficient for STOM sampling duration ranged from 0.009 to 0.011, with lowest value at Mulde river and highest at Lockwitzbach / MS6 river. Indicating a slightly stronger improvement of the relative STOM Performance with increasing sampling duration, as compared to nitrate results. The coefficient of log-transformed sampling interval ranged from -0.35 at Lockwitzbach / MS6 to -0.41 at Elbe, this corresponds to the smallest and largest catchments in the study. Hence, for chloride sampling, return-interval is more decisive in large than in small catchments, despite larger summer-winter differences at Lockwitzbach (compare
Table 2).
STOM showed the best Performance compared to grab sampling among all stations and all years with a return interval of three months and 17 days in 2014 at the monitoring station in Görlitz (Neiße) and at MS6 (2018). However, the worst scenarios were found at Elbe/Schmilka for several years with a similar Performance than nitrogen-nitrate (one month return interval and one day of application) in 2014, 2015, 2016, 2017, 2018, 2020 and in Mulde/Bad Düben in 2009, 2013, 2014, 2016 and 2017.
Figure 7.
Comparison of STOM and grab sampling for dissolved oxygen, values were calculated according to
Figure 4, break even points from regression model are highlighted with black frames.
Figure 7.
Comparison of STOM and grab sampling for dissolved oxygen, values were calculated according to
Figure 4, break even points from regression model are highlighted with black frames.
Monthly grab sampling lead to 80.4% mean absolute deviation or assessment error from the complete data set. Lockwitzbach showed both, the highest and smallest mean absolute errors (MS6: 20% & 246%). No effect of catchment size on the error could be identified, the Neiße in Görlitz showed an error of 42.6%, the Elbe in Schmilka 31.2% and at Mulde in Bad Düben 61.8%.
DO sampling Performance underlines the potential of STOM, for all catchments a return-interval of half a year was sufficient with an application duration between 1 to 19 days, depending on the stream, to be as good as monthly grab sampling (
Figure 7). In all cases a STOM regime of 3-monthly sampling during one day or five-monthly sampling during twelve days outperforms monthly grab sampling.
The coefficient for STOM sampling duration ranged from 0.014 to 0.022, with lowest value at Lockwitzbach / MS 6 and very similar values at the larger water bodies. Hence, of all three water constituents, dissolved oxygen sampling accuracy benefits most from longer STOM sampling duration. The higher coefficients at both Lockwitzbach stations coincide with more pronounced day-night differences there. The coefficients of log-transformed sampling interval ranged from -0.21 at Lockwitzbach / MS4 to -0.48 at Elbe, suggesting that oxygen sampling at the larger rivers benefits more from a reduced return-interval than sampling at the smaller stream.
For several years the rarest option (six month return interval and one day of monitoring) were reached at Mulde/Bad Düben (2018) and MS4 (2018,2019,2020). For Elbe/Schmilka and Neiße/Görlitz five months return intervals and 18/21 days of sensor application were the worst Performances in 2009 and 2012 respectively. According to the OGewV, the yearly minimum DO concentrations or the mean of max. three consecutive yearly minima need to be selected for the classification. Results from STOM with frequent return intervals and long application duration frequently detected the “real” yearly minimum value according to the OGewV regulation. These values were omitted for the calculation of the presented mean values, as well as in the uncertainty assessment, since they would yield “infinite” Performance (Δ
STOM = 0, division by zero, see
Figure 4).