1. Introduction
In enclosed water bodies such as lakes, bays, and reservoirs, the water exchange is gradual, so that the nutrients of nitrogen and phosphorus flowing in from the catchment area tend to accumulate [
1]. When particulate phosphorus is deposited deep down, it takes on complex dynamics due to the stratification of the water body. When nutrients eluted by internal flow are delivered to a field with suitable water temperature and illuminance conditions, the bloom of phytoplankton occurs in a field that is spatiotemporally different from the direct supply of nutrients ([
2,
3,
4]). The predominance of certain species of phytoplankton can cause taste and odor problems that contribute to the degradation of drinking water supplies, inhibit recreational uses of surface waters [
5], the reduction of suitable fish habitats, and some assemblages of cyanobacteria produce toxins that are harmful to humans and animals [
6,
7]. In eutrophic lakes, the species of the Microcystis, which have gas vesicles, often dominate and spread on the whole water surface. Harmful Algal Bloom (HAB) are raised as social problems that pose a major hindrance to the environment and human activities [
8,
9,
10,
11].
A radical treatment for this problem is to identify the toxic species and find the environmental variation patterns in which the species occurs [
12,
13], and to control the supply of the nutrients so that they do not exceed thresholds that could lead the algal bloom because it is difficult to control the other factors of the blooming; insolation and water temperature in water bodies. There are many attempts to predict the abnormal occurrence of phytoplankton blooming based on machine learning from various water quality parameters [
14,
15,
16]. However, as mentioned above, the spatiotemporal behavior of the nutrients within a water body is complex. Even if the species identification and water quality analysis are performed from the water sampled at a limited number of locations in the water body, achieving the goal is difficult, and a correspondingly large-scale effort and cost-intensive investigation is required [
17,
18,
19,
20,
21].
On the other hand, the sophistication of the simulations in hydrodynamics and that in the associated changes in water quality have progressed rapidly in recent years. The reproducibility and the spatiotemporal resolution of those distributions within a water body become reliable and high precision [
22,
23,
24]. In proportion to this, the development of observation technology with high spatiotemporal resolution for HABs is expected. This is because the distribution of HABs can change rapidly due to variations in population dynamics and environmental conditions. Normal water quality analysis and microscopic observation require a large amount of time and labor even for just one sample, so it is difficult to understand the entire dynamics of HABs. Except for the above example, there are very few cases where a sufficient number of observations have been made to enable discussion about the transportation of nutrients by currents within the water body.
Remote sensing enables the spatiotemporally high-density observations required for HAB observation in a wide area of water at a relatively low cost. This is because phytoplankton generally proliferates only in shallow layers where sunlight reaches from the surface of the water. In addition, the species of Microcystis, which is one of the groups of HAB, is floating with gas vesicles. Therefore, it is highly compatible with remote sensing from the sky, which measures the intensity of reflection on the water surface, especially in large lakes and coastal areas. There are many examples of such observations. Li et.al. [
27] used multiple satellite image analyses to observe the green tide, which occurs in the southern Yellow Sea and moves northward, severely impacting the coastal ecological environment. Hu et. al. [
28] developed an objective method to estimate the biomass of
Ulva prolifera using satellite image analysis and clarified the transition of its blooming area. In general, the satellite data such as MODIS that comes in every day has a coarse spatial resolution, while the satellites with high spatial resolution such as Sentinel come in once every 1-2 weeks, and the images can only be captured if the sky is clear on that day. In satellite image analysis, it is often necessary to choose either spatial or temporal resolution.
Chen et. al. [
29] calculated the Floating Algae Index (FAI) based on the Advanced Himawari Imager (AHI) data acquired by the geostationary meteorological satellite for the detection of the diurnal algae dynamics in Lake Tai, China. The observations by the geostationary satellites guarantee exceptionally high resolution and imaging frequency. However, such satellite data is not obtained in every region. In analyses based on data from global observation satellites, which can be applied anywhere, it is necessary to select either spatial resolution or inbound frequency depending on the characteristics of the research objects. To supplement this, there are examples of combining various types of satellite images and discussing the short-term decline of HABs [
30,
31,
32,
33].
On the other hand, HAB observation based on the analysis of aerial images taken from Unmanned Aerial Vehicles (UAV) is superior to satellite images in terms of spatiotemporal resolution and in that it is not affected by clouds [
34]. Using a small Unmanned Aerial Vehicle system equipped with a consumer-grade camera, Qu et al. [
35] determined surface-floating cyanobacteria at a maximum detection altitude of 80 m. The small UAV can cover up to 1 km
2 per flight mission, and the short time lag between sampling and flight allows for follow-up monitoring and treatment. Guimarães et al. [
36] photographed a small reservoir with an NGB (near-infrared (N), green (G), blue (B)) camera connected to a UAV, and extracted normalized differential vegetation index (NDVImod) from orthorectified images. The results of multivariate analysis using each spectrum data as an explanatory variable and NDVImod showed some correlation with chlorophyll-a (Chl-a) concentration. Su et al. [
37] tried the estimation of the concentration of Chl-a, Total phosphate and using the average method and pixel-by-pixel matching (MPP) method to search for the optimal regression model from the brightness values of multispectral images obtained by UAV. In each of these research cases, the study site is narrower than in the research based on satellite image analysis, and the observation could be completed with a limited number of images.
Cheng et al. [
38] developed the estimation model for Chl-a concentration from the images taken by the digital camera mounted on the UAV and validated the estimated result against the observed data over a year. As the photography duration increases, the insolation conditions change in a day, and the insolation conditions also differ if the photography takes place on different days and in different seasons, so the brightness value of the photograph changes even for the same color on the water surface. In that study, the brightness values of each spectrum were corrected based on the amount of insolation. The need to correct insolation conditions is an issue not only for aerial images taken by UAVs but also when comparing images taken on different dates in satellite image analysis [
32,
39,
40].
In addition to the insolation and Chl-a, turbidity is a factor that greatly affects the color of water surfaces. Kishino et al. [
41] proposed using a neural network to estimate the concentration of turbidity and Chl-a at once from the images taken by the Aster satellite. Sakuno et.al [
42] developed an integrated algorithm for the remote sensing of Chl-a and turbidity in eutrophic and hyper-turbid waters such as Lake Shinji and Nakaumi in Japan and Vaal Dam Reservoir in South Africa. Palmer et. Al. [
43] investigated indicators for evaluating the distribution of Chl-a concentration in shallow lakes with extremely high turbidity.
Most of the studies on the wide-area observation of HABs using satellite images or UAVs have mainly focused on natural lakes, rivers, and seawater, and only a few studies have been conducted on reservoirs though the water quality management of a reservoir is extremely important for drinking water. The reason is due to the characteristics of each observation method mentioned above. The reservoirs in mountainous areas are smaller than those in the ocean, and satellites such as MODIS, which frequently take images, have insufficient resolution. Alternatively, if a satellite has a certain high spatial resolution, the imaging interval will be weekly, making it impossible to track the dynamics of phytoplankton. To compensate for this drawback, it is preferable to analyze aerial images taken from UAVs and collect spatiotemporally high-resolution information, but fluctuations in insolation strongly affect the analysis results. In addition, because reservoirs are generally narrow and laterally long, there is a spatial distribution of turbidity depending on the distance from the river inflow and the season, which affects the color tone. This tends to cause errors in determining the Chl-a concentration. In addition, in such a narrow and closed water body that has some inflow river, to determine the source of nutrients, it is necessary to be able to observe the spatial distribution of low-ranging chlorophyll in the pre-occurrence stage of HAB bloom.
This research aims to conduct observations using photographs taken from a UAV and to be able to conduct spatially high-resolution analysis freely without restrictions due to photographing date or cloud cover. After calibrating the reflectance of the images based on insolation, we investigated a method for estimating Chl-a concentration using machine learning based on the calibrated reflectance values and turbidity. Not only was the presence or absence of the bloom of HAB identified, but the lower concentrations of Chlorophyll were also assessed. The information on the distribution is expected to support reservoir management, such as identifying sources of nutrients.
5. Conclusions
In this study, we tried to estimate the distribution of chlorophyll-a by analyzing the aerial images taken from UAVs, which have a high spatial resolution and a high flexibility of shooting schedule. The bloom of harmful algae is the most important event to control in reservoir management. It is necessary to detect environmental factors such as oversupply of nutrients at an early stage and take countermeasures. However, in closed water bodies such as reservoirs with limited surface area, the lake water is often completely eutrophic during the bloom stage. Furthermore, since floating harmful algae are blown by wind, it is considered difficult to identify the source of nutrients from their spatial distribution.
It was expected that the hue of the lake surface was determined by the turbidity discharged after flooding, in addition to the chlorophyll concentration. We calculated WTI from satellite images, and used it as one of the explanatory variables, together with the RGB values of aerial photographs to regression the chlorophyll concentration. The proposed method obtained good reproducibility. Turbid water behaves more slowly than chlorophyll, so even if there is a slight temporal gap between the date of aerial photography by UAV and the day of shooting from satellites, the turbidity distribution does not change significantly. It will be possible to synchronize the evaluation of turbidity and spatial distribution of chlorophyll by equipping a UAV with a near-infrared camera [
36], instead of using satellite images to evaluate WTI. Since, likely, appropriate satellite images cannot be obtained due to weather conditions, it is desirable to obtain near-infrared reflectance using a UAV in the future to determine WTI. In this study, turbidity was recognized as a factor that could affect the hue of the lake surface after a considerable number of observations had been completed, so the monitoring with the UAV mounting a near-infrared sensor has not been done.
The chlorophyll-a concentrations evaluated by the proposed method were plotted on maps, which visually represented the temporal fluctuations of chlorophyll-a, and it was also possible to see that the chlorophyll-a concentrations differed depending on the inflowing rivers, even on the same day. However, the reflectance characteristics of chlorophyll and turbid water depend on the type of plankton that dominates and the geological characteristics of the basin. So, to regress chlorophyll-a concentration generally for any reservoir using a machine learning model, it is necessary to prepare learning data for each reservoir and build a model or to build a single model using images obtained from multiple reservoirs as training data. The results presented in this study were only based on the observations from the reservoir. For further study, it is necessary to improve the accuracy of the model by photographing and observing water quality in a lot of reservoirs.
Author Contributions
Conceptualization, M.I.; methodology, M.I., Y.M. and M.Y.; software, Y.M. and M.Y.Y.M.; validation, M.I., Y.M. and M.Y.; formal analysis, M.I., Y.M. and M.Y.; investigation, M.I., Y.M. and M.Y.; resources, M.I.; data curation, Y.M. and M.Y.; writing—original draft preparation, M.I.; writing—review and editing, M.I.; visualization, Y.M. and M.Y.; supervision, M.I.; project administration, M.I.; funding acquisition, M.I.