1. Introduction
The catchment areas are important ecological components that provide a variety of ecosystem services that benefit society and biodiversity [1,2]. These include the availability of freshwater for consumption, support the societal needs such as food production, sport, and recreation, and providing a habitat for both [
3,
4,
5]. Such essential services can be sustained when the biophysical environment of the catchment areas is sustainably managed. However, natural land cover and freshwater resources in the water catchment areas are threatened by Land use/Land cover (LULC) changes in response to societal demands [
6]. It is essential to map and monitor the water catchment landscape dynamics to provide valuable information on the distribution of the land use activities in the catchment, which serve as the tool for initiatives whose mandates aim to strike a balance between developments and the natural environment in the catchment areas. In addition, this information will help water resource managers and other environmental health practitioners establish environmental management plans and policies based on available information [
7,
8]. Moreover, the availability of such information is important as it can be used to troubleshoot water quality issues and prompt the identification of land resource degradation [
9].
Mapping the water catchment land use activities can be achieved by either traditional field observation or remote sensing methods. Remote sensing emerged as the convenient and cost-effective way to provide the spatiotemporal data required to analyze heterogeneous landscapes at a large scale [
10]. In addition, the remote sensing data can be systematically stored, maintained, and openly shared to end users. A variety of remote sensing sensors (optical and active sensors) have been utilized to map the LULC in the catchment areas. [
11] assessed the influence of the contextual data features and various dimensionality filters, including Discrete Wavelet Transform (DWT), in mapping LULC using Headwall Hyperspec data in Bergama, Turkey. The random forest (RF) and support vector machine (SVM) were employed, and RF and DWT data obtained higher overall accuracy and kappa scores of 88.13% and 0.88, respectively. In comparison, SVM recorded lower accuracy than RF. [
12] have used Environmental Mapping and Analysis Program (EnMAP) imagery data to map the LULC using pixel-based machine learning algorithms (MLAs), such as RF and SVM image classification. Their results showed that the SVM image classifier returned an overall accuracy of 92.6%, while the RF model obtained an overall accuracy of 88.1%. [
13] used PRISMA hyperspectral data with machine learning methods—such as RF, artificial neural network (ANN), and convolutional neural network (CNN), to classify the land use activities in Naples, Italy. The CNN achieved the best results at 0.973% and 0.968 overall accuracy and Kappa, respectively. As the second-best model, ANN recorded 0.963% and 0.956 and RF showed the lowest classification accuracy at 0.887% and 0.867, respectively.
Other studies have demonstrated the potential of very high spatial resolution (VHR) multispectral data in providing the same service as hyperspectral data in different terrestrial settings and achieving higher classification accuracies. For instance, [
14], utilized Pleiades-1 imagery to map the LULC in Côte d’Emeraude, France. Maximum likelihood (ML) and SVM algorithms were used to implement the classification. ML algorithm yielded a higher overall accuracy of 84.64% compared to SVM, which only produced a map with an overall accuracy of 76.13%. In Fayoum City, Egypt, [
15] evaluated a variety of classifiers in mapping the city land cover classes using PlanetScope imagery. They found the ANN model more accurate with a 0.97 kappa compared to Naïve Bayes (NB), SVM, RF, and decision trees (DT), which recorded 0.93, 0.90, 0.86, and 0.87, respectively.
Some studies explored the capabilities of moderate to low-resolution sensors' capabilities in classifying spatial land use classes. For example, in a comparative study, [
7] employed Landsat-8 (L8-OLI) data and SVM and RF algorithms to map Nkandla Forest cover classes in South Africa. L8-OLI was found efficient in mapping the forest cover, and SVM was found more accurate with overall accuracy and kappa records of 95.83 % and 0.94, compared to 95.24 0.93 achieved by RF, respectively. On the west coast of India, a study by [
16] assessed the efficiency of the MLAs in mapping LULC using Sentinel-2 (S2) imagery. NB, Classification And Regression Trees (CART), gradient tree boost (GTB), and RF MLAs were assessed, and the models achieved an overall accuracy of 81.34%, 90.18%, 91.06%, and 92.14%, respectively. A study by [
10] also used L8-OLI data to evaluate the capabilities of MLAs in land use analysis in Casablanca, Morocco. The RF, SVM, CART, and GTB algorithms were among the tested methods, and RF showed robust performance with an overall accuracy of 95.42%, compared to other classifiers with an overall accuracy of 83%, 91.50%, and 93.46%, respectively. A study by [
17] compared SVM and RF algorithms using S2 datasets without and with spectral indices to analyze the LULC hilly terrain of Gopeshwar, India. The RF models provided more accurate results in both datasets, with overall accuracy and kappa of 86% and 0.83 for a dataset with only spectral bands and 88% and 0.85 for a dataset with incorporated spectral indices, respectively. While SVM reported the least accuracy, with overall accuracy and kappa of 82% and 0.79 for spectral bands and 87% and 0.83 for incorporated indices, respectively. In the Munneru River Basin, India, [
18] explored the performance of the MLAs viz. SVM, RF, and CART are used to map the LULC based on L8-OLI and S2 data. They found S2 and RF to dominate their counterparts. The average overall accuracy for RF, SVM, and CART in L8-OLI was 94.85%, 90.88%, and 82.88%. In comparison, S2 average overall accuracy for RF, SVM, and CART was 95.84%, 93.65%, and 86.48%, respectively. [
19] evaluated the MLAs in classifying the land use dynamics using L8-OLI data in the Lake Haramaya catchment, Ethiopia. Object-based image analysis (OBIA), SVM, RF, and ANN were tested, and the SVM dominated other classifiers with an overall accuracy of 94%, while RF and ANN achieved 92% and 89%, respectively. While OBIA reported the least overall accuracy at 75%. [
20], compared the parametric and non-parametric classifiers in mapping LULC based on L8-OLI data in the Big Sunflower River watershed, United States of America (USA). RF, SVM, and the ML were factored in, and their results showed SVM showed a robust performance with an overall accuracy (kappa) of 93.5% (0.88), 88.8% (0.82), and RF with the least performance at 84.6%, (0.72).
The success of land cover mapping is dependent on the classification algorithm utilized, not only the remote sensing data. As discussed above, various machine-learning algorithms have been used in the remote sensing field to explain the distribution of spatial features in catchment areas, with varying degrees of accuracy reported. Their performance varies with the biophysical features of the landscapes [
16]. Hence, there is no generic approach for LULC classification for all terrestrial landscapes [
19]. Therefore, it is vital to compare the MLA models to determine their suitability and sensitivity to different settings [
19]. In view of that, if any, little has been done to assess the suitability of the MLAs in the analysis of LULC in the case of South African river catchment landscapes. The current study employed L8-OLI data to map the thematic land use classes of the uMngeni river catchment in KwaZulu Natal, South Africa. Further, the performance of four MLAs, such as NB, RF, SVM, and ANN, was assessed in the uMngeni River Catchment terrestrial landscape. The current study aims to (i) map the LULC that defines the uMngeni River catchment area and (ii) compare the performance of four non-parametric pixel-based MLAs in a supervised classification based on L8-OLI data. The study is expected to improve the understanding of LULC configuration over the uMngeni River catchment, a key element in building a LULC change and projection module within the catchment, to generate information for water resource management strategies.
4. Discussion
The four non-parametric MLAs (NB, RF, SVM, and ANN) were complemented in GEE to classify the nine land use classes. To unpack the status of the LULC and the spatial extent of individual classes in the uMngeni River catchment. The outcomes of this work will help to identify tools suitable for planning toward safeguarding freshwater resources. Also provides an insight into the application of MLAs and their respective abilities to explain the LULC in the undulating river catchment area.
The thematic classes were classified based on the use of the spectral properties from the training sites for each class. The grassland class was identified as the most dominant class. It is well distributed with an area extent of 40.99% of the total catchment area. The grassland class is made up of small shrubs, herbaceous, and different types of grass species. It forms the carpet-like layer on the surface of the soil with the roots embedded in the soil, which strengthens the topsoil from stripping during the fluvial period, increases the water infiltration rate, and plays a significant role in transpiration and photosynthesis. Thus, plays a crucial role in water quality and groundwater recharge.
The forest provides similar ecological functions as the grassland; it covers an average of 23.85% of the total catchment area, which makes it the second largest class after grassland. It is abundant in the middle regions, and partly scattered patches are observed in the lower and the upper regions. The areas covered by the forest class are characterized by woody plants for both open and closed canopies. Most importantly, it serves as carbon storage, indirectly saving water from degradation by mitigating climate change [
7]. Therefore, the attempt to maintain the existing natural vegetation cover should made and possibly promote initiatives that seek to expand the area cover of these classes. For example, the South African government targeting planting ten million trees in the next five years to rehabilitate the degraded green areas.
The cropland is the fifth largest class, which covers an area of 10.06% of the study area. The cropland includes commercial and subsistence farming. It is evenly distributed around the catchment, but homogeneous patches are observed in the north of the middle regions towards the tip of the upper regions (
Figure 5). In the same region, about 9.92% of the plantation area cover was observed extending towards the south of the upper region. The plantations are characterized by planted wood trees for commercial reasons. The areas dominated by cropland fields and plantations are characterized by poor water quality due to the inputs applied to either maximize the yield or to control the insects [
9]. The residuals tend to remain in the soil for a longer time as non-point pollutant. During rain periods pollutants get activated and splashed into the waterbodies via run-off and ultimately change the chemical composition of the water with implications for the water quality degradation. With the realization of the impact of agricultural inputs on waterbodies. The environmental managers will have to ensure control measures are in place to promote sustainable agriculture and timber.
Among the classes that account for the bigger share of the catchment area, the built-up class is the last at 10.27% distributed around the catchment, with dense patches in the lower regions and the southern part of the middle region. It is also necessary to highlight the visibility of the built-up class around or along waterbodies. This class comprises a range of infrastructure types that serve different purposes such as residential areas, industrial, and economic corridors, towns, and public service infrastructures. The built-up class creates impervious surfaces with low water infiltration, increasing the water run-off, soil degradation, hiking of the non-point pollutants to the rivers, and ultimately increasing the stream rate flow with the implications for water quality degradation and hydrological structures. The increase of built-up structures goes together with the package of services such as wastewater drainage facilities which eventually discharge into the fresh waterbodies. Considering the detriment effects that rise from the built-up land use class, practical measures to control the urban sprawling must be developed and a balance between industrial activities and fresh waterbodies must leveled.
Barren and mining pose similar threats to freshwater resources as built-up, they cover a relatively small portion of the catchment, 3.03%, and 0.08 % respectively, as shown in
Table 6. Barren includes areas characterized by poor vegetation or completely bare soil. It is one of the main sources of water turbidity, it allows the topsoil particles to splash into the nearby rivers reducing the transparency of the water, which causes other aquatic species to struggle to survive. The mining close is the smallest land use class with snail trail-like structures and pits mining partly distributed randomly in the study area.
Water and wetland classes cover 1.69% and 0.12% of the uMngeni River catchment area, respectively. These two classes include dams, complex river networks, ponds, and perennial and non-perennial wetlands, member of the wetland class are likely found in the tip of the upper region and in the banks of the dams. They play an important role in promoting the functionality of ecological processes by naturally purifying the water providing the medium for aquatic organisms and supporting day-to-day human needs.
The quantitative analysis of the LULC in the catchment unpacked the scientific information and distribution of the land use types. This information will serve as a reference to the freshwater resource managers and environmental health specialists in developing management plans and policies for freshwater resources based on informed decisions.
It is crucial to assess the performance of the MLAs at the class level, as models produce different comes in different classes [
30]. In a view of that, only the ANN model confidently delineated the river path as part of the water class but could not pick up the small water bodies. In contrast, RF, SVM, and NB struggled to distinguish between small river paths and built-up class. The RF and SVM show a strong ability to map small water bodies. Similar results have been reported by [
62]. highlighting the efficiency of SVM in delineating small waterbodies. However, it performed poorly in the identification of the mining class members. NB and ANN on the other hand significantly demonstrated the swapping of the pixels between built-up and mining classes, while RF succeeded in separating these classes with a high user accuracy score. The spectral overlap among the set of classes that share similar optical properties and the sensitivity of the models to the training dataset can account for this error [
29] found similar results, claiming that RF possesses a strong generalization technique for LULC analysis which makes it less sensitive to the small variations in input vector data. It is worth reporting the challenge encountered by all MLAs to delineate the wetland areas except the RF model. NB and ANN tend to overfit the wetland class, while SVM commissioned some of the wetland pixels to grassland. These findings were also observed in other studies, [
20] suggested that this can be associated with composite signatures from the mixed pixels caused by the L8-OLI 30 m resolution and sparse vegetation within the wetland. In addition, this can also be influenced by the proportion of the wetland class and their relatively small training sample allocation. The accurate assessment at the class level provides the opportunity to put together the overall performance of the MLAs.
Overall, all the models performed well and produced more realistic LULC maps that resemble the reality on the ground. The findings of this study indicate that RF outperformed all the MLAs with slightly higher OA and Kappa values from those recorded by the SVM model as the second-best classifier, which produced higher accurate results compared to ANN and NB. The comparison of the RF and SVM has been debated in many previous studies. [
18] found RF to be robust in assessing LULC using multiple MLAs, as did [
64] in mapping the Cocco forest classes using various MLAs. They attributed the performance of the model to its less sensitivity to training data size and the advantage of the random multi-decision trees in generalization. In contrast, [
7] found SVM to outperform the RF class in mapping natural forest cover classes, highlighting that the performance of the model can be credited to its default parameters. [
19] in evaluating MLAs for dynamic land use, found SVM to be superior to RF, they credited the performance of the models to the use of the kernel functions to optimize the hyperplanes to find the optimum margin to classify the complex land use classes.
An emerging MLA in the remote sensing field, Nave Bayes was found not suitable for analysis of the complex land use landscapes. Similar findings were also reported in other studies, [
29] in the performance of multi-MLAs in multi-data sources, which found NB to be not suited for complex LULC analysis. A similar study by [
30] also found similar results, highlighting the limitations in the inherent design of the model to deal with high dimensional data. [
47], associated NB performance with the limitations of its model to limited training data. The NB classifier can be effective in large areas with more linear land cover pattern.
The results of our study proved the excellence of the RF model, which produced the higher accurate LULC analysis of the uMngeni River catchment. These results will fill the gap in the literature by identifying the suitable MLA for quantitative analysis of dimensional LULC in undulating watershed areas. In addition, this can be a useful tool for water resource managers and environmental health stakeholders to facilitate the development of freshwater resource management strategies by unpacking the visual and qualitative LULC information for the watershed areas.
The L8-OLI multi-spectral bands with incorporated indices provided a good performance in classification of uMngeni River Catchment. For such reasons, L8-OLI data was found to be reliable for analysis of the complex LULC in the undulating landscape of the catchment areas. However, it was found to be challenging to classify the pixels in the edges of the big parcels of class and build-up class members in the core of barren and/or grassland or partially masked by trees. These findings are consistent with findings reported by [
20], who found that pixels in the transition zone between two parcels have been misclassified due to mixed-pixel composite signatures, as explained by [
65,
66]. [
18] also reported that it is challenging to discriminate classes from others in L8-OLI 30 m resolution images due to mixed pixels. Other studies factored in the limitations of pixel-based algorithms, [
25] associated mixed pixel effects with the restrictions of pixel-based models to go beyond the pixel level.