Scale Effect of Land Cover Classification from Multi-Resolution Satellite Remote Sensing Data

Preprint

Essay

Scale Effect of Land Cover Classification from Multi-Resolution Satellite Remote Sensing Data

Altmetrics

Downloads

174

Views

Comments

A peer-reviewed article of this preprint also exists.

This version is not peer-reviewed

Submitted:

05 May 2023

Posted:

06 May 2023

You are already at the latest version

Alerts

Abstract

Land cover data are important basic data for earth system science and other fields. Multi-source remote sensing images have become the main data source for land cover classification. There are still many uncertainties in the scale effect of image spatial resolution on land cover classification. Since it is difficult to obtain multiple spatial resolution remote sensing images of the same area at the same time, the main current method to study the scale effect of land cover classification is to use the same image resampled to different resolutions, however errors in the resampling process lead to uncertainty in the accuracy of land cover classification. To study the land cover classification scale effect of different spatial resolutions of multi-source remote sensing data, we selected 1 m and 4 m of GF-2, 6 m of SPOT-6, 10 m of Sentinel-2 and 30 m of Landsat-8 multi-sensor data, and explored the scale effect of image spatial resolution on land cover classification from two aspects of mixed image element decomposition and spatial heterogeneity. For the study area, we compared the classification obtained from GF-2, SPOT-6, Sentinel-2, and Landsat-8 images at different spatial resolutions based on GBDT and RF. The results show that (1) GF-2 and SPOT-6 had the best classification results, and the optimal scale based on this classification accuracy was 4–6 m; (2) the optimal scale based on linear decomposition depended on the study area; (3) the optimal scale of land cover was related to spatial heterogeneity, i.e., the more fragmented and complex was the space, the smaller the scale needed; and (4) the resampled images were not sensitive to scale and increase the uncertainty of the classification. These findings have implications for land cover classification and optimal scale selection, scale effects and landscape ecology uncertainty studies.

Keywords:

Subject: Environmental and Earth Sciences - Remote Sensing

1 Introduction

Remote sensing provides data on a large scale and around the clock for use in various industries. Land cover classification data obtained by remote sensing are essential data for studying surface processes and for climate model simulations [1,2,3,4]. In recent decades, with the development of space science and multi-platform remote sensing, multi-sensor and multi-angle technologies, the spatial resolution, spectral resolution and temporal resolution of remote sensing images have been improving. Spatial resolution is one of the basic characteristics of remote sensing images, and scale effect in remote sensing is a key research problem. Woodcock [5] considered that spatial resolution should be similar to the scale of observation. Multi-source satellite remote sensing images have become the basic data source for regional, national and global mapping. Studies have shown that land cover mapping is influenced by the spatial resolution of remote sensing images, which has an obvious spatial scale effect [6,7,8]. The scale effect of the information acquired by remote sensing is the key to obtain optimal scale land cover mapping based on the optimal resolution for a particular study area. Han et al [9] used the method of information entropy to solve the average entropy of category differentiability of image data at each scale and calculate the optimal scale. They showed that the optimal scale has a relationship with the spatial distribution characteristics of the features. Treitz [10] used the variance function to calculate the optimal spatial resolution based on the theory of spatial autocorrelation analysis of spatial statistics, and concluded that the optimal scale was related to the ground scene and sensor parameters. Dongping Ming et al [11] proposed an improved local variance method based on variable window and variable resolution to determine the optimal resolution using local variance as a measure. Their results are not applicable to large area and complex environments. Feng et al [12] used a Triangular Prism Method and Double Blanket Method to determine the resolution of images with three fractal dimensions, and the results had uncertainties. Ming et al [13] studied the optimal spatial resolution of different features in remote sensing images by using the improved method of average local variance, and concluded that variance increases with increasingly complex feature information. The above studies are based on the optimal spatial resolution calculated by geostatistical and classical statistical methods, and there is a large uncertainty in the research conclusions. Research on the scale effect of remote sensing images still lacks clear conclusions.

Generally speaking, features have inherent scales, and the expression at the inherent scale is the most realistic representation of the features [1,14]. However, it is difficult to obtain images with different spatial resolutions from multiple satellites covering the same area due to weather conditions, satellite transit times, and other factors, so most current studies of scale effects often use the same image resampled to obtain data with different spatial resolutions [6,15,16,17,18,19,20,21,22]. However, due to the existence of spatial heterogeneity, resampling can cause distortion of features and loss of spectral information, and the resampled images are different from the real satellite images at a specific spatial scale. Therefore, the results of these studies are still somewhat questionable [23,24].

Markham and Townshend [25] argued that remote sensing classification accuracy is mainly influenced by two factors. The first factor is the image elements at the edge between categories in the classification results, i.e., the hybrid image elements. When the spatial resolution of the image increases, the number of hybrid image elements at the edge between different ground feature categories decreases, and the classification accuracy increases. The second factor is spatial heterogeneity. When the spatial resolution increases, the variability of spectral features within the same feature category increases, which causes the inter-category separability to decrease, thus leading to a decrease in classification accuracy. On the surface, the accuracy effects of spatial resolution variation in relation to mixed image elements and spatial heterogeneity are contradictory. However, the variation in classification accuracy ultimately still depends on the relative relationship between the spatial resolution of the image and the size of the target within the scene. For larger homogeneous targets, the reduction of spatial resolution only increases the number of hybrid pixels at the edges, but does not cause a change of spectral variability between pixels within the target, so the classification accuracy is reduced. In contrast, for targets with large spectral spatial heterogeneity, the reduction of spatial resolution increases the number of hybrid pixels at the edges, but the smoothing effect of the reduced spatial resolution may improve the accuracy of the final classification results, which may reduce the intra-class spectral variation and increase the distinguishability between classes. Woodcock and Strahler [5] argued that the net effect of these two conflicting factors is a function of the environment of the image scene. Therefore, it is necessary to analyze the effect of spatial resolution variation on land cover classification accuracy based on multi-source remote sensing data in terms of both mixed image element decomposition and spatial heterogeneity, and to study the optimal scale effect of land cover classification at different spatial resolutions from multi-source remote sensing data.

In this study, we selected 1 m and 4 m data from GF-2 satellite, 6 m data from SPOT-6, 10 m data from Sentinel-2, and 30 m data from Landsat-8 OLI to quantitatively investigate the relationship between land cover classification results and different spatial resolution from multiple satellite remote sensing data , and explore how the classification accuracy varies with spatial resolution, and to investigate whether the resampled remote sensing data have any influence on the scale analysis comparing with real remote image data, and which scale can most accurately represent the ground truth distribution characteristics of land cover. The results can provide a reference for selecting the optimal scale for land cover classification and a basis for scale conversion.

2 Study Area and Data Pre-Processing

2.1. Overview of the Study Area

The Huangshui River is an important first-class tributary of the upper reaches of the Yellow River, and the Huangshui basin is located in the northeast of Qinghai province, between 36°02´-37°28´N, 100°42´–103°04´E. The basin area covers 16120 km². The main cities in the basin include Xining city and Haidong city, the main population gathering areas in Qinghai province. The land cover is greatly affected by human activities, with diverse feature types and fragmented feature patches. Xining is the capital city, as well as the political, economic, transportation and cultural center of Qinghai province. Its administrative area includes four districts and three counties (Huangyuan County, Datong County and Huanzhong County). Haidong City includes two municipal districts (Ledu District and Pingan District), as well as Minhe Hui and Tu Mutual Autonomous County. The topography of the whole watershed is undulating and diverse, dominated by hills and medium-high mountains. Two typical areas in two important cities in the Huangshui basin were selected for our study. One area is located in Duoba New Area of Xining, which is a key development and construction area of Xining with complex topography and typical land cover type; and another area is located in the Ping'an District of Haidong, which is an important transportation hub of the Qinghai-Tibet Plateau and the main foreign port of Qinghai Province. Xining Caojiabao International Airport is located in this area. The topography is relatively flat and the land cover type is typical.

Figure 1. Location map of the study area.

2.2. Data Sources

The satellite images used in this study are from Chinese GF-2, French SPOT-6, ESA Sentinel-2 and U.S. Landsat-8. Among them, GF-2 is the first batch of satellites launched by the major project of China's high-resolution earth observation system. It is the civil remote sensing satellite with the highest spatial resolution and the largest observation width developed by China. It is equipped with two high-resolution 1 m panchromatic and 4 m multispectral cameras. SPOT-6 was successfully launched by the French Space Center on September 22, 2012. It has an orbital altitude of 695 km and a spatial resolution of 6 m. It records images in multi-spectral blue, green, red and near-infrared bands and 1.5 m panchromatic bands with a standard image coverage of 60 km × 60 km. Sentinel-2A is the second satellite of the European Space Agency of the European Union's Copernicus Earth Observation Program. It was launched on June 23, 2015 for the Global Monitoring for Environment and Security program. Sentinel-2A carries a multispectral imager with 13 spectral bands, a strip width of 290 km, and a revisit period of 10 days. Landsat-8 is a U.S. Landsat program, which was successfully launched on February 11, 2013. Landsat-8 carries the Land Imager (OLI) and the Thermal Infrared Sensor (TIRS). The OLI Land Imager includes nine bands with a spatial resolution of 30 m. The satellite sensors and their parameters are listed in Table 1 below.

3. Research Methodology

The flow chart in Figure 2 shows the mapping and analysis methods applied in this study. Land use/cover classification ,analysis of scale effects are the main steps involved.The following sections describe the analysis scheme and several relevant steps in this study in detail.

3.1. Ensemble Classification Methods

Ensemble learning (EL) classification methods based on multiple classifiers have been shown to be some of the most effective methods for remote sensing image classification [26,27,28]. EL trains various base classifiers separately and then combines them with related combination methods (e.g., bagging, augmentation, or voting) to produce the final classification results. Bagging ensemble methods use the same training algorithm to train several subsets, and each classifier randomly selects the training data, which means that different subsets of the same sample can be selected [29]. Then, the output of each classifier is used for voting decisions. The Random Forest (RF) algorithm is based on the Bagging ensemble method, with a small adjustment so that the correlation between individual trees is reduced [30]. The Boosting ensemble method is an improvement on RF. The classification principle is to iteratively train a series of weak classifiers. A higher weighted attention is used to correctly classify in the next learning round, and the final result is determined by the maximum number of votes classified by the weak classifier [31]. The Gradient Boosting Decision Tree (GBDT), an algorithm among Boosting ensemble methods has been proved to be one of the most effective algorithms. It is known for its excellent performance, and recent results in many research areas have shown that it outperforms various other classifiers [32,33,34]. In this paper, we study the application of EL method to investigate the spatially optimal scale of land cover.

3.1.1. RF

RF belongs to the classification and prediction method that integrates a set of Classification and Regression Tree (CART) decision trees. RF is the most representative Bagging ensemble learning algorithm that combines Bagging ensemble learning and random subspace methods, to reduce overfitting [35]. In the classification process, data sets with different subsamples are randomly selected. Several decision trees are trained using different feature subsamples, and the results of the subsample decision trees are voted on to output the final classification results. RF input data does not require magnitude processing and can automatically handle missing values. It is one of the most commonly used machine learning algorithms.

3.1.2 GBDT

GBDT is a Boosting ensemble machine learning method that combines multiple decision trees. GBDT is a residual model in the direction of gradient descent. It is based on the process of upgrading weak classifiers to strong classifiers. Each iteration reduces the residuals of the last iteration and constantly adjust the weights of misclassified samples to improve the accuracy of classification. GBDT can fit the true distribution of data and has a strong generalization ability. GBDT has good overall performance due to the complementary strengths of the weak classifiers [36,37].

3.2. Linear Decomposition Method

The lower is the spatial resolution of an image, the higher the probability that an image element contains two or more features. In ensemble learning classification, the mixed image elements are assigned to the category with the highest probability. If each hybrid image element can be decomposed and the percentage of overlay type components to the image element can be solved, the uncertainty of classification results can be quantified, resulting in multiple hybrid image element decomposition models. The same idea can be used to calculate the percentage of various features in an image element for the classified results of low-resolution images, and the classification uncertainty of low-resolution images can be evaluated.

The principle of the hybrid image decomposition model is to decompose each hybrid image element and solve for the percentage of the overlay type components in the decomposed image elements. The hybrid image decomposition model allows the uncertainty of classification results to be quantified. For the classified results, the same method can be used to evaluate the classification uncertainty of low-resolution images by calculating the percentage of each feature type in each image element of the low-resolution image classification results.

In this study, the area corresponding to the image element size of 1 m × 1 m after the fusion of 1 m panchromatic and 4 m multispectral images of GF-2 image is used as a sliding window. The land cover types contained in each window and the percentage of each type are counted on the classification results of 4 m of GF-2, 6 m of SPOT-6, 10 m of Sentinel-2, and 30 m of Landsat-8, respectively. The classification results of GF-2–4 m, SPOT-6, Sentinel-2, and Landsat-8 are only one category of cultivated land, forest land, grassland, water, built-up land, and bareland, while the GF-2 classification results of 1 m after fusion are a linear combination of each category expressed as:

a_{1} c_{c u l t i v a t e l a n d} + a_{2} c_{f o r e s t l a n d} + a_{3} c_{g l a s s l a n d} + a_{4} c_{w a t e r l a n d} + a_{5} c_{b u i l t - u p l a n d} + a_{6} c_{b a r e l a n d} = 1

W h e r e : a_{1} + a_{2} + a_{3} + a_{4} + a_{5} + a_{6} = 1 ； a_{1} ， a_{2} ， a_{3} ， a_{4} ， a_{5} ， a_{6} \in [0, 1]

c_{c u l t i v a t e l a n d} ， c_{f o r e s t l a n d} ， c_{g l a s s l a n d} ， c_{w a t e r l a n d} ， c_{b u i l t - u p l a n d} ， c_{b a r e l a n d}

represent the land cover types: cultivated land, forest land, grassland, water, built-up land, and bareland, respectively.

4. Results and Analysis

4.1. Classification Results and Accuracy Analysis

The classification results based on RF and GBDT, two ensemble methods to classify land cover in two respective study areas, are shown in Figure 3 and Figure 4. The classification results were evaluated using producer’s accuracy, user’s accuracy, f1-score, overall accuracy, and Kappa coefficient, as shown in Figure 5–7 and Table 2.

The comprehensive analysis of the classification results of the five different spatial resolution images shows that higher spatial resolution of the images allow to better extract features with smaller areas or sizes. For instance, small features such as reservoir pits are difficult to be extracted on images of 10 m and 30 m resolution, but can be extracted on high resolution images such as 1m and 4 m image of GF-2. Cultivated land is extracted more completely and clearly outlined on mesoscale 6 m and 10 m images, while the area of cultivated land extracted on 30 m upper images is large. The classification results are better for the small-scale resolution of feature types with larger areas and single features, for instance grassland is better classified on 10 m and 30 m images with less pepper and better consistency.

It can be seen from Table 2 that the spatial resolution of the images largely affects the classification results of the images. For the five resolution images included in this study, the overall accuracy exceeds 84.54%. As a whole, the classification accuracy is consistent with related studies [38,39]. The effect of different classification methods on the same spatial resolution is not very significant. The classification accuracy varies by about 6% with a change of spatial resolution from 1–30 m, while the accuracy of different classification methods of the same resolution varies about 1%, indicating that land cover classification accuracy is closely related to the spatial resolution of remote sensing images. The classification accuracy of GBDT is higher than that of RF, which indicates that the effect of Boosting ensemble classification is better than that of Bagging ensemble classification. The classification accuracy of the two ensemble classification methods was the highest at the spatial resolution of 4 m and 6 m, and decreased when the images resolution exceeded or smaller than 4 m. In terms of classification accuracy analyses, the optimal spatial scale of land cover for both study areas is 4–6 m, which is consistent with related studies [11,40].

Figure 4. RF and GBDT land cover classification maps for region B.

Figure 5. Producer’s accuracy Chart.

Figure 6. User’s accuracy Chart.

Figure 7. f1-score accuracy Chart.

The overall producer’s accuracy of GBDT is higher than that of RF. Among them, the average producer’s accuracy of GF-2 (4m) on Region A was the highest; the average producer’s accuracy of SPOT-6 and Sentinel-2 was higher on Region B. The producer’s accuracy of water areas is generally higher, and the producer’s accuracy of bareland is generally lower. The extreme differences between the maximum and minimum values of the producer’s accuracies of different land cover types on Sentinel-2 are larger, specifically: the extreme differences are larger in Region A than in Region B. This indicates that the distribution of the producer’s accuracies of different land cover types in Region B is more concentrated than that in Region A, and the uncertainties of different feature types in Region A are higher.

The user’s accuracy of GBDT is generally higher than that of RF. Among them, the average user’s accuracy of GF-2 (4m) on Region A is the highest; the average user’s accuracy of SPOT-6 and Sentinel-2 is higher on Region B. The user’s accuracy of forest land on Region A is generally higher; the user’s accuracy of unutilized land on Region B is lower. The larger extreme difference between Region A and Region B indicates that the distribution of user’s accuracy for different land cover types in Region B is more concentrated than that in Region A. The uncertainty of different feature types in Region A is higher.

The f1-score is the average of producer’s accuracy and user’s accuracy. The average f1-score accuracy of GF-2 (4m) on region A is the highest; the average f1-score accuracy of SPOT-6 and Sentinel-2 is higher on region B; the f1-score of Landsat-8 is lower on both regions. Based on the classification accuracy, it can be concluded that the spatial resolution dependence of watershed on the image is low, while the dependence of woodland and cropland is high. The accuracy GF-2 (4m) is better in region A, and the accuracy of SPOT-6 and Sentinel-2 is better in region B. Therefore, the uncertainty of region A is higher than that of region B, and the optimal scale accuracy of region B corresponds to a lower spatial resolution than the optimal scale accuracy of region A.

4.2. Optimal Scale Analysis Based on Linear Decomposition

Based on the linear decomposition method, the 1 m classification results of GF-2 fusion were used as a reference for Landsat-8, Sentinel-2 and SPOT-6 classification results. The fitted equations and related parameters were established using the curve fitting method, and are summarized in Table 3. The coefficients of determination are greater than 0.85 (p < 0.01) in both study areas. The fitted curves are shown in Figure 8.

Figure 8. Upper scale fit of region A and region B.

The scale effect of the fitted curves based on the linear decomposition method in region A and region B shows that the mean value of the decomposition tends to increase and then decrease as the scale increases (Figure 7). The optimal classification scale of region A is 5 m, and the optimal scale of region B is 13 m. The optimal scale of decomposition is different for different study areas, mainly because of the different distribution patterns of topographic and geomorphological complexity and land cover types in the two regions, and therefore the spatial heterogeneity is different. Region A has more complex topography and more fragmented patches. Region B is located in Xining Caojiabao Airport, with relatively flat topography and regular feature patches, boundaries and shapes. Therefore, the optimal land cover scale of Region B is larger than that of Region A. That is, the spatial resolution of remote sensing images used for the classification of Region B is larger than that of Region A.

To compare the scale effect of image resampling on land cover, resampling was performed based on the same scene image. To compare data sources and maintain maximum spectral information, the sampling method chosen used the nearest neighbor sampling method with the least information loss. This has less impact on the amount of image information and is especially suitable for resampling before classification [41]. Therefore, the GF-1 fused 1 m image was resampled respectively into 4, 6, 10, and 30 m by the nearest neighbor sampling method. The results of the linear decomposition based on resampling are shown in Figure 9.

It can be seen that the linear decomposition result based on the classification after resampling is not sensitive to the scale. The linear decomposition gradually decreases as the scale rises. The linear decomposition value and scale of the classification after resampling do not show a trend of first increasing and then decreasing, probably because resampling introduces errors.

4.3. Spatial Heterogeneity Analysis

Spatial heterogeneity refers to the heterogeneity and complexity of the spatial distribution of ecological processes and patterns [42]. Spatial heterogeneity can generally be understood as the sum of spatial patchiness and gradient. Spatial pattern, heterogeneity, and patchiness are characteristics dependent on scale. We can define landscape indices to quantitatively describe landscape characteristics, landscape pattern information, and spatial heterogeneity, reflecting the structural composition characteristics and spatial configuration relationships of the landscape [43].

Land cover classification of remote sensing images is mainly influenced by two factors: mixed image elements and spatial heterogeneity. Mixed image element decomposition shows that the choice of the optimal scale for land cover classification is related to the study area. The more complex the study area, the higher the resolution or the larger the scale of the image for classification. To analyze the spatial heterogeneity of different study areas, five landscape indices, namely average patch area, maximum patch to landscape area ratio, aggregation index, separation index, and shape index, were selected. The results are shown in Table 4, Table 5, Table 6 and Table 7.

From the above tables, the average patch area of Region B is larger than that of Region A. The maximum patch area of Region B is twice as large as that of Region A. The proportion of the largest patches to the landscape area in Region B is larger than that in Region A, and the aggregation is better and the shape is more regular. Region A is more spatially heterogeneous than Region B. Therefore, the classification of land cover types on region B corresponds to a larger scale, consistent with the results of linear decomposition.

According to our results, the optimal classification result is not necessarily the optimal land cover scale. The classification results are the expression of macroscopic feature patterns, and factors such as shape, patchiness, and aggregation separability associated with feature characteristics determine the optimal scale of land cover. The more fragmented and complex the features are, the smaller the optimal land cover scale needs to be to finely represent the feature distribution.

5. Discussion

5.1. Discussion of Classification Methods

RF and GBDT are the most popular Bagging and Boosting ensemble methods based on decision trees. In our study the GBDT model outperforms the RF model due to the different ensemble of trees in the ensemble methods. RF uses Bagging (Bootstrap resampling) method to construct different training sets, and determines the final classification result using the maximum number of votes for the results of different datasets. GBDT uses gradient boosting to create a tree based on the residuals of the previously created tree. Both RF and GBDT have advantages such as dealing with nonlinearities and limiting overfitting. Dietterich [44] compared Bagging and Boosting ensembles and found that the noise in the Boosting dataset is less than that in the Bagging dataset, which means that Boosting is sensitive to the noise in the data. The signal-to-noise ratios of data at different scales are different, but in this study RF and GBDT are better in GF-2, SPOT-6, Sentinel-2, and Landsat-8 classification. The ensemble classifier has complementary advantages over single classifier, and can improve the classification accuracy, in accordance with Feng et al [45].

Yang et al [46] used three methods, ISODATA, MLC and SVM, to classify land cover in the Poyang Lake region, a region with strong spatial heterogeneity during the dry period. Their results indicate that different land cover classification methods may lead to different classification results for the same remote sensing data. In the same geographical area, the classification results may be different when using the same classification methods for land cover classification of remote sensing data with different resolutions. Even if the same classification method is used to classify land cover data with different resolutions in the same geographical area, the classification results may be different. By contrast, in our study, the classification results from RF and GBDT show more consistent results on different remote sensing data, and both SPOT-6 and GF-2 (4m) have higher classification accuracy on two different regions. Therefore, the advantages of the ensemble method are better for multi-scale data usability.

5.2. Uncertainty Analysis of Classification

Determining the optimal classification system is the premise and foundation of land cover classification. A suitable classification system should consider both the actual situation of the study area and the spectral and spatial resolution of image data. At present, the National Land Use/Cover Classification System for Remote Sensing Monitoring gives six major categories: cultivated land, forest land, grassland, water area, urban and rural industrial and mining residential construction land and bareland. Our study area is located in a high altitude hilly middle and high mountainous terrain area, and considers only six major categories of land cover classification. A more detailed classification may cause more complicated linear decomposition and spatial heterogeneity. Therefore, the conclusions drawn in this paper are applicable to the six major categories of land cover classification, and more detailed land classification may require further study.

The overall classification accuracy and the f1-score of each category after resampling for each resolution of the real images in study areas A and B are shown in Figure 10. From the figure, it can be seen that the overall classification accuracy and the consistency between each category are not very good. The consistency on study area B is better than that on study area A. The greater the regional heterogeneity, the greater the effect on the images after resampling. Combining the analysis of area A and area B, the consistency was higher on urban and rural industrial and mining residential construction land and forest land, and the uncertainty was larger on other categories, which is basically similar to the findings of Xu et al [47]. It further indicates that resampling increases the uncertainty of the classification.

5.3. Uncertainty Analysis Based on Linear Model

The statistics of feature categories on region A and region B for the two ensemble learning classification methods are shown in Figure 11, Figure 12, Figure 13 and Figure 14. The table refers to the mean and standard deviation of the image elements of each category on the SPOT-6, Sentinel-2, and Landsat-8 classification images. This is similar to the Misclassification Cost (MCC) introduced by Defries et al [48] for land cover classification accuracy evaluation, where the degree of misclassification is different between categories.

In the two study areas, the proportions of cropland, grassland, urban and rural industrial and mining residential construction land, and bareland is larger, and their consistency of classification results on SPOT-6, Sentinel-2, and Landsat-8 images is higher. There is less uncertainty in the attribution of image categories for these four features. The mean value of water is lower because the areas under water are smaller and distributed in strips, corresponding to a smaller scale. The mean value of GF-2 (4m) is the largest in both study areas, 0.5957 and 0.6945, indicating 40% uncertainty in area A and 30% uncertainty in area B. The mean standard deviation is around 0.3, indicating less fluctuations and more reliable mean decomposition.

6. Conclusion

The spatial resolution of remote sensing images is one of the key factors to extract valuable information from images. Multi-scale validation of remote sensing data, remote sensing inversion based on ground truth data, and classification of remote sensing data are highly dependent on scale. In this paper, we explored the scale effect of land cover classification from mixed image element decomposition and spatial heterogeneity using multi-source and multi-scale satellite remote sensing data, and came to the following conclusions.

GF-2, SPOT-6, Sentinel-2, and Landsat-8 images with different spatial resolutions based on GBDT and RF were used for classification studies, and GF-2 and SPOT-6 had the best classification results. Therefore, the optimal scale based on classification accuracy is 4–6 m.
The optimal scale based on linear decomposition is related to the study area, and the optimal scale is different for different study areas.
The optimal scale of land cover classification is related to the spatial heterogeneity. The more fragmented and complex the space, the smaller the scale needed.
Images based on resampling do not reflect the characteristics of the actual scale images well, are insensitive to scale effects, and increase the uncertainty of classification.
The best classification result is not necessarily the optimal land cover scale. The classification result is only a representation of the macroscopic feature pattern. Factors such as shape, patchiness, and aggregation separability associated with feature characteristics determine the optimal land cover scale. The more fragmented and complex the features are, the smaller the optimal scale of land cover needs to be to represent the distribution of features more finely; the more spatially consistent and regular the distribution of features is, the larger the corresponding optimal scale is.

Author Contributions

Conceptualization, R.L. and X.G.; methodology, R.L.; software, R.L and H.Z.; validation, S.F.; formal analysis, X.G; investigation, R.L. and F S.; resources, X.G. and X.M; data curation, S.F.; writing—original draft preparation, R.L.; writing—review and editing, R.L.; supervision, X.G.; project administration, X.G and X.M; funding acquisition X.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the Natural Science Foundation of Qinghai Science and Technology Department（Grant No. 2021-ZJ-913）.

Data Availability Statement

The data presented in this study are available upon request from thefirst author.

Acknowledgments

We sincerely thank the Natural Resources Remote Sensing Center of Qinghai Province for providing the GF-2 satellite data.

Conflicts of Interest

The authors declare no conflict of interest.

References

DeFries, R. Terrestrial Vegetation in the Coupled Human-Earth System: Contributions of Remote Sensing. Annu. Rev. Environ. Resour. 2008, 33, 369–390. [Google Scholar] [CrossRef]
Turner, W.; Spector, S.; Gardiner, N.; Fladeland, M.; Sterling, E.; Steininger, M. Remote sensing for biodiversity science and conservation. Trends Ecol. Evol. 2003, 18, 306–314. [Google Scholar] [CrossRef]
Cihlar, J. Land cover mapping of large areas from satellites: Status and research priorities. Int. J. Remote. Sens. 2000, 21, 1093–1114. [Google Scholar] [CrossRef]
Gong, P.; Wang, J.; Yu, L.; Zhao, Y.; Zhao, Y.; Liang, L.; Niu, Z.; Huang, X.; Fu, H.; Liu, S.; et al. Finer resolution observation and monitoring of global land cover: First mapping results with Landsat TM and ETM+ data. Int. J. Remote Sens. 2013, 34, 2607–2654. [Google Scholar] [CrossRef]
Woodcock, C.E.; Strahler, A.H. The factor of scale in remote sensing. Remote. Sens. Environ. 1987, 21, 311–332. [Google Scholar] [CrossRef]
Chen, D.; Stow, D.A.; Gong, P. Examining the effect of spatial resolution and texture window size on classification accuracy: an urban environment case. Int. J. Remote Sens. 2004, 25, 2177–2192. [Google Scholar] [CrossRef]
Lu, D.; Weng, Q. A survey of image classification methods and techniques for improving classification performance. Int. J. Remote. Sens. 2007, 28, 823–870. [Google Scholar] [CrossRef]
Lv, Z.; He, H.; Benediktsson, J.A.; Huang, H. A Generalized Image Scene Decomposition-Based System for Supervised Classification of Very High Resolution Remote Sensing Imagery. Remote. Sens. 2016, 8, 814. [Google Scholar] [CrossRef]
Han, P.; Gong, J.Y.; Li, Z.L.; Bo, Y.C.; Cheng, L. Selection of optimal scale in remotely sensed image classification. J. Remote Sens. 2010, 14(3), 507–518. [Google Scholar]
Treitz, P. Variogram analysis of high spatial resolution remote sensing data: An examination of boreal forest ecosystems. Int. J. Remote. Sens. 2001, 22, 3895–3900. [Google Scholar] [CrossRef]
Ming, D.P.; Wang, Q.; Yang, J.Y. Spatial scale of remote sensing image and selection of optimal spatial resolution. J. Remote Sens. 2008, 12(4), 529–537. [Google Scholar]
Feng, G.X.; Ming, D.P. Fractal based method on selecting the optimal spatial resolution for remote sensing images. J. Geogr. Inf. Sci. 2015, 17(4), 478–485. [Google Scholar]
Ming, D.; Yang, J.; Li, L.; Song, Z. Modified ALV for selecting the optimal spatial resolution and its scale effect on image classification accuracy. Math. Comput. Model. 2011, 54, 1061–1068. [Google Scholar] [CrossRef]
Marceau, D.; Hay, G. Remote Sensing Contributions to the Scale Issue. Can. J. Remote. Sens. 1999, 25, 357–366. [Google Scholar] [CrossRef]
Li, X.; Cheng, G.; Liu, S.; Xiao, Q.; Ma, M.; Jin, R.; Che, T.; Liu, Q.; Wang, W.; Qi, Y.; et al. Heihe Watershed Allied Telemetry Experimental Research (HiWATER): Scientific Objectives and Experimental Design. Bull. Am. Meteorol. Soc. 2013, 94, 1145–1160. [Google Scholar] [CrossRef]
Pu, R.; Bell, S. Mapping seagrass coverage and spatial patterns with high spatial resolution IKONOS imagery. Int. J. Appl. Earth Obs. Geoinformation 2017, 54, 145–158. [Google Scholar] [CrossRef]
Ling, Y.; Ehlers, M.; Usery, E.L.; Madden, M. Effects of spatial resolution ratio in image fusion. Int. J. Remote. Sens. 2008, 29, 2157–2167. [Google Scholar] [CrossRef]
Hsieh, P.-F.; Lee, L.; Chen, N.-Y. Effect of spatial resolution on classification errors of pure and mixed pixels in remote sensing. IEEE Trans. Geosci. Remote. Sens. 2001, 39, 2657–2663. [Google Scholar] [CrossRef]
Bo, Y.; Wang, J.; Li, X. Exploring the scale effect in land cover mapping from remotely sensed data: The statistical separability-based method. Geoscience and Remote Sensing Symposium, Seoul, South Korea (IGARSS), Proceedings 2005 IEEE International IEEE 2005; 3884–3887. [CrossRef]
Goward, S.N.; E Davis, P.; Fleming, D.; Miller, L.; Townshend, J.R. Empirical comparison of Landsat 7 and IKONOS multispectral measurements for selected Earth Observation System (EOS) validation sites. Remote. Sens. Environ. 2003, 88, 80–99. [Google Scholar] [CrossRef]
Meddens, A.J.; Hicke, J.A.; Vierling, L.A. Evaluating the potential of multispectral imagery to map multiple stages of tree mortality. Remote. Sens. Environ. 2011, 115, 1632–1642. [Google Scholar] [CrossRef]
Schaaf, A.N.; Dennison, P.E.; Fryer, G.K.; Roth, K.L.; Roberts, D.A. Mapping Plant Functional Types at Multiple Spatial Resolutions Using Imaging Spectrometer Data. GIScience Remote. Sens. 2011, 48, 324–344. [Google Scholar] [CrossRef]
Moody, A.; Woodcock, C.E. Scale-dependent errors in the estimation of land-cover proportions: Implications for global land-cover datasets. Photogramm. Eng. Remote Sens. 1994, 60(5), 585–594. [Google Scholar]
Wang, G.; Gertner, G.; Anderson, A.B. Up-scaling methods based on variability-weighting and simulation for inferring spatial information across scales. Int. J. Remote. Sens. 2004, 25, 4961–4979. [Google Scholar] [CrossRef]
Markham, B.L.; Townshend, J. Land cover classification accuracy as a function of sensor spatial resolution. Proceedings of International Symposium on Remote Sensing of Environment; 1981. [Google Scholar]
Saini, R.; Ghosh, S.K. Ensemble classifiers in remote sensing: A review. In Proceedings of the International Conference on Computing, Communication and Automation (ICCCA), Greater Noida, India, India, 2017., 5–6 May 2017. IEEE: Greater Noida. [Google Scholar]
Ghimire, B.; Rogan, J.; Galiano, V.R.; Panday, P.; Neeti, N. An Evaluation of Bagging, Boosting, and Random Forests for Land-Cover Classification in Cape Cod, Massachusetts, USA. GISci. Remote Sens. 2012, 49, 623–643. [Google Scholar] [CrossRef]
Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning. Data Mining, Inference, and Prediction, 2nd ed.; Springer Series in Statistics; Verlag: New York, NY, USA, 2009. [Google Scholar]
Breiman, L. Bagging predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef]
Ho, T.K. A Data Complexity Analysis of Comparative Advantages of Decision Forest Constructors. Pattern Anal. Appl. 2002, 5, 102–112. [Google Scholar] [CrossRef]
Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Sun, R.; Wang, G.; Zhang, W.; Hsu, L.-T.; Ochieng, W.Y. A gradient boosting decision tree based GPS signal reception classification algorithm. Appl. Soft Comput. 2019, 86, 105942. [Google Scholar] [CrossRef]
Lombardo, L.; Cama, M.; Conoscenti, C.; Märker, M.; Rotigliano, E. Binary logistic regression versus stochastic gradient boosted decision trees in assessing landslide susceptibility for multiple-occurring landslide events: application to the 2009 storm event in Messina (Sicily, southern Italy). Nat. Hazards 2015, 79, 1621–1648. [Google Scholar] [CrossRef]
Tama, B.A.; Rhee, K.-H. An in-depth experimental study of anomaly detection using gradient boosted machine. Neural Comput. Appl. 2017, 31, 955–965. [Google Scholar] [CrossRef]
Chan, J.C.-W.; Paelinckx, D. Evaluation of Random Forest and Adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery. Remote. Sens. Environ. 2008, 112, 2999–3011. [Google Scholar] [CrossRef]
Freund, Y.; Schapire, R.E. Experiments with a new boosting algorithm. Int. Conf. Mach. Learn. 1996, 96, 148–156. [Google Scholar]
Demarchi, L.; van de Bund, W.; Pistocchi, A. Object-Based Ensemble Learning for Pan-European Riverscape Units Mapping Based on Copernicus VHR and EU-DEM Data Fusion. Remote. Sens. 2020, 12, 1222. [Google Scholar] [CrossRef]
Ota, T.; Mizoue, N.; Yoshida, S. Influence of using texture information in remote sensed data on the accuracy of forest type classification at different levels of spatial resolution. J. For. Res. 2011, 16, 432–437. [Google Scholar] [CrossRef]
Atkinson, P.M.; Aplin, P. Spatial variation in land cover and choice of spatial resolution for remote sensing. Int. J. Remote. Sens. 2004, 25, 3687–3702. [Google Scholar] [CrossRef]
Chen, J.; Deng, M.; Xiao, P.F.; Yang, M.H.; Mei, X.M.; Liu, H.M. Optimal spatial scale choosing for high resolution imagery based on texture features frequency analysis. J. Remote Sens. 2011, 15(3), 492–511. [Google Scholar]
Dalponte, M.; Bruzzone, L.; Gianelle, D. Tree species classification in the Southern Alps based on the fusion of very high geometrical resolution multispectral/hyperspectral images and LiDAR data. Remote. Sens. Environ. 2012, 123, 258–270. [Google Scholar] [CrossRef]
Wu, J.G. Landscape Ecology Pattern, Process, Scale and Hierarchy, Higher Education Press: Beijing, China, 2007.
Fu, B.J. Principles and Applications of Landscape Ecology, 2nd ed.; Science Press: Beijing, China, 2011. [Google Scholar]
Dietterich, T.G. An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization. Mach. Learn. 2000, 40, 139–157. [Google Scholar] [CrossRef]
Feng, L.; Zhang, Z.; Ma, Y.; Du, Q.; Williams, P.; Drewry, J.; Luck, B. Alfalfa Yield Prediction Using UAV-Based Hyperspectral Imagery and Ensemble Learning. Remote. Sens. 2020, 12, 2028. [Google Scholar] [CrossRef]
Yang, Y.; Liu, Y.B.; Ruan, R.Z.; Ye, C.; Lu, P.P. Scale-induced uncertainty in MODIS-based land cover classification. J. Remote Sens. 2012, 16(4), 868–880. [Google Scholar]
Xu, K.; Tian, Q.; Yang, Y.; Yue, J.; Tang, S. How up-scaling of remote-sensing images affects land-cover classification by comparison with multiscale satellite images. Int. J. Remote. Sens. 2018, 40, 2784–2810. [Google Scholar] [CrossRef]
Defries RS, Wai Chan JC. Multiple criteria for evaluating machine learning algorithms for land cover classification from satellite data. Remote Sens. Environ. 2000, 74(3), 503–515.

Figure 2. Flow chart map of land cover mapping and analysis methods.

Figure 3. RF and GBDT land cover classification maps for region A.

Figure 9. Linear decomposition after resampling on region A and region B.

Figure 10. Scale fit after resampling on regions A and B.

Figure 11. Class mean statistics of classification results at different scales for region A.

Figure 12. Category standard deviation statistics of classification results at different scales for region A.

Figure 13. Class mean statistics of classification results at different scales for region B.

Figure 14. Category standard deviation statistics of classification results at different scales for region B.

Table 1. Satellite sensors and their parameters.

Satellite sensor	Country	Spatical resolution(m)	Wave band	Wavelength coverage(um)	Acquisiton data
GF-2	China	1,4	Blue,Green, Red,Near-infrared (R, G, B, NIR)	0.45~0.52/0.52~0.59 0.63~0.69/0.77~0.89	2017/7/21
SPOT-6	France	6		0.455~0.525/0.530~0.590 0.625~0.695/0.760~0.890	2017/7/19
Sentinel-2A	ESA	10		0.440~0.538/0.537~0.582 0.646~0.684/0.760~0.890	2017/7/27
Landsat-8	America	30		0.450~0.515/0.525~0.600 0.630~0.680/0.845~0.885	2017/9/11

Table 2. Classification overall accuracy and Kappa coefficient of the two classification methods.

Region	Satellite sensor	RF		GBDT
Region	Satellite sensor	OA(%)	Kappa	OA(%)	Kappa
A	GF-2 (1m)	88.03	0.85	88.43	0.86
	GF-2 (4m)	89.50	0.87	90.29	0.88
	SPOT-6	91.56	0.89	92.19	0.90
	Sentinel-2A	89.73	0.87	91.85	0.89
	Landsat-8	84.54	0.80	87.01	0.84
B	GF-2 (1m)	86.53	0.83	88.83	0.86
	GF-2 (4m)	91.01	0.89	92.81	0.91
	SPOT-6	90.57	0.90	92.05	0.90
	Sentinel-2A	91.17	0.89	90.39	0.88
	Landsat-8	85.57	0.82	86.96	0.83

Table 3. Fitting model and related parameters.

Region	Fitting equation	R²	p
A	$y = 0.4397 + 0.5906 * 10^{- 3} x - 0.6393 * 10^{- 4} x^{2}$	0.911	0.002
B	$y = 0.5088 + 7.5814 * 10^{- 3} x - 0.2888 * 10^{- 3} x^{2}$	0.877	0.005

Table 4. RF-based region A landscape index.

Satellite sensor	AREA_AM	LPI	AI	SPLIT	LSI
SPOT-6	693.96	10.93	84.47	26.50	176.83
Sentinel-2A	1488.94	16.36	83.58	12.33	112.55
Landsat-8	951.74	17.39	64.86	19.28	80.50
GF-2(4m)	605.55	12.30	81.13	30.32	321.02
GF-2(1m)	361.93	12.39	88.03	23.47	552.93

Table 5. GBDT-based region A landscape index.

Satellite sensor	AREA_AM	LPI	AI	SPLIT	LSI
SPOT-6	761.20	12.36	79.38	24.15	234.33
Sentinel-2A	1012.56	12.61	76.69	18.13	159.15
Landsat-8	694.40	9.27	62.58	26.42	85.63
GF-2(4m)	599.21	12.06	77.11	30.63	388.99
GF-2(1m)	331.02	11.64	81.82	25.66	839.41

Table 6. RF-based region B landscape index.

Satellite sensor	AREA_AM	LPI	AI	SPLIT	LSI
SPOT-6	1421.05	24.66	87.88	8.96	115.28
Sentinel-2A	1660.92	27.40	85.12	7.67	85.21
Landsat-8	1401.22	23.92	75.64	9.09	47.02
GF-2(4m)	1352.46	22.29	89.78	9.41	145.59
GF-2(1m)	533.92	15.88	89.60	13.86	534.97

Table 7. GBDT-based region B landscape index.

Satellite sensor	AREA_AM	LPI	AI	SPLIT	LSI
SPOT-6	1626.08	23.48	83.92	9.34	152.53
Sentinel-2A	1586.00	26.46	80.11	8.03	113.43
Landsat-8	1314.06	23.48	72.47	9.50	52.94
GF-2(4m)	1412.41	23.34	86.78	9.01	187.784
GF-2(1m)	531.19	15.92	83.43	13.93	713.88

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

MDPI Initiatives

Important Links

Choose an area of interest and we will send you notifications of new preprints at your preferred frequency.

Disclaimer

Scale Effect of Land Cover Classification from Multi-Resolution Satellite Remote Sensing Data

Abstract

1 Introduction

2 Study Area and Data Pre-Processing

2.1. Overview of the Study Area

2.2. Data Sources

3. Research Methodology

3.1. Ensemble Classification Methods

3.1.1. RF

3.1.2 GBDT

3.2. Linear Decomposition Method

4. Results and Analysis

4.1. Classification Results and Accuracy Analysis

4.2. Optimal Scale Analysis Based on Linear Decomposition

4.3. Spatial Heterogeneity Analysis

5. Discussion

5.1. Discussion of Classification Methods

5.2. Uncertainty Analysis of Classification

5.3. Uncertainty Analysis Based on Linear Model

6. Conclusion

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe