1. Introduction
The need to protect the ecological environment and restore ecosystems has grown in recent years due to the effects of human activity and global climate change [
1,
2,
3,
4]. As an important vegetation type, shrubs play a crucial role in ecosystems [
5,
6,
7]. They not only maintain soil moisture balance and prevent soil erosion but also provide habitats and fodder [
8,
9,
10], which are essential for preserving biodiversity and maintaining ecological balance [
11]. Among their attributes, shrub biomass is a crucial measurement indicator for reflecting shrub growth status and overall vegetation productivity [
12,
13]. Therefore, accurately estimating shrub biomass is a key task for understanding vegetation growth status, ecological functions, and ecosystem carbon storage [
14,
15,
16].
Conventional methods of field surveying have limitations in accurately estimating the biomass of shrubs [
17]. Firstly, these methods necessitate a substantial allocation of human resources and financial investment and are time-consuming, particularly in expansive areas with intricate topography [
18,
19]. Secondly, field surveys can only yield localized sample data, making it challenging to capture the spatial distribution and temporal patterns of shrubs fully. Conversely, the use of destructive sampling techniques to quantify shrub biomass may potentially impact the surrounding ecological system [
20]. Furthermore, the biomass of shrubs is influenced by various factors such as vegetation structure, growth environment, and climatic conditions [
21,
22]. Consequently, it is challenging to account for these factors using traditional methods comprehensively.
To address the limitations of conventional methods, remote sensing technology has emerged as an effective tool for estimating shrub biomass [
23,
24,
25]. This technology enables the acquisition of extensive and continuous data with high spatial and temporal resolution, facilitating large-scale estimation of shrub biomass [
26,
27]. Recent advancements in satellite remote sensing technology, along with the availability of data from satellites such as Landsat and Sentinel series, have expanded the possibilities in the field of biomass estimation [
28,
29]. However, utilizing only the spectral bands of remote sensing images in shrub biomass estimation has its constraints [
30]. For example, the visible and near-infrared bands have a narrow wavelength range [
31], making it challenging to capture the differences in factors closely related to shrub biomass, such as vegetation structure, leaf area index, and chlorophyll content. To overcome these limitations, many studies have used vegetation indices as important indicators for biomass estimation [
32]. Vegetation indices are numerical values calculated from spectral bands that can reflect key information about vegetation growth status, chlorophyll content, leaf area index, and other related factors [
33,
34]. However, existing vegetation indices may encounter saturation issues and struggle to differentiate higher biomass shrubs in densely vegetated areas [
35]. Therefore, it is imperative to evaluate which vegetation indices or spectral bands contribute most significantly to the estimation of shrub biomass, emphasizing the need to identify the optimal feature combination for accurate estimation.
Unmanned aerial vehicles (UAVs) are also essential in the estimation of shrub biomass due to their ability to provide detailed information and significant advantages [
36,
37]. Equipped with high-resolution sensors and advanced image processing techniques, UAVs can capture high-quality remote sensing imagery, offering comprehensive data on shrub vegetation [
38,
39]. They are capable of capturing extensive areas of shrub coverage from a high-altitude overhead perspective, enabling the rapid and accurate identification and extraction of shrub objects [
37,
40,
41,
42]. Additionally, UAVs have higher spatial resolution and flexibility compared to traditional aerial remote sensing techniques, allowing for the capture of finer details of shrub structure and characteristics [
43]. This capability is crucial for biomass estimation, as the spatial heterogeneity and subtle variations of shrubs can significantly impact the accuracy of estimations. Furthermore, UAVs can enhance workflow efficiency, reduce workforce, and save time and costs compared to traditional field surveys and plot measurements [
44]. However, there is limited research on integrating UAV-derived shrub biomass estimation at the satellite scale, highlighting the need for further investigation in this area.
Traditionally, biomass estimation models relied on statistical regression techniques to establish empirical relationships between field-measured biomass and various biophysical or remote sensing variables [
45]. Over time, several modeling approaches have been developed to enhance the precision of estimating shrub biomass [
37,
46,
47]. In recent times, machine learning methods have emerged as potent tools in this field, demonstrating superior performance compared to conventional modeling methods [
48]. However, these models had limitations in capturing complex nonlinear relationships and accounting for spatial heterogeneity [
49]. In contrast, machine learning methods, such as random forests, support vector machines, and neural networks, have gained traction due to their capability to handle intricate and nonlinear relationships [
50,
51]. Machine learning methods offer several advantages over traditional modeling techniques in shrub biomass estimation, particularly in effectively handling large volumes of remote sensing data, including multispectral and hyperspectral imagery, which provide rich spectral and spatial information for accurate biomass estimation [
48,
52,
53].
This work aimed to propose a framework to estimate shrubland biomass at 10m over an arid and semi-arid mountain region based on multi-scale data from field measurements, UAV, Sentinel-1, Sentinel-2 and Landsat observations. The Helan Mountains in Ningxia province, China, were selected as the study area. Firstly, a land cover classification was conducted to identify the shrublands and other land cover types. Secondly, the prediction model of shrub biomass was developed in a Random Forest Regression (RFR) approach driven by different predicted variable datasets based on field measurements, UAV, and satellite images. The field measurement data was used to establish the allometric growth equation between shrub biomass and shrub structure parameters. Using the allometric equation, the shrub biomass was determined in UAV data. Using the UAV-based shrub biomass as inputs, the optimal satellite-based biomass estimation model was developed by comparing different predictor variables from Landsat, Sentinel-1, and Sentinel-2 satellites. Thirdly, with the best model, we created a map of the biomass distribution of shrubland in the Helan Mountains. The accuracy of the resultant map was evaluated over various ranges of shrub biomass or shrub coverage. Finally, we assessed the spatial characteristics of shrubland biomass based on the resultant biomass map and the auxiliary datasets of climate and topography.
Figure 1.
(a,b) Locations of the Helan Mountain in China and Ningxia province, and (c) the distribution of ground truth samples from field measurements, UAV, and visual interpretation.
Figure 1.
(a,b) Locations of the Helan Mountain in China and Ningxia province, and (c) the distribution of ground truth samples from field measurements, UAV, and visual interpretation.
Figure 2.
The workflow for estimating biomass of shrubland in the Helan mountains, China.
Figure 2.
The workflow for estimating biomass of shrubland in the Helan mountains, China.
Figure 3.
(
a) depicts the original unmanned aerial vehicle (UAV) image. (
b) represents the classified map of shrublands. (
c) illustrates the fishnet constructed based on the UAV imagery. (
d–
g) represent the zoomed-in views of four sample points in
Figure 3b.
Figure 3.
(
a) depicts the original unmanned aerial vehicle (UAV) image. (
b) represents the classified map of shrublands. (
c) illustrates the fishnet constructed based on the UAV imagery. (
d–
g) represent the zoomed-in views of four sample points in
Figure 3b.
Figure 4.
(a) displays the shrublands and other land-cover types of Helan Mountain, China in 2023. (b–i) represents the zoom-in views of four example regions in the resultant map and the Google Earth images.
Figure 4.
(a) displays the shrublands and other land-cover types of Helan Mountain, China in 2023. (b–i) represents the zoom-in views of four example regions in the resultant map and the Google Earth images.
Figure 5.
The comparison of accuracy among the three models. The x-axis represents three models driven by the basic bands (SB), the vegetation indices (VI), and the combination of the basic bands and vegetation indices (SBVI). Their performance is evaluated using R2 and RMSE.
Figure 5.
The comparison of accuracy among the three models. The x-axis represents three models driven by the basic bands (SB), the vegetation indices (VI), and the combination of the basic bands and vegetation indices (SBVI). Their performance is evaluated using R2 and RMSE.
Figure 6.
(a) illustrates the distribution of R2 and EOPC within different ranges of shrub coverage. (b) displays the distribution of R2 and EOUB within different ranges of shrub biomass. (c) illustrates the sensitivity of the biomass model to each variable examined by R2 and RMSE. These analyses were conducted based on the ground samples. EOPC denotes the error of one percent coverage of shrub, calculated by RMSE/mean shrub coverage. EOUB denotes the error of one unit biomass, calculated by RMSE/mean shrub biomass.
Figure 6.
(a) illustrates the distribution of R2 and EOPC within different ranges of shrub coverage. (b) displays the distribution of R2 and EOUB within different ranges of shrub biomass. (c) illustrates the sensitivity of the biomass model to each variable examined by R2 and RMSE. These analyses were conducted based on the ground samples. EOPC denotes the error of one percent coverage of shrub, calculated by RMSE/mean shrub coverage. EOUB denotes the error of one unit biomass, calculated by RMSE/mean shrub biomass.
Figure 7.
(a) represents the estimated distribution map of shrub biomass in the Helan Mountains. (b) displays the corresponding map of standard deviation (SD). (c) illustrates the distribution of EOPC within different ranges of shrub coverage. (d) displays the distribution of EOUB within different ranges of shrub biomass. These analyses were conducted based on the estimated distribution map of shrub biomass and the corresponding map of standard deviation. EOPC denotes the error of one percent coverage of shrub, calculated by SD/mean shrub coverage. EOUB denotes the error of one unit biomass, calculated by SD/mean shrub biomass.
Figure 7.
(a) represents the estimated distribution map of shrub biomass in the Helan Mountains. (b) displays the corresponding map of standard deviation (SD). (c) illustrates the distribution of EOPC within different ranges of shrub coverage. (d) displays the distribution of EOUB within different ranges of shrub biomass. These analyses were conducted based on the estimated distribution map of shrub biomass and the corresponding map of standard deviation. EOPC denotes the error of one percent coverage of shrub, calculated by SD/mean shrub coverage. EOUB denotes the error of one unit biomass, calculated by SD/mean shrub biomass.
Figure 8.
(a) depicts the distribution of shrub biomass under precipitation gradients. (b) illustrates the distribution of shrub biomass under temperature gradients. (c) represents the distribution of shrub biomass within different ranges of aridity index. (d) displays the distribution of shrub biomass along elevation gradients.
Figure 8.
(a) depicts the distribution of shrub biomass under precipitation gradients. (b) illustrates the distribution of shrub biomass under temperature gradients. (c) represents the distribution of shrub biomass within different ranges of aridity index. (d) displays the distribution of shrub biomass along elevation gradients.
Table 1.
Widely used vegetation indices (VIs) for the estimates of shrub biomass. R = Red, G = Green, B = Blue, RE = RedEdge1, NIR = Near-infrared, SWIR1 = Shortwave infrared1 and SWIR2 = Shortwave infrared2. ρ represents the surface reflectance of MODIS bands 2 (841–876 nm), σ = 0.5 × (NIR + Red).
Table 1.
Widely used vegetation indices (VIs) for the estimates of shrub biomass. R = Red, G = Green, B = Blue, RE = RedEdge1, NIR = Near-infrared, SWIR1 = Shortwave infrared1 and SWIR2 = Shortwave infrared2. ρ represents the surface reflectance of MODIS bands 2 (841–876 nm), σ = 0.5 × (NIR + Red).
No. |
Index |
Formula |
1 |
Normalized Difference Vegetation Index [57] |
NDVI = |
2 |
Enhanced Vegetation Index 1 [58] |
EVI = |
3 |
Land Surface Water Index [59] |
LSWI = |
4 |
Difference Vegetation Index [60] |
DVI = NIR − R |
5 |
Green Normalized Difference Vegetation Index [61] |
GNDVI = |
6 |
Vegetation Index green [62] |
VIgreen = |
7 |
Infrared Simple Ratio [63] |
ISR = |
8 |
Moisture Stress Index [64] |
MSI = |
9 |
Ratio Vegetation Index [65] |
RVI = |
10 |
Simple Ratio [66] |
SR = |
11 |
Enhanced Vegetation Index 2 [67] |
EVI2 = |
12 |
Modified Simple Ratio [68] |
MSR = |
13 |
Optimized Soil-Adjusted Vegetation Index [69] |
OSAVI = (1+L) × , L was set to 0.16 |
14 |
Renormalized Difference Vegetation Index [70] |
RDVI = |
15 |
Soil Adjusted Vegetation Index [71] |
SAVI = (1+L) ×, L=0.5 |
16 |
Soil Adjusted Vegetation Index2 [72] |
SAVI2 = , b was set to 0.025 and a to 1.25 |
17 |
Stress-related Vegetation Index 1 [73] |
STVI1 = |
18 |
Stress-related Vegetation Index 2 [73] |
STVI2 = |
19 |
Stress-related Vegetation Index 3 [73] |
STVI3 = |
20 |
Red Edge Normalized Difference Vegetation Index [74] |
RENDVI = |
21 |
Anthocyanin Reflectance Index [75] |
ARI = - |
22 |
Vogelmann Red Edge Index [76] |
VREI = |
23 |
Radar ratio vegetation index |
Ratio = |
24 |
Radar Difference Vegetation Index |
Difference = VV - VH |
25 |
Radar normalized difference vegetation index RNDVI |
RNDVI = |
26 |
Near-infrared reflectance of vegetation [77] |
NIRv = (NDVI – C) × ρ (C = 0.08) |
27 |
kernel NDVI [78] |
kNDVI = tanh(()2) |
28 |
Normalized Difference Phenology Index [79] |
NDPI = |
Table 2.
Field samples, including structural parameters and biomass data of each shrub.
Table 2.
Field samples, including structural parameters and biomass data of each shrub.
ID |
CL (m) |
CW (m) |
CH (m) |
CA (m2) |
CV (m3) |
Number of branches |
Single branch biomass (g) |
Single plant biomass (g) |
1 |
1.62 |
1.02 |
1.34 |
1.3 |
1.74 |
6 |
487.81 |
2926.88 |
2 |
1.74 |
1.55 |
1.83 |
2.12 |
3.88 |
13 |
243.91 |
3170.79 |
3 |
1.26 |
1.02 |
0.73 |
1.01 |
0.74 |
33 |
40.95 |
1351.46 |
4 |
0.72 |
0.58 |
0.5 |
0.33 |
0.16 |
1 |
183.43 |
183.43 |
5 |
0.51 |
0.48 |
0.45 |
0.19 |
0.09 |
1 |
513.86 |
513.86 |
6 |
1.15 |
0.96 |
0.84 |
0.87 |
0.73 |
10 |
134.43 |
1344.3 |
7 |
0.42 |
0.42 |
0.6 |
0.14 |
0.08 |
1 |
185.91 |
185.91 |
8 |
1.63 |
1.58 |
0.74 |
2.02 |
1.5 |
9 |
226.38 |
2037.45 |
9 |
0.88 |
0.99 |
1 |
0.68 |
0.68 |
9 |
121.95 |
1097.58 |
10 |
1.96 |
2.08 |
1.32 |
3.2 |
4.23 |
40 |
38.48 |
1539.07 |
11 |
1.32 |
1.47 |
1.11 |
1.52 |
1.69 |
15 |
144.95 |
2174.3 |
12 |
1.05 |
0.98 |
0.59 |
0.81 |
0.48 |
3 |
495.86 |
1487.58 |
13 |
0.89 |
0.68 |
0.75 |
0.48 |
0.36 |
5 |
51.95 |
259.77 |
Table 3.
Auxiliary datasets were used in this study.
Table 3.
Auxiliary datasets were used in this study.
Influencing factor |
Name |
Spatial resolution |
Precipitation |
Global Precipitation Measurement (GPM) v6 |
11132m |
Air temperature |
ERA5-Land Daily Aggregated - ECMWF Climate Reanalysis |
11132m |
Aridity Index |
Global Aridity Index and Potential Evapotranspiration (ET0) Climate Database v2 |
1000m |
Elevation |
NASA SRTM Digital Elevation |
30m |
Table 4.
Shrubland and non-shrubland classification accuracy assessment results.
Table 4.
Shrubland and non-shrubland classification accuracy assessment results.
Accuracy index |
Accuracy |
Recall |
F1 score |
Value |
0.91 |
0.92 |
0.92 |
Table 5.
Model feature selection results.
Table 5.
Model feature selection results.
Model |
Features |
SB |
Blue, Red, NIR, SWIR2 |
VI |
EVI, DVI, GNDVI, SAVI2, Ratio, RNDVI |
SBVI |
DVI, SAVI2, VH, VV |