1. Introduction
Air temperature (T
air) is a climate variable describing the energy and thermal balance in a very special zone of the Earth-atmosphere system, namely the surface of the Earth which is at the same time the very bottom layer of the atmosphere [
1]. It is a useful factor in tracking the climate change associated with human activities, especially in urban areas, which represent the “peaks” of anthropogenic activities and where the climate change impact is expressed and sensed the most. T
air is usually recorded by weather stations, which measure T
air between 1.5 - 2 m above ground and are distributed sparsely, thus failing to provide synoptic spatial coverage [
2,
3]. Moreover, urban areas are more heterogeneous than rural areas, hence the effective coverage of weather stations providing long-term observational data tends to be narrow [
4,
5], leaving large swathes of urban are-as unobserved. This is a reason to use remote sensing data for T
air prediction; remote sensing offers a possibility to track its seasonal behavior and especially its spatial distribution. T
air cannot be observed directly from space but thermal infrared sensors enable derivation of land surface temperature (LST), which is a limiting condition for the energy balance and is widely used to assess the spatial distribution of T
air [
6,
7].
The methodologies and approaches to assess and predict T
air via remote sensing are different. They are mainly based on a hybrid methodology, which combines GIS and re-mote sensing data. For example, in 2008 Cristobel J et al. applied this methodology combining geographical variables (e.g. altitude, latitude, continentality and solar radiation) and remote sensing predictors related with T
air such as albedo and LST and vegetation index NDVI obtained from Landsat sensors NOAA and TERRA (MODIS) and used multiple regression analysis and spatial interpolation techniques for data processing. The au-thors support that this combined approach underpins the best T
air models and NDVI and LST are the most powerful RS predictors in T
air modeling [
8].
It is worth to recall that LST is a basic physical parameter to describe ecological, hydrological and atmospheric processes and has a strong relationship with near surface T
air. However, these two parameters have different responses to atmospheric conditions and their link becomes even more complex in mountainous complex terrain, and/or where weather stations are scarce. In 2020 Mutiibwa D et al analyzed the benefits and some limitations of using LST as a variable for predicting T
air in two case study regions of Nevada well known for complex mountainous topography. Though with some limitations and complexities, the relationship between LST and T
air was found to be consistent [
9].
In 2020 Nikoloudakis N et al. applied the hybrid methodology to predict T
air in urban areas without LST. They developed and used predictive models, which are based on urban morphological peculiarities such as land cover and terrain, and in situ T
air measurements from urban weather stations [
10].
The modeling of urban T
air is much more complicated because of the heterogeneity, where the scarcity of weather stations hinders accurate spatial representation of T
air [
11]. It is further complicated in mountainous urban areas [
12]. The use of Machine learning (ML) approaches increases the accuracy of the estimations of T
air [
13]. A wide variety of statistical and ML models have been used so far. Among them the most popular ones still remain multiple regression, ANN models [
1,
11,
12,
14,
15,
16], with differences in results mainly due to the selection of input variables.
A most important component of regression models is the number of variables, which can range from one to a few tens. For example, up to 7 variables (skin temperature (LST), elevation, the Normalized Difference Water Index (NDWI), Sky View Factor (SVF), incident solar radiation, distance from the ocean, and atmospheric water vapor) were used by Hung Chak [
17], 24 variables by Modeste Meliho (ACDC (Aqua day clear-sky coverage), ACNC (Aqua night clear-sky coverage), ADVA (Aqua day view angle), ADVT (Aqua day view time), AE31 (Aqua Band 31 Emissivity),AE32 Aqua Band 32 Emissivity, ALSTD (Aqua day land surface temperature), ALSTN (Aqua night land surface temperature), ANVA (Aqua night view angle), ANVT (Aqua night view time),TCDC (Terra day clear-sky coverage),TCNC (Terra night clear-sky coverage), TDVA (Terra day view angle), TDVT (Terra day view time),TE31 (Terra Band 31 Emissivity), TE32 (Terra Band 32 Emissivity),TLSTD (Terra day land surface temperature), TLSTN (Terra night land surface temperature), TNVA (Terra night view angle), TNVT(Terra night view time), Elevation (DEM), Hillshade (HSD), Sky-view, Slope) [
1], 10 variables by Hanna Meyer (LST, DEM, Slope, aspect, sky-view, month, season, time, sensor, ice) [
18], 11 variables by Yongming Xu (monthly daytime Ts, monthly nighttime Ts, monthly percent of cloudy days, monthly percent of cloudy nights, monthly NDVI, monthly NDSI, monthly albedo, monthly solar radiation, annual landcover, elevation, and TI) [
19], 6 variables by Phan Thanh Noi (AQUA daytime, AQUA nighttime, TERRA daytime, and TERRA nighttime) and two additional auxiliary datasets (elevation and Julian day)), 5 variables by Long Li (Daytime LST, Night-time LST, NDVI, Night-time light, DEM) [
20], 9 variables by Yongming Xu (land surface temperature (LST), normalized difference vegetation index (NDVI), modified normalized difference water index (MNDWI), latitude, longitude, distance to ocean, altitude, albedo, and solar radiation) [
19], up to 10 variables by Munkhdulam Otgonbayar (daytime and nighttime LST data (LSTd and LSTn), quality information (QCd and QCn), observation information (DvA, NvA, DvT, and NvT), emissivity data (Em31 and Em32), clear sky coverage (CsD and CsN), Elevation, Slope, Aspect, Latitude, Longitude) [
21], 7 variables by Chunling Wang (Digital elevation model (DEM), LST, Downward Shortwave Radiation (DSR), Normalized Difference Vegetation Index (NDVI), and Land Cover (LC), latitude (LAT) and declination of the sun) [
22].
In this study, a PLSR model was used to assess and predict urban Tair considering more than 30 variables. To the best of our knowledge, statistical regression models starting from 30 variables have not been presented elsewhere in existing literature.
One more difference with the urban study cases described above lies in the complexity of the mountain features with respect to the size of the considered area, and the sparse configuration of weather stations. This study focuses on the city of Yerevan, Armenia, which stands out for unique geographical parameters, in particular for the high variation range of absolute elevation, exceeding 500m (>1600ft) on a small area covering about 220 sq km. The area has a dry climate, and only 3 weather stations are operating, which limits the information on spatial variation of T
air. In other case studies from literature, like Athens and Heraklion (Greece), Los Angeles (USA), Seoul (South Korea), Vancouver (Canada), Erbil (Iraq), where T
air predictions were performed using remote sensing data [
10,
12,
16,
23] such a combination is never found.
Hence, the objectives of the current study are:
Considering the complexity of the terrain configuration in Yerevan, to assess for the first time the feasibility of estimating urban Tair based on remote sensing data alone
Estimate the Urban Tair of the city of Yerevan using the PLSR model with a high (30) number of input variables.
To the best of our knowledge, so far, no investigations have been performed that could answer these questions.
4. Results and discussions
Pearson values are reported in
Table 2. The reported figures suggest that LST-mean has the most significant influence (r=0.79; p<0.001) on T
air. All other components such as IBI-SAVI-mean (r=0.35; p<0.001), SWIR1-mean (r≈0.3; p<0.001), SWIR2I-mean (r≈0.3; p<0.001), Red-mean (r≈0.3; p<0.001) show significant positive correlation. Green-mean, Blue-means and Aspect_SD correspondingly (r≈0.2; p<0.001), (r≈0.14; p<0.01) and (r≈0.11; p<0.01) also show significant positive correlation with T
air. Some other components such as NDWI_mean (r= -0.35; p<0.001), NDVI_mean (r= -0.25; p<0.001), NDWI_SD (r= -0.24; p<0.001) NDVI_SD (r= -0.26; p<0.001), IBI SAVI_SD and SWIR2_SD (r= -0.13; p<0.001) show significant negative correlation with T
air. In fact, two components contribute the most information to the temperature prediction and one of them is the modified IBI-SAVI index. This will be compared with results of automated selection as explained in the following.
As mentioned above the spatial dependence of the parameter impact was investigated for the selected sized areas 30m, 100m, 200m, 300m, 400m, 500m, 600m, 700m, 800m, 900m, 1000m through PLSR estimation. The table below shows the estimation results for the all-sized areas. Prior to the PLSR run the estimation of variable impacts and the selection of the parameters (VIP) were conducted.
The estimated importance of the 30 predictor variables in the PLS regression model for all the sized buffer zones is shown in the
Figure 3.
As mentioned above the selection of the sizes were stopped on the 1000m because the further increase of the radius results in an overlap of the areas between two weather stations.
As seen in
Table 3 the number of VIP components varies over increasing the radius of the circles around the weather stations. The table x shows that the quantity of the components (predictor variables) stabilized.
The VIP scores for the 1000m buffer zone are shown in
Table 4. LST-mean has the highest VIP score (2.77). According to table X the following variables have VIP scores greater than 1 (one): SWIR2_mean (1.42), IBI SAVI-mean (1.29) NDWI-mean (1.23). Blue-mean, red-mean and SWIR1_mean show scores of approximately 1.1 and NDVI_SD has a VIP score of 1.0. All the others are close to or below 1.
The results of the PLSR model received for 1000m buffer zone are shown in
Figure 4. As it can be seen, the PLSR model provides satisfactory results both for calibration (R
2Cal = 0.72, RMSE
Cal = 1.67) and validation (R
2Val = 0.77, RMSE
Val = 1.58).
In this work, using PLSR driven by a wide range of predictor variables (30) values of T
air on an area with complex terrain features such as Yerevan (Armenia) have been predicted with high accuracy of RMSE
Val =1.58 C. In the process, it was noticed that 5 predictor variables of the selected 10 with the high VIP scores also feature comparatively high (LST-mean: r=0.79; p<0.001) correlation coefficients (IBI-SAVI-mean: r=0.35; p<0.001 and SWIR2-mean; SWIR1-mean & Red-mean: r≈0.3; p<0.001). However, this shows that among these 5 variables Landsat-derived land surface temperature plays a key role in modeling T
air, with all other variables having a significantly smaller impact. The studies of Otgonbayar et al, concluded that PLSR represents well even seasonal and spatial variations in T
air when time-series of LST were included as predictor variables [
21].
The results of the importance analysis highlighted a pool of parameters, which impact the most on Tair. The heterogeneity of the area makes it particularly difficult to venture guesses on the reasons for the composition of such pool of variables.
Previous studies on estimating urban T
air from remote sensing data were performed using more advanced ML models such as Random Forest, Cubist, Support Vector Machine (SVM), and neural networks to estimate urban air temperature [
12,
15,
43]. Though we saw the great potential of the remote sensing data to estimate the T
air on Yerevan’s territory there is still a strong need to continue the studies using above-mentioned advanced ML models.