1. Introduction
Global warming has accelerated the rate of melting of the Arctic sea ice [
1]. During 1970-2010, Arctic sea ice area decreased by an average of 4% per decade [
2]. The rate of sea ice area decline increased dramatically into the 21st century [
3,
4,
5]. The extent of the Arctic sea ice reached its smallest value in the recorded history of satellite data in 2012, at about 3.34 million
, while the second lowest value occurred in 2020, at about 3.74 million
[
6]. The reduced extent of the sea ice presents new opportunities for a number of industries, including Arctic shipping, tourism, fisheries, and oil and gas exploration [
7]. Predicting seasonal and daily scale changes in Arctic sea ice is of great practical significance for the safe operation of Arctic shipping lanes and the development and utilization of Arctic resources.
Physical interactions between the atmosphere, ocean, and sea ice are the basis for predicting sea ice. Several studies have already been conducted to predict Arctic sea ice on different spatial and temporal scales and to explore the predictability in different seasons. Blanchart et al. [
8] used the outputs of the Community Climate System Model, version 3 (CCSM3) to assess the mechanisms of sea ice persistence, and comparisons with actual observations demonstrated that the model can be used for seasonal to annual predictions of sea ice. Krikken et al. [
9] used 15 climate models of CMIP5 to analyze the natural variability in Arctic sea ice from an energy balance perspective and found a strong correlation between the energy balance and the reappearance of sea ice anomalies from the sea ice melting season to the growing season. Guemas et al. [
10] reviewed potential sources of Arctic sea ice predictability from months to years, including the persistence and advection of sea ice anomalies, interactions with the oceanic atmosphere, and changes in the radiative forcing. Mohammadi et al. [
11] determined the potential predictability of Arctic winter sea ice using a sea ice–ocean coupling model, noting high predictability of sea ice concentration (SIC) and sea ice edge position over a 10-day period. Cruz et al.[
12] investigated sea-ice predictability from seasonal to annual scales using a variety of climate models, emphasizing the importance of the reoccurring effects of sea ice anomalies, and observed that anomalies in SIC in the Barents Sea are highly negatively correlated with local sea surface temperature anomalies. Onarheim et al.[
13] showed that ocean heat transport (OHT) variability plays an important role in winter sea ice variability in the Barents Sea, and that the use of the OHT can lead to predictions two years in advance. These studies provide knowledge on the predictability of sea ice associated with a wide range of physical processes, while emphasizing the importance of selecting predictors that are relevant to the target location and time scale.
The main approaches to Arctic sea ice prediction are numerical simulations, statistical predictions and deep learning. Numerical simulation methods are based on physical links between temperature changes, humidity transport, wind field models, cloud cover and ocean heat fluxes. Major climate simulation centers around the world have released some atmospheric and oceanic simulation data. Their adoption of climate models relies on real-time inputs of observational conditions in the data assimilation process. At the same time, due to the limitations of a single climate model and the large differences in the results of different models, it is necessary to determine the assigned weights of each climate model based on its contribution to the simulation of the current climate mean, and then take a weighted average to improve the accuracy of sea ice prediction. In practice, this treatment does not eliminate the effect of model bias on sea ice prediction. In addition, many processes in the dynamic model of sea ice need parameterization, and the current model lacks modeling of rheology, ice thickness distribution, wave–ice interaction, landing ice, melting water and size distribution of floating ice[
14]. Statistical methods are used to predict the state of sea ice according to historical data. With the thinning of sea ice, the average state of sea ice has changed significantly. In addition, most statistical models are linear models, which cannot learn the nonlinear relationship between variables in the Arctic climate system. Because the nonlinear feedback mechanism plays an important role in the coupling system of Arctic atmosphere, ocean and sea ice, it is necessary to predict Arctic sea ice using a nonlinear model. Deep learning technology has a strong nonlinear learning ability. Chi et al. and Choi et al. input the monthly average SIC data of the National Snow and Ice Data Center (NSIDC) into the multilayer perceptron (MLP) and the long and short-term memory (LSTM) models to predict the monthly average of SIC, and found that the results are better than the traditional autoregressive (AR) model[
15,
16]. Kim et al.[
17] used the integrated data of a regional climate model (RCM) as input variables, and used the deep neural network (DNN) method to deal with the nonlinear relationship between SIC and climate variables. As a result, they predicted the SIC in the Kara Sea and Barents Sea in the next 10-20 years. Fritzner et al.[
18] compared the prediction accuracy of a high-resolution dynamic assimilation model, K-NN model and FCN model for the next 7 days, and pointed out that the FCN model can provide similar prediction results to the dynamic assimilation model. Kim et al.[
19] input eight predictors into the CNN model to predict the monthly average SIC in the next month, and the results were better than that of the RF model. Andersson et al.[
20] put forward a model of a probabilistic sea ice prediction system. The model used climate simulation and observation data as the input data to predict the monthly average SIC in the next six months. The results showed that the IceNet model has a high accuracy in predicting the sea ice range, and it is superior to the SEAS5 dynamic model in predicting extreme sea ice events in summer. Liu et al.[
21] used the convolutional LSTM (ConvLSTM) model to predict SIC in the Barents Sea over the next six weeks. They added ERA-Interim reanalysis data to the training dataset, and used the covariance between different variables and the spatiotemporal correlation to complete the prediction of regional SIC. The results were better than the linear regression model. At present, the deep learning method is mainly used to predict the sub-seasonal scale of regional sea ice. Because the daily short-term forecast of SIC is very important for maritime shipping decision-making, it is urgent to attain the accurate daily short-term forecast of SIC.
The ConvLSTM and predictive recurrent neural network (PredRNN) models exhibit the capacity to capture spatiotemporal correlations among diverse input parameters, enabling them to theoretically predict spatiotemporal sequence data. This study introduces these models into the realm of the high-precision daily scale short-term prediction of Arctic sea ice. Initially, we compare the predictive performance of the ConvLSTM and PredRNN models when only SIC is utilized as the input. Subsequently, we enhance the input data by incorporating meteorological parameters that influence both SIC and the sea boundary, leading to the formation of ConvLSTM-multi and PredRNN-multi models. Through an investigation of the spatiotemporal correlations between SIC and meteorological parameters, we observe a substantial enhancement in the model's predictive capability for the sea ice edge region. Furthermore, this paper conducts a quantitative analysis to discern the model's sensitivity to the input meteorological parameters, pinpointing the key meteorological variables that affect the prediction accuracy of SIC.
4. Conclusions
This study explores the integration of multiple predictors into the ConvLSTM and PredRNN models, thereby creating ConvLSTM-multi and PredRNN-multi models. These models are designed for the 10-day prediction of SIC across the entire Arctic Ocean. The findings underscore that the incorporation of meteorological reanalysis data within the ConvLSTM and PredRNN models leads to a significant enhancement in the daily scale prediction accuracy of SIC. Notably, the predictive accuracy of the model is most influenced by SST, followed by the date of SIC. The ConvLSTM-multi model demonstrates the lowest MAE, RMSE and IIEE, with the least increase with the extension in the prediction time. Moreover, the ConvLSTM-multi model exhibits commendable accuracy, particularly during extreme years, albeit with a slightly inferior performance in predicting the distribution of SIC values compared to the PredRNN-multi model. The evaluation of sea-ice edge displacement indicates that the ConvLSTM-multi model excels during summer and autumn, while the PredRNN-multi model performs optimally during spring and winter. Consequently, this comparative analysis highlights that the ConvLSTM-multi model is better suited for predicting SIC over the next 10 days.
Despite the progress made in this study, it is essential to acknowledge several limitations inherent to the model. The predictive capacity for SIC hinges on both the quantity of available SIC datasets and the diversity of input parameters. However, the quantity of SIC datasets and the spatial resolution offered by NSIDC are constrained, necessitating a higher volume of datasets to further refine the model's parameters. Additionally, the model exhibits limited predictive capability for summer sea ice due to its exclusive reliance on reanalysis data. Crucial parameters such as sea ice thickness and ice melting pools, which wield a substantial influence on sea ice dynamics, have not been incorporated into the model. This omission can be attributed to the limited availability and accessibility of these parameters. Lastly, the study falls short in comprehensively analyzing the impact of input parameters on prediction outcomes and discerning the intricate nonlinear relationships between them. Future research endeavors will focus on intensifying the analysis of physical parameters influencing sea ice variations and enhancing the model's capacity to assimilate input data, to enhance the prediction ability of the model.
Author Contributions
Conceptualization, J.L. and J.F.; methodology, J.F.; software, J.F.; validation, J.F., J.L. and W.Z.; formal analysis, J.F.; investigation, J.L. and J.F.; resources, J.L., J.F., W.Z. and Z.L.; data curation, J.F., J.W., Z.L., L.K. and L.G.; writing—original draft preparation, J.F., J.L. and L.G.; writing—review and editing, J.L.; visualization, J.F. and J.W.; supervision, J.L.; project administration, J.L.; funding acquisition, J.L. All authors have read and agreed to the published version of the manuscript.
Figure 1.
Distribution map of sea in the study area.
Figure 1.
Distribution map of sea in the study area.
Figure 2.
The convolutional LSTM (ConvLSTM) network framework.
Figure 2.
The convolutional LSTM (ConvLSTM) network framework.
Figure 3.
The predictive recurrent neural network (PredRNN) network framework.
Figure 3.
The predictive recurrent neural network (PredRNN) network framework.
Figure 4.
ConvLSTM, PredRNN, ConvLSTM-multi, and PredRNN-multi models' SIC prediction accuracies over prediction time in 2020 and 2021.
Figure 4.
ConvLSTM, PredRNN, ConvLSTM-multi, and PredRNN-multi models' SIC prediction accuracies over prediction time in 2020 and 2021.
Figure 5.
Histograms of NSIDC SIC versus four model predictions during the melt season (June-September 2020). (a), (b), (c), and (d) corresponding to ConvLSTM, PredRNN, ConvLSTM-multi, and PredRNN-multi, respectively.
Figure 5.
Histograms of NSIDC SIC versus four model predictions during the melt season (June-September 2020). (a), (b), (c), and (d) corresponding to ConvLSTM, PredRNN, ConvLSTM-multi, and PredRNN-multi, respectively.
Figure 6.
Histograms of NSIDC SIC versus four model predictions during the melt season (June-September 2021). (a), (b), (c), and (d) corresponding to ConvLSTM, PredRNN, ConvLSTM-multi, and PredRNN-multi, respectively.
Figure 6.
Histograms of NSIDC SIC versus four model predictions during the melt season (June-September 2021). (a), (b), (c), and (d) corresponding to ConvLSTM, PredRNN, ConvLSTM-multi, and PredRNN-multi, respectively.
Figure 7.
The change in RMSE of daily scale SIC forecast values of four models from 2020 to 2021 with the increase in forecast days. (a1-a10), (b1-b10), (c1-c10) and (d1-d10), respectively, correspond to the RMSE distributions of ConvLSTM, PredRNN, ConvLSTM-multi and PredRNN-multi models from the first day to the tenth day.
Figure 7.
The change in RMSE of daily scale SIC forecast values of four models from 2020 to 2021 with the increase in forecast days. (a1-a10), (b1-b10), (c1-c10) and (d1-d10), respectively, correspond to the RMSE distributions of ConvLSTM, PredRNN, ConvLSTM-multi and PredRNN-multi models from the first day to the tenth day.
Figure 8.
The change in ACC of daily scale SIC forecast values of four models from 2020 to 2021 with the increase in forecast days. (a1-a10), (b1-b10), (c1-c10) and (d1-d10), respectively, correspond to the ACC distributions of ConvLSTM, PredRNN, ConvLSTM-multi and PredRNN-multi models from the first day to the tenth day.
Figure 8.
The change in ACC of daily scale SIC forecast values of four models from 2020 to 2021 with the increase in forecast days. (a1-a10), (b1-b10), (c1-c10) and (d1-d10), respectively, correspond to the ACC distributions of ConvLSTM, PredRNN, ConvLSTM-multi and PredRNN-multi models from the first day to the tenth day.
Figure 9.
The change in the integrated ice edge error (IIEE) values of the four model predictions over the prediction time, (a) and (b) represent the years 2020 and 2021, with IIEE in
Figure 9.
The change in the integrated ice edge error (IIEE) values of the four model predictions over the prediction time, (a) and (b) represent the years 2020 and 2021, with IIEE in
Figure 10.
a-b, c-d, and e-f represent the average displacement of the ice edge (), the average displacement of the integrated ice edge error (), and the deviation in the integrated ice edge error () for the four models in 2021, respectively, and the dashed lines in e and f correspond to the of the four models.
Figure 10.
a-b, c-d, and e-f represent the average displacement of the ice edge (), the average displacement of the integrated ice edge error (), and the deviation in the integrated ice edge error () for the four models in 2021, respectively, and the dashed lines in e and f correspond to the of the four models.
Figure 11.
RMSE distribution of the prediction results corresponding to different input parameters of the ConvLSTM-multi model. The red line represents the prediction results with all the input parameters. (a) and (b) correspond to the model predictions for 2020 and 2021, respectively.
Figure 11.
RMSE distribution of the prediction results corresponding to different input parameters of the ConvLSTM-multi model. The red line represents the prediction results with all the input parameters. (a) and (b) correspond to the model predictions for 2020 and 2021, respectively.
Figure 12.
RMSE distribution of the prediction results corresponding to different input parameters of the PredRNN-multi model. The red line represents the prediction results of inputting all parameters. (a) and (b) correspond to the model predictions for 2020 and 2021, respectively.
Figure 12.
RMSE distribution of the prediction results corresponding to different input parameters of the PredRNN-multi model. The red line represents the prediction results of inputting all parameters. (a) and (b) correspond to the model predictions for 2020 and 2021, respectively.
Figure 13.
RMSE distributions of the first day SIC predictions for the ConvLSTM-multi model without adding any noise (a1), (b1), masking the reanalysis data as noise (a2), (b2) and masking the SIC as noise (c1), (c2). a1-a3 and b1-b3 correspond to the predicted values for 2020 and 2021, respectively.
Figure 13.
RMSE distributions of the first day SIC predictions for the ConvLSTM-multi model without adding any noise (a1), (b1), masking the reanalysis data as noise (a2), (b2) and masking the SIC as noise (c1), (c2). a1-a3 and b1-b3 correspond to the predicted values for 2020 and 2021, respectively.
Figure 14.
RMSE distributions of the first day SIC predictions for the PredRNN-multi model without adding any noise (a1), (b1), masking the reanalysis data as noise (a2), (b2) and masking the SIC as noise (c1), (c2). a1-a3 and b1-b3 correspond to the predicted values for 2020 and 2021, respectively.
Figure 14.
RMSE distributions of the first day SIC predictions for the PredRNN-multi model without adding any noise (a1), (b1), masking the reanalysis data as noise (a2), (b2) and masking the SIC as noise (c1), (c2). a1-a3 and b1-b3 correspond to the predicted values for 2020 and 2021, respectively.
Table 1.
The specifications of the eleven predictors used to predict short-term sea ice concentration (SIC) in the study.
Table 1.
The specifications of the eleven predictors used to predict short-term sea ice concentration (SIC) in the study.
Variable |
Source |
Unit |
Temporal resolution |
Spatial resolution |
Value range |
Sea ice concentration |
NSIDC |
% |
Daily |
25 km |
[0,1] |
Sea surface temperature |
ECMWF ERA5 |
K |
Hourly |
0.25° |
[0,1] |
2m temperature |
ECMWF ERA5 |
K |
Hourly |
0.25° |
[0,1] |
Skin temperature |
ECMWF ERA5 |
K |
Hourly |
0.25° |
[0,1] |
Surface solar radiation downwards |
ECMWF ERA5 |
J m-2 |
Hourly |
0.25° |
[0,1] |
Mean sea level pressure |
ECMWF ERA5 |
Pa |
Hourly |
0.25° |
[0,1] |
10m u-component of wind |
ECMWF ERA5 |
m s-1 |
Hourly |
0.25° |
[0,1] |
10m v-component of wind |
ECMWF ERA5 |
m s-1 |
Hourly |
0.25° |
[0,1] |
Land mask |
# |
# |
Daily |
25 km |
0/1 |
Cosine of initialization day index |
# |
# |
Daily |
25 km |
[–1,1] |
Sine of initialization day index |
# |
# |
Daily |
25 km |
[-1,1] |
Table 2.
The daily average prediction accuracy of SIC in 2020 and 2021 by the "selected models" in CMIP6 under three CO2 emission scenarios (SSP126, SSP245 and SSP585).
Table 2.
The daily average prediction accuracy of SIC in 2020 and 2021 by the "selected models" in CMIP6 under three CO2 emission scenarios (SSP126, SSP245 and SSP585).
Year |
Scenarios |
MAE |
RMSE |
nRMSE |
ACC |
NSE |
2020 |
SSP126 |
19.67% |
29.13% |
69% |
0.76 |
0.53 |
SSP245 |
23.57% |
32.94% |
78% |
0.74 |
0.47 |
SSP585 |
25.44% |
35.2% |
85% |
0.71 |
0.41 |
2021 |
SSP126 |
20.08% |
29.22% |
70% |
0.76 |
0.53 |
SSP245 |
23.32% |
32.11% |
76% |
0.75 |
0.49 |
SSP585 |
24.58% |
33.95% |
80% |
0.73 |
0.45 |
Table 3.
Average sea ice edge displacement (unit: km) in winter–spring and summer–autumn of 2021 under three future CO2 emission scenarios (SSP126, SSP245, and SSP585) for the four deep learning models in this study and the "selected models" in CMIP6.
Table 3.
Average sea ice edge displacement (unit: km) in winter–spring and summer–autumn of 2021 under three future CO2 emission scenarios (SSP126, SSP245, and SSP585) for the four deep learning models in this study and the "selected models" in CMIP6.
|
ConvLSTM |
PredRNN |
ConvLSTM-multi |
PredRNN-multi |
SSP126 |
SSP245 |
SSP585 |
|
5.44 |
14.97 |
6.28 |
18.10 |
5.45 |
15.51 |
5.05 |
16.86 |
30.50 |
127.46 |
28.74 |
203.26 |
30.85 |
217.10 |
|
5.03 |
12.51 |
5.15 |
13.72 |
4.87 |
11.74 |
4.79 |
13.52 |
25.18 |
90.62 |
23.32 |
124.80 |
23.36 |
132.34 |
|
1.42 |
4.70 |
1.68 |
5.68 |
1.55 |
4.39 |
1.36 |
7.73 |
6.92 |
66.94 |
2.62 |
111.80 |
3.54 |
118.79 |
|
1.07 |
1.22 |
1.19 |
1.27 |
1.10 |
1.31 |
1.05 |
1.23 |
1.22 |
1.4 |
1.24 |
1.54 |
1.33 |
1.56 |