1. Introduction
The retrieval of Significant Wave Height(SWH) is crucial in meteorology and oceanography as it serves as a fundamental parameter for assessing ocean wave conditions. Accurate SWH measurements are Vital for understanding sea state, which directly influences marine navigation, offshore operations, and coastal management. Utilizing remote sensing techniques to retrieval SWH allows for high-resolution, wide-coverage wave data collection, essential for improving weather forecasts, monitoring marine environments, and studying climate change impacts. Moreover, SWH data plays a significant role in validating and calibrating wave models, which are integral to predicting extreme weather events and managing coastal risks [
1,
2,
3,
4,
5]. Traditional methods for retrieving SWH, such as using altimeter satellites, buoys, and ship-based observations, face several challenges, including computational complexity, data acquisition difficulties, and platform-specific limitations. Altimeter satellites like TOPEX/Poseidon, Jason-1, Jason-2, and Sentinel-3 measure the time for radar pulses to travel to the ocean surface and back, requiring sophisticated algorithms to correct for atmospheric and instrumental effects, making data processing complex and resource-intensive [
6,
7]. Despite offering global coverage, these satellites have temporal and spatial gaps due to specific orbital paths, and adverse weather or technical issues can disrupt data collection [
8,
9]. Buoys, although highly accurate and capable of providing real-time data, are limited by geographic distribution and are susceptible to damage from severe weather, necessitating regular maintenance, which is costly and logistically challenging [
10,
11]. Ship-based observations, while valuable for direct measurements, are infrequent, geographically constrained, and costly, often restricted to specific routes or missions, and prone to human error [
12,
13]. Furthermore, the calibration and validation of SWH data require continuous efforts and integration of various datasets, adding to the overall complexity and resource requirements [
10,
11].
GNSS-Reflectometry (GNSS-R) is an innovative remote sensing technique that leverages reflected signals from Global Navigation Satellite Systems (GNSS) to extract various environmental parameters, including surface wind speeds, ice extent, soil moisture, and SWH [25,26,27,28,33,43,50]. By processing Delay-Doppler Maps (DDMs) to correlate surface roughness with wave height, GNSS-R offers a compelling alternative to traditional SWH retrieval methods. Compared to traditional methods, GNSS-R has several advantages in SWH retrieval. Unlike altimeter satellites, which have temporal and spatial gaps due to specific orbital paths, GNSS-R provides more frequent and comprehensive coverage. It also overcomes the limitations of buoys, which are restricted by geographic distribution and susceptibility to severe weather damage, and ship-based observations, which are infrequent and geographically constrained. Additionally, GNSS-R data collection is less affected by adverse weather conditions and does not require complex and resource-intensive data processing algorithms [25]. The spaceborne GNSS-R research commenced with the TechDemoSat-1 (TDS-1) mission, launched by the UK Space Agency (UKSA), which carried the SGR-ReSI payload to demonstrate GNSS-R’s feasibility for environmental monitoring [23]. This pioneering mission paved the way for subsequent advancements, notably the Cyclone Global Navigation Satellite System (CYGNSS), launched by NASA in December 2016. Comprising eight microsatellites, CYGNSS provides frequent and comprehensive measurements. Initially scheduled to terminate on September 30, 2023, the mission has been extended due to its excellent operational performance and high-quality data products. The extension of CYGNSS’s mission underscores the significant potential and broad applicability of GNSS-R technology. This technique not only enhances our ability to monitor and understand oceanic and atmospheric conditions but also promises a robust future for environmental remote sensing. By integrating GNSS-R capabilities with advanced machine learning models, researchers can achieve unprecedented accuracy in SWH retrieval, contributing significantly to marine navigation, weather forecasting, and climate research. The continued operation of CYGNSS beyond its planned termination date highlights its importance and efficacy in environmental monitoring, affirming the broad and meaningful impact of GNSS-R technology.
Studies by Ruf et al. [48] have demonstrated strong correlations between CYGNSS-derived roughness measurements and buoy data, validating the approach for SWH retrieval. Additionally, Clarizia et al. [29] have significantly improved noise reduction in DDMs, enhancing the precision of SWH measurements. In addition to these data analysis methods, machine learning techniques have also been widely applied to SWH retrieval. These techniques have notably enhanced the capabilities of CYGNSS in retrieving various environmental parameters. For instance, neural networks have been applied to retrieve sea surface wind speeds with high accuracy. Methods developed by Stopa and Cheung [23] and Jensen et al. [24] utilize large datasets of co-located buoy and satellite data to train models that accurately retrieve wind speeds based on GNSS-R signal characteristics. Neural networks trained on extensive datasets of co-located buoy and satellite data have modeled the relationship between reflected signal characteristics and SWH with high precision. For example, Morris et al. [31] and Gleason et al. [30] used neural networks to retrieve SWH from CYGNSS data, demonstrating significant improvements in accuracy. Furthermore, deep learning approaches, such as those explored by Li et al. [32], have shown promise in extracting SWH from GNSS-R data, leveraging the ability of deep neural networks to handle complex and non-linear relationships. Recent studies, such as those by Patanè et al., have proposed the use of LSTM-based estimation models [36], while Bu et al. combined ERA5 data with CNN networks for SWH retrieval research [37].
Despite these advancements, there are still limitations in using machine learning for SWH retrieval compared to wind speed retrieval. The models used for SWH are often less sophisticated, and the amount of training data is relatively sparse. Research on SWH retrieval lags behind that of wind speed, where more advanced models like transformers have been applied. For instance, the use of hybrid transformer networks and ConvLSTM models in wind speed forecasting has shown significant improvements in accuracy and prediction horizons [34,35]. In contrast, SWH models tend to rely on older, simpler architectures, which may not capture the complex dynamics as effectively. Overall, the integration of advanced machine learning techniques with GNSS-R data from CYGNSS not only improves the accuracy and precision of SWH retrievals but also broadens the scope of applications, making it a crucial tool for contemporary remote sensing. However, the field still faces challenges, particularly in the sophistication of models used for SWH retrieval compared to wind speed retrieval.
In China, the current approach to SWH retrieval predominantly utilizes data from the Cyclone-1 (CFOSAT) satellite. Launched in 2018, CFOSAT employs GNSS-R to monitor ocean surface wind speeds and SWH. However, its single-satellite design limits both data coverage and temporal resolution. With the launch of the FY-3E series satellites in 2021, equipped with GNOS(GNSS Occultation Sounder) payloads capable of GNSS-R data collection, China’s spaceborne GNSS-R technology has seen rapid development. Comparatively, CYGNSS, composed of eight microsatellites, offers high-frequency data that is particularly useful for monitoring extreme weather conditions. CYGNSS processes DDMs from reflected GPS signals to maintain data quality under adverse weather, while FY-3E’s GNOS leverages multi-frequency DDMs to improve measurement accuracy and reduce noise [14,15,16]. The GNOS sensor on FY-3E also demonstrates superior capabilities in monitoring polar ice changes and high-latitude ocean environments, complementing the strengths of CYGNSS [17,18,19,20,21,40]. Despite significant progress in sea surface wind speed retrieval[38,44], comprehensive research on SWH retrieval using FY-3E data has yet to be conducted. Developing this capability would significantly enhance the utility and accuracy of FY-3E products. By focusing on SWH retrieval, FY-3E can provide more reliable data, crucial for applications such as marine navigation, weather forecasting, and climate research[39,41,42]. This research would not only improve the precision of FY-3E’s measurements but also extend its applicability, thereby solidifying its role in global oceanographic monitoring [21,22].
Given the challenges in machine learning-based GNSS-R retrieval methods, this study focuses on using FY-3E GNOS payload data with ERA5 as the reference data for SWH retrieval. The contributions of this paper are as follows:
First Use of FY-3E GNOS Data for SWH Retrieval: This study is the first to utilize FY-3E GNOS payload data for SWH retrieval, achieving promising accuracy.
Proposal of the ViT-Wave Model: Combining the latest transformer models with Vision Transformer (ViT) models tailored for the task, we propose a specialized model, ViT-Wave, for SWH retrieval.
Global Ocean Analysis: The global ocean analysis demonstrates that the model significantly improves the retrieval accuracy of high wave heights and enhances the overall precision distribution across different sea states.
The structure of this paper is as follows:
Section 2 introduces the data encountered in the experiment.
Section 3 describes the experimental methods and models.
Section 4 details the experimental process.
Section 5 provides a summary and discussion of the experimental results.
Section 6 concludes the paper with final remarks. By advancing the application of neural network models and integrating state-of-the-art transformer techniques, this research aims to significantly improve the accuracy and reliability of SWH retrieval using FY-3E GNOS data, contributing to the broader field of oceanographic monitoring and analysis.
5. Result and Discussion
Upon completing the experimental tests, we evaluated the finalized models using the test dataset. The performance of these models was assessed based on several predetermined evaluation metrics: RMSE, MAE, BIAS, MAPE, and R². The test results are summarized in
Table 3. Among the evaluated models, the ViT-Wave model demonstrated superior performance across all five metrics. Specifically, the ViT-Wave model achieved an RMSE of 0.4052 meters, indicating that it had the lowest overall error distribution. This suggests that the ViT-Wave model is highly effective in capturing the variability in the data and minimizing retrieval errors. The MAE for the ViT-Wave model was 0.27 meters, reflecting its capability to maintain low absolute errors, which highlights the model’s robustness in providing accurate retrievals with minimal deviations from the observed values. In terms of BIAS, the ViT-Wave model achieved a value of -0.0015 meters. This near-zero bias indicates that the model’s retrievals are almost unbiased, with negligible systematic errors, thereby enhancing its reliability for practical applications. The MAPE for the ViT-Wave model was 18.02%, highlighting its efficiency in minimizing percentage errors relative to the observed values. This metric is crucial for assessing the model’s performance in contexts where relative error measurements are critical. However, the Hybrid-Wave model achieved a slightly lower MAPE of 17.71%, indicating its superior performance in scenarios where minimizing relative errors is particularly important. This suggests that the Hybrid-Wave model may be more effective in applications requiring precise percentage error reduction, possibly because it is less influenced by extreme values. Finally, the R² value of 0.9117 for the ViT-Wave model signifies a high level of correlation between the retrieved and observed values. This strong correlation indicates that the model explains a substantial proportion of the variance in the observed data, further validating its effectiveness.
In addition to these evaluations, we conducted comparative studies with other models that utilize CYGNSS data for SWH retrieval. When compared with traditional data statistical analysis methods such as SNR[49] and NCDW LES[49], the ViT-Wave model exhibited improved performance in terms of RMSE and MAE. These improvements highlight the advantages of leveraging advanced neural network architectures over conventional statistical methods. Furthermore, the ViT-Wave model outperformed several other machine learning models, including ANN[32], BT [32], and DCNN[37]. The enhancements in retrieval accuracy with the ViT-Wave model underscore its potential for more accurate and reliable SWH retrievals compared to these existing methods.
To better illustrate the performance improvements of the ViT-Wave model,
Table 4 presents the percentage enhancements of key parameters when compared to other models.
Scatter density plots were generated for each model to comprehensively evaluate their performance as shown in
Figure 8. In these plots, the black solid line represents the ideal
line, indicating perfect agreement between the observed and retrieved values. Additionally, a red dashed line depicts the linear fit of the data, with the corresponding regression equation displayed in the bottom right corner of each plot. The scatter density plots reveal that the ViT-Wave model demonstrates the most favorable distribution of data points around the ideal
line, indicating a high degree of accuracy in its retrievals. The linear regression equation for the ViT-Wave model,
, underscores the model’s superior fitting performance. This close alignment with the ideal line suggests that the ViT-Wave model effectively captures the underlying relationship between the observed and retrieved SWH values, resulting in minimal deviations and high fidelity in its predictions.
In comparison, other models show varying degrees of dispersion around the y=x line, reflecting differences in retrieval accuracy and consistency. The scatter density plots reveal that models such as the CNN-Wave and Trans-Wave have more scattered points, indicating higher retrieval errors and less reliable performance. The fitting equations for these models also exhibit greater deviations from the ideal line, further highlighting their comparative inferiority. Overall, the scatter density analysis reinforces the earlier findings from the quantitative metrics, solidifying the ViT-Wave model’s status as the most robust and accurate model for SWH retrieval among those evaluated. Its superior performance across both quantitative metrics and visual scatter plots underscores its potential for practical applications in wave height prediction and oceanographic research.
We also conducted a segmented error analysis based on the range of SWH values. The SWH data were divided into five segments: 0-2 m, 2-4 m, 4-6 m, 6-8 m, and >8 m. The blue column represents the ANN-Wave model; the orange column represents the CNN-Wave model; the blue column represents the ANN-Wave model; the gray column represents the Hybrid-Wave model; the red column represents the Tans-Wave model; the green column represents the ViT-Wave model. The errors for each model within these segments were calculated and presented in the form of bar charts (see
Figure 9 Figure 10 Figure 11). From these bar charts, it is evident that the ViT-Wave model consistently demonstrates the lowest errors across the SWH range of 0-8 meters, indicating its superior performance in this range. Notably, the error range of 0-4 meters is where all models perform best across all metrics, which may be related to the data distribution, as shown in
Figure 7. This suggests that the ViT-Wave model is highly effective in accurately retrieving SWH values, particularly within the common range encountered in oceanographic observations. Interestingly, for SWH values greater than 8 meters, the Trans-Wave model shows the smallest errors, outperforming other models in this higher SWH segment. This indicates that while the ViT-Wave model excels in general SWH conditions, the Trans-Wave model has a notable advantage in extreme wave conditions, where its architecture might better capture the complex features associated with higher wave heights. This also aligns with the test results in
Table 3 and
Table 4, showing the ViT-Wave model’s limitations under extreme conditions. The segmented error analysis highlights the strengths of both the ViT-Wave and Trans-Wave models, each excelling in different SWH ranges and collectively offering robust solutions for a wide spectrum of wave height retrieval scenarios.
To better understand the global distribution of SWH across the ocean surface, we utilized reference points based on ERA5 SWH data along with their corresponding latitudes and longitudes. Upon evaluating the test dataset, it was observed that 95% of the SWH values were concentrated in the 0-5 meter range. To enhance the visibility of the overall SWH distribution, a customized color bar was employed, as shown in
Figure 12, where darker colors indicate lower SWH values and lighter colors indicate higher SWH values. From
Figure 12 and
Figure 13, it can be seen that most of the colors are consistent, indicating that the ViT-Wave model’s retrieval results are close to the ERA5 SWH. However, in the central part of the Southern Hemisphere, the color in
Figure 13 is lighter than in
Figure 12, indicating that the ViT-Wave model underestimates the SWH compared to the ERA5 data. This suggests that the ViT-Wave model requires further optimization in high-value ranges.
From in
Figure 12, which represents the ERA5 data, it is evident that SWH values tend to be higher in high latitude regions, particularly in the Southern Hemisphere. This observation cannot be made using CYGNSS data due to its limited coverage between 33°N and 33°S latitude, highlighting the advantage of FY-3E data. Overall, the ViT-Wave model shows a distribution that closely resembles the ERA5 reference, indicating its superior performance in capturing the global SWH distribution.
To evaluate the error distribution of the ViT-Wave model across the global ocean surface, we generated bias distribution maps (see
Figure 14). These maps visually represent the differences between the ViT-Wave model predictions and the ERA5 reference data. In these maps, negative biases (indicating model underestimation) are shown in red, with darker shades representing larger discrepancies. Conversely, positive biases (indicating overestimation) are shown in blue, with darker shades indicating greater deviations. Regions where the model predictions match the ERA5 data are displayed in white. From
Figure 14, it is evident that the ViT-Wave model tends to underestimate SWH values overall. The biases are more pronounced in high-latitude regions compared to areas near the equator, suggesting that the model’s performance is less accurate in these regions. This indicates a need for further refinement and optimization of the ViT-Wave model to improve its accuracy, particularly in high-latitude areas. Future research should focus on addressing these discrepancies to enhance the model’s reliability and performance in various oceanographic conditions.