1. Introduction
Six major U.S. climate-model development institutions (NOAA GFDL, NCAR, NASA GISS, DOE ACME, NASA GMAO, and NCEP CFS—list of acronyms in
Table S1) have coordinated the efforts that could be translated into a better understanding of sources of climate predictability in almost a decade [
1,
2]. The motivation behind these modeling efforts is to improve operational sub-seasonal forecasts (10 to 60-day); however, climate predictability at such temporal lead times is still a major challenge [
4,
5,
6,
7]. Of particular interest is the sub-seasonal forecasting skill and predictability of precipitation in the United States Northern Great Plains (NGP) during late spring and early summer, May through July [
8]. During this season, extended precipitation event, or their absence, may lead to natural disasters such as the 1993 flood in the Midwest or the 1988 and 2012 droughts (
Figure 1 and
Figure 2), considered some of the costliest events in the history of the United States with estimated damages of 20 and 40 billion dollars [
9,
10]. Losses in the NGP included drops in corn yields of about 30% for Nebraska, Iowa, Minnesota, and Illinois, which affected local farming and regional economies [
11]. These states lie in the U.S. Midwest, where the corn economy is valued at about 50 billion dollars [
12]. Also, this region is considered a production hub for corn, soybean, and cattle and halves in the United States [
13,
14], making climate diagnostics and prognostics key to food and biofuel production and water resources security [
15,
16,
17,
18,
19,
20,
21,
22].
Precipitation variability in the NGP has been related to the spatial and temporal variability of the Great Plains low-level jet (GP-LLJ) [
24,
25,
26]. During the summer season, NGP precipitation and the GP-LLJ are linked through the transport of moisture from the Gulf of Mexico, the region's primary moisture source. The GP-LLJ can be identified using wind data from rawinsonde stations [
24] and wind profiler observations [
26]. Wind at 900-mb is in several retrospective reanalyzes [
27], such as NCEP-NCAR [
28] and the North American Regional Reanalysis (NARR)[
29]. The GP-LLJ, which has its maximum annual cycle during May through July (with a peak in July), allows an efficient moisture transport through an extensive plains surface center at 900-mb and along 95ºW [
27].
Rainfall variability of the northern Great Plains is also linked to large-scale atmospheric teleconnections [
27,
30,
31]. At 200 mb, the association between the GP-LLJ index and geopotential height (HGT) shows a positive strength center over Tennessee, acting over the country's eastern half, and a Rossby-wave train pattern [
32]. This geopotential-height pattern matches the interannual variability characteristic of the atmospheric circumglobal teleconnection (CGT) [
33]. Observational and modeling indicate that CGT affects summer precipitation in the United States [
30,
31]. The CGT pattern with one center of action located over North America is essential in driving the variability of summer precipitation over the Northern Hemisphere [
33]. The authors [
30] showed that the CGT pattern affects the rainfall distribution during the summer. This evidence can be a source of predictability for precipitation in North America and East Asia. Further, [
31] found two CGT climatic modes that affect North America's summer precipitation in the Southern and the Northern Great Plains. The pattern in the Northern Great Plains is responsible for 16% of the early summer (June-July) variability, as evidenced by the application of Empirical Orthogonal Function (EOF) analyses [
30]. These authors showed that the maximum strengthening of the CGT during July matches the maximum transport of low-level moist air from the Gulf of Mexico into the Great Plains. How these two components of U.S. climate variability (the CGT and the Low-Level Jets) affect the predictability of summer precipitation over the NGP region is a major interest of this study.
Although NGP precipitation variability has been extensively studied for several decades [
24,
26,
27], the predictability in the sub-seasonal range is still a challenge [
34,
35]. The authosrs [
36] reported the limited skill of the Climate Forecast System (CFS) to predict rainfall at ranges higher than 30 days. After many improvements on CFS version 2, they discussed that it is as good forecasting as its predecessor version 1. However, at a 15-day range, they found a significant improvement in precipitation forecast from CFSv1 to CFSv2, which is attributed to the improved initial state in the tropical atmosphere. A multi-model ensemble prediction seems to be a promising approach, at least for some variables such as sea surface temperature [
3,
36], but comparisons among several models from major national modeling centers and CFS reveal a similar skill for precipitation [
36]. Further understanding of the regional-to-global modes of variability, such as the GP-LLJ and CGT, and their interaction can help improve sub-seasonal forecasts, especially in agricultural landscapes reliant on precipitation forecasts for crops.
The following questions arise: How strong is the link between the GP-LLJ and CGT in a modeling framework—concerning the evolution of summer precipitation over the Northern Great Plains? Furthermore, how does precipitation predictability vary when the internal dynamics of a GCM capture these two modes of climate variability? In other words, whether the interaction between the GP-LLJ and CGT can influence precipitation predictability within a prognostic 30-day range. We hypothesize that the forecast skill of precipitation over the NGP can be better assessed if GP-LLJ and CGT's patterns of variability—at the daily-to-sub-seasonal and daily scales—are adequately simulated by the model used. We consider that the regional scale of the GP-LLJ combined with the large-scale circulation of the CGT could reveal the underlying mechanisms responsible for the improvement of sub-seasonal predictability in the NGP. The objectives are three-folded: (1) Estimate the GP-LLJ and CGT indices based on NARR and CFS diagnostic and CFS forecast products for at least 30 years; (2) Estimate the correlation threshold to assess the unconditional and conditional causality between NGP precipitation and the GP-LLJ and CGT indices; and (3) evaluate the 30-day forecast skill of the CFS-based daily precipitation forecast products for NARR data using the conditional association between NGP precipitation and the GP-LLJ and CGT indices. The lead time of 30-day length was selected based on the CFS's reported limited 20-day forecast skill [
36]. Then the performance of CFS-based precipitation forecasts was then evaluated with the GP-LLJ and CGT simulation for the extended wet and drought events of 1993 and 1988, respectively.
The paper's organization is as follows:
Section 2 describes the sources of data,
Section 3 is the Methodology, which explains the estimation of GP-LLJ and CGT indices, correlation thresholds, and associations between precipitation estimates and the forecast products.
Section 4 analyzes the results, and
Section 5 discusses the paper's central thesis.
4. Discussion
The predictability of precipitation has been evaluated by addressing two scientific questions: How strong is the link between the GP-LLJ and CGT in a modeling framework—concerning the evolution of summer precipitation over the Northern Great Plains? Furthermore, how does precipitation predictability vary when the internal dynamics of a GCM capture these two modes of climate variability?
The first question was answered by showing the GP-LLJ and the CGT's role in the variability of NGP precipitation at sub-seasonal and interannual scales. The link between the GP-LLJ, CGT, and NGP precipitation is strong, as shown by the NARR and CFS-RR using EOF analysis and Pearson correlation. In both reanalyses, the GP-LLJ contributes to the rainfall variability in the NGP at a daily scale (
Figure S2). The EOF analysis of V900 for the NARR for 30 years identifies diagnostic patterns of spatiotemporal variability for the GP-LLJ that explain 26% of the variance; the same GP-LLJ pattern from the CFS-RR explains 23% of the variance. The EOF analysis on 200-mb HGT reveals the CGT as the second dominant pattern for the precipitation in the NGP, with an explained variance of 11%. We have found that the association between precipitation GP-LLJ and CGT occurs at the daily and sub-seasonal scales. Thus, NGP precipitation responds to regional-to-global moisture transport mechanisms in the lower troposphere. At the same time, these mechanisms are modulated by the CGT at the sub-seasonal scale, as shown on HGT anomalies caused by the upper-level jet stream.
The GP-LLJ, the CGT, and the NGP precipitation relationship identified in the CFS reanalysis define the metric to evaluate their role in a forecasting framework. Using a Hovmoller diagram of precipitation (
Figure 5), this study depicts how the region's GP-LLJ peaks coincide with major convective storms. A correlation analysis between both gives a value of 0.4 (p < 0.05), which confirms previous research using alternative techniques [
26]. Similarly, the CGT index reveals a statistically significant link with GP-LLJ with a correlation of 0.5 (p<0.05) between them. The GP-LLJ, CGT, and NGP precipitation relationship facilitates a metric to evaluate the 30-day predictability of precipitation. This analysis indicates how this relationship is maintained in space (by the CGT index) and time (by the GP-LLJ index). This process-based approach could incorporate the predictability of the dominant drivers of precipitation into the analysis. Further, the treatment of scale separation was beneficial when assessing the contribution of the dominant modes of climate variability at the global (CGT) scale.
For the second question, the 30-day NGP precipitation was evaluated in the context of the GP-LLJ-and-CGT relationship maintained in space and time. A study by [
48] concluded that correct simulation of the GP-LLJ is necessary, but more is needed for adequately representing NGP precipitation. As hypothesized here, precipitation forecast skill increases in response to the enhanced simulation of the GP-LLJ and CGT at daily and sub-seasonal scales. The improved performance of the CFS precipitation in a forecast mode is due to the proper simulation of the GP-LLJ--CGT individual cases. At least 43% of the selected "good" cases outperform when they adequately represent the GP-LLJ and the CGT. Thus, it gives an objective process-based approach to quantify the role of these two modes of climate variability. Precipitation prediction is better when the relationship between these two climate variables is maintained.
The CFS, in a forecast mode, simulates limited cases of 30-day precipitation (r>0.5 [p<0.05]). This study has found that only 126 cases (22.9% of the total analyzed) were able to represent the 30-day precipitation variability with a correlation higher than 0.35 (p<0.05). In addition, from this group, 40.5% show correlation values higher than 0.5 (p<0.05); therefore, only 9% of the total simulated cases. A similar range of improvement can be found when the GP-LLJ and the CGT have the same thresholds. From the total 126 selected cases, 43.6% were able to simulate the GP-LLJ, and 37.3% of the CGT exceeded the threshold (r>0.35 [p<0.05]). These 43.6% and 37.3% of the cases demonstrate the importance of the GP-LLJ-and-CGT link for the predictability of NGP precipitation. A deeper analysis of the multidimensional covariance with a three-dimensional PDF also reveals that high precipitation correlation coefficients are associated with a better predictive GP-LLJ and CGT. Authors [
27] also highlighted how essential the GP-LLJ is in the NGP precipitation. However, this research demonstrates that predictability is improved when the GP-LLJ and the CGT signals are adequately simulated. As the temporal variability of the CGT is higher in the range of 10-60 days, the CGT provides predictability in these cases. However, it is essential to consider the 6-day variability of the GP-LLJ, which plays a secondary role in extending the range of precipitation predictability to 30 days.
The GP-LLJ-and-CGT analysis suggests that its link is passed through the model during the initialization process. This model ability was confirmed when the CFS was evaluated using its reanalysis version. Perfect boundary conditions enable the modeling framework to efficiently simulate the GP-LLJ and CGT at the sub-seasonal and interannual scales. This outcome suggested that the initialization processes are critical. Thus, rainfall generation in the NGP in GCMs is not independent of the initial conditions. This condition may indicate substantial uncertainty in the modeling evolution of the GP-LLJ-and-CGT flux, but the large-scale flow well informs a few regional weather patterns.
Although the CFS-R, as analyzed by [
36], showed meager predictability skill on average, this study showed that individual cases could reach higher skill levels when separated by how two significant drivers of precipitation are simulated NGP. Improved predictability of precipitation occurs when the GP-LLJ and the CGT are adequately simulated. In this context, predictability is limited to a few cases within what [
49] call "windows of opportunities." This outcome could motivate researchers to explore further the initialization process of those dominant modes of variability or how the GP-LLJ-CGT link could be built into the internal dynamic of the modeling framework. Author [
31] found that the CGT has two modes of variability that influence U.S. precipitation. This study encourages researchers to continue exploring how these two climate modes affect the predictability of precipitation. The combined use of these modes of variability could potentially help the improvement of predictive frameworks for water resources management and governance [
19,
50], phenotype predictability [
22], and water supply for agriculture [
51,
52,
53]; infrastructure risk and resilience [
20,
54]. Furthermore, the characterization of integration of variables such as soil moisture and soft computing can enhance the diagnostics and prognostics of extreme events associated with precipitation [
55,
56,
57,
58,
59,
60].
Author Contributions
Conceptualization, C-M.C. and F.M-A.; methodology, C-M.C. and F.M-A.; software, C-M.C. and F.M-A.; validation, C-M.C. and F.M-A.; formal analysis, C-M.C. and F.M-A.; investigation, C-M.C. and F.M-A.; resources, F.M-A.; data curation, C-M.C.; writing—original draft preparation, C-M.C. and F.M-A.; writing—review and editing, C-M.C., F.M-A., and L.C.; visualization, C-M.C.; supervision, C-M.C. and F.M-A.; project administration, F.M-A.; funding acquisition, F.M-A. All authors have read and agreed to the published version of the manuscript.
Figure 1.
Summer precipitation anomaly (ΔP) of the 1993 July–August (JA) season. Precipitation units are in mm/day and the climatology is used from the 1950-2014 period. The dataset is from [
23] Livneh et al. (2013), and the box defines a region (37.5º-45ºN; 103º-90ºW) for a precipitation index used in the next figures.
Figure 1.
Summer precipitation anomaly (ΔP) of the 1993 July–August (JA) season. Precipitation units are in mm/day and the climatology is used from the 1950-2014 period. The dataset is from [
23] Livneh et al. (2013), and the box defines a region (37.5º-45ºN; 103º-90ºW) for a precipitation index used in the next figures.
Figure 2.
(a) Precipitation climatology (P) in bars for the Northern Great Plains-Midwest region (MW=37.5º-45ºN; 103º-90ºW). The precipitation associated with the 1993 flood and 1988 drought are shown as a black line to illustrate the monthly changes in comparison with the 1981-2010 climatology. (b) Interannual variability of late spring- early summer (May through August) precipitation over the Northern Great Plains – Midwest region (37.5-45N; 103-90W). The 1993 historical flood event is highlighted as well as other major wet (red arrows) and dry (dark blue) years since 1950. The dataset is from [
23].
Figure 2.
(a) Precipitation climatology (P) in bars for the Northern Great Plains-Midwest region (MW=37.5º-45ºN; 103º-90ºW). The precipitation associated with the 1993 flood and 1988 drought are shown as a black line to illustrate the monthly changes in comparison with the 1981-2010 climatology. (b) Interannual variability of late spring- early summer (May through August) precipitation over the Northern Great Plains – Midwest region (37.5-45N; 103-90W). The 1993 historical flood event is highlighted as well as other major wet (red arrows) and dry (dark blue) years since 1950. The dataset is from [
23].
Figure 3.
(a) Spatial Empirical Orthogonal Function (EOF1) pattern of 900-mb meridional wind from the North American Regional Reanalysis (NARR) obtained from daily fields from May 1 through September 1, 1993, with explained variance of 30%. (b) Spatial correlation patterns among the temporal Principal Component (PC1; from top figure) and both daily precipitation and (c) daily 200-mb geopotential height (HGT). Oblique lines represent significant values at the t-test 95% level of confidence.
Figure 3.
(a) Spatial Empirical Orthogonal Function (EOF1) pattern of 900-mb meridional wind from the North American Regional Reanalysis (NARR) obtained from daily fields from May 1 through September 1, 1993, with explained variance of 30%. (b) Spatial correlation patterns among the temporal Principal Component (PC1; from top figure) and both daily precipitation and (c) daily 200-mb geopotential height (HGT). Oblique lines represent significant values at the t-test 95% level of confidence.
Figure 4.
Spatial pattern of the Empirical Orthogonal Function mode 2 (EOF2) from 200-mb geopotential height anomalies (ΔHGT) from the North America Regional Reanalysis (NARR) obtained from daily fields from May 1 through September 1 (MJJA), 1993, with explained variance of 11%.
Figure 4.
Spatial pattern of the Empirical Orthogonal Function mode 2 (EOF2) from 200-mb geopotential height anomalies (ΔHGT) from the North America Regional Reanalysis (NARR) obtained from daily fields from May 1 through September 1 (MJJA), 1993, with explained variance of 11%.
Figure 5.
Longitudinal-time Hovmoller diagram of observed precipitation for the 1993 summer season. The latitudinal average is taken over 40ºN-50ºN. Superimposed solid lines are GP-LLJ index (magenta) and CGT index (green) obtained using Empirical Orthogonal Function analysis using the North American Regional Reanalysis (NARR). Big arrows and horizontal lines are added to indicate the cases of high CFS precipitation correlation.
Figure 5.
Longitudinal-time Hovmoller diagram of observed precipitation for the 1993 summer season. The latitudinal average is taken over 40ºN-50ºN. Superimposed solid lines are GP-LLJ index (magenta) and CGT index (green) obtained using Empirical Orthogonal Function analysis using the North American Regional Reanalysis (NARR). Big arrows and horizontal lines are added to indicate the cases of high CFS precipitation correlation.
Figure 6.
Spatial patterns of the two dominant Empirical Orthogonal Functions (EOF1 and EOF2) of 900-mb meridional wind for (A) the North American Regional Reanalysis (NARR), and (B) the Climate Forecast System Retrospective Reanalysis (CFS-RR). Both obtained from the periods 1979-2014 and 1979-2010. The explained variance of these EOFs is shown at the top of each graph in percentage.
Figure 6.
Spatial patterns of the two dominant Empirical Orthogonal Functions (EOF1 and EOF2) of 900-mb meridional wind for (A) the North American Regional Reanalysis (NARR), and (B) the Climate Forecast System Retrospective Reanalysis (CFS-RR). Both obtained from the periods 1979-2014 and 1979-2010. The explained variance of these EOFs is shown at the top of each graph in percentage.
Figure 7.
Unconditional probability density functions (PDF) of the correlation values for the three indices: NPG precipitation (A), GP-LLJ index (B), and CGT index (C)—cyan bars. The total number of cases for each distribution is 701; and the continues line is the fitted normal PDF. The x-axis represents the correlation values between the observation and reanalysis and the forecasted valued from CFS-R for each index, respectively. The null distribution (gray bars) is constructed by bootstrap of this pool of cases and with a randomization of the original time series before computing the correlation.
Figure 7.
Unconditional probability density functions (PDF) of the correlation values for the three indices: NPG precipitation (A), GP-LLJ index (B), and CGT index (C)—cyan bars. The total number of cases for each distribution is 701; and the continues line is the fitted normal PDF. The x-axis represents the correlation values between the observation and reanalysis and the forecasted valued from CFS-R for each index, respectively. The null distribution (gray bars) is constructed by bootstrap of this pool of cases and with a randomization of the original time series before computing the correlation.
Figure 8.
Conditional bi-dimensional probability distribution (2D-PDF) constructed with equation 2 for two pair of datasets: (A) the GP-LLJ and NGP precipitation; and (B) the CGT index and NGP precipitation (Z200, PREC). The data used represents the total available cases between 1982 and 2009 (701 cases).
Figure 8.
Conditional bi-dimensional probability distribution (2D-PDF) constructed with equation 2 for two pair of datasets: (A) the GP-LLJ and NGP precipitation; and (B) the CGT index and NGP precipitation (Z200, PREC). The data used represents the total available cases between 1982 and 2009 (701 cases).
Figure 9.
The conditional and unconditional probability distribution of correlation for the NGP precipitation. The x-axis is the precipitation correlation values between observation and CFS-R for the period 1982-2009 based on 701 cases. The unconditional distribution is one-dimensional PDF and is the same as in Fig S5a; here is added for comparison. The conditional distribution is based on the simultaneous occurrence of GP-LLJ and CGT threshold, computed with equation 2 in the text. The conditional probability is plotted for this condition: [GP-LLJ index]r (NARR, CFS-R) > 0.4 and [CGT index]r (NARR, CFS-R) >0.4.
Figure 9.
The conditional and unconditional probability distribution of correlation for the NGP precipitation. The x-axis is the precipitation correlation values between observation and CFS-R for the period 1982-2009 based on 701 cases. The unconditional distribution is one-dimensional PDF and is the same as in Fig S5a; here is added for comparison. The conditional distribution is based on the simultaneous occurrence of GP-LLJ and CGT threshold, computed with equation 2 in the text. The conditional probability is plotted for this condition: [GP-LLJ index]r (NARR, CFS-R) > 0.4 and [CGT index]r (NARR, CFS-R) >0.4.
Figure 10.
Thirty-day Pearson temporal correlation between observed precipitation and the Climate Forecast System reforecast (CFS-R) precipitation for an area average over the Northern Great Plains (37.5°-45°N; 103°-90°W) described by bocks in two colors. The light-blue blocks represent statistically significant correlations higher than 0.35 but lower than 0.5, and the dark-blue blocks correlate higher than 0.5. Unfilled spaces have a correlation value lower than 0.35. The location of the block defines the initialization of the CFS-R. The season of analysis is from May 1 through September 1 from 1982-2009.
Figure 10.
Thirty-day Pearson temporal correlation between observed precipitation and the Climate Forecast System reforecast (CFS-R) precipitation for an area average over the Northern Great Plains (37.5°-45°N; 103°-90°W) described by bocks in two colors. The light-blue blocks represent statistically significant correlations higher than 0.35 but lower than 0.5, and the dark-blue blocks correlate higher than 0.5. Unfilled spaces have a correlation value lower than 0.35. The location of the block defines the initialization of the CFS-R. The season of analysis is from May 1 through September 1 from 1982-2009.
Figure 11.
As in
Figure 10, for Great Plains low-level jet (GP-LLJ) index correlation between the North American Regional Reanalysis (NARR) and the Climate Forecasting System reforecast (CFS-R) simulations for 1982-2009. (a) Correlations are shown only for the precipitation cases that are statistically significant, as defined in
Figure 10. These correlations are noted in this plot with the same light-blue and dark-blue blocks when the LLJ correlation is not significant. Statistically significant correlations are classified in four groups using the following ranges: 0.35, 0.45, 0.55, and 0.65 in orange tones. Only cases from May 1 through July 31 were analyzed. (b) Correlation for the 200-mb geopotential height index (Z200) between the North American Regional Reanalysis (NARR) and Climate Forecasting System (CFS) simulations.
Figure 11.
As in
Figure 10, for Great Plains low-level jet (GP-LLJ) index correlation between the North American Regional Reanalysis (NARR) and the Climate Forecasting System reforecast (CFS-R) simulations for 1982-2009. (a) Correlations are shown only for the precipitation cases that are statistically significant, as defined in
Figure 10. These correlations are noted in this plot with the same light-blue and dark-blue blocks when the LLJ correlation is not significant. Statistically significant correlations are classified in four groups using the following ranges: 0.35, 0.45, 0.55, and 0.65 in orange tones. Only cases from May 1 through July 31 were analyzed. (b) Correlation for the 200-mb geopotential height index (Z200) between the North American Regional Reanalysis (NARR) and Climate Forecasting System (CFS) simulations.
Table 1.
The Pearson correlation for the precipitation index (NGP precipitation), the Great Plains low-level jet (GP-LLJ) index, and the circumglobal teleconnection (CGT) index between the Climate Forecast System reforecast (CFS-R) and the North American Regional Reanalysis (NARR) for 30- day of simulations. The initial simulation time is indicated in the first column, and statistically significant results (p < 0.05) are bolded. The correlation values were aggregated for 100º-60ºW region, as observed in
Figure S6.
Table 1.
The Pearson correlation for the precipitation index (NGP precipitation), the Great Plains low-level jet (GP-LLJ) index, and the circumglobal teleconnection (CGT) index between the Climate Forecast System reforecast (CFS-R) and the North American Regional Reanalysis (NARR) for 30- day of simulations. The initial simulation time is indicated in the first column, and statistically significant results (p < 0.05) are bolded. The correlation values were aggregated for 100º-60ºW region, as observed in
Figure S6.
Model Initialization |
NGP precipitation |
GP-LLJ index |
CGT index |
1988.05.11 |
0.44 |
0.14 |
0.12 |
1988.05.21 |
0.77 |
0.36 |
0.13 |
1988.06.25 |
0.46 |
0.21 |
0.31 |
1988.06.30 |
0.37 |
0.39 |
0.57 |
1988.07.15 |
0.36 |
0.38 |
-0.33 |
1988.08.09 |
0.42 |
0.36 |
-0.39 |
|
|
|
|
1993.07.05 |
0.38 |
0.71 |
0.12 |
1993.07.20 |
0.48 |
0.60 |
0.3 |
1993.08.09 |
0.55 |
0.56 |
-0.22 |
1993.08.14 |
0.38 |
0.57 |
0.29 |
1993.08.24 |
0.45 |
0.01 |
0.12 |