1. Introduction
Preterm birth is a major public health issue concerning perinatal mortality, long-term morbidity, and economic burden, with a worldwide prevalence of 5-18%, 90% of which occurs in developing countries [
1,
2]. Spontaneous preterm birth (SPB) is defined as the delivery of the fetus before 37 weeks’ gestation, calculated by the last menstrual period or reliable first-trimester ultrasound, without any medical intervention to induce this outcome [
3]. Although evidence suggests that preterm labor is a heterogeneous condition triggered by multiple factors [
4,
5], the inflammatory response elicited at the maternal-fetal interface is considered a hallmark of the pathology. The increase in the local cytokine release that orchestrates the inflammatory response is observed cases with intrauterine infection and spontaneous preterm labor that is not associated with infection, thereby described as the intrauterine inflammatory response syndrome [
6].
In vivo, cytokines are part of the molecular network mediating innate immune and inflammatory responses [
7]. All cytokines expressed locally in the cervix participate in complex interactions with prostaglandins and nitric oxide, which regulate the production of extracellular matrix proteases and other factors associated with the cervical shortening, rupture of membranes, and uterine contractions that lead to labor either at preterm or term [
6,
8,
9]. An increase in the pro-inflammatory cytokines IL-6 and IL-8 in the cervical-vaginal fluid has been associated with cervical effacement [
10,
11], while an increased TNF-α is associated with cervical ripening [
12]. On the other hand, IL-10 concentration has been identified as the main anti-inflammatory cytokine involved in PB pathogenesis [
13]. So far, the results of studies analyzing the relationship between cytokines and PB are inconsistent, and none have identified a biomarker that can accurately predict preterm delivery [
7]. These inconsistencies can be explained by the complexity of the local inflammatory processes and the heterogeneity of the pathways that trigger PB.
There is a pathophysiological association between preterm premature rupture of membranes (PPROM) and PB, both triggered when the intrauterine or maternal environment is hostile. Moreover, PPROM is among the leading causes of preterm birth [
14], probably due to the onset of an intrauterine infectious/inflammatory process, which produces an imbalance in cytokine production, disrupting the tight junctions of the membranes and rupturing the amniotic sac [
15]. Some cytokines associated with PPROM and PB in placental tissue are IL-1β, IL-6, IL-8, and transforming growth factor (TGF-β) [
16].
Among the screening techniques to detect women at high-risk for PB, the most used measurement in clinical practice is cervical length (CL), obtained by vaginal ultrasound in the second trimester of gestation, with a detection rate of 50 to 70%. Also, some clinical calculators incorporate CL, gestational age, and obstetric history to provide a patient-specific risk for developing PB. The calculator that is considered the gold standard in clinical practice is the Fetal Medicine Foundation (FMF) calculator, having a detection rate for preterm spontaneous birth <28 weeks = 75%, 28-30 weeks = 57%, 31-33 weeks = 46%, and 34-36 weeks = 24%, considering a false positive rate of 10% [
17].
In recent years, artificial intelligence has been used in different healthcare fields to predict, prevent, diagnose, and monitor different pathologies, even in obstetrics [
18]. Models using machine learning are more accurate than risk calculators as they analyze massive data and, through algorithms, identify patterns to make predictions [
19]; furthermore, it has been proposed that machine learning models can be helpful in personalized pregnancy management, especially in low- and middle-income countries [
18]. The goal of identifying women at high-risk for developing PB is to individualize the clinical follow-up and offer medical preventive strategies (e.g., progesterone) that reduce by up to 90% the risk of preterm birth in women with a history of this outcome and by 42% in pregnant women with short cervix detected in second-trimester screening [
20].
Although the role of cytokines as critical mediators in the inflammatory process that triggers labor has been demonstrated [
21], very few predictive models consider their measurement in the PB screening process [
22,
23,
24]. However, none have been implemented in clinical practice. Therefore, this study aimed to characterize the cytokine profile in cervical-vaginal mucus in pregnant women with high- and low-risk for PB and then integrate them in a PB screening model using a predictive machine learning analysis.
3. Discussion
In this study, we first focused on the characterization of inflammatory cytokine profiles in cervical-vaginal mucus in pregnant women with high- and low-risk for PB, as classified according to the CL measured in the second trimester of gestation. In agreement with previous reports, a higher concentration of IL-2, IL-6, and IFN-γ was found in the cervical-vaginal samples of the high-risk group, and a higher concentration of IL-1ra in the low-risk group [
26,
27,
28].
On the other hand, IL-4 and IL-10 were found to be increased in the high-risk group, in contrast to what was expected according to the inflammatory pathogenesis of PB [
29]. These differences could be explained by the complexity of the inflammatory processes involved in PB and owing to the different methodologies and designs used in other studies [
30]. Elevation of anti-inflammatory cytokines in the high-risk group for PB could be a physiological attempt to moderate the inflammatory process to maintain homeostasis; this hypothesis was previously proposed by Wang et al., as they found that IL-37 (an anti-inflammatory cytokine) was elevated in the fetal membranes of women with preterm labor, as an attempt to stop the inflammatory process caused by IL-6 [
31].
IL-6 is a critical mediator in infection and inflammation and one of the most studied biomarkers associated with cervical shortening in preterm labor. In our study, IL-6 was found to increase in cervical-vaginal samples from the high-risk group, which coincides with that reported by other studies [
32]. Goepfert et al. found that cervical IL-6 concentrations measured at 24 weeks of gestational age (wGA) were elevated in women who had preterm delivery before 32 wGA compared to women whose pregnancies were carried to term, and this finding was even more evident in women with a history of preterm delivery [
32,
33,
34]. In our study, 40% of the women in the high-risk group had a history of a previous delivery before 37 wGA [
33,
34,
35].
In our study, a higher concentration of IFN-γ was found in the group of patients at high-risk for preterm delivery. Accordingly, in a recent report by Sandoval-Colin et al. it was found a positive correlation between elevated IFN-γ, TNF-α, IL-1β, IL-6, and the onset of labor [
22], in what appears to be a functional “maturation” of the immune system due to the inflammatory response. Additionally, it has been described that the activation of Th1 cells may increase the secretion of TNF-α, INF-γ, and IL-1β in the fetal membranes and the amniotic fluid in preterm labor [
36].
IL-1ra belongs to the IL-1 family of cytokines with an anti-inflammatory action [
37]; our study found IL-1ra to be higher in patients at low-risk for PB. To our knowledge, there is no literature on the cervical concentration of this cytokine in the pathogenesis of preterm delivery; however, our IL-1ra results could be well compared with those of the soluble IL-6 receptor (sIL-6r) that has been associated with a reduction in the risk of preterm delivery (RR 0.4 CI 95% 0.15–0.80) [
38,
39].
IL-2 is crucial for maintaining immune homeostasis, and a correlation has been previously demonstrated between elevated IL-2 levels and chronic inflammation [
40]. Few studies have demonstrated the relevance of IL-2 and its receptor in the pathogenesis of preterm labor; the possible explanation is that in normal pregnancy, the concentrations of IL-2 and its receptor are relatively low and difficult to measure, but in the case of preterm labor, an increase of this interleukin has been demonstrated that could imply the chronicity of an inflammatory process and not the acute response as in the case of IL-6 [
41].
In most previous publications concerning cytokines in cervical-vaginal fluid, the elevated concentrations of IL-10 have been proposed as a protective factor for preterm delivery [
42,
43], as belongs to the group of cytokines with anti-inflammatory activity. In our study, the highest concentrations were observed in the high-risk group, which is in agreement with the study by Vogel et al. reporting that elevated IL-10 concentrations could also be associated with an increased risk of preterm delivery (RR 3.1 CI 95% 0.96–9.7) [
44], as a response to an inflammatory mechanism that was previously initiated [
42].
In the second phase of our study, we developed an artificial intelligence model for PB prediction using machine learning (ML), incorporating the quantification of cytokines, as they are the crucial mediators of the inflammatory process observed in preterm labor. Machine learning is the engine that is driving advances in the development of artificial intelligence. The main areas that may benefit from ML techniques in the medical field are diagnosis and outcome prediction; ML can transform how medicine works [
19,
45]. AI methods in medical care could facilitate individual pregnancy management and improve public health, especially in low- and middle-income countries. ML allows us to analyze interactions between variables different from what we are conventionally used to, overcoming limitations such as sample size and data distribution. In our study, two predictive models were performed, the full and the adjusted models. The full model included all the predictor variables (obstetric history, CL, and concentration of all the cytokines measured). The ML analysis allowed us to choose only the variables with the highest predictive significance to build and train the adjusted model that included only maternal age, CL, and IL-2 concentration. Our results demonstrated that the adjusted model, even when it included a smaller number of variables, had a better predictive performance; this is possible because of the type of analysis used, where the variables that full the model are eliminated to improve performance [
45,
46,
47].
In most studies involving cytokines in high- and low-risk groups for PB, the classification only consideres obstetric history [
48]. So, one of the main strengths of our work is that we studied the cervical-vaginal fluid concentration of the principal cytokines involved in the inflammatory process in patients classified as high- and low-risk for PB by CL, the gold standard in current screening and a variable which itself indicates the onset of cervical shortening and therefore the phase prior to the onset of labor. Additionally, the screening was performed at the appropriate weeks of gestation according to international guidelines [
49] and was not biased by interventions such as cerclage or progesterone before measurement. Besides, the presence of vaginal infection during sampling for cytokine determination was ruled out.
Another contribution of our work is comparing the FMF calculator and the predictive model obtained by the Random Forest analysis. With this analysis, we demonstrate that if we add the measurement of IL-2 to the prediction model, we can increase by 8 percentage points the detection rate reported by the gold standard in clinical practice, which would help to identify more accurately those cases at high-risk for PB allowing to implement preventive medical strategies (e.g., progesterone) that can offer an efficiency up to 90% [
50,
51].
The main limitation of our study is the number of patients included; however, it was possible to identify differences between the groups with significant statistical power. Considering the construction of the model, this fact is compensated by the nature of the analysis performed since the Random Forest analysis builds the ideal model and then replicates it thousands of times to test its efficiency [
52]. However, we consider that a more significant number of patients is required to strengthen the model to be used as a reference for PB prediction in clinical practice [
53].
As we aimed to design an effective screening model for PB with the inclusion of clinically applicable markers based on cost, availability, efficacy, and timeliness of the test, the next step is undoubtedly to perform a cost/benefit analysis to assess whether adding the IL-2 measurement to the screening model is endorsed by the resources saved with the risk detection and prevention of PB.
Our results contribute to a better understanding of the cervical-vaginal inflammatory network elicited in patients classified as high- and low-risk for preterm delivery according to the current screening gold standard. The prediction model integrating cytokines in cervical-vaginal mucus showed a DR 8 points higher than the gold standard used in clinical practice, with a lower FNR. The increase in DR with the reduction in FNR may allow us to identify women at risk for PB more accurately and implement strategies to prevent this outcome. Our work could represent a significant advance in screening and reducing the prevalence of PB, a goal that has not being achieved in the last 30 years yet.
4. Materials and Methods
4.1. Ethics statement
The study was conducted at the Instituto Nacional de Perinatologia in Mexico City between January 2019 and December 2021. The protocol was approved by the Ethics, Research, and Biosafety Internal Review Boards (2017-2-69). Women who met the inclusion criteria were invited to participate, and they read and signed the informed consent form.
4.2. Study population
Recruitment at convenience was performed by considering women who attended the Maternal-Fetal Medicine Department at the Instituto Nacional de Perinatologia in Mexico City to screen for PB with a singleton 18–23.6 weeks of gestation pregnancy. Clinical data, including age, weight, pregestational weight, socioeconomic status, smoking, history of preterm delivery, gestational age by date of last period and corroborated by first trimester US, presence of urinary or vaginal infections, use of antibiotics, follow-up of pregnancy until its termination, were collected. Fetuses with structural alterations or women with a confirmed diagnosis of isthmic cervical incompetence were not included. The primary outcome was considered to be spontaneous preterm birth, defined as the delivery of the fetus before 37 weeks’ gestation, calculated by the last menstrual period or reliable first-trimester ultrasound, without any medical intervention that produced this outcome [
3].
Participants were classified as having high- or low-risk for developing PB according to the cervical length measurement, using a cut-off point <20 mm in patients with a history of preterm delivery and <25 mm for patients without a history of PB [
25]. Our institutional protocol was applied in high-risk patients by administering micronized progesterone 200 mg vaginally every 24 h, with a follow-up every 2–3 weeks. Progesterone treatment was initiated after being classified as a high-risk patient based on CL measurement. No patient was receiving progesterone before being classified.
4.3. Sample collection
A cervical-vaginal mucus sample from the posterior vaginal fornix was obtained using a dacron polyester-tipped swab that was rinsed in collection buffer containing 1X PBS, 0.05% Tween-20, 1% BSA, and a protease Inhibitor Cocktail (Roche). Samples in collection buffer were centrifuged at 3200 rpm, 15 min at 4 °C, and the supernatants were stored at -80°C until further cytokine analysis. Before preservation, an aliquot was tested to rule out subclinical vaginal or uterine infection by fresh examination and negative microbiological cultures.
4.4. Cervical-vaginal cytokine quantification
Cytokine concentration in cervical-vaginal mucus was measured using the Luminex X-Map platform with the Bio-Plex Pro Human Cytokine 10-plex that includes IL-2, IL-4, IL-6, IL-8, IL-10, INF-γ, TNF-α, IL-1β, IL-12p70, and IL-1ra (Cat 12020756). The protocol was performed according to the manufacturer's instructions, and the value for each cytokine was expressed as pg/ml. Inter- and intra-assay variation was <10%.
4.5. Statistical analysis
Data were analyzed with the SPSS software, version 24. Descriptive statistics were performed to characterize groups: for qualitative variables, frequency measures expressed in percentages were used, while for quantitative variables, measures of central tendency (mean, median), and measures of dispersion standard deviation (SD). Data distribution was verified before the statistical analysis with Kolmogorov–Smirnov test. Cytokine concentration was evaluated, and differences between groups were assessed using the Mann-Whitney U test. The Chi-square test was used to calculate the difference in proportions. No sample size calculation was performed beforehand, but the statistical power was calculated for all variables with significant differences to verify that it was greater than 80%.
4.6. Machine learning model
A Random Forest classifier model was created to predict the incidence of preterm infants using 12 clinical variables (women´s age, weight, pregestational weight, socioeconomic status, smoking history, parity, previous preterm delivery, gestational age at screening (date of last period and corroborated by first-trimester US), precedent urinary or vaginal infections, use of antibiotics, and gestational age at the time of pregnancy resolution) and 10 cervical-vaginal cytokine determinations (IL-2, IL-4, IL-6, IL-8, IL-10, INF-γ, TNF-α, IL-1β, IL-12p70, and IL-1ra), to create the first full model. To adjust the model, we performed hyperparameter selection by out-of-bag error and simple cross-validation, resulting in 50 trees, 1 mtry (number of predictors considered in each split), and 1 as maximum depth, thus generating the adjusted model. The models were analyzed in a mathematical matrix to determine false positive rates (FPR) and false negative rates (FNR), their overall efficiency, as well as to compare the area under the ROC curves (AUC).
Author Contributions
Conceptualization, H.B.-O.; methodology, M.J.R.-S., A.E.-N. and A.F.-P.; formal analysis, H.B.-O., G.E.-G. and J.M.-O.; writing—original draft, H.B.-O. and G.E.-G.; funding acquisition, H.B.-O.; resources, H.B.-O., G.E.-G., M.J.R.-S., I.C.-A., A.E.-N., A.F.-P., R.G.-C. and J.C.E.-A.; writing—review and editing, H.B.-O., G.E.-G., I.C.-A., and A.F.-P. All authors have read and agreed to the published version of the manuscript.