1. Introduction
COVID-19 is a disease in response to a viral infection, the Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2), which emerged in Wuhan, China, in December 2019. The clinical pathway may progress from asymptomatic to severe illness, like pneumonia, acute respiratory distress syndrome (ARDS) and fatal multi-organ dysfunction. Its effects can be characterised through clinical severity, regarding medical complications experienced by infected individuals. In this sense, this disease has been classified according to different definitions such as those of the World Health Organization [
1].
The present study was conducted in the frame of the Multidisciplinary European hospital network for research, prevention and control of the COVID-19 pandemic (I-MOVE-COVID-19 hospital network) [
2]. This consortium came from many partners already involved in the studies within the Influenza Monitoring Vaccine Effectiveness in Europe (I-MOVE) network [
3] who rapidly adapted their infrastructure and protocols to include COVID-19 to their surveillance system for hospitalised cases in several countries when the novel coronavirus epidemic arrived. Spain joined this project with the participation of two hospitals that were able to provide a valuable information for COVID-19 surveillance in Europe [
4], heading also a pilot model for COVID-19 and SARI surveillance at a national level [
5].
During the period included in the present study, two notable variants of concern (VOCs) circulated, which were associated to different severity presentations. They were Alpha / B.1.1.7 in winter 2020 and Delta / B.1.617.2 in spring 2021 (named by Phylogenetic Assignment of Named Global Outbreak (PANGO) / lineage designation) [
4]. Although COVID-19 vaccines have proven to be key to control the pandemic by reducing the incidence of COVID-19 infections, hospitalizations and deaths, the burden of COVID-19 clinical management is high [
6] and SARS-CoV-2 continues evolving into new variants that are being monitored by international health authorities [
7]. However, mainly host-related factors rather than viral genetic mutations drive the pathogenesis, and the severity of the COVID-19 episode depends especially on the host’s health condition level, and the presence of underlying risk factors and comorbidities [
8,
9].
Since clinical presentation of COVID-19 is polymorphic [
10], it seemed obvious that patients with a phenotype characterized by a pool of risk factors showed several joined features at admission, which could predict the severity outcome. COVID-19 patients show a wide clinical spectrum, since COVID-19 affects multiple organ systems, including respiratory manifestations (e.g. shortness of breath, cough, sore throat), neuropsychiatric and cognitive symptoms (e.g. fatigue, headaches, loss of smell), cardiovascular presentations (e.g. chest pain, arrhythmias, thromboembolic events), and manifestations from other organ systems, such as endocrine, gastrointestinal, renal, and skin [
11]; however, a meta-analysis in 5000 patients found that only dyspnoea was associated with severity [
12]. Moreover, comorbidity and underlying conditions play a major role on disease prognosis [
8,
9], and they have been found associated to severe clinical outcomes [
11], also being related to an encountered number of biomarkers [
13], highlighting its effect on the systemic inflammation and immune response [
14]. Furthermore, COVID-19 severity outcomes are also affected by sociodemographic factors, such as age, sex, occupation, housing situation (with crowded and/or intergenerational housing), and barriers in access to healthcare [
15].
In this sense, the combination of sociodemographic, clinical, comorbidity, and metabolomics data improve the prediction accuracy of the models, and it seems useful to study associations with severity outcomes. Nevertheless, and to best of our knowledge, combining all those kind of features with the aim of predict several severity outcomes it only has been done until now in a small size of American COVID-19 patients [
16]. Despite this, other previous studies had studied epidemiological and clinical characteristics regarding some severity outcome in other populations [
10,
17,
18,
19]. However, there are few published studies with large sample sizes describing the phenotype associated to severity outcomes in Spanish hospitalised COVID-19 patients [
10,
20].
Therefore, the present article aims to describe the phenotype at hospital admission associated to severe COVID-19 disease, in a sample of COVID-19 cases admitted in two reference hospitals in Spain between March 2020 and December 2021, in the context of the I-MOVE-COVID-19 project. This study provides a comprehensive assessment of a wide board of health-related features and their potential impact as predictors of severity, using ventilation, ICU admission and mortality as main outcomes.
2. Materials and Methods
2.1. Study Setting and Design
A retrospective multi-site observational study was conducted in the two tertiary care hospitals participating as the Spanish component of the I-MOVE-COVID-19 network, with data collected between 16th March 2020 and the end of 2021. The Miguel Servet (HUMS) and the Virgen de las Nieves (HUVN) University Hospitals, located in Zaragoza and Granada, respectively, are both reference hospitals for surveillance of respiratory infections in their region, with a capacity of more than 1300 bed each and covering a total catchment area of over 880.000 people.
2.2. Participants
Following the I-MOVE-COVID-19 surveillance, risk factors, and vaccine effectiveness studies protocols [
21,
22,
23], patients admitted to these hospitals during the study period were tested for SARS-CoV-2 detection and data were collected in a structured online questionnaire, including a comprehensive set of demographic, epidemiological and clinical variables. To manage the workload of data collection in periods of high incidence of COVID-19 admissions (from July 2020 and until the end of the study) a weekly sample of patients were systematically selected (those admitted on every Tuesday and Wednesday).
This study was conducted on a representative sample of 2050 patients who had a positive test for SARS-CoV-2 detection and were admitted to one of the aforementioned hospitals between two days prior and up to 28 days after their index positive swab. Patients who tested negative for SARS-CoV-2 PCR on nasopharyngeal swab, and those with missing data were excluded.
2.3. Laboratory Testing
The diagnosis and confirmation of SARS-CoV-2 were based on molecular tests performed on patients’ nasopharyngeal swab specimen collected by trained nurses. Concretely, CE-IVD Real-time PCR (real time reverse transcriptase-polymerase chain reaction) with at least two specific targets of SARS-CoV-2 RNA and an internal control to check the quality of the sample. Results were categorized according to the specifications of the manufacturer and following this classification: negative (absence of SARS-CoV2 target detection or detection over 38 Ct and detection of internal control), positive (detection of two SARS-CoV-2 target detection under 38 Ct) and inconclusive (only one SARS-CoV-2 target detection). Inconclusive samples were retested for clarification. If an inconclusive result persisted, a new sample of the participant was analysed.
Other laboratory determinations (shown on tables below) were performed on blood samples collected at the time of admission using standardised and similar methods available in the hospitals.
2.4. Data Collection
Data were obtained with standardized data collection forms from medical electronic records. They were anonymised and linked in a data repository. Consent was not required from individual patients, due to the pandemic emergency exemption and because these data were included as part of the pilot surveillance system of the future integrated respiratory surveillance in Spain [
5]. Nonetheless, only summary data were extracted to minimize the risk of disclosure.
2.5. Outcomes of Interest
We described the epidemiological and clinical characteristics in relation to the main severity outcomes as dependent variables. These were the requirement for ventilation, intensive care unit (ICU) admission and death in COVID-19 hospitalised patients. As independent variables, we considered a long list of socio-demographics, underlying diseases, symptoms, and biochemical determinations (shown on tables below). Definitions of the variables were outlined in the generic published protocol [
23].
2.6. Statistical Analysis
Descriptive analysis of each epidemiological and clinical variable by sex. C categorical variables as absolute and relative frequencies (n, %), and continuous variables are presented as mean and standard deviations (x̄ ± SD), but age as median and interquartile range (m, IQR). We exclude participants with >30% records with missing data for each specific analysis. Variables were excluded from the analysis if they had >30% records with missing data. Group wise comparisons were performed using Student’s t-test or Chi-square (χ2), and by applying Haldane Ascombe correction when needed.
Bivariate logistic regression models were used to assess the association between clinical variables and the main outcomes: need for ventilation, ICU admission and in-hospital mortality, with the OR and confidence interval (95%). Models were refitted with categories for continuous variables such as biochemistry results or clinical measurements at admission according to accepted values for standard range to obtain odds ratios by categories of low/within range/high values as appropriate.
Variables with statistical significance or near were carried forward to a multivariate logistic regression analysis based on the Akaike information criterion (AIC), assessing the independent association of each specific variable with each outcome. However, vaccination and admission centre were not included in these models because of potential biases (due to differences in vaccine availability). Final model was determined by stepwise selection criteria based on improvement in AIC, and its suitability was assessed using the area under the receiver operating characteristic curve (ROC AUC score), Nagelkerke’s R2 coefficient, accuracy, specificity and sensitivity values. Finally, nomograms for each model were established to provide a clinical tool so that, after visually normalising the ORs for each variable in the multivariate model, the sum of the points that the presence of each risk factor confers allows an estimate of the risk for each subject to be made. The score to determine the level of risk (low, high) for each nomogram was established from the percentile value of the predictors corresponding to the frequency found in the sample for each outcome.
Two-tailed p value was considered statistically significant when minor 0.05. All analyses and graphs have been carried out with free R software, MS Excel and MS PowerPoint.
2.7. Ethical Approval
This study was conducted in accordance with the Declaration of Helsinki and its subsequent modifications. The requirement for written informed consent was waived, given the context of emerging infectious disease. Nevertheless, the protocol was approved by the Ethics Committee of Andalucía, and by the managements of both centres.
3. Results
3.1. Descriptive Analysis
The characteristics of the studied sample of 2050 COVID-19 hospitalized COVID-19 patients, overall and stratified by sex, are presented in
Table 1.
Regarding overall sociodemographic and clinical factors, there were slightly more men than women (53% vs. 47%), their median age was 64 years, and the most common age range was from 40 to 64 years (40%). HUVN admitted 41% participants whereas 59% were recruited at HUMS. Their vaccine coverage was 10% for COVID-19 (24% since it was available) and 39% for flu. They account for an average of 2.5 previous hospitalisations and 7 medical visits in the year prior to their admission (15% added up to more than 12).
In terms of the main severity outcomes, 15% cases died, 9% were admitted to ICU and 27% needed ventilation. On average, it took less than a week from the onset of symptoms to hospitalisation and 10 days to ICU admission. The length of stay was over a week for 60% participants and, on average, the stay was over 12 days at the hospital and almost 24 days at the ICU. The time between admission to hospital and ICU was about 4 days, and less than 8% of participants waited more than a week.
Chronic conditions were present in COVID-19 (almost 3 on average), being obesity and hypertension the most prevalent in nearly half of the sample (48%) with a BMI of 29.4 kg/m2 on average. Other frequent disorders were rheumatic disease (24%), diabetes (24%), and heart disease (20%).
Their most prevalent symptoms were cough (71 %), feverish (67%), dyspnoea (66%) and malaise (62%), followed by fever (34%), general deterioration (33%), myalgia (24%), diarrhoea (18%), and headache (16%).
Clinical measures were low on average for oxygen saturation (92%) and high respiratory rate (26 rpm on average). Indeed, low oxygen saturation was present in 62% participants, whilst other parameters such as systolic and diastolic blood pressure (SBP, SDP) and long QT were out of range in less than 15%.
Regarding biochemistry alterations, the most frequently observed deviations were high levels of ferritin and C-reactive protein (CRP) (99%), urea (95%), lactate dehydrogenase (LDH) (77%), D-dimer (68%), gamma-glutamyl transferase (GGT) (57%), and aspartate transaminase (AST) (47%), being also frequent to found high levels of alanine transaminase (ALT) (34%), hypoalbuminaemia (28%), high levels of creatinine phosphokinase (CPK) (25%), neutrophilia (18%), thrombocytopenia and high prothrombin time (17%).
Compared to women, men were younger and more likely to need ventilation and ICU admission, to have heart and lung diseases, to show fever and long QT, to show higher figures for diastolic and systolic blood pressure, and elevated figures for ALT, AST, total bilirubin, C-reactive protein, GGT, prothrombin time, and blood urea, as well as to present thrombocytopenia. Otherwise, men were less likely to be vaccinated against influenza, to account for medical visits, to have anaemia, asthma, dementia and rheumatic diseases, to show chest pain, coryza, diarrhoea, general deterioration, headache, nausea and vomiting, to show low SBP, DBP, and oxygen saturation, to show elevated D-dimer, neutrophilia and thrombocytophilia.
Additionally, temporal distribution in terms of month of admission for our COVID-19 hospitalised cases resulting from the recruitment implementation described in the 2.2 is shown in
Figure 1.
Furthermore, their distribution by age and sex is represented in
Supplementary Figure A1, showing there were more women in their 80’s but more men in their 50’s.
3.2. Bivariate Analysis
As
Table A1 shows, mortality was more likely among those who were older, visited more their physician previously, needed ventilation or admission at ICU, stayed longer at the hospital, waited longer for ICU, had more underlying conditions (including anaemia, cancer, dementia, diabetes, heart disease, hypertension, ictus, kidney disease, neuromuscular disorders, lung or rheumatic disease), showed confusion, general deterioration, tachycardia, neutrophilia, thrombocytopenia, hypoalbuminaemia, low SBP, DBP, oxygen saturation, elevated AST, bilirubin, CPK, D-dimer, LDH, prothrombin time and urea. Contrary, mortality was less likely among those who were flu vaccinated, and those who showed ageusia, anosmia, chest pain, cough, diarrhoea, fever, feverish, headache, myalgia, sore throat, and vomiting, lower figures for DBP and oxygen saturation, elevated ALT and GGT.
Besides,
Table A2 shows that ICU admission was more likely among those of extreme age groups, who were hospitalised at HUVN, who had more than 12 medical visits last year, died or needed ventilation, whose stay was longer, who showed higher BMI, obesity, diabetes, showed abdominal pain, dyspnoea, low SBP and oxygen saturation, elevated ALT, AST, bilirubin, CPK, GGT, LDH, urea and neutrophilia. Otherwise, ICU admission was less frequent among those who were men, COVID-19 vaccinated, showed confusion, malaise, and higher oxygen saturation.
Table A3 shows that need for ventilation was more likely among those COVID-19 patients who were older, men, admitted at HUVN, flu vaccinated, had more previous medical visits last year, deceased, stayed at ICU, stayed longer, had higher BMI, cancer, diabetes, heart disease, hypertension, lung disease, and obesity, showed dyspnoea, low SBP and oxygen saturation, elevated AST, bilirubin, CRP, CPK, D-dimer, GGT, LDH, urea and neutrophilia. Contrary, ventilation were needed less likely among those who were COVID-19 vaccinated, showed anaemia, rheumatic disease, ageusia, anosmia, chest pain, coryza, diarrhoea, feverish, malaise, lower figures for SBP, DBP and oxygen saturation.
3.3. Multivariate Analysis
The independent associations of several risk factors with the severity outcomes computed by adjusted multivariate analysis are shown as forest plots in
Figure 2. The statistics for the predictive models are shown in
Table 2 including those for all the dataset and also its application on each hospital.
Mortality was directly associated with several factors: need for ventilation (OR 6.9, IC 95% 4.3-11.1), having advanced age (OR 3.5, IC 95% 2.6-4.7), ICU admission (OR 3.2, IC 95% 1.8-5.6), neuromuscular disorders (OR 2.9, IC 95% 1.2-2.9), thrombocytopenia (OR 2.2, IC 95% 1.4-3.6), elevated prothrombin time (OR 2.1, IC 95% 1.2-3.3), hypoalbuminaemia (OR 2.1, IC 95% 1.3-3.2), dementia (OR 2.0, IC 95% 1.1-3.6), cancer (OR 1.9, IC 95% 1.0-3.4), elevated CPK (OR 1.9, IC 95% 1.2-2.9) and neutrophilia (OR 1.8, IC 95% 1.1-2.9). Inverse associations with mortality were also found: having cough (OR 0.5, IC 95% 0.4-0.8), sore throat (OR 0.2, IC 95% 0.1-0.8) and vomiting (OR 0.1, IC 95% 0.0-06). Statistics for this model were AUC ROC = 0.803, Nagelkerke’s R2 = 0.294, accuracy = 0.813, specificity = 0.956, and sensitivity = 0.279.
ICU admission was found to be directly associated with need for ventilation (OR 371, IC 95% 90.8-1516.8), abdominal pain (OR 3.6, IC 95% 1.2-10.8), and elevated LDH (OR 2.2, IC 95% 1.2-10.8). Likewise, ICU admission was inversely associated with advanced age (OR 0.5, IC 95% 0.4-0.7). Statistics for this model were AUC ROC = 0.927, Nagelkerke’s R2 = 0.520, accuracy = 0.905, specificity = 0.984, and sensitivity = 0.141.
Need for ventilation was directly associated with low oxygen saturation (OR 5.2, IC 95% 2.8-9.4), elevated LDH (OR 3.0, IC 95% 1.5-6.3), elevated CPK (OR 2.4, IC 95% 1.5-3.7), diabetes (OR 2.3, IC 95% 1.4-3.9), neutrophilia (OR 2.2, IC 95% 1.3-3.7), obesity (OR 1.9, IC 95% 1.2-3.0) and elevated GGT (OR 1.7, IC 95% 1.1-2.7). On the contrary, hypertension was inversely associated with the need for ventilation (OR 0.5, IC 95% 0.3-0.8). Statistics for this model were AUC ROC = 0.906, Nagelkerke’s R2 = 0.471, accuracy = 0.881, specificity = 0.967, and sensitivity = 0.352.
Some factors lost their statistical significance when the models were applied to each hospital. For death, these were cough, hypoalbuminaemia, elevated CPK, sore throat and vomiting in HUMS, and cancer, dementia, neutrophilia, elevated prothrombin time, neuromuscular disorders, sore throat and vomiting in HUVN. For ICU admission, significance were lost for abdominal pain and elevated LDH in HUMS, and for elevated LDH and ventilation in HUVN. Ventilation did not show significance for elevated GGT, neutrophilia, hypertension and obesity in HUMS; and for diabetes, elevated CPK, GGT, LDH, neutrophilia and obesity in HUVN.
Finally, nomograms in
Figure 3 were constructed for each outcome based on the above results. The risk of death was set as high for a score higher than 67, corresponding to a predictive value of 0.338, correspondent to nearly 70 points. High risk of ICU admission was set for a score of more than 85 points, corresponding to a model predictive value of more than 0.455. High risk of ventilation was set for a score of more than 58 points, corresponding to a model predictive value of 0.293.
4. Discussion
We described the clinical phenotypes at hospital admission associated to three different severity outcomes for COVID-19 confirmed cases: mortality, ICU admission and the need for ventilation. A large sample size (n = 2050) and a broad range of data collected during nearly two years since the start of the COVID-19 pandemic in Spain allowed us to assess a wide range of clinical parameters to characterize these phenotypes with a worst prognosis of the COVID-19 episode, including socio-demographic, underlying comorbidities and chronic conditions, and symptoms, physiological and biochemical markers presented during admission. Furthermore, describing associations between the phenotype characteristics and the severity outcomes allow us to compute a predictive model and to establish nomograms as clinical tools for each severity outcome.
There were a large number of studies regarding each feature and outcome included in our aim individually, but very few includes all of them or were made in such a large sample of Spanish COVID-19 patients. Despite this, since our data were shared on an European project, we can compare some assessments with the European data set results [
4]. Regarding main outcomes, the situation of severity for the COVID-19 hospitalised patients appears to have been more favourable in Spain, as we found lower rates for death (15.1% vs. 18.8%), ICU admission (8.7% vs. 13.7%) and use of ventilation (26.7% vs. 38.4%), even accounting for a similar age and sex distribution on both studies (median age was 64.0 vs. 63.8 years, and 53.1% were men vs. 54.6%). On the contrary, our Spanish figures were much higher for pluripathology (two or more underlying conditions were present in 62.2% for Europeans but 70.6% for Spaniards –data not shown), and the prevalence for obesity (48.4% vs. 27.1%), hypertension (47.8% vs. 43.6%) and rheumatic diseases (24.5% vs. 6.3%) were higher, albeit heart disease (19.8% vs. 32.6%) and diabetes (23.8% vs. 26.1%) were lower. Nevertheless, on Spanish general population and beyond the I-MOVE study, the metabolic risk factors related with mortality and an elevation for the risk for transmissible diseases are also included at the top of the list of risk factors for mortality (hypertension 18.5%, diet 17.4%, smoking 14.1%, excess weight 12%, hyperglycaemia 8.7% and hyperlipidaemia 6.5%) [
24], and smoking is even above the European median in Spain (23.1% vs 19.7%) [
25].
Therefore, additional factors must be involved in these differences on severity outcomes by country, such as the way in which the pandemic crisis and health intervention strategies were managed. In this sense, Spanish Healthcare System has universal coverage and, before the pandemic, it was known as one of the best and most efficient in Europe, according to the World Economic Forum and to Bloomberg [
26,
27]. In addition, Spanish health system’s institutions and health professionals are highly trusted by the public opinion [
28] and are supported by the government, which has a strong role [
29]. This favoured and encouraged the implementation of early strong mandatory physical distancing measures, and prioritised vaccination policies for the elders, conditioning implications for patient management. Accordingly, our study reflects this situation not only on the severity outcomes, but on the higher median ICU stay for Spanish COVID-19 hospitalised patients, when compared to their European counterparts (23.9 vs. 18 days).
On the other hand, our findings on predictive risk factors for severity outcomes are consistent with literature [
30], and in line with reports from previous large size studies in Spanish COVID-19 hospitalised patients, which described some risk factors strongly associated to death hazard, including a number of underlying conditions and higher inflammatory parameters [
10], such as advanced age, low oxygen saturation, and high C-reactive protein [
20].
Our final models were good or even greater according with the figures for AUC-ROC, accuracy and specificity (0.80-0.93, 0.81-0.90, and 0.96-0.98 respectively, although figures for Nagerkerke’s R2 and sensitivity were not so good (0.294-0.520 and 0.141-0.352). Dichotomisation of variables simplifies the interpretation and construction of the models, but can also facilitate such features profile.
It should be noted that none of the identified predictors appeared in the final models for all severity outcomes (however, age, elevated CPK, LDH, and neutrophilia appeared in two of them), and even several of them was found to be associated in a different (advanced age and hypertension) or not expected direction (vomiting, sore throat and cough for mortality). In this regard, we should point out some issues.
Advanced age may be directly associated with death and inversely with admission to the ICU, due to this condition does not benefit the patient when applying admission triage in a situation of work overload [
31]. Besides, hypertension has been observed as a protective factor for need of ventilation. In this sense, it should be highlighted that SARS-CoV-2 uses angiotensin-converting enzyme 2 (ACE2) as a cell receptor for viral entry whilst ACE is the target for some therapeutic treatments for hypertension that could prevent vasoconstriction and disease conditions in Raynaud’s phenomenon; thus, patients treated for hypertension may have a different immune response to SARS-CoV-2 which could influence the severity of the disease and thus the need for respiratory support [
32]. In addition, we may point out how the reporting of some symptoms (such as vomiting, sore throat and cough) may work as indicators of the disease manifestation, which may have facilitated its early diagnosis and treatment [
33], fitting with the timeline of symptom presentation reported previously by other authors [
34].
It is also important to note that none of the final models included sex, medical contacts (either admissions or visits), or time between events (such as days of stay) as predictors. This can easily be explained by the fact that it is not these factors per se but some associated conditions also present in the models that are playing a role in predicting the severity of COVID-19. However, these are easily identifiable factors, so their consideration could be useful in practice.
Besides, some factors deserve additional regard, although (due to our sample characteristics) we have excluded their presence in our final predictive models to avoid different kind of biases. COVID-19 and flu vaccination has been shown to be effective in reducing severity outcomes which is reflected on its inverse associations with mortality (for flu vaccination), need of ventilation and ICU admission (for COVID-19 vaccination). However, flu vaccination was found directly associated to the need of ventilation; this may be explained by the increased efforts of vaccination campaigns in those at higher risk (e.g. COPD patients), as well as a very low vaccine coverage. Anyway, the lack of effectiveness of COVID-19 vaccination for mortality, or even its perceived risk for the need for ventilation, may be biased on our participants by the timeliness of vaccination availability, as we already mentioned.
Finally, models did not fit the same when applied to each hospital’s data, showing several statistical differences. This was to be expected since the hospital datasets reflected important differences, e.g. more participants admitted to ICU and in need of ventilation at HUVN. However, working with both datasets together allowed us to build a more robust and useful predictive model for other Spanish hospitals. Notwithstanding, it should be noted that severity outcomes do not depend only on differences on SARS-CoV-2 incidence and strains, but also on characteristics of the catchment area of population and resources availability of each health care centre.
4.1. Limitations and Strengths
Firstly, COVID-19 mortality figures could be overestimated as cause of death as this was not included in the definition of the outcome (primary outcome was all-cause mortality during the index hospital admission, regardless of the recorded cause of death), since it would compromise the timeliness of the surveillance system.
On the other hand, data collection methods affected the data completeness of certain variables, resulting in missing data. Even though missing data was excluding only for the analysis of each variable (to prevent loss of power), this may have impacted the accuracy of results and their generalisability. Poor reporting from the health care workers when they were overloaded with work (especially at first cases) also may contribute to this type of bias. Additionally, covid-19 vaccination coverage data may be biased by vaccine availability and difficulty of calculation, as the population was vaccinated in a staggered manner according to age and other priority conditions.
In this sense, the lower predictive ability for the need of ventilation model could be explained by the fact that our analysis did not include specification of the type of ventilation (i.e. whether ventilation was mechanical or not). However, including only those patients with mechanical ventilation would not have substantially improved our results either, as the sample size would be smaller and thus also the power and reliability of our findings.
Despite these challenges, the design of definitions of the variables studied and data collection were carried out according to a published robust protocol from a European project of excellence (H2020). A wide list of clinical sociodemographic and clinical variables was assessed on a high sized sample. All this has implications for the high accuracy revealed by the high AUC scores obtained in the multivariate analysis. Furthermore, the study was conducted in two tertiary referral hospitals, located in different Spanish regions; therefore, it may be representative of the country population.
In addition, to our knowledge, this is the first study describing predictive models and nomogram tools on the phenotype association to mortality, ICU stay, and the need of ventilation, all at once, in Spanish COVID-19 hospitalised patients, and through a wide range of factors at admission, including underlying conditions, as well as symptoms and biochemical parameters.
Furthermore, this study contributed to the European I-MOVE-COVID-19 surveillance system, and laid the groundwork for launching COVID-19 Spanish official surveillance system. Likewise, it highlights the importance of monitoring hospitalised patients to understand phenotypes features, which may predict severe outcomes, in order to be useful to public health response, especially regarding emergent diseases.
5. Conclusions
This study assessed the clinical phenotype of Spanish COVID-19 hospitalised patients regarding three main severity outcomes by computing predictive models, which in turn have enabled the development of nomograms as clinical tools for prognosis.
Mortality, ICU admission and need for ventilation were found affected by the role of several risk factors (advanced age; diagnose of obesity, cancer, diabetes and neuromuscular disorders; presence of abdominal pain and cough; low blood levels of oxygen saturation, platelets and albumin; elevated blood levels of CPK, GGT, LDH, neutrophils, and elevated prothrombin time), as well as protective factors (diagnose of hypertension; presence of cough, sore throat and vomiting). Predictive models including these features allowed the construction of nomograms as clinical tools to assess the severity risk of illness for different outcomes. Furthermore, additional useful associations have been identified, such as vaccination status, number of underlying conditions (diseases and previous medical visits or admissions), and characteristics of the stay (such as time between events or the centre were being admitted).
These findings should be considered in order to improve COVID-19 patient management and strategic interventions including policy measures to prevent and combat COVID-19, allocating the available resources more efficiently, and avoiding the over-stressing of the healthcare system. Studies with similar approaches may be useful to address these issues on future emerging diseases.
Author Contributions
Conceptualization, Clara Mazagatos and Amparo Larrauri; Data curation, Miriam Latorre-Millán, María Rodríguez del Águila and Clara Mazagatos; Formal analysis, Miriam Latorre-Millán, María Rodríguez del Águila and Laura Clusa; Funding acquisition, Clara Mazagatos and Amparo Larrauri; Investigation, Miriam Latorre-Millán, María Rodríguez del Águila, Laura Clusa, Clara Mazagatos, María Fernández and Ana Milagro; Methodology, Miriam Latorre-Millán, María Rodríguez del Águila, Clara Mazagatos, Amparo Larrauri, María Fernández and Ana Milagro; Project administration, Miriam Latorre-Millán, Clara Mazagatos, Amparo Larrauri, María Fernández and Ana Milagro; Resources, Clara Mazagatos, Amparo Larrauri, María Fernández, Antonio Rezusta and Ana Milagro; Supervision, Amparo Larrauri, María Fernández and Ana Milagro; Visualization, Clara Mazagatos and Amparo Larrauri; Writing – original draft, Miriam Latorre-Millán and María Rodríguez del Águila; Writing – review & editing, Miriam Latorre-Millán, María Rodríguez del Águila, Laura Clusa, Clara Mazagatos, Amparo Larrauri, María Fernández, Antonio Rezusta and Ana Milagro.