1. Introduction
Systemic sclerosis (SSc) is a chronic connective tissue disorder characterized by widespread fibrosis and impaired microcirculation [
1,
2]. Endothelial and fibroblast dysfunction leading to tissue hypoxia, together with altered immune responses, play key roles in disease pathogenesis (1,2). In SSc, fibrosis and vasculopathy are closely associated and result in heterogeneous multi-organ clinical manifestations with a variable prognosis [
1,
2].
Besides the skin, internal organs are often affected, particularly the lungs (2), representing one of the major causes of death in SSc [
3,
4]. SSc interstitial lung disease (SSc-ILD) is characterized by early pulmonary infiltration of inflammatory cells, mainly B cells, promoting the generation and persistence of myofibroblasts, accumulation of excess extracellular matrix and subsequent fibrosis of the lung parenchyma [
2,
5].
All patients with SSc are at risk of SSc-ILD. Established risk factors for SSc-ILD are respiratory symptoms, smoking history, ethnicity (native American, African heritage), male gender, diffuse cutaneous (dcSSc) subset, and anti-topo I/Scl-70 antibodies [
3,
6]. Although the probability seems to be higher in the early stages of the disease [
3,
7], a recent study regarding 826 patients with SSc-ILD from the EUSTAR database found that only 21% of patients had a disease duration < 3 years, and about half of the SSc patients had no evident risk factors for SSc-ILD [
8]. Therefore, the identification of additional factors associated to SSc-ILD remain an open research question [
3,
7,
8].
The primary tool for diagnosing SSc-ILD is high-resolution computed tomography-HRCT [
6,
9]. Moreover, baseline and longitudinal changes in pulmonary function tests (PFTs,) mostly forced vital capacity (FVC) and diffusing capacity of the lungs for carbon monoxide (DLco) [
4,
6,
9] are found to be associated with SSc-ILD. Other measures directly related with ILD in SSc have been recently investigated, i.e., patient reported measures of symptoms severity and quality of life [
3,
10,
11], and the extent of oesophageal involvement [
11,
12]. Furthermore, the usefulness of nailfold videocapillaroscopy (NVC) in SSc-ILD has been addressed [
13,
14], highlighting that abnormal NVC patterns (i.e., active and late) are linked with the presence of SSc-ILD, and emphasizing that NVC could potentially be used as a biomarker in screening algorithms for SSc-ILD [
13,
14]. Nevertheless, none of the available tools alone can detect SSc-ILD with appropriate sensitivity and specificity compared to HRCT [
3,
6,
7,
9].
Therefore, we aimed to identify additional factors mainly linked to the clinical/biometric interface of pulmonary tissue damage that might be associated with the presence of ILD in SSc patients, trying to build a useful algorithm for further strengthen and personalize this association in daily clinical practice.
2. Patients and Methods
The study population consisted of 78 non-consecutive outpatients with SSc, without a known diagnosis of ILD, aged≥18 years, defined by the current classification criteria [
15], enrolled between January and November 2023. Written informed consent was obtained from all participants, in accordance with the Declaration of Helsinki.
Patients who were unable to undergo NVC (due to reduced visibility or amputation of the phalanges), or unable to perform PFTs, patients with a high probability of PAH according to the DETECT algorithm [
16] and/or with PAH diagnosed by right heart catheterization RHC, patients with SSc overlapping with other systemic connective tissue diseases, and/or those with chronic lung diseases other than SSc-ILD, and those taking antifibrotic drugs were excluded from the study.
Study approval was obtained in the context of a cross-sectional database of patients with RP secondary to SSc (
RAYnaud pheno
Men
ON Database, RAYMOND study, n. 257-2020 ID 1650), According to the study intentions, and in agreement with the current recommendations of SSc-ILD [
3,
6], all patients had periodic HRCT scans to screen ILD, every 12-24 months depending on their symptoms and the clinical phenotype.
Demographic, Clinical and Laboratory Data
Basic information included age, sex, and disease duration (from first non-Raynaud symptom). Patients were classified in limited (lcSSc) and diffuse SSc (dcSSc) [
17]. None fit the strict definition of SSc ‘sine scleroderma’ [
17]. The modified Rodnan skin score (mRSS) was used to assess the extent of skin thickness. The score was calculated by summing the rating from all 17 areas (range 0–51) [
18]. We gathered other clinical data such as oesophageal symptoms (dysphagia, reflux) and dyspnoea, antinuclear antibodies (ANA), anti-extractable nuclear antigens (anti-ENA), SSc-related (anti-centromere/CENP-B, anti-topoisomerase I/Scl70, anti-RNA polymerase III) and non-SSc related (anti-U3 RNP, anti-SSA) antibodies.
HRCT Assessment and Visual Reader-Based Disease Quantification
All HRCT examinations were performed according to a standard protocol, using a CT 64GE light Speed VCT power scanner with a rotation tube scanning time of 0.65 s. Scans were acquired in full inspiration from the apex to the lung base in supine position, at 120 kV and 300 mAs, and slice thickness and spacing of scans of 1.25 and 7 mm, respectively. Contrast media agents were not employed.
The HRCTs were evaluated independently by a radiologist (MC – consultant with 20 years of experience in the field of musculoskeletal radiology) and a rheumatologist (FS – trained in HRCT interpretation) blinded to clinical and respiratory functional findings. In case of disagreement, a third reader (AG-expert radiologist) examined the scans to obtain a final consensus decision. The intraclass correlation coefficient (ICC) of the radiologists’ agreement level on total HRCT scores, as previously calculated by the CoVR method, was 0.80 [
19,
20]. The lung parenchymal abnormalities were assessed according to the Warrick score [
21]. A point value was assigned to each abnormality as follows: ground-glass appearance = 1, irregular pleural margins = 2, septal/subpleural lines = 3, honeycombing = 4, subpleural cysts = 5. In each patient the “severity of disease” score was obtained by adding single point values. An “extent of disease” score was obtained by counting the number of bronchopulmonary segments involved for each abnormality: one to three segments scored as 1; four to nine segments scored as 2; more than nine segments scored as 3. The severity and extent of disease were then calculated as total HRCT score (range from 0 to 30). To estimate intra-observer reliability, each reader examined all HRCTs twice, with an interval of at least four weeks.
Patient-Reported Measures
The following patient-centred measures were obtained: dyspnoea severity using the modified-Borg Dyspnoea Scale (m-Borg) by the interviewer-administered paper version [
20,
22], a numerical scale for assessing perceived dyspnoea (breathing discomfort) with a scale of 0 = no breathlessness at all, 0.5 = very, very slight (just noticeable), 1 = very slight, 2 = slight breathlessness, 3 = moderate, 4 = somewhat severe, 5 = severe breathlessness, 7 = very severe breathlessness, 9 = very, very severe (almost maximum) and 10 = maximum, and the Health Assessment Questionnaire-Disability Index (HAQ-DI) [
20,
23], which is a generic, self-administrated patient-reported outcome (PRO) instrument targeted for musculoskeletal impairment, defined as a condition-specific measure of functional status (assessing activities of daily living) [
20,
23]. The standard HAQ-DI is calculated as an ordinal variable, from 0 = no disability to 3 = severe disability.
Gerd-Q is a simple self-administered questionnaire consisting of six items for assessing the risk of gastroesophageal reflux disease-GERD [
24]. Four of these assess symptoms and situations considered to be positive predictors for GERD: heartburn, regurgitation, sleep disturbances (caused by gastric symptoms) and taking antacids, in addition to those medically prescribed, while the remaining two assess symptoms considered to be negative predictors for reflux (nausea and epigastric discomfort). The answers refer to the frequency of symptoms over the past week, using a Likert scale of 0 to 3 for positive characteristics, and 3 to 0 for negative characteristics [
25]. The maximum score that can be achieved is 18. A GERD-Q cut-off of 9 was associated with the best ratio of sensitivity and specificity for the diagnosis of gastro-esophageal reflux [
25].
Pulmonary Function Tests
PFTs were performed within 2 weeks from the HRCT scan by a flow-sensing spirometer and a body plethysmograph connected to a computer for data analysis while the patient was at rest in a seated position. These tests consisted in spirometry using a computerised lung analyser (MasterScreen Diffusion, Jaeger GmbH, Höchber, Germany). FVC (% predicted value) and DLco (% predicted value, corrected for haemoglobin) were obtained. At least three measurements were taken for each variable to guarantee repeatability [
20].
Nailfold Capillaroscopy
All patients underwent NVC within 3 months before/after the HRCT scan, using a videocapillaroscope with a 200x magnification optical contact probe (DinoLite Digital Microscope, Dino-Lite Europe, Almere, The Netherlands). All fingers of both hands, excluding thumbs, were examined. Four contiguous fields of 1 mm in the middle of the nailfold were captured from each finger at least [
26]. The corresponding images were stored and analysed by means of Dino Capture 2.0 Software (version 1.5 14.b, AnMo Electronic Corporation). An experienced investigator (FF), blinded for the clinical data, reviewed, and rated all NVC images. For each image, the total number of capillaries/mm was counted, regardless of morphology, as described for the “scleroderma pattern” [
27]: U-shaped loops, tortuous capillaries, crossed loops, ramified/bushy capillaries, bizarre loops, giant capillaries (width of limbs >50 μm), both homogenously and inhomogeneously shaped. Micro-haemorrhages and micro-thrombosis were excluded from the counting process. A consensus concerning image acquisition and analysis, scoring system, reliability of image acquisition and interpretation had already been reached by the authors (RDA, EC, FF) [
26,
28] (
Figure 1).
Statistical Analysis
All data were entered into a Microsoft Excel database and analysed with MedCalc® version 18.6 (MedCalc Software, Ostend, Belgium). The values were expressed both as mean±standard deviation (SD) or as median (interquartile range [IQR]). A two-sample “t” test was used to compare continuous variables and the χ2 test to compare categorical variables. The relationships among the lung analysis, the readers and the PFTs results were calculated using univariate regression analysis and Pearson’s product moment correlation (Pearson r values). Furthermore, multivariate regression analyses were performed to identify factors associated with higher percentage of pulmonary fibrosis at HRCT.
We included in the multivariate models the following covariates: age, sex, disease duration, anti-topoisomerase I antibodies, mRSS, m-Borg, GERD-Q, HAQ-DI, FVC and DLco (predicted), and capillary density (number of capillary/mm). Ethnicity was not included because all the enrolled patients were Caucasians.
The results were expressed as multivariate regression coefficient (R) and square regression coefficient corrected (R2) for the number of variables entered in the analysis. This enables the calculation and the predictivity of each multivariate model according to the number of variables entered in the model itself. Significance level was set at p <0.05. The predictive performance of each variable was estimated by the area under the receiver operating characteristic curve (AUC-ROC). Youden’s index on the ROC curve analysis was used to determine the optimal cut-off point for the single variables.
A model based on Bayes’ theorem was then constructed to determine post-test probability using the pre-test probability of illness and the product of the positive likelihood ratio (+LR) of the items. The Fagan nomogram was used to graphically represent the model. The predictive variables included in the model are those that demonstrated significance on logistic regression analysis.
3. Results
The baseline characteristics (continuous variables) of the 78 SSc patients are summarized in
Table 1.
The patients’ age ranged from 22 to 76 years (mean 63.60±10.34); 88.5% were women. The mean±SD duration of the disease was 10.53±7.38 years.
Thirty-three (42.31%) patients showed on HRCT features of ILD (Warrick score >7, average total HRCT score= 11.62±7.79). The mean disease extent score was 5.64 (±3.69), the mean disease severity score was 5.97 (±4.03). Analysis of the ILD subtypes showed that UIP was found in 32 (41.0%) patients, NSIP in 21 (26.9%) patients, and unclassifiable ILD in 25 (32.1%) patients. No patient showed a diffuse alveolar damage (DAD) pattern, nor an organizing pneumonia pattern [
3,
19,
20].
Thirty-eight (48.7%) patients were classified as having dcSSc and 40 (51.3%) patients as having lcSSc, with a mean mRSS value of 10.7±7.97. Comparison of the two groups showed that patients with dcSSc were older (65.98±10.30 vs 60.97±11.04 years; p<0.05) and with a longer disease duration (12.04±7.15 vs 9.99±8.56 years; p<0.05) than those with lcSSc. ACA were present in 35 patients (44.8%) and anti-Scl-70 in 33 (42.3%). The proportion of anti-Scl-70 was significantly higher (p=0.03) in patients with dcSSc.
The mean predicted FVC (FVC %pred) was 87.72±18.51 and the mean predicted DLco (DLco % pred) was 73.41±16.76. FVC and DLco did not differ between patients with lcSSc and dcSSc. Time interval between PFTs and HRCT was 4.9±1.6 days (range: 0–6 days).
On average, the HAQ-DI suggested a moderate functional impairment (0.96±0.45). As expected, patients with dcSSc showed higher mean HAQ-DI scores than those with lcSSc (1.12±0.39 vs 0.81±0.32; p<0.05). The mean m-Borg scale score was 2.73 (±2.22) and the Gerd-Q was 8.19 ±4.03. Finally, the mean capillary number/mm was found to be 5.44± 2.22.
3.1. Variables Associated with SSc-ILD
Table 2 reports the result of the multivariate regression analysis. DLco, M-Borg, GERD-Q, and capillary density were significantly associated with the presence of ILD-SSc at HRCT. This model had a coefficient of determination R
2 of 0.697. Age, gender, disease duration, anti-topo I antibodies, FVC predicted, HAQ-DI, and mRSS were not significantly associated with ILD.
The predictive performance of each variable was estimated by the AUC-ROC analysis (
Figure 2 A-B-C-D).
The DLco predicted had an AUC of 0.861 (p<0.001) and an optimal cut-off point of ≤ 72.3 (sensitivity 78.8%, specificity 91,1%, +LR 8.86) (
Figure 2A) (
Table A1,
Appendix A). The m-Borg scale demonstrated excellent discriminative ability, with an AUC of 0.883 (p<0.001) (
Figure 2B) and an optimal cut-off point of >2 (sensitivity 84.8%, specificity 82.2%, +LR 4.77) (
Table A2,
Appendix A). GERD-Q showed an AUC of 0.815 (p<0.001) and an optimal cut-off point >7 (sensitivity 72.7%, specificity 86.7%, +LR 5.45) (
Figure 2C) (
Table A3,
Appendix B), and the capillary density showed an AUC of 0.815 (p<0.001) and an optimal cut-off point ≤ 4.78 (sensitivity 87.9%, specificity 68.9%, +LR 2.82 (
Figure 2D) (
Table A4,
Appendix B).
The four variables (DLco, m-Borg, GERD-Q, and the mean capillary density/n. of capillary/mm) have been applied to the Fagan nomogram to calculate the post-test probability of having SSc-ILD. Calculation of the post-test probability is based on the pre-test probability (42.3% in our case series, which may vary depending on different local settings) and the product of the +LRs of the four items in the individual patient.
Figure 3 shows an example of this calculation.
4. Discussion
Our study identified four variables indicating a significant association with SSc- ILD, namely DLco, m-Borg, GERD-Q, and capillary density. Moreover, the concomitant inclusion of the single measures obtained in a post-test algorithm, tailored to these specific patient characteristics, significantly increases the likelihood of this association.
All patients with SSc should be screened for ILD at the time of diagnosis [
3,
9], as SSc-ILD can be progressive, even in “lower risk” patients [
7]. The estimated prevalence of ILD in SSc patients is highly variable depending on the population studied and the used methodology [
3]. In a Canadian registry, 64% of 289 patients with SSc were diagnosed with ILD by HRCT, compared to 26% by physical examination and 22% by chest X-ray [
29]. In a Norwegian population-based cohort of 650 patients with SSc, ILD at baseline was manifest on HRCT in half of the patients [
30]. Our ILD prevalence was slightly lower (42.31%), presumably due to the single-centre design of the study and the type of the methodology used for reading out the HRCT images, carried out by a radiologist and a rheumatologist with extensive experience in ILD evaluation [
19,
20,
21].
Experts in SSc agree that comparative analysis of PFTs at the time of diagnosis may be relevant to estimate and predict SSc-ILD outcome [
3,
6,
9,
29,
31], but PFTs alone are not sufficient as a tool for detecting patients with no obvious ILD on HRCT [
3,
31,
32,
33] since they reproducibility may vary [
11,
31,
33], and can be supplemented with additional information [
30,
31,
32,
33]. We found that a reduced DLco of 72.3% exhibits the best probability to be associated with SSc-ILD. Of note, DLco could be reduced if concomitant PAH and/or emphysema are present, but it should be mentioned that in our study, patients with a high probability of PAH according to the DETECT algorithm and/or PAH diagnosed by RHC and/or with chronic lung disease other than SSc-ILD were excluded [
11,
16,
28,
31,
34].
Questionnaires assessing dyspnoea in SSc-ILD as well as quality of life are used in clinical practice [
10,
20,
35], especially for correlation with the extent of pulmonary involvement [
10,
20]. In our study we found a significant ability of m-Borg in identifying patients with SSc-ILD, similarly to the DLco. Obviously, m-Borg and DLCO are linked, one being the clinical counterpart of dyspnoea and the other reflecting alveolar-capillary exchanges, which are altered in ILDs [
36,
37].
Some studies suggested that gastro-oesophageal reflux may play a significant role in the pathogenesis of SSc-ILD [
38,
39,
40]. Abnormalities in oesophageal peristalsis and the reduced oesophageal sphincter pressure are believed to lead to recurring micro-aspiration of gastric acid into the respiratory tract, resulting in chronic inflammation and progressive tissue damage [
39,
40,
41]. Results of our study provide evidence that the self-administration of Gerd-Q, a validated questionnaire addressing the symptoms of gastro-oesophageal reflux [
24,
25,
42], can be used to assess the presence of SSc-ILD, being this association more likely when the score is >7.
Microvascular damage is one of the characteristic findings in SSc, which may occur early in the disease course. Moreover, it is thought to be strongly involved in the SSc-ILD pathogenesis [
2,
14,
41,
43]. A causal relationship between microcirculation abnormalities and pulmonary fibrosis could be found in the association of SSc-ILD with structural capillary changes detected by NVC [
13,
14]. Among the typical abnormalities belonging to the “scleroderma pattern” capillary loss appears to be the one most correlated with ILD, through a qualitative (“active and “late” pattern) and quantitative assessment [
13,
44,
45,
46]. In particular, a significant lower capillary density was reported in 48 patients with SSc-ILD diagnosed by HRCT [
44] and, similarly, in 58 patients [
45]. Our findings confirmed this association, also strengthen its value in identifying SSc-ILD when the number of capillary is lower than 5/mm. This result is greatly useful, since capillary density was found to be the parameter with highest inter-reader reliability [
13,
26,
27] among other nailfold abnormalities, understanding that our NVC examination was obtained, as in both previous studies, using a videocapillaroscope with a 200× magnification contact lens [
44,
45].
It seems clear that the four measurements represent the clinical/biometric interface of the pathogenesis of pulmonary involvement, especially of the early phase of SSc (alteration of alveolar gas exchanges, chronic inflammation due to gastric micro-aspirations, chief role of autoimmune-type microcirculation damage as a primer of mesenchymal cell activation) [
3,
11,
41,
43] and appear suitable to be used as variables indicating SSc-ILD [
11].
The lack of significance regarding age, gender, disease duration, mRSS and the presence of anti-topo I was similar to our previous findings (20]. The non-significant association with sex could be explained by the small number of male patients included in the study (67 F/11 M). Similarly, the lack of association with disease duration may be explained by recall bias. Further, mRSS results are not surprising, since this score has reproducibility issues [
11,
20]. Finally, our findings are limited by the cross-sectional design, and the sample study characteristic, needing to be validated in an external cohort.
It should be emphasized that results obtained after inclusion of the four item measurements in a post-test algorithm further increase the probability to identify patients with SSc-ILD. The inclusion of the four variables (DLco, m-Borg, GERD-Q and capillary density) was possible by using Fagan’s nomogram, which allows a prompt assessment in individual SSc patient, using the pre-test probability previous obtained. We performed simulations by entering the LR+ for the four variables into the algorithm, obtaining a higher post-test probability the more the variables differed from the normal. This calculation may be suitable to be performed using a smartphone app, following the example of that already published for Covid-19 pulmonary involvement [
47]. Such a tool incorporating a set of input parameters may be usefully employed to assess the risk of lung tissue damage progression in longitudinal studies, even before evidence on HRCT [
11,
33].