Material And Methods
Study population. The analysis was run in the pool of two cohorts of middle-aged men (40-59 years old) belonging to the IRA of the Seven Countries Study of Cardiovascular Diseases (SCS). The defined samples included a total of 1735 men and 1712 were examined at start in 1960 (participation rate of 98.7%). More information is available elsewhere [
7].
Mortality data. Follow-up for life status, mortality and causes of death was extended for 61 years and only one man was lost to follow-up after 50 years when he was aged 91 years while 3 men were still alive. Causes of death were allocated by reviewing and combining information from death certificates, interim examinations, hospital and medical records, interviews with physicians and relatives of the deceased and any other witnesses of fatal events. Causes of death were determined by a single coder (AM) following defined criteria, and employing the 8
th Revision of the WHO-ICD (ICD-8) [
8]. In the presence of multiple causes and uncertainty about the principal cause, a hierarchical preference was adopted with violence, cancers, coronary heart disease, stroke and other in that order [
9].
The end-points of this analysis were the following groups of cardiovascular diseases:
1) CHD including cases of myocardial infarction, acute ischemic attacks, and sudden coronary death occurring within 2 hours from onset of symptoms, after the exclusion of other possible causes (ICD-8 codes 410, 411, 412, 413, 795); cases with only mention or evidence of chronic coronary heart diseases (part of ICD-8 code 412) or classified as other ischemic heart disease (ICD-8 Coded 414) were not included in this group for reason given elsewhere [
1,
2], but healed myocardial infarction was retained in this group;
2) Stroke including any type of cerebrovascular diseases (ICD-8 codes 430-438);
3) among other types of CVD we selected and created a large subgroup on heart disease of uncertain etiology (HDUE) including symptomatic heart disease (ICD-8 code 427) (thus CHF in 62% of this group), hypertensive heart disease, usually poorly documented (ICD-8 codes 400-404) (13%), and cases vaguely quoted as chronic or other types of coronary heart disease, (ICD code 412, 414) in the absence of any evidence of typical coronary syndromes (25%); usually they were manifested as heart failure, arrhythmia and blocks.
All other rare or etiologically defined cardiovascular disease were not included in this analysis.
Risk factors. A large number of potential risk factors were measured but for the purpose of this analysis only 12 were selected, to be conservative and taking into account their role shown in previous analyses run on shorter follow-up periods [
7].
There were: a) age in years, approximated to the nearest birthday; b) family history of heart attacks, among closed relatives (0=no; 1 = yes); c) cigarettes smoked on average per day (n/day), which was adopted as a metrics since preliminary analyses showed that in the long run ex-smoker had a risk rather similar to the non-smokers (both group being classified with zero cigarettes); d) body mass index (kg/m
2) following the technique reported in the WHO Cardiovascular Survey Methods ( the WHO Manual) [
10]; e) subscapular skinfold thickness (in mm) following the technique reported in the WHO Manual [
10]; f) arm circumference (in mm) following the technique reported in the WHO Manual [
10] with the crude measurement cleaned for the contribution of subcutaneous tissue using a formula that included the tricipital skinfold thickness [
11]; g) systolic blood pressure (mmHg) measured in supine position at the end of a physical examination, using a mercury sphygmomanometer, following the technique reported in the WHO Manual [
10] having the average of two measurements taken one minute apart as analytical variable; h) heart rate (beats/min) derived from a resting ECG; i) vital capacity (in L/m
2) following the technique reported in the WHO Manual [
10] having the best of two attempts used for analysis; l) serum cholesterol (mg/dl) measured in casual blood samples following the technique by Anderson and Keys [
12]; m) urine glucose (yes=1; no=0) measured by uristix in casual urine samples, definite presence used as a proxi of diabetes; n) prevalence of major cardiovascular diseases at entry (0=no; 1 = yes) as defined by the SCS criteria [
13].
Baseline data were collected in 1960 before the era of the Helsinki Declaration and acceptance was implied with participation at entry examinations while only later oral informed consent was obtained in view of collecting follow-up data.
Statistical Analysis. A descriptive statistic was computed for death rates of the CVD end-points during 61 years and Kaplan-Meier curves were produced for each CVD group.
A simplified description of mortality trends was created computing death rates for each CVD group in 3 time-blocks of about 20 years whose denominator was made by subjects alive at the beginning of each time-block. For each CVD group age at death was computed.
Cox proportional hazards models were solved with 3 cardiovascular end-points and 12 risk factors as covariates. For each end-point, calibration was computed (proportion of cases in quintile classes of estimated risk).
To test the possible competing risks, the Fine-Gray procedure was used to challenge CHD mortality versus all other causes of death (that is all causes other than CHD). Two models were solved, one with CHD as primary event and the other causes as secondary event, another the other way round, both as a function of the risk factors previously used in Cox models.
Mean levels of baseline risk factors significantly predicting at least one CVD end-point were computed in men who died from any CVD in 3 subsequent time-periods of about 20 years and compared by ANOVA. Statistical significance was set at p value <0.05.
Results
In the 61-year follow-up, among the 1712 men examined at baseline there were 1708 deaths (99.8%), while 3 men were still alive with an age range of 102 to 106 years and one man was lost to follow-up after 50 years when he was aged 91 years. All types of CVD deaths covered 46% of total mortality while those considered for this analysis were 93% of all CVD deaths (N=727). Among them, 38.7% were diagnosed as CHD, 31.6% as stroke, and 28.5% as HDUE.
Kaplan-Meier curves showed a highly statistically significant difference (p<0.0001) among the 3 components of CVD mortality with a clear-cut and constant rightward displacement of the lifetime HDUE curve as compared to those of CHD and stroke (
Figure 1).
On the other hand, the time-trends of the 3 major conditions showed large differences in the 20-year periods covering the whole follow-up. In fact, CHD death rate sharply increased between period 1 and 2, but largely decreased in period 3 losing its first rank position; death rates for stroke increased from period 1 to period 3, maintaining its second rank position; while death rates for HDUE continuously increased becoming the first ranking in the third time period (
Figure 2). All this suggested that HDUE, beyond other characteristics, had a different natural history compared with the other two major CVD categories. Moreover, age at death was largely different across the CVD groups, i.e. 73.2 (±19.4) years for CHD, 75.2 (±10.6) years for stroke and 79.7 (±10.8) years for HDUE (ANOVA p<0.0001).
Mean levels of the selected risk factors at entry examination are reported in
Table 2 where those related to the whole study population reflect the situation for men in Italian rural environments in the mid of last century. Among them, there were relatively high levels of blood pressure and cigarette consumption and relatively low levels of serum cholesterol. Prevalence of family heart attack was rather high but the vague term used in the questionnaire was preferred to a more technical term (like myocardial infarction) that could be difficult to understand those times.
Baseline mean levels of risk factors across CVD mortality groups did not show significant differences except for heart rate (p=0.0075), and serum cholesterol (p=0.0066).
Cox models showed a variety of combinations of significant risk factors for CHD, stroke and HDUE as based on hazard ratios and their 95% confidence intervals (
Table 2,
Table 3 and
Table 4). In particular, significant risk factors for CHD were age, cigarette smoking, systolic blood pressure, serum cholesterol and vital capacity; for stroke age, systolic blood pressure, serum cholesterol, urine glucose and vital capacity; for HDUE age, cigarette smoking, systolic blood pressure and heart rate. Significant risk factors common to the three CVD end-points were only age and systolic blood pressure. Among risk factors used in these models we found a relatively high correlation and a collinearity problem between BMI and subscapular skinfold but, after removal of the latter, the new multivariable coefficients of risk factors were not significantly different from those of the original models.
The performance of the Cox models was estimated through the calibration procedure, reported in
Table 6 for the 3 CVD end-points. Calibration was poor for all end-points since there was no clear trends in the proportion of cases distributed in quintile classes of estimated risk. We tried to find an explanation for the poor performance of the Cox models and
Table 7 may contribute, at least partially. Using only risk factors with significant coefficients in at least one CVD end-point, mean levels were reported for those who died from any end-point in each time period of about 20 years. A definite trend occurred from time period 1 to time period 3, compatible with early deaths among those carrying unhealthy levels.
Table 5.
Cox model for HDUE deaths (N=216) in 61 years as a function of 12 risk factors.
Table 5.
Cox model for HDUE deaths (N=216) in 61 years as a function of 12 risk factors.
Variable |
Delta |
HR |
95% CI |
P of coefficient |
Age |
5 |
2.34 |
1.98 2.77 |
<0.0001 |
Family heart attack |
1 |
1.17 |
0.89 1.54 |
0.2663 |
Cigarettes |
9 |
1.21 |
1.06 1.38 |
0.0052 |
Body mass index |
4 |
0.95 |
0.70 1.30 |
0.7612 |
Subscapular skinfold |
6 |
0.95 |
0.75 1.22 |
0.7014 |
Arm circumference |
24 |
0.91 |
0.74 1.11 |
0.3426 |
Systolic blood pressure |
20 |
1.36 |
1.16 1.58 |
<0.0001 |
Heart rate |
13 |
0.81 |
0.69 0.96 |
0.0145 |
Vital capacity |
0.24 |
0.92 |
0.78 1.09 |
0.3329 |
Serum cholesterol |
40 |
1.07 |
0.93 1.23 |
0.3467 |
Urine glucose |
1 |
1.72 |
0.89 3.32 |
0.1037 |
CVD prevalence |
1 |
1.74 |
0.91 3.33 |
0.0950 |
Findings of the competing risks by Fine-Gray procedure (
Table 8) suggested that serum cholesterol was the critical risk factor in competition between CHD and other causes with a large positive and significant coefficient in the first model and a negative non-significant coefficient in the second model. On the other hand, age played a different role with positive higher level of coefficient in the second model.
Discussion
The study based on a follow-up of 61 years in a cohort of middle-aged men showed large differences across mortality from 3 major CVD conditions, i.e. CHD, HDUE (mostly composed of CHF) and stroke that related to several aspects of natural history and prediction thus suggesting that different diseases might be implicated and that etiology, mainly of HDUE, is still largely unknown but different from CHD. Although it was authoritatively suggested to base conclusions more on effect size, direction, plausibility, consistency, repeatability and clinical or practical utility rather than statistical significance [
14] there are occasions, like the opposite sign (and thus direction) seen in this study for the predictive capacity of cholesterol versus CHD or other causes of death in the competing risk approach (
Table 8), that deserve special attention as a statistical divergence, which is an important result presented. There are moreover, several other characteristics that were different across the 3 CVD mortality end-points used in this analysis, that is, death rates, mortality time-trends, age at death, baseline levels of some risk factors and long-term risk factor predictive power. In particular, CHD was the most common condition. It declined in the last 20 years of follow-up, while during the same period there was a sharp increase of HDUE (including in large proportions CHF) mortality. Age at death was significantly lower for CHD, higher for HDUE and intermediate for stroke. The latter result in the IRA cohorts was recently confirmed in another 8 cohorts of the SCS [
15].
Baseline mean levels of serum cholesterol were higher for those who died from CHD, lower for those who died from HDUE, intermediate for those who died from stroke, while it was significantly predictive (with a positive coefficient) of CHD events and stroke events and not predictive for HDUE events. In particular, a further novel finding in this analysis was the significant positive coefficient of serum cholesterol predicting stroke mortality since in previous analyses on shorter follow-up periods this was not found [
16]. It is possible that in the late years more thrombotic events added up but this conclusion remains uncertain since in the majority of cases we were unable to segregate thrombotic from hemorrhagic stroke.
All these important differences explicitly point to the fact that when comparing CHD with HDUE we are probably looking at different diseases, what might also be concluded if CHD is compared, by Fine-gray model that takes competition into account, to all other causes of death whereby a major risk factor such as cholesterol has a divergent role. As a consequence, it may not be appropriate to pool those 3 conditions into a single CVD end-point, simply because they relate to the same anatomical-physiological system although sharing the role of age and systolic blood pressure as significant risk factors, as is commonly but inappropriately done in composite end-point analyses in the current Literature. A Cox model including all CVD conditions (not reported in detail) may make sense in view of identifying risk factors that probably share at least some of the various end-points and for general preventive purposes. However, this does not help to improve the understanding of possible etiologies of the single conditions that, for certain aspects, remain different diseases.
The hypothesis was made of an important explanatory contribution of competing risks, a concept already documented on the same material using the 50-year follow-up data [
17,
18]. In other words, those with high cholesterol (and possibly other risk factors) are hit by a CHD event at younger age while the others (relatively protected) become more exposed to other types of events that finally occur but years later and at older ages. This likely explains part of the significant rightward displacement of the Kaplan-Meier curve for HDUE as compared to CHD and stroke (
Figure 1) and the different age at death mainly comparing CHD with HDUE.
Some coefficients of major risk factors were smaller than in models run in the past for shorter follow-up period [
7,
16]. This can be explained by the fact that during the first periods subjects with higher levels were eliminated opening the chance to people with lower levels of risk factors to reach the disease status. In fact, the poor performance of the Cox models could partly be explained by the attrition phenomenon consisting in the early drop-out of subjects with worse risk factor levels. As a consequence, those dying later were usually carrying better levels of risk factors, contributing to dilute the magnitude of multivariable coefficients. This seems a universal phenomenon but probably enhanced by the lifelong follow-up that we have here.
Changes of risk factors level during the follow-up might also influence the multivariable coefficients. By sure there has been a sharp decline of cigarette smoking, a definite increase of blood pressure and a sizeable increase of serum cholesterol, but from previous analyses we know that the baseline risk factor levels were predictive for at least 40 years when tested in partitioned blocks of follow-up tim
e [19].
It was almost impossible to find in the Literature studies really comparable to this one in terms of follow-up duration, practically extinction of the cohort and dealing with the same groups of CVD mortality with well-defined groups based on the international classification of diseases. The only “key word” providing somewhat similar analyses and references were “lifetime CVD risk”. Many studies claim to have estimated the lifetime risk of CHD but almost all have simply applied to their data a lifetime Framingham risk score [
20]. Actually, they do not deserve a direct mention since they simply produced theoretical estimates applying an external score to their baseline data.
On the other hand, the Framingham score [
20] was based on the use of total and HDL cholesterol, systolic and diastolic blood pressure, smoking habits and body mass index, all transformed into a variable number of discrete classes. The lifetime exposure was actually stopped at the age of 95 years and the overall death rate of the observed cohort was of about 30%. Moreover, the assumption was made that those reaching the age of 75 years could be considered as “dead” although the majority of them died later. In fact, the concept of "lifetime” was interpreted in a substantially arbitrary way and the analytical approach had nothing to do with our approach where a cohort was followed-up until practical extinction (99.8%).
Other contributions based on the Framingham study dealing with the same general problem were mainly devoted to numerical estimates of lifelong CVD incidence-mortality based on incomplete follow-up [
21], including a specific analysis dedicated to CHF (in people without and with previous myocardial infarction) [
22] and another to stroke [
23]. The same authors made comparisons of 10 years versus lifelong events prediction [
24,
25] concluding that a new specific approach should be developed.
A huge analysis was conducted in the US pooling several large studies with data collected from 1964 through 2008 and exploiting the predictive role of blood pressure, smoking habits, diabetes and serum cholesterol and posing the age limit of 95 years. The outcome, defined by a pool of different major CVD, was predicted with largely different probabilities depending on the combination of high-low levels of the considered risk factors [
26].
A Swedish study based on a risk factor population screening of 92,000 adults aged 40, 50 and 60 years, covered a maximum follow-up of 25 years [
27]. Measured risk factors were serum cholesterol, smoking habits, blood pressure, body mass index, physical activity, self-reported health, education and marital status, all considered in discrete classes and combining them to determine 3 different risk profiles. The end-points were the occurrence of CHD, stroke, CHF and a pool of CVD. There was a coherence between the risk profiles and the so-called lifetime risk of CHD, but not for stroke and CHF, suggesting that this study had some similarity with our findings. However, the authors did not comment this aspect and simply said that the distinction between specific CVD events was not a relevant problem since the real problem is the prevention through action on risk factors. This conclusion has a practical sense, but excludes any attention for possible differences across various CVD types and their possible etiology, alike what we tried here.
A Chinese study published in 2015 involved almost 22,000 adults aged 35-84 where a few risk factors were measured and the follow-up was extended for 18 years [
28] with pooled CVD events as end-point. The estimate of lifetime CVD risk, with limit posed at the age of 80 years, showed the ability of a few risk factors to segregate high from low risk for CVD events.
In 3 large meta-analyses run on the same population studies, but separately exploring different risk factors, it was impossible to identify a subgroup of common heart diseases, outside CHD (that is somewhat similar to those we call HDUE) since they were usually pooled with other vascular diseases preventing the possibility to describe carefully their characteristics [29-31].
It is clear that genomics analyses may improve long-term risk prediction and some contributions start to be of great interest [
32] but this aspect could obviously not be considered in our data collected in the mid of last century.
Finally, we have more recently investigated in the context of 10 cohorts of 9,063 middle-aged men of the Seven Countries Study followed for 60 years to extinction whether competing risks of CHD mortality versus other causes of death, comparatively assessed by Fine-Gray versus Cox models, might help explaining why regions with initially high serum cholesterol have higher mortality from CHD and lower from stroke and HDUE, while the reverse is found for those with initially lower serum cholesterol [
33]. There were 5 important conclusions that lead support to the results presented here: 1) CHD mortality is bound to different risk factor relationships when put in competition with all other causes of death and with the sum of HDUE and stroke; 2) competing risk analyses (by Fine-Gray methods) are needed to assess risk factor capabilities in specific outcome types of CVD since they are different depending on outcome types considered (CHD versus HDUE plus stroke ); 3) there are important differences among predictive covariates, in terms of presence/absence and opposition of the coefficients’ algebraic sign when the inversion is performed of primary versus the secondary outcome in case of Fine-Gray, what is much less evident or absent by Cox which point to the need of a certain power, namely number of enrolled individuals, in order to assess these aspects; 4) the changing algebraic sign of country coefficients, treated in the study as dummy variables, comparing direct versus inverse models might suggest a possible “ecologic“ interpretation of the Fine-Gray analysis; 5) it is clear that individual outcomes of CVD should not be summed-up or equated, which should be considered in all future investigations.
A major strength of the present study is the practical extinction of the original cohort and the extremely long duration of follow-up that with detailed diagnosis of outcome results, what has few if no comparative examples in the Literature. However, these results cannot be extended to women that were not included since at starting of the Seven Countries Study it was estimated that the low incidence of heart disease would have requested too large groups to be enrolled, unacceptable financial and organizational requirements in early sixties. It is extremely important to underline that composite end-point analyses of CVD mortality have the inherent risk to distort the relations among factors and outcomes to the extent that, at least, CHD versus HDUE (or also possibly the sum of HDUE and stroke) and versus all other causes of death should be considered, not mixing them up for a composite end-point analysis of CVD, like mostly done in the Literature. Moreover, there might be predictive differences in the analyzed covariates when a cohort presents around 39% of CHD deaths versus lower or higher proportions, which depends, among other factors also on the respective proportions of stroke and HDUE. This fact has important consequences for epidemiology, public health and clinical practice since the potential different etiology between CHD and HDUE (largely composed of CHF) points to the need of dissecting these causes in large ongoing and future investigations also separating stroke from the other components of CVD [
34] and certainly not addressing composite end-points just to unduly increase the power of studies which is done more than often [
35] and may have unpredictable consequences depending on the analyzed cohorts.