1. Introduction
Maximal heart rate (HRmax) is a widely used variable to recommend training intensity by exercise practitioners [
1] and prescribe treatment by medical professionals [
2]. HRmax usually means the highest heart rate (HR) achieved during maximum symptom-limited cardiopulmonary exercise test (CPET) [
3]. Verification of achieving HRmax can be carried out based on volitional CPET termination or by HR plateau (lack of growth in the HR with increasing intensity) [
4,
5]. Currently performed CPET often leads to nondiagnostic results because participants terminate them at the submaximal intensity (without signs of ischemia or <85% of HRmax) based only on prespecified HRmax calculated from various prediction models. The clinical value of such CPET is sub-optimal and this was confirmed by ECG and imaging data [
4,
5].
HRmax shows significant variability [
1,
6]. Measuring the HR during exercises, and calculating its proportion to HRmax, resting HR, or relative workload (% of HRmax) is a common practice and has been implemented in many wearable devices [
6]. Results help to optimize exercise intensity for both healthy endurance athletes (EA) and rehabilitating patients. It has been confirmed that people with better fitness level, in particular EA, are able to achieve higher HRmax and maintain slower declines in HRmax with ageing [
6,
7]. They also have lower all-cause mortality, especially due to CVD [
6].
CPET results differ significantly between treadmill and cycle modality[
8]. The differences apply not only to HRmax but also to other performance indicators[
9,
10]. Usually, higher scores are observed in running CPET[
9]. Cycling CPET has a lower occurrence of attributes because it is performed in a more stable position[
8]. In consequence, cycling test is more recommended when higher measurement accuracy is required (e.g. in the clinical settings)[
3].
There is a demand for individualized, risk-based, stratified therapy and exercise programs. It points out that medical decisions, intensity prescriptions, and diagnostic examinations are precisely tailored to the individual [
11,
12]. Knowledge of the endurance capacity remains therefore essential. Prediction models are usually developed to share decision-making [
11]. Numerous regression equations were derived to obtain the HRmax setpoint without maximal CPET [
13]. However, their accuracy is often questioned and there is a lack of large studies comparing different prediction models among EA and stratifying them between CPET modalities. Recently, endurance sports are gaining popularity, hence the number of EAs with suspected CVD and veteran EAs, for whom the exact value of HRmax is particularly important, increased.
The majority of prediction models include a baseline value (around 200-220) and different age covariates [
14]. Their authors postulate that such a univariate analysis with the inclusion of only one key factor is sufficient and other variables remain negligible [
2]. Indirect estimation of HRmax has found wide application in fitness equipment and is a common method for preparing medical recommendations (e.g. for treatment of heart failure) [
1,
15].
The aims of this study are: (1) to assess the accuracy of HRmax prediction models in the population of EA independently for running and cycling CPET, (2) to evaluate their practical application based on the precision of estimations, and (3) to provide further research recommendations to improve the accuracy of novel prediction models. According to our knowledge, this is the first external validation of HRmax prediction models stratified between CPET type that includes up to 13 formulae and has been performed on such a large EAs population.
2. Materials and Methods
2.1. Study setting
This is a population study on a cohort of healthy adult EA who declared regular training and a planned exercise regimen (≥3 weekly training sessions). A retrospective analysis of CPET data collected between 2013-2021 from the tertiary care sports medicine center SportsLab (
www.sportslab.pl, Warsaw, Poland) was performed. All CPETs were conducted at the individual request of athletes as part of the optimization of a training program or periodic performance evaluation. Exclusion criteria were: (1) age <18 years old, (2) any medical contraindications (both acute and chronic), (3) usage of any medications at the date of study (both acutely and chronically), (4) smoking, (5) missing data in HRmax. The selection procedure is presented in
Figure 1. Validation was performed following the TRIPOD guidelines (see
Supplementary Material 1 S1: TRIPOD Checklist: Prediction Model Development and Validation) [
16].
2.2. Previously published HRmax prediction models
Candidates of prediction models were identified from systematic reviews for normative exercise reference values by Paap et al. and Takken et al. up to 2019 [
17,
18]. Screening for models derived between 2019 and 2023 was conducted by a manual literature search in four electronic databases PubMed, MEDLINE, Scopus, and Web of Science using keywords: "prediction model", "prediction equation", “prediction algorithm”, “endurance athletes”, “cardiopulmonary exercise testing” and “maximal heart rate”. Inclusion criteria were: (1) usage only of somatic or exercise variables which were available in our database, (2) providing HRmax defined as the peak value (not averaged), and (3) providing data about the primarily derived population stratified by CPET modality. Exclusion criteria were: (1) being derived primarily for pediatric or geriatric populations, (2) being derived exclusively for one sex, and (3) focusing on clinical population, Additionally, Fox et al. [
19] and Tanaka et al. [
2] equations were added due to their wide usage. Finally, 13 different prediction models from 9 studies were qualified. Their classification is presented in the left part of
Table 2. Original derivation studies are presented in
Supplementary Material 3 S3: Selected prediction models for maximal heart rate.
2.3. Cardiopulmonary exercise testing procedures
All CPETs were performed in a single laboratory under unified protocols both for treadmill and cycle ergometry. EA underwent graded maximal effort CPET on either a mechanical treadmill (h/p/Cosmos quasar, Germany) or cycle ergometer (Cyclus 2, RBM elektronik-automation GmbH, Leipzig, Germany). Briefly, the testing modality was selected by the agreement of the subject and the physiologist to suit the primary training discipline.
The cycling CPET started with a 5 min free wheel pedaling as a warmup and continued with a gradual increase in resistance every 2 min until termination (20W for females and 30W for males). The running CPET begins with a 5 min walking or slow jogging warmup and continued with a gradual increase in velocity every 2 min (1 km·h
-1 for both females and males). The treadmill was set at a constant inclination equal to 1%. The intensity was adjusted by the physiologist in conjunction with the trainee to reach their maximum exertion. The termination points considered as a maximal effort were: (1) volitional exhaustion and inability to continue the protocol with declared exertion ≥18 in Borg’s RPE, (2) an HR or oxygen uptake (VO
2) plateau (a stable level of HR or levelling-off in VO
2, defined as an increase <100 mL·min
−1 with growing exercise intensity before CPET termination)[
20,
21,
22].
Exercise indices were obtained breath-by-breath by the Hans Rudolph V2 Mask (Hans Rudolph, Inc, Shawnee, KS, USA), a gas exchange monitor Cosmed Quark CPET (Rome, Italy), and analyzed using dedicated software Omnia. HR was measured via ANT+ chest strap as a part of the Cosmed Quark CPET set (manufacturer product accuracy comparable to ECG; ± 1 beats·min−1). HRmax was defined as the peak value and was not averaged in the interval preceding CPET termination. Maximal VO2 (VO2max) was considered as the average VO2 during the 15-s period at the end of the CPET. The maximal oxygen pulse was calculated as VO2max/HRmax, maximal respiratory exchange ratio as the maximal volume of exhaled carbon dioxide/ maximal volume of oxygen uptake, and maximal minute ventilation efficiency as maximal ventilation/ maximal volume of exhaled carbon dioxide.
2.4. Data Analysis
Continuous variables are presented as mean (standard deviation; SD) and 95% confidence intervals (CI). Categorical variables are presented as numbers (percentages). Equations were tested independently for running and cycling CPET. Data distribution was assessed by the quantile-quantile plots. Differences between both disciplines (running/cycling) were calculated by Student t-test.
Predictive performance of the selected formulae was compared by the MAPE (mean absolute percentage error), MAE (mean absolute error), and RMSE (root mean square error). MAE and RMSE were additionally adjusted to the percentage of observed HRmax (by dividing the error by the mean of observed HRmax). Differences between observed and predicted HRmax were also calculated by Student t-test.
The significance agreement was adopted at two-tailed p=0.05. Statistical analyzes were performed in the SPSS program (version 29.0, IBM, Chicago, IL, USA).
4. Discussion
In this retrospective analysis of data from CPET conducted at the tertiary care sports diagnostic center, we examine the relationship between directly measured and predicted HRmax by the 13 commonly used regression equations. We demonstrate that: (1) the predicted values differed significantly for the majority of formulae, (2) the underestimation between the predicted and observed values ranged from 0.18 beats·min
−1 to 4.90 beats·min
−1, (3) overestimation of HRmax was observed less often than underestimation among EA, and (4) overestimation ranged from 0.08 up to 4.94 beats·min
−1. By definition, external validation is ‘assessing the predictive agreement of a prediction model in a research population other than the one from which the model was developed’ [
23]. We underline the lack of external validation studies on EA performed comprehensively for numerous HRmax equations. The main novelties of the present research are a wide cohort of EA at different levels of fitness and independent analysis adjusted for treadmill and cycling CPET. Such an approach enables reliable validation showing whether current models are transferable and suitable for EAs at both testing modalities.
Accurate prediction of HR might be helpful to confirm that the maximal effort has been achieved, compare training intensity with the maximal capacity of the individual, or consider clinically focused CPET as valuable in diagnosing CVDs [
6]. The most accurate way to obtain HRmax is to perform laboratory CPET or maximum effort during the competition [
15]. Although, those methods are not always possible due to the limited availability of specialized diagnostic centers, high costs of fees, or participant health restrictions and limitations in locations of events[
24]. Despite significant inaccuracy, indirect measurement is widely applied in practice. So far, the accuracy of Fox et al. [
19] and Tanaka et al. [
2] models have been most frequently evaluated. Along with their popularity, they already have some inaccuracies. Nes et al. postulate bias at the level of 4-7 beats·min
−1 for the Tanaka et al. and up to 35 beats·min
−1 for the Fox et al. in certain subjects [
14]. Furthermore, Magri et al. report that in a clinical population consisting of patients with heart failure, Fox et al. led to 37.60% and Tanaka et al. led to 42.60% errors compared to directly measured HRmax [
25]. However, a comprehensive evaluation of other previously derived models is missing, and most studies focus only on a few particular formulae.
Our validation approach directed at the EA enables a comprehensive assessment of whether models are fairly replicable. Current HRmax formulae were originally derived from varied samples. Briefly, our results indicate that they do not perform precisely in EA, despite we only selected models derived from healthy, active cohorts with comparable demographic to our’s subjects. This is especially important due to their wide usage in sports diagnostics. The bias for most of the formulae ranged from 0 to 5 beats·min
−1. However, the one provided by Machado et al.[
26] overestimated HRmax for up to 4.94 beats·min
−1.
Relying on inaccurate results when setting the exercise intensity may lead to suboptimal, non-diagnostic effort[
3]. As a rule, the intensity for exercise medical programs should be 70-80% of HRmax for moderate-intensity steady-state activity and >85% of HRmax for high-intensity interval training [
27]. An underestimated HRmax of ~5 beats·min
−1 (i.e. ~3% of HRmax for 30-year male individuals according to widely used Fox et al. algorithm) does not seem to be a wide inaccuracy. This level of bias fills in the range of adjusted intensity for medical programs. Thus, predicted HRmax could be applied to EA in a medical setting but we underline that this relationship for the general and clinical populations need to be confirmed.
So far, previous studies have usually postulated that models may underestimate HRmax in people with a higher level of fitness [
14,
25]. Physical activity allows maintaining high and stable HRmax, despite increasing age. In addition, people with lower endurance capacity experience a steeper decline [
28]. Our results showed a similar relationship and only ~40% (i.e. 3 of 8 for running CPET and 2 of 5 for cycling CPET) overestimated HRmax in the athletic cohort.
Calculating MAPE is our additional approach for the evaluation of HRmax prediction performance. Briefly, MAPE indicates function loss by a regression model, and it's intuitive to interpret relative inaccuracy. MAPE indicated that Arena et al. and Tanaka et al. perform with substantial accuracy for each modality. Their MAPE was the lowest both for treadmill and cycling data. We also noticed that Arena et al. (
p=0.80 for cycling), Tanaka et al. (
p=0.16 for cycle) and Fox et al. (
p=0.30 for treadmill and
p=0.23 for cycling) equations were the only ones that did not differ significantly from measured values. Our results indicate that these models perform quite precisely for EAs. They were characterized by the lowest inaccuracies. Perhaps because they were derived from numerous populations from synthesized various studies[
2,
19] or cohorts with above-average physical activity levels[
29]. Thus, the ratio between basic (~200-220) and age covariates requires further studies to find the most precise values.
It is worth to underline that MAPE is more biased toward clinical than healthy, athletic populations. Our MAPE was lower for the athletic cohort than for the patients with heart failure in Magri et al.[
30] study (4.25% and 4.41% vs 37.60% for Fox et al. or 3.96% and 4.11% vs 42.60% for Tanaka et al.). This indicates that HRmax prediction could be more accurate for active than diseased individuals and perhaps find its wider practical application among EA. We stipulate that these results emerge other confounding variables in participants with heart failure or CVDs (i.e. impact of
medications, heart anatomy, etc.) [
30].
All selected studies included just two variables, i.e. age and sex. The formulas used varied proportions of multipliers. It is well documented, that EAs achieve higher HRmax and a slower decline in HRmax with age than general population [
7]. However, the majority of them differ significantly compared to observed values. We suggest that predicting HRmax only in this way may not be the most optimal method, and other estimation possibilities could exist. Previous reports indicate that there could be an impact of resting HR or submaximal HR at first and second ventilatory thresholds, BMI, body mass, VO
2max, body fat, and testing modality [
31,
32]. All of them are parameters regularly measured by most diagnostic centers. Recently derived more specified models consider blood counts, left ventricular ejection fraction (LVEF), and diet [
30]. However, these variables are difficult to implement in a unified form under practical circumstances. Measuring them for all subjects could be problematic. Therefore, staying with predicting HRmax based on additional somatic and exercise variables other than the age-only approach seems to be simultaneously a more accurate and feasible tool.
The limitation of this study is the slightly underrepresented group of female EA (15.01%). Our cohort was also more advanced in age. We recommend that future research consider a more equal proportion of male-to-female and include a higher number of younger participants. We also advise validation on equations derived exclusively for one sex.
Author Contributions
PSK: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Software, Validation, Visualization, Writing - Original Draft, Writing - Review & Editing. SW: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Supervision, Visualization, Writing - Review & Editing. IC: Data curation, Validation, Formal analysis, Resources, Methodology, Software. TT: Investigation, Project Administration, Validation, Supervision, Visualization. JL: Data curation. ML: Funding acquisition. MB: Funding acquisition. AM: Supervision. DŚ: Funding acquisition, Project administration.