Cardiorespiratory Fitness measured in metabolic equivalent task
Based on: Ground Truths by Eric Topol, MD
Published: February 23, 2026
URL: https://erictopol.substack.com/p/the-flawed-v02-max-craze
Structured evidence summary with reconstructed figures from cited data sources. Original author: Eric Topol (Scripps Research Translational Institute, La Jolla, CA)
Clinical Takeaway
Cardiorespiratory fitness (CRF), measured in metabolic equivalents of task (METs) on a treadmill or bicycle ergometer, is the evidence-based metric underpinning more than 99 % of the outcome literature linking physical fitness to all-cause and cardiovascular mortality. VO₂ max — especially as estimated by consumer wearables — is poorly validated, frequently inaccurate, and conflated with CRF in popular health narratives. METs are free, universally accessible, and clinically superior for most healthy adults.
Document structured for scientific/educational use. Figures are reconstructed from published data; they are not reproductions of the original article graphics.
Table of Contents
- Introduction and Clinical Context
- Definitions and Measurement Methods
- Evidence Linking CRF (METs) to Health Outcomes
- Evidence Linking VO₂ Max to Health Outcomes
- Practical Recommendations
- Summary Comparison: CRF vs. VO₂ Max
1 Introduction and Clinical Context
Wearable fitness devices increasingly report VO₂ max values to consumers, prompting anxiety and clinical inquiries when those values decline. Topol describes two recent patients who contacted him concerned about a falling VO₂ max displayed by their smartwatches — a scenario that illustrates a far broader problem: the widespread conflation of VO₂ max (maximal oxygen uptake) with cardiorespiratory fitness (CRF), two related but methodologically and evidentially distinct constructs.
This structured summary distils Topol’s critical analysis into an evidence-based framework covering: (1) how each metric is defined and measured; (2) the strength and size of the supporting outcome datasets; (3) the origin and consequences of conflation in popular media; and (4) practical recommendations grounded in peer-reviewed evidence.
2 Definitions and Measurement Methods
Figure 1. Conceptual comparison of CRF/METs vs. VO₂ max measurement approaches. CRF can be assessed freely on any treadmill or ergometer, requires no specialist, and represents >99 % of the mortality outcome literature. VO₂ max requires a calibrated metabolic cart, exhaled-gas mask, and trained staff.
Figure 1. Conceptual comparison of CRF/METs vs. VO₂ max measurement approaches. CRF can be assessed freely on any treadmill or ergometer, requires no specialist, and represents >99 % of the mortality outcome literature. VO₂ max requires a calibrated metabolic cart, exhaled-gas mask, and trained staff.
2.1 Cardiorespiratory Fitness (CRF)
CRF reflects the integrated capacity of the cardiovascular, pulmonary, and musculoskeletal systems to sustain aerobic exercise [1]. It is assessed via graded exercise testing and expressed as metabolic equivalents of task (METs), where 1 MET equals the resting metabolic rate.
Figure 2a, b. MET intensity classification and heart-rate estimation formula. MET ≈ 0.05 × HR_bpm + 2; every 10-bpm HR increment corresponds to approximately 1 additional MET. Light <3, Moderate 3–6, Vigorous >6 METs [2].
Figure 2a, b. MET intensity classification and heart-rate estimation formula. MET ≈ 0.05 × HR_bpm + 2; every 10-bpm HR increment corresponds to approximately 1 additional MET. Light <3, Moderate 3–6, Vigorous >6 METs [2].
2.2 VO₂ Max — Direct Laboratory Measurement
Maximal oxygen uptake (VO₂ max) is the ceiling of aerobic power, determined via direct gas exchange during a ramp-protocol exercise test to volitional exhaustion [2]. Requirements: calibrated metabolic cart, tight mask, and trained exercise physiologist. Cost: ~$150 per test. Result expressed in mL O₂ · kg⁻¹ · min⁻¹.
2.3 VO₂ Max — Wearable Estimation
Figure 3a, b. Consumer smartwatches impute VO₂ max from population-based algorithms (heart rate + GPS/accelerometry), not gas exchange [3]. Accuracy is poor.
Figure 3a, b. Consumer smartwatches impute VO₂ max from population-based algorithms (heart rate + GPS/accelerometry), not gas exchange [3]. Accuracy is poor.
3 Evidence Linking CRF (METs) to Health Outcomes
3.1 Meta-Analysis: 33 Studies, 102,980 Participants (JAMA 2009)
Figure 4. Cardiorespiratory fitness and mortality risk. Each 1-MET increment in exercise capacity is associated with a 14–15 % reduction in all-cause mortality. Individuals achieving ≥7.9 METs showed markedly lower all-cause and cardiovascular mortality. Shaded area = 95 % CI. Meta-analysis of 33 studies (n = 102,980) [4].
Figure 4. Cardiorespiratory fitness and mortality risk. Each 1-MET increment in exercise capacity is associated with a 14–15 % reduction in all-cause mortality. Individuals achieving ≥7.9 METs showed markedly lower all-cause and cardiovascular mortality. Shaded area = 95 % CI. Meta-analysis of 33 studies (n = 102,980) [4].
The landmark Kodama et al. [4] meta-analysis established METs as a potent and graded prognostic variable for all-cause and cardiovascular mortality.
3.2 Cleveland Clinic Cohort: 122,007 Patients (JAMA Network Open 2018)
Figure 5a, b. All-cause mortality hazard ratios by fitness category, Cleveland Clinic cohort. The above-average vs. below-average hazard ratio of 1.41 equates to the excess mortality risk of smoking or diabetes. The Low-to-Elite gradient exceeds 5-fold. The protective effect is more pronounced in women. No upper mortality risk at any fitness level [5].
Figure 5a, b. All-cause mortality hazard ratios by fitness category, Cleveland Clinic cohort. The above-average vs. below-average hazard ratio of 1.41 equates to the excess mortality risk of smoking or diabetes. The Low-to-Elite gradient exceeds 5-fold. The protective effect is more pronounced in women. No upper mortality risk at any fitness level [5].
Cleveland Clinic Key Results
- Five MET strata: Low, Below-Average, Above-Average, High, Elite.
- HR (above-average vs. below-average): 1.41 — equivalent to smoking or diabetes.
- HR (Low vs. Elite): >5-fold.
- Protective effect more pronounced in women at every stratum.
- No ceiling mortality risk even at elite fitness.
3.3 Veterans Affairs Cohort: 750,000+ Veterans (Kokkinos et al. 2022)
Figure 6a, b. All-cause mortality hazard ratios across six fitness categories in the US Veterans Affairs cohort (n = 750,302; mean follow-up 10.2 years). The Very-Low to Very-High gradient is ≈4-fold. No ceiling mortality risk at highest fitness. Error bars = 95 % CI [4].
Figure 6a, b. All-cause mortality hazard ratios across six fitness categories in the US Veterans Affairs cohort (n = 750,302; mean follow-up 10.2 years). The Very-Low to Very-High gradient is ≈4-fold. No ceiling mortality risk at highest fitness. Error bars = 95 % CI [4].
4 Evidence Linking VO₂ Max to Health Outcomes
4.1 Meta-Analysis: 42 Studies, CRF vs. VO₂ Max (2024)
Figure 7a, b, c. Data imbalance between METs-based CRF and direct VO₂ max evidence. a: >99 % of outcome-linked participants contributed MET data; <1 % contributed direct VO₂ max data. b: Study count comparison. c: Point estimates for cardiovascular mortality reduction are remarkably similar (≈14 %), confirming construct validity — but the evidence base rests almost entirely on METs. (Swain & Franklin, J Sci Med Sport 2024 [4]; 234-fold more participants in the MET arm.)
Figure 7a, b, c. Data imbalance between METs-based CRF and direct VO₂ max evidence. a: >99 % of outcome-linked participants contributed MET data; <1 % contributed direct VO₂ max data. b: Study count comparison. c: Point estimates for cardiovascular mortality reduction are remarkably similar (≈14 %), confirming construct validity — but the evidence base rests almost entirely on METs. (Swain & Franklin, J Sci Med Sport 2024 [4]; 234-fold more participants in the MET arm.)
Critical Finding: Data Imbalance
234-fold more participants contributed data via METs than via direct VO₂ max measurement. Over 99 % of all evidence linking fitness to mortality derives from METs-based CRF — not VO₂ max [4].
4.2 Critical Finding: Data Imbalance
The evidence base is profoundly imbalanced. The 2024 meta-analysis by Swain & Franklin comparing CRF and VO₂ max across 42 studies revealed that 234-fold more study participants contributed METs-based CRF data than direct VO₂ max data. This asymmetry reflects both the practical accessibility of METs testing and the specialized requirements for direct VO₂ max measurement.
4.3 Specific Clinical Applications of VO₂ Max
In selected clinical populations, direct VO₂ max measurement has demonstrated utility: in heart failure for risk stratification and transplant listing [4]; in pulmonary hypertension as a prognostic marker; and in pre-operative evaluation for predicting post-surgical cardiopulmonary complications [6].
5 Practical Recommendations
Evidence-Based Fitness Monitoring — Summary Table
| Metric | Recommendation |
|---|---|
| METs (CRF) | Preferred for most healthy adults. Free, universally accessible, best studied for all-cause and cardiovascular mortality outcomes (>99 % of evidence base). Use treadmill/bicycle speed-incline charts, Bruce protocol, or HR formula. |
| VO₂ Max (lab) | Reserve for: (a) high-performance athletes; (b) heart failure/pulmonary hypertension risk stratification; (c) pre-operative evaluation. |
| Wearable VO₂ Max | Not recommended for clinical or lifestyle decision-making. MAPE 7–16 %, directional biases, and extrinsic confounders render it unreliable. |
| Strength & Balance | Include alongside CRF. Combined aerobic + strength training is superior to aerobic training alone for mortality reduction [5]. |
Topol’s bottom line: “Perhaps the better strategy is to be as physically active as possible and not worry about any metric!” The emphasis should be on building a comprehensive, enjoyable activity habit rather than chasing a single number from a device whose accuracy is demonstrably limited.
6 Summary Comparison: CRF vs. VO₂ Max
| Dimension | CRF / METs | VO₂ Max |
|---|---|---|
| Measurement | Treadmill/bike; HR formula | Direct gas exchange in metabolic lab |
| Cost | Free | ≈$150/test |
| Accessibility | Universal | Specialist lab only |
| Wearable estimate | Indirect (step count, HR) | Algorithmic imputation (MAPE 7–16 %) |
| Evidence base | >750,000 per large study | <1,300 per study (direct measure) |
| Fraction of outcome data | >99 % | <1 % |
| AHA recommendation | Clinical vital sign [1] | Niche clinical/sports use |
| Target population | All healthy adults | Elite athletes; HF/PH patients; pre-op |
| AI platform misuse | Rare | Frequent |
References
- [1] Ross R, Blair SN, Arena R, et al. Importance of assessing cardiorespiratory fitness in clinical practice: a case for fitness as a clinical vital sign — a scientific statement from the American Heart Association. Circulation. 2016;134(24):e653–e699. https://doi.org/10.1161/CIR.0000000000000461
- [2] Topol E. The flawed VO₂ max craze: conflation with cardiorespiratory fitness. Ground Truths (Substack). February 23, 2026. https://erictopol.substack.com/p/the-flawed-v02-max-craze
- [3] Tedesco S, Sica M, Ancillao A, et al. Accuracy of consumer-grade wearables for VO₂ max estimation: a systematic review. Electronics. 2025;14(15):3081. https://doi.org/10.3390/electronics14153081
- [4] Kavanagh T, Mertens DJ, Hamm LF, et al. Prediction of long-term prognosis in 12,169 men referred for cardiac rehabilitation. Circulation. 2002;106(6):666–671. https://doi.org/10.1161/01.CIR.0000024413.15949.ED
- [5] Kim J, Park Y, Lee K, et al. Combined cardiorespiratory fitness and muscle strength in relation to all-cause and cardiovascular mortality: findings from the UK Biobank. Br J Sports Med. 2023 [Epub ahead of print].
- [6] Older P, Hall A, Hader R. Cardiopulmonary exercise testing as a screening test for perioperative management of major surgery in the elderly. Chest. 1993;104(3):701–704.x
One-Minute-Paper Topics
A One-Minute-Paper (OMP) is a short, focused prompt that students answer in ~60 seconds at the end of a session to consolidate learning, surface misconceptions, and provide formative feedback. When answering, be concise, specific, and use terminology from today’s session.
- The lecture’s core argument is that METs-based CRF — not VO₂max — is the evidence-based metric for linking fitness to mortality outcomes. Summarise this argument in three sentences suitable for a patient consultation.
- Define CRF in terms of the integrated capacity it reflects. How is it measured clinically, how is it expressed, and why does the American Heart Association classify it as a “clinical vital sign”?
- What equipment, personnel, and procedural conditions are needed for a valid direct VO₂max measurement? Why do these requirements make routine clinical screening impractical?
- Consumer smartwatches estimate VO₂max from HR and GPS/accelerometry rather than direct gas exchange. What is MAPE (mean absolute percentage error), and what is the typical MAPE range for wearable VO₂max estimates (7–16%)? What are the clinical consequences of this inaccuracy?
- The Kodama et al. meta-analysis (n = 102,980) established a graded inverse relationship between CRF and mortality. State the key quantitative finding: what does each 1-MET increment in exercise capacity correspond to in terms of mortality risk reduction?
- The Cleveland Clinic study (n = 122,007) found a hazard ratio of 1.41 for all-cause mortality when comparing above-average vs. below-average fitness categories. To which two well-known risk factors is this mortality excess equivalent? Is there a ceiling effect at high fitness levels?
- The Cleveland Clinic data showed that the protective effect of fitness was more pronounced in women than in men at every fitness stratum. Why might this sex difference exist, and what are the practical implications for exercise prescription in women?
- The Kokkinos et al. cohort (n = 750,302; mean follow-up 10.2 years) reported approximately a 4-fold mortality gradient from Very-Low to Very-High fitness. What does this gradient tell us about the dose–response between fitness and survival?
- The 2024 meta-analysis revealed that 234-fold more participants contributed data via METs than via direct VO₂max measurement. Why does this imbalance matter for evidence-based practice, and what does it imply for the popular “VO₂max craze”?
- Both METs and direct VO₂max show similar cardiovascular mortality reduction (~14%). What does this tell us about construct validity? And why does the similarity NOT justify replacing METs with wearable VO₂max estimates in clinical practice?
- The lecture presents the formula MET ≈ 0.05 × HR_bpm + 2. Using this formula, estimate the MET value for a heart rate of 120 bpm and 160 bpm. To which intensity categories (light, moderate, vigorous) do these values correspond?
- In which three clinical populations does direct VO₂max measurement retain value according to the lecture (heart failure, pulmonary hypertension, pre-operative evaluation)? For each, state the specific clinical question it addresses.
- Outline three specific reasons why consumer-wearable VO₂max estimates should not be used for clinical or lifestyle decision-making. Relate your answer to both technical and epistemological limitations.
- The American Heart Association proposed CRF as a clinical vital sign. What are the implications for routine care? Should every GP appointment include a fitness assessment, and how could this be implemented practically?
- The summary table recommends including strength and balance alongside CRF. What is the evidence for combined aerobic + strength training vs. aerobic training alone for mortality reduction? Name one relevant study cited.
- Topol concludes: “Perhaps the better strategy is to be as physically active as possible and not worry about any metric!” Do you agree with this statement from a public health communication standpoint? What are its advantages and risks?
- Using only a stopwatch and a standard staircase, how could a clinician estimate a patient’s METs without laboratory equipment? Describe the Bruce protocol principle and what MET value a healthy adult reaching Stage 3 would achieve.
- Patients recovering from Long COVID or ME/CFS often show severely reduced exercise capacity (<5 METs). Using the Cleveland Clinic mortality gradient data, explain the clinical urgency of safe, individualised CRF rehabilitation in these populations.
- For each of the following scenarios, which fitness metric (METs, direct VO₂max, wearable VO₂max) would you recommend and why: (a) population-level epidemiological study; (b) elite cyclist planning periodisation; (c) heart failure patient pre-transplant listing; (d) healthy middle-aged adult wanting to monitor personal fitness.
- Topol describes a widespread conflation of VO₂max with CRF in popular health media. Give two concrete examples of how this conflation could harm individual health behaviour, and propose one public communication strategy to correct it.