Cardiorespiratory Fitness measured in metabolic equivalent task
Based on: Ground Truths by Eric Topol, MD
Published: February 23, 2026
URL: https://erictopol.substack.com/p/the-flawed-v02-max-craze
Structured evidence summary with reconstructed figures from cited data sources. Original author: Eric Topol (Scripps Research Translational Institute, La Jolla, CA)
Clinical Takeaway
Cardiorespiratory fitness (CRF), measured in metabolic equivalents of task (METs) on a treadmill or bicycle ergometer, is the evidence-based metric underpinning more than 99 % of the outcome literature linking physical fitness to all-cause and cardiovascular mortality. VO₂ max — especially as estimated by consumer wearables — is poorly validated, frequently inaccurate, and conflated with CRF in popular health narratives. METs are free, universally accessible, and clinically superior for most healthy adults.
Document structured for scientific/educational use. Figures are reconstructed from published data; they are not reproductions of the original article graphics.
Table of Contents
- Introduction and Clinical Context
- Definitions and Measurement Methods
- Evidence Linking CRF (METs) to Health Outcomes
- Evidence Linking VO₂ Max to Health Outcomes
- Practical Recommendations
- Summary Comparison: CRF vs. VO₂ Max
- References
- One-Minute-Paper Topics
1 Introduction and Clinical Context
Wearable fitness devices increasingly report VO₂ max values to consumers, prompting anxiety and clinical inquiries when those values decline. Topol describes two recent patients who contacted him concerned about a falling VO₂ max displayed by their smartwatches — a scenario that illustrates a far broader problem: the widespread conflation of VO₂ max (maximal oxygen uptake) with cardiorespiratory fitness (CRF), two related but methodologically and evidentially distinct constructs.
This structured summary distils Topol’s critical analysis into an evidence-based framework covering: (1) how each metric is defined and measured; (2) the strength and size of the supporting outcome datasets; (3) the origin and consequences of conflation in popular media; and (4) practical recommendations grounded in peer-reviewed evidence.
2 Definitions and Measurement Methods
Figure 1. Conceptual comparison of CRF/METs vs. VO₂ max measurement approaches. CRF can be assessed freely on any treadmill or ergometer, requires no specialist, and represents >99 % of the mortality outcome literature. VO₂ max requires a calibrated metabolic cart, exhaled-gas mask, and trained staff.
Figure 1. Conceptual comparison of CRF/METs vs. VO₂ max measurement approaches. CRF can be assessed freely on any treadmill or ergometer, requires no specialist, and represents >99 % of the mortality outcome literature. VO₂ max requires a calibrated metabolic cart, exhaled-gas mask, and trained staff.
2.1 Cardiorespiratory Fitness (CRF)
CRF reflects the integrated capacity of the cardiovascular, pulmonary, and musculoskeletal systems to sustain aerobic exercise [1]. It is assessed via graded exercise testing and expressed as metabolic equivalents of task (METs), where 1 MET equals the resting metabolic rate.
Figure 2a, b. MET intensity classification and heart-rate estimation formula. MET ≈ 0.05 × HR_bpm + 2; every 10-bpm HR increment corresponds to approximately 1 additional MET. Light <3, Moderate 3–6, Vigorous >6 METs [2].
Figure 2a, b. MET intensity classification and heart-rate estimation formula. MET ≈ 0.05 × HR_bpm + 2; every 10-bpm HR increment corresponds to approximately 1 additional MET. Light <3, Moderate 3–6, Vigorous >6 METs [2].
2.2 VO₂ Max — Direct Laboratory Measurement
Maximal oxygen uptake (VO₂ max) is the ceiling of aerobic power, determined via direct gas exchange during a ramp-protocol exercise test to volitional exhaustion [2]. Requirements: calibrated metabolic cart, tight mask, and trained exercise physiologist. Cost: ~$150 per test. Result expressed in mL O₂ · kg⁻¹ · min⁻¹.
2.3 VO₂ Max — Wearable Estimation
Figure 3a, b. Consumer smartwatches impute VO₂ max from population-based algorithms (heart rate + GPS/accelerometry), not gas exchange [3]. Accuracy is poor.
Figure 3a, b. Consumer smartwatches impute VO₂ max from population-based algorithms (heart rate + GPS/accelerometry), not gas exchange [3]. Accuracy is poor.
3 Evidence Linking CRF (METs) to Health Outcomes
3.1 Meta-Analysis: 33 Studies, 102,980 Participants (JAMA 2009)
Figure 4. Cardiorespiratory fitness and mortality risk. Each 1-MET increment in exercise capacity is associated with a 14–15 % reduction in all-cause mortality. Individuals achieving ≥7.9 METs showed markedly lower all-cause and cardiovascular mortality. Shaded area = 95 % CI. Meta-analysis of 33 studies (n = 102,980) [4].
Figure 4. Cardiorespiratory fitness and mortality risk. Each 1-MET increment in exercise capacity is associated with a 14–15 % reduction in all-cause mortality. Individuals achieving ≥7.9 METs showed markedly lower all-cause and cardiovascular mortality. Shaded area = 95 % CI. Meta-analysis of 33 studies (n = 102,980) [4].
The landmark Kodama et al. [4] meta-analysis established METs as a potent and graded prognostic variable for all-cause and cardiovascular mortality.
3.2 Cleveland Clinic Cohort: 122,007 Patients (JAMA Network Open 2018)
Figure 5a, b. All-cause mortality hazard ratios by fitness category, Cleveland Clinic cohort. The above-average vs. below-average hazard ratio of 1.41 equates to the excess mortality risk of smoking or diabetes. The Low-to-Elite gradient exceeds 5-fold. The protective effect is more pronounced in women. No upper mortality risk at any fitness level [5].
Figure 5a, b. All-cause mortality hazard ratios by fitness category, Cleveland Clinic cohort. The above-average vs. below-average hazard ratio of 1.41 equates to the excess mortality risk of smoking or diabetes. The Low-to-Elite gradient exceeds 5-fold. The protective effect is more pronounced in women. No upper mortality risk at any fitness level [5].
Cleveland Clinic Key Results
- Five MET strata: Low, Below-Average, Above-Average, High, Elite.
- HR (above-average vs. below-average): 1.41 — equivalent to smoking or diabetes.
- HR (Low vs. Elite): >5-fold.
- Protective effect more pronounced in women at every stratum.
- No ceiling mortality risk even at elite fitness.
3.3 Veterans Affairs Cohort: 750,000+ Veterans (Kokkinos et al. 2022)
Figure 6a, b. All-cause mortality hazard ratios across six fitness categories in the US Veterans Affairs cohort (n = 750,302; mean follow-up 10.2 years). The Very-Low to Very-High gradient is ≈4-fold. No ceiling mortality risk at highest fitness. Error bars = 95 % CI [4].
Figure 6a, b. All-cause mortality hazard ratios across six fitness categories in the US Veterans Affairs cohort (n = 750,302; mean follow-up 10.2 years). The Very-Low to Very-High gradient is ≈4-fold. No ceiling mortality risk at highest fitness. Error bars = 95 % CI [4].
4 Evidence Linking VO₂ Max to Health Outcomes
4.1 Meta-Analysis: 42 Studies, CRF vs. VO₂ Max (2024)
Figure 7a, b, c. Data imbalance between METs-based CRF and direct VO₂ max evidence. a: >99 % of outcome-linked participants contributed MET data; <1 % contributed direct VO₂ max data. b: Study count comparison. c: Point estimates for cardiovascular mortality reduction are remarkably similar (≈14 %), confirming construct validity — but the evidence base rests almost entirely on METs. (Swain & Franklin, J Sci Med Sport 2024 [4]; 234-fold more participants in the MET arm.)
Figure 7a, b, c. Data imbalance between METs-based CRF and direct VO₂ max evidence. a: >99 % of outcome-linked participants contributed MET data; <1 % contributed direct VO₂ max data. b: Study count comparison. c: Point estimates for cardiovascular mortality reduction are remarkably similar (≈14 %), confirming construct validity — but the evidence base rests almost entirely on METs. (Swain & Franklin, J Sci Med Sport 2024 [4]; 234-fold more participants in the MET arm.)
Critical Finding: Data Imbalance
234-fold more participants contributed data via METs than via direct VO₂ max measurement. Over 99 % of all evidence linking fitness to mortality derives from METs-based CRF — not VO₂ max [4].
4.2 Critical Finding: Data Imbalance
The evidence base is profoundly imbalanced. The 2024 meta-analysis by Swain & Franklin comparing CRF and VO₂ max across 42 studies revealed that 234-fold more study participants contributed METs-based CRF data than direct VO₂ max data. This asymmetry reflects both the practical accessibility of METs testing and the specialized requirements for direct VO₂ max measurement.
4.3 Specific Clinical Applications of VO₂ Max
In selected clinical populations, direct VO₂ max measurement has demonstrated utility: in heart failure for risk stratification and transplant listing [4]; in pulmonary hypertension as a prognostic marker; and in pre-operative evaluation for predicting post-surgical cardiopulmonary complications [6].
5 Practical Recommendations
Evidence-Based Fitness Monitoring — Summary Table
| Metric | Recommendation |
|---|---|
| METs (CRF) | Preferred for most healthy adults. Free, universally accessible, best studied for all-cause and cardiovascular mortality outcomes (>99 % of evidence base). Use treadmill/bicycle speed-incline charts, Bruce protocol, or HR formula. |
| VO₂ Max (lab) | Reserve for: (a) high-performance athletes; (b) heart failure/pulmonary hypertension risk stratification; (c) pre-operative evaluation. |
| Wearable VO₂ Max | Not recommended for clinical or lifestyle decision-making. MAPE 7–16 %, directional biases, and extrinsic confounders render it unreliable. |
| Strength & Balance | Include alongside CRF. Combined aerobic + strength training is superior to aerobic training alone for mortality reduction [5]. |
Topol’s bottom line: “Perhaps the better strategy is to be as physically active as possible and not worry about any metric!” The emphasis should be on building a comprehensive, enjoyable activity habit rather than chasing a single number from a device whose accuracy is demonstrably limited.
6 Summary Comparison: CRF vs. VO₂ Max
| Dimension | CRF / METs | VO₂ Max |
|---|---|---|
| Measurement | Treadmill/bike; HR formula | Direct gas exchange in metabolic lab |
| Cost | Free | ≈$150/test |
| Accessibility | Universal | Specialist lab only |
| Wearable estimate | Indirect (step count, HR) | Algorithmic imputation (MAPE 7–16 %) |
| Evidence base | >750,000 per large study | <1,300 per study (direct measure) |
| Fraction of outcome data | >99 % | <1 % |
| AHA recommendation | Clinical vital sign [1] | Niche clinical/sports use |
| Target population | All healthy adults | Elite athletes; HF/PH patients; pre-op |
| AI platform misuse | Rare | Frequent |
References
- [1] Ross R, Blair SN, Arena R, et al. Importance of assessing cardiorespiratory fitness in clinical practice: a case for fitness as a clinical vital sign — a scientific statement from the American Heart Association. Circulation. 2016;134(24):e653–e699. https://doi.org/10.1161/CIR.0000000000000461
- [2] Topol E. The flawed VO₂ max craze: conflation with cardiorespiratory fitness. Ground Truths (Substack). February 23, 2026. https://erictopol.substack.com/p/the-flawed-v02-max-craze
- [3] Tedesco S, Sica M, Ancillao A, et al. Accuracy of consumer-grade wearables for VO₂ max estimation: a systematic review. Electronics. 2025;14(15):3081. https://doi.org/10.3390/electronics14153081
- [4] Kavanagh T, Mertens DJ, Hamm LF, et al. Prediction of long-term prognosis in 12,169 men referred for cardiac rehabilitation. Circulation. 2002;106(6):666–671. https://doi.org/10.1161/01.CIR.0000024413.15949.ED
- [5] Kim J, Park Y, Lee K, et al. Combined cardiorespiratory fitness and muscle strength in relation to all-cause and cardiovascular mortality: findings from the UK Biobank. Br J Sports Med. 2023 [Epub ahead of print].
- [6] Older P, Hall A, Hader R. Cardiopulmonary exercise testing as a screening test for perioperative management of major surgery in the elderly. Chest. 1993;104(3):701–704.x
One-Minute-Paper Topics
A One-Minute-Paper (OMP) is a short, focused prompt that students answer in ~60 seconds at the end of a session to consolidate learning, surface misconceptions, and provide formative feedback. When answering, be concise, specific, and use terminology from today’s session.
- Define cardiorespiratory fitness (CRF) and explain why it is a vital sign.
- What is the conceptual difference between CRF expressed in METs vs. measured VO₂max?
- Why does Topol argue that METs underpin >99% of the outcome literature on fitness and mortality?
- Summarise in one sentence the main finding of the JAMA 2009 meta-analysis (n≈103,000) on CRF and mortality.
- What advantage does the Cleveland Clinic cohort (n≈122,000) bring to the CRF–mortality evidence base?
- Why is the Veterans Affairs cohort (n≈750,000) particularly informative for dose-response analyses?
- What does a MAPE of 7–16% mean in plain language for consumer wearable VO₂max estimates?
- Identify one source of error in wrist-based VO₂max estimation.
- Why is the imbalance between MET-based and direct-VO₂max evidence clinically important?
- For a healthy adult, why might METs be a more practical measure than laboratory VO₂max?
- Describe one scenario where direct VO₂max measurement is still preferred over METs.
- How would you explain “1 MET = 3.5 ml O₂·kg⁻¹·min⁻¹” to a patient in everyday language?
- What is the approximate mortality risk reduction associated with each 1-MET increase in CRF?
- Why does CRF predict all-cause mortality even after adjustment for traditional risk factors?
- How could MET-based CRF estimates be integrated into routine primary care?
- What ethical issue arises if wearable VO₂max estimates are used for clinical decision-making?
- Suggest one study design that could close the data gap between METs and measured VO₂max.
- Why might CRF be a more modifiable predictor of mortality than genetic risk scores?
- What was the most convincing piece of evidence for you in today’s lecture, and why?
- Which question about CRF measurement would you most like to discuss next session?