Study record · meta analysis · 2025
Apple watch accuracy in monitoring health metrics: a systematic review and meta-analysis
Choe JP, Kang M
Physiological Measurement · 2025
Why this study matters to CircaTest
Most comprehensive published meta-analysis of Apple Watch accuracy: 56 studies, 270 effect sizes. Editorially load-bearing because it gives a definitive answer to two common reader questions: (1) Is Apple Watch heart rate accurate? Yes (mean bias -0.12 bpm, none of the subgroups exceed the 10% MAPE threshold). (2) Is Apple Watch energy expenditure accurate? No (every subgroup exceeds the 10% MAPE threshold). Important limitation for CircaTest's editorial focus: this meta-analysis covers HR, energy expenditure, and step counts, NOT sleep stage classification. For Apple Watch sleep accuracy, see Schyvens 2025 and Walch 2019.
Abstract
Objective. Wearable technology like the Apple Watch is increasingly important for monitoring health metrics. Accurate measurement is crucial, as inaccuracies can impact health outcomes.…
Read the full abstract on the source →
Source: PUBMED · Excerpt for fair-use commentary; full abstract via the source link
Population
Age
adult, varies across included studies
Reference standard
ecg
Systematic review and meta-analysis pooling 56 individual Apple Watch validation studies, totaling 270 effect sizes (71 for energy expenditure, 148 for heart rate, 51 for step counts). Subgroups span device series, age, health status, activity intensity, and activity type.
Devices and metrics
Apple Watch (multiple series; meta-analysis covers the device family)
All studies for this device →| Metric | Value | 95% CI | Note |
|---|---|---|---|
| Bias (minutes) | -0.12 | — | Mean bias for heart rate in beats per minute (NOT minutes; the metric kind is recorded as bias_minutes for schema consistency but the unit is bpm). Limits of agreement -11.06 to +10.81 bpm. None of the HR subgroups exceeded the 10% MAPE validity threshold. |
| Bias (minutes) | 0.3 | — | Mean bias for energy expenditure in kcal/min (NOT minutes). Limits of agreement -2.09 to +2.69. ALL EE subgroups exceeded the 10% MAPE validity threshold, meaning the Apple Watch is NOT considered validated for energy expenditure measurement. |
| Bias (minutes) | -1.83 | — | Mean bias for step count in steps/min (NOT minutes). Limits of agreement -9.08 to +5.41. Some subgroups exceeded the 10% MAPE threshold, indicating variability. |
Apple Watch (Choe & Kang meta-analysis covers many series; per-series breakdown is in the full paper)
All studies for this device →| Metric | Value | 95% CI | Note |
|---|---|---|---|
| Bias (minutes) | see source | — | The meta-analysis pools studies across many Apple Watch generations. CircaTest does not extrapolate the pooled estimate to Series 11 specifically; consult the full paper for the per-series breakdown. |
Cite this study
Choe JP, Kang M (2025). Apple watch accuracy in monitoring health metrics: a systematic review and meta-analysis. Physiological Measurement. https://doi.org/10.1088/1361-6579/adca82
Source links
Added to the CircaTest meta-analysis on 2026-04-06. How CircaTest evaluates studies →