Study record · validation · 2026
Assessing the performance of a portable electroencephalographic sleep monitor against level 1 polysomnography
Lanthier M, Foti MC, Fonseca K, Higginson C, and Oksit D
Sleep Advances, zpaf089 · 2026
Why this study matters to CircaTest
The most recent and most rigorous published validation of the Muse-S headband against gold-standard level 1 PSG. Cohen's kappa of 0.76 is substantially HIGHER than the best wrist-worn device in the Schyvens 2025 multi-device comparison (Apple Watch Series 8 at kappa 0.53). This is the strongest published evidence to date that EEG-based consumer headbands meaningfully outperform PPG/accelerometry-based wrist devices for sleep stage classification — which makes editorial sense, given that the Muse-S actually measures brain activity rather than inferring it. The study tested the standard Muse-S; whether the figures generalize to the newer Muse-S Athena variant requires verification against the full paper.
Abstract
STUDY OBJECTIVES: To assess the performance of a portable electroencephalography device for sleep monitoring against polysomnography. METHOD: Fifty-six adults underwent one night of in-laboratory sleep recording with the Muse-S headband and simultaneous level 1 polysomnography. Muse-S data were scored by an automated sleep staging algorithm. A registered technologist, blind to the Muse-S automated sleep scoring, scored the polysomnography data. RESULTS: Good quality data were available for 47 (84 per cent) participants (53 per cent females; 20-71 years old; 17 per cent with sleep-related breathing disorder). Epoch-by-epoch analyses showed substantial agreement between the Muse-S and polysomnography (full night Cohen's Kappa = 0.76). Cohen's Kappa were in the fair agreement range for non-rapid eye movement (NREM) 1, substantial agreement range for NREM2 and NREM3, and near-perfect agreement range for rapid eye movement sleep and wake. Accuracy ranged from 88 per cent to 96 per cent across all sleep stages, with a sensitivity of 79-92 per cent and a specificity of 90-99 per cent. Similar results were observed in the subgroup with sleep-related breathing disorder.
Source: PUBMED · Licensed under CC-BY 4.0
Population
Sample size
n = 47
Age
20-71 years
Reference standard
psg
56 adults enrolled, good-quality data from 47 (84 percent). 53 percent female. Age 20-71. 17 percent had a sleep-related breathing disorder. Single in-laboratory night with simultaneous Muse-S and level 1 PSG.
Devices and metrics
Muse-S headband
All studies for this device →| Metric | Value | 95% CI | Note |
|---|---|---|---|
| Cohen's kappa | κ = 0.76 | — | Full-night Cohen's kappa for epoch-by-epoch agreement vs PSG. Substantial agreement range. The highest published kappa for any consumer sleep wearable in the CircaTest corpus. |
| Accuracy | 96% | — | Upper bound of accuracy across sleep stages. |
| Accuracy | 88% | — | Lower bound of accuracy across sleep stages. |
| Sensitivity | 92% | — | Upper bound of sensitivity across sleep stages. |
| Sensitivity | 79% | — | Lower bound of sensitivity across sleep stages. |
| Specificity | 99% | — | Upper bound of specificity across sleep stages. |
| Specificity | 90% | — | Lower bound of specificity across sleep stages. |
Cite this study
Lanthier M, Foti MC, Fonseca K, Higginson C, and Oksit D (2026). Assessing the performance of a portable electroencephalographic sleep monitor against level 1 polysomnography. Sleep Advances. https://doi.org/10.1093/sleepadvances/zpaf089
Source links
Added to the CircaTest meta-analysis on 2026-04-06. How CircaTest evaluates studies →