Standardized image-based polysomnography database and deep learning algorithm for sleep-stage classification

Abstract Study Objectives Polysomnography (PSG) scoring is labor-intensive, subjective, and often ambiguous. Recently several deep learning (DL) models for automated sleep scoring have been developed, they are tied to a fixed amount of input channels and resolution. In this study, we constructed a s...

Full description

Saved in:
Bibliographic Details
Published inSleep (New York, N.Y.) Vol. 46; no. 12; p. 1
Main Authors Jeong, Jaemin, Yoon, Wonhyuck, Lee, Jeong-Gun, Kim, Dongyoung, Woo, Yunhee, Kim, Dong-Kyu, Shin, Hyun-Woo
Format Journal Article
LanguageEnglish
Published US Oxford University Press 11.12.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Study Objectives Polysomnography (PSG) scoring is labor-intensive, subjective, and often ambiguous. Recently several deep learning (DL) models for automated sleep scoring have been developed, they are tied to a fixed amount of input channels and resolution. In this study, we constructed a standardized image-based PSG dataset in order to overcome the heterogeneity of raw signal data obtained from various PSG devices and various sleep laboratory environments. Methods All individually exported European data format files containing raw signals were converted into images with an annotation file, which contained the demographics, diagnoses, and sleep statistics. An image-based DL model for automatic sleep staging was developed, compared with a signal-based model, and validated in an external dataset Results We constructed 10253 image-based PSG datasets using a standardized format. Among these, 7745 diagnostic PSG data were used to develop our DL model. The DL model using the image dataset showed similar performance to the signal-based dataset for the same subject. The overall DL accuracy was greater than 80%, even with severe obstructive sleep apnea. Moreover, for the first time, we showed explainable DL in the field of sleep medicine as visualized key inference regions using Eigen-class activation maps. Furthermore, when a DL model for sleep scoring performs external validation, we achieved a relatively good performance. Conclusions Our main contribution demonstrates the availability of a standardized image-based dataset, and highlights that changing the data sampling rate or number of sensors may not require retraining, although performance decreases slightly as the number of sensors decreases. Graphical Abstract
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0161-8105
1550-9109
1550-9109
DOI:10.1093/sleep/zsad242