Audio-based snore detection using deep neural networks

•We proposed an end-to-end deep neural network model (CNN+LSTM) combined with constant Q transformation to detect snore on audio data.•We used audio data recorded in a hospital (sleep lab) to train and validate our model.•We investigated the influence of microphone placement on snore detection perfo...

Full description

Saved in:
Bibliographic Details
Published inComputer methods and programs in biomedicine Vol. 200; p. 105917
Main Authors Xie, Jiali, Aubert, Xavier, Long, Xi, van Dijk, Johannes, Arsenali, Bruno, Fonseca, Pedro, Overeem, Sebastiaan
Format Journal Article
LanguageEnglish
Published Ireland Elsevier B.V 01.03.2021
Subjects
Online AccessGet full text
ISSN0169-2607
1872-7565
1872-7565
DOI10.1016/j.cmpb.2020.105917

Cover

Loading…
More Information
Summary:•We proposed an end-to-end deep neural network model (CNN+LSTM) combined with constant Q transformation to detect snore on audio data.•We used audio data recorded in a hospital (sleep lab) to train and validate our model.•We investigated the influence of microphone placement on snore detection performance. Background and Objective: Snoring is a prevalent phenomenon. It may be benign, but can also be a symptom of obstructive sleep apnea (OSA) a prevalent sleep disorder. Accurate detection of snoring may help with screening and diagnosis of OSA. Methods: We introduce a snore detection algorithm based on the combination of a convolutional neural network (CNN) and a recurrent neural network (RNN). We obtained audio recordings of 38 subjects referred to a clinical center for a sleep study. All subjects were recorded by a total of 5 microphones placed at strategic positions around the bed. The CNN was used to extract features from the sound spectrogram, while the RNN was used to process the sequential CNN output and to classify the audio events to snore and non-snore events. We also addressed the impact of microphone placement on the performance of the algorithm. Results: The algorithm achieved an accuracy of 95.3 ± 0.5%, a sensitivity of 92.2 ± 0.9%, and a specificity of 97.7 ± 0.4% over all microphones in snore detection on our data set including 18412 sound events. The best accuracy (95.9%) was observed from the microphone placed about 70 cm above the subject's head and the worst (94.4%) was observed from the microphone placed about 130 cm above the subject's head. Conclusion: Our results suggest that our method detects snore events from audio recordings with high accuracy and that microphone placement does not have a major impact on detection performance.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0169-2607
1872-7565
1872-7565
DOI:10.1016/j.cmpb.2020.105917