Audio-based snore detection using deep neural networks

•We proposed an end-to-end deep neural network model (CNN+LSTM) combined with constant Q transformation to detect snore on audio data.•We used audio data recorded in a hospital (sleep lab) to train and validate our model.•We investigated the influence of microphone placement on snore detection perfo...

Full description

Saved in:

Bibliographic Details
Published in	Computer methods and programs in biomedicine Vol. 200; p. 105917
Main Authors	Xie, Jiali, Aubert, Xavier, Long, Xi, van Dijk, Johannes, Arsenali, Bruno, Fonseca, Pedro, Overeem, Sebastiaan
Format	Journal Article
Language	English
Published	Ireland Elsevier B.V 01.03.2021
Subjects	Audio signal processing Body-position in sleep Constant Q transformation Convolutional neural network Humans Neural Networks, Computer Polysomnography Recurrent neural network Sleep Apnea, Obstructive - diagnosis Snore detection Snoring - diagnosis Sound Snore detection Audio signal processing Recurrent neural network Convolutional neural network Body-position in sleep Constant Q transformation
Online Access	Get full text
ISSN	0169-2607 1872-7565 1872-7565
DOI	10.1016/j.cmpb.2020.105917

Cover

Loading…

More Information
Summary:	•We proposed an end-to-end deep neural network model (CNN+LSTM) combined with constant Q transformation to detect snore on audio data.•We used audio data recorded in a hospital (sleep lab) to train and validate our model.•We investigated the influence of microphone placement on snore detection performance. Background and Objective: Snoring is a prevalent phenomenon. It may be benign, but can also be a symptom of obstructive sleep apnea (OSA) a prevalent sleep disorder. Accurate detection of snoring may help with screening and diagnosis of OSA. Methods: We introduce a snore detection algorithm based on the combination of a convolutional neural network (CNN) and a recurrent neural network (RNN). We obtained audio recordings of 38 subjects referred to a clinical center for a sleep study. All subjects were recorded by a total of 5 microphones placed at strategic positions around the bed. The CNN was used to extract features from the sound spectrogram, while the RNN was used to process the sequential CNN output and to classify the audio events to snore and non-snore events. We also addressed the impact of microphone placement on the performance of the algorithm. Results: The algorithm achieved an accuracy of 95.3 ± 0.5%, a sensitivity of 92.2 ± 0.9%, and a specificity of 97.7 ± 0.4% over all microphones in snore detection on our data set including 18412 sound events. The best accuracy (95.9%) was observed from the microphone placed about 70 cm above the subject's head and the worst (94.4%) was observed from the microphone placed about 130 cm above the subject's head. Conclusion: Our results suggest that our method detects snore events from audio recordings with high accuracy and that microphone placement does not have a major impact on detection performance.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0169-2607 1872-7565 1872-7565
DOI:	10.1016/j.cmpb.2020.105917