Classification of Respiration Sounds Using Deep Pre-trained Audio Embeddings

In this work we present the use of an end-to-end deep learning based pre-trained audio embeddings generator, and apply it to the purpose of classification of respiration sounds. With this approach, there is no need to pre-compute spectral representations, e.g. MFCC or filterbanks, since the classifi...

Full description

Saved in:
Bibliographic Details
Published in2021 IEEE Latin American Conference on Computational Intelligence (LA-CCI) pp. 1 - 5
Main Authors Galindo Meza, Carlos A., del Hoyo Ontiveros, Juan A., Lopez-Meyer, Paulo
Format Conference Proceeding
LanguageEnglish
Published IEEE 02.11.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In this work we present the use of an end-to-end deep learning based pre-trained audio embeddings generator, and apply it to the purpose of classification of respiration sounds. With this approach, there is no need to pre-compute spectral representations, e.g. MFCC or filterbanks, since the classification model uses raw audio as the input. Transfer learning was used to train an audio classifier for sounds of respiratory cycles as defined in the ICBHI 2017 challenge. The results on this dataset show that this end-to-end model represents a viable alternative to more common spectral-based classifiers, while achieving state-of-the-art performance.
DOI:10.1109/LA-CCI48322.2021.9769831