Automatic classification of dog barking using deep learning

Barking and other dog vocalizations have acoustic properties related to emotions, physiological reactions, attitudes, or some particular internal states. In the field of intelligent audio analysis, researchers use methods based on signal processing and machine learning to analyze the digitized acous...

Full description

Saved in:

Bibliographic Details
Published in	Behavioural processes Vol. 218; p. 105028
Main Authors	Gómez-Armenta, José Ramón, Pérez-Espinosa, Humberto, Fernández-Zepeda, José Alberto, Reyes-Meza, Verónica
Format	Journal Article
Language	English
Published	Netherlands Elsevier B.V 01.05.2024
Subjects	Acoustic characterization Audio analysis Deep learning Dog barks Deep learning Acoustic characterization Dog barks Audio analysis
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Barking and other dog vocalizations have acoustic properties related to emotions, physiological reactions, attitudes, or some particular internal states. In the field of intelligent audio analysis, researchers use methods based on signal processing and machine learning to analyze the digitized acoustic signals’ properties and obtain relevant information. The present work describes a method to classify the identity, breed, age, sex, and context associated with each bark. This information can support the decisions of people who regularly interact with animals, such as dog trainers, veterinarians, rescuers, police, people with visual impairment. Our approach uses deep neural networks to generate trained models for each classification task. We worked with 19,643 barks recorded from 113 dogs of different breeds, ages and sexes. Our methodology consists of three stages. First, the pre-processing stage prepares the data and transforms it into the appropriate format for each classification model. Second, the characterization stage evaluates different representation models to identify the most suitable for each task. Third, the classification stage trains each classification model and selects the best hyperparameters. After tuning and training each model, we evaluated its performance. We analyzed the most relevant features extracted from the audio and the most appropriate deep neural network architecture for that feature type. Even if the application of our method is not ready for being used in ethological practice, our evaluation showed an outstanding performance of the proposed method, surpassing previous research results on this topic, providing the basis for further technological development. •Analyzed diverse dog breed data, optimizing computational efficiency.•Largest variety & number of bark samples in barking classification.•Employed efficient characterization techniques, reducing computational cost.•Achieved classifier success with tailored deep neural network architecture.•Optimized classification with neural network hyperparameter tuning.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0376-6357 1872-8308
DOI:	10.1016/j.beproc.2024.105028