Depression Detection and Recognition Research Based on Audio Analysis and Artificial Intelligence Algorithms

In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in contemporary society, this study proposes an automated depression detection method based on audio analysis. This method aims to enhance the accura...

Full description

Saved in:
Bibliographic Details
Published in2024 IEEE 1st International Workshop on Future Intelligent Technologies for Young Researchers (FITYR) pp. 67 - 72
Main Authors Wang, Zhuozheng, Wang, Yunlong, Zhao, Xixi, Chen, Bingxu, Chen, Haonan, Wang, Gang, Feng, Lei
Format Conference Proceeding
LanguageEnglish
Published IEEE 15.07.2024
Subjects
Online AccessGet full text
DOI10.1109/FITYR63263.2024.00017

Cover

Loading…
Abstract In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in contemporary society, this study proposes an automated depression detection method based on audio analysis. This method aims to enhance the accuracy of depression state recognition by comparing the efficiency of different models, including machine learning and deep learning. Initially, the study collected audio sample data from 50 individuals, including both healthy subjects and patients with depression. During the audio data processing phase, features related to depression recognition, such as Mel-frequency cepstral coefficients (MFCC) and fundamental frequency, were extracted from these samples. Subsequently, a series of recognition comparison experiments were conducted based on the extracted features, involving various algorithmic models such as Support Vector Machine (SVM), Random Forest (RF), and Convolutional Neural Network (CNN). Experimental results demonstrated that the Convolutional Neural Network exhibited higher accuracy in recognizing depression states from audio data, with an average recognition accuracy rate of 95.82%. This finding indicates that deep learning models, especially Convolutional Neural Networks, have significant advantages in addressing such issues, providing robust technical support for the future automatic detection of depression.
AbstractList In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in contemporary society, this study proposes an automated depression detection method based on audio analysis. This method aims to enhance the accuracy of depression state recognition by comparing the efficiency of different models, including machine learning and deep learning. Initially, the study collected audio sample data from 50 individuals, including both healthy subjects and patients with depression. During the audio data processing phase, features related to depression recognition, such as Mel-frequency cepstral coefficients (MFCC) and fundamental frequency, were extracted from these samples. Subsequently, a series of recognition comparison experiments were conducted based on the extracted features, involving various algorithmic models such as Support Vector Machine (SVM), Random Forest (RF), and Convolutional Neural Network (CNN). Experimental results demonstrated that the Convolutional Neural Network exhibited higher accuracy in recognizing depression states from audio data, with an average recognition accuracy rate of 95.82%. This finding indicates that deep learning models, especially Convolutional Neural Networks, have significant advantages in addressing such issues, providing robust technical support for the future automatic detection of depression.
Author Wang, Zhuozheng
Chen, Bingxu
Wang, Yunlong
Zhao, Xixi
Wang, Gang
Chen, Haonan
Feng, Lei
Author_xml – sequence: 1
  givenname: Zhuozheng
  surname: Wang
  fullname: Wang, Zhuozheng
  email: Wangzhuozheng@bjut.edu.cn
  organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China
– sequence: 2
  givenname: Yunlong
  surname: Wang
  fullname: Wang, Yunlong
  email: wyl1231@emails.bjut.edu.cn
  organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China
– sequence: 3
  givenname: Xixi
  surname: Zhao
  fullname: Zhao, Xixi
  email: zhaoxixi@ccmu.edu.cn
  organization: Beijing Anding Hospital, Capital Medical University,Beijing Key Laboratory of Mental Disorders,National Clinical Research Center for Mental Disorders & National Center for Mental Disorders,Beijing,China
– sequence: 4
  givenname: Bingxu
  surname: Chen
  fullname: Chen, Bingxu
  email: chenbingxu3@emails.bjut.edu.cn
  organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China
– sequence: 5
  givenname: Haonan
  surname: Chen
  fullname: Chen, Haonan
  email: cheng_hnn@emails.bjut.edu.cn
  organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China
– sequence: 6
  givenname: Gang
  surname: Wang
  fullname: Wang, Gang
  email: gangwangdoc@ccmu.edu.cn
  organization: Beijing Anding Hospital, Capital Medical University,Beijing Key Laboratory of Mental Disorders,National Clinical Research Center for Mental Disorders & National Center for Mental Disorders,Beijing,China
– sequence: 7
  givenname: Lei
  surname: Feng
  fullname: Feng, Lei
  email: flxlm@ccmu.edu.cn
  organization: Beijing Anding Hospital,Beijing Key Laboratory of Mental Disorders,National Clinical Research Center for Mental Disorders & National Center for Mental Disorders,Capital Beijing,China
BookMark eNotj9FKwzAYhSPohc69gUJeYDXJnzbNZd2cFgZC6Y1XI0n_doEsHU292Ns7p1fnnA_OgfNAbuMYkZBnzjLOmX7Z1u1XU4AoIBNMyIwxxtUNWWqlSwCeg1a5uidhg6cJU_JjpBuc0c2_zsSONujGIfprbjChmdyBvpqEHb2Q6rvzI62iCefk07VQTbPvvfMm0DrOGIIfMDqkVRjGyc-HY3okd70JCZf_uiDt9q1df6x2n-_1utqtvObzqi9AW2GsVH2OgpVdaZ0oc8TcCM6lcbLUjgO3OrfWSm2LyyUE2WsAJ7mBBXn6m_WIuD9N_mim854zVUCpJPwA-llX7w
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/FITYR63263.2024.00017
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798331539757
EndPage 72
ExternalDocumentID 10763874
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i91t-f639b2ab47f5e208d8bc285ee5a2114ac489c131b95bbb49b6798e34f933c41a3
IEDL.DBID RIE
IngestDate Wed Aug 06 17:54:14 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i91t-f639b2ab47f5e208d8bc285ee5a2114ac489c131b95bbb49b6798e34f933c41a3
PageCount 6
ParticipantIDs ieee_primary_10763874
PublicationCentury 2000
PublicationDate 2024-July-15
PublicationDateYYYYMMDD 2024-07-15
PublicationDate_xml – month: 07
  year: 2024
  text: 2024-July-15
  day: 15
PublicationDecade 2020
PublicationTitle 2024 IEEE 1st International Workshop on Future Intelligent Technologies for Young Researchers (FITYR)
PublicationTitleAbbrev FITYR
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8791134
Snippet In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in...
SourceID ieee
SourceType Publisher
StartPage 67
SubjectTerms Accuracy
cnn
Convolutional neural networks
Deep learning
Depression
depression recognition
Feature extraction
Radio frequency
random forest
Random forests
Resistance
Support vector machines
svm
Testing
Title Depression Detection and Recognition Research Based on Audio Analysis and Artificial Intelligence Algorithms
URI https://ieeexplore.ieee.org/document/10763874
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA7akycVK77JwevWzSbZTY7VWlrBIqVCPZU8ZrVYd6XuXvz1JmnXFwjekkBIyCTMTPJ9XxA69zCdlGY80gAsYlTHkYoNRMIqy7iVLsb3ROHbUTq4ZzdTPl2T1QMXBgAC-Aw6vhje8m1pan9V5k64Ow0iY5to02VuK7LWmpVDYnnRH04exqmLR6jL-xKvih2Tn7-mBKfR30ajZrgVVuS5U1e6Y95_KTH-ez47qP3Fz8N3n55nF21AsYcWvQbUWuAeVAFiVWBVWDxuQEKu3iDt8KVzXxa7lm5t5yVuxElCh-4yIIjc1sTDb5KduLt4LJfz6unlrY0m_evJ1SBa_6UQzSWpotwFIjpRmmU5hyQWVmiTCA7AlcsAmTJMSEMo0ZJrrZnU_nEGKMslpYYRRfdRqygLOEBY5i7osllimaZMGKoJSS1LYi6sVbmVh6jtV2r2ulLLmDWLdPRH-zHa8tby96WEn6BWtazh1Dn6Sp8FA38AESyrfw
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8MwGA46D3pSceK3OXjtbJqkTY7TOTbdhowKehr5qg5nK7O9-OtNstUvELy1gdDyvgnPm-R5ngBw5mg6MU5oII0hAcEyDESoTMC00IRqbmt8JxQejuLeHbm-p_dLsbrXwhhjPPnMtNyjP8vXharcVpmd4XY2sISsgjUL_BQt5FpLXQ4K-Xm3nz6MY1uRYLvyi5wvdoh-3pviYaO7CUb1BxdskedWVcqWev_lxfjvP9oCzS-FHrz9xJ5tsGLyHTDr1LTWHHZM6UlWORS5huOaJmTfa64dvLAApqFtaVd6WsDansR3aM89h8gOTtj_ZtoJ27PHYj4tn17emiDtXqWXvWB5m0Iw5agMMluKyEhIkmTURCHTTKqIUWOosGtAIhRhXCGMJKdSSsKlO54xmGQcY0WQwLugkRe52QOQZzb6Ook0kZgwhSVCsSZRSJnWItN8HzRdpCavC7-MSR2kgz_aT8F6Lx0OJoP-6OYQbLjMud1TRI9Ao5xX5tjCfilPfLI_AC-Grsg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+IEEE+1st+International+Workshop+on+Future+Intelligent+Technologies+for+Young+Researchers+%28FITYR%29&rft.atitle=Depression+Detection+and+Recognition+Research+Based+on+Audio+Analysis+and+Artificial+Intelligence+Algorithms&rft.au=Wang%2C+Zhuozheng&rft.au=Wang%2C+Yunlong&rft.au=Zhao%2C+Xixi&rft.au=Chen%2C+Bingxu&rft.date=2024-07-15&rft.pub=IEEE&rft.spage=67&rft.epage=72&rft_id=info:doi/10.1109%2FFITYR63263.2024.00017&rft.externalDocID=10763874