Depression Detection and Recognition Research Based on Audio Analysis and Artificial Intelligence Algorithms

In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in contemporary society, this study proposes an automated depression detection method based on audio analysis. This method aims to enhance the accura...

Full description

Saved in:

Bibliographic Details
Published in	2024 IEEE 1st International Workshop on Future Intelligent Technologies for Young Researchers (FITYR) pp. 67 - 72
Main Authors	Wang, Zhuozheng, Wang, Yunlong, Zhao, Xixi, Chen, Bingxu, Chen, Haonan, Wang, Gang, Feng, Lei
Format	Conference Proceeding
Language	English
Published	IEEE 15.07.2024
Subjects	Accuracy cnn Convolutional neural networks Deep learning Depression depression recognition Feature extraction Radio frequency random forest Random forests Resistance Support vector machines svm Testing
Online Access	Get full text
DOI	10.1109/FITYR63263.2024.00017

Cover

Loading…

Abstract	In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in contemporary society, this study proposes an automated depression detection method based on audio analysis. This method aims to enhance the accuracy of depression state recognition by comparing the efficiency of different models, including machine learning and deep learning. Initially, the study collected audio sample data from 50 individuals, including both healthy subjects and patients with depression. During the audio data processing phase, features related to depression recognition, such as Mel-frequency cepstral coefficients (MFCC) and fundamental frequency, were extracted from these samples. Subsequently, a series of recognition comparison experiments were conducted based on the extracted features, involving various algorithmic models such as Support Vector Machine (SVM), Random Forest (RF), and Convolutional Neural Network (CNN). Experimental results demonstrated that the Convolutional Neural Network exhibited higher accuracy in recognizing depression states from audio data, with an average recognition accuracy rate of 95.82%. This finding indicates that deep learning models, especially Convolutional Neural Networks, have significant advantages in addressing such issues, providing robust technical support for the future automatic detection of depression.
AbstractList	In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in contemporary society, this study proposes an automated depression detection method based on audio analysis. This method aims to enhance the accuracy of depression state recognition by comparing the efficiency of different models, including machine learning and deep learning. Initially, the study collected audio sample data from 50 individuals, including both healthy subjects and patients with depression. During the audio data processing phase, features related to depression recognition, such as Mel-frequency cepstral coefficients (MFCC) and fundamental frequency, were extracted from these samples. Subsequently, a series of recognition comparison experiments were conducted based on the extracted features, involving various algorithmic models such as Support Vector Machine (SVM), Random Forest (RF), and Convolutional Neural Network (CNN). Experimental results demonstrated that the Convolutional Neural Network exhibited higher accuracy in recognizing depression states from audio data, with an average recognition accuracy rate of 95.82%. This finding indicates that deep learning models, especially Convolutional Neural Networks, have significant advantages in addressing such issues, providing robust technical support for the future automatic detection of depression.
Author	Wang, Zhuozheng Chen, Bingxu Wang, Yunlong Zhao, Xixi Wang, Gang Chen, Haonan Feng, Lei
Author_xml	– sequence: 1 givenname: Zhuozheng surname: Wang fullname: Wang, Zhuozheng email: Wangzhuozheng@bjut.edu.cn organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China – sequence: 2 givenname: Yunlong surname: Wang fullname: Wang, Yunlong email: wyl1231@emails.bjut.edu.cn organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China – sequence: 3 givenname: Xixi surname: Zhao fullname: Zhao, Xixi email: zhaoxixi@ccmu.edu.cn organization: Beijing Anding Hospital, Capital Medical University,Beijing Key Laboratory of Mental Disorders,National Clinical Research Center for Mental Disorders & National Center for Mental Disorders,Beijing,China – sequence: 4 givenname: Bingxu surname: Chen fullname: Chen, Bingxu email: chenbingxu3@emails.bjut.edu.cn organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China – sequence: 5 givenname: Haonan surname: Chen fullname: Chen, Haonan email: cheng_hnn@emails.bjut.edu.cn organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China – sequence: 6 givenname: Gang surname: Wang fullname: Wang, Gang email: gangwangdoc@ccmu.edu.cn organization: Beijing Anding Hospital, Capital Medical University,Beijing Key Laboratory of Mental Disorders,National Clinical Research Center for Mental Disorders & National Center for Mental Disorders,Beijing,China – sequence: 7 givenname: Lei surname: Feng fullname: Feng, Lei email: flxlm@ccmu.edu.cn organization: Beijing Anding Hospital,Beijing Key Laboratory of Mental Disorders,National Clinical Research Center for Mental Disorders & National Center for Mental Disorders,Capital Beijing,China
BookMark	eNotj9FKwzAYhSPohc69gUJeYDXJnzbNZd2cFgZC6Y1XI0n_doEsHU292Ns7p1fnnA_OgfNAbuMYkZBnzjLOmX7Z1u1XU4AoIBNMyIwxxtUNWWqlSwCeg1a5uidhg6cJU_JjpBuc0c2_zsSONujGIfprbjChmdyBvpqEHb2Q6rvzI62iCefk07VQTbPvvfMm0DrOGIIfMDqkVRjGyc-HY3okd70JCZf_uiDt9q1df6x2n-_1utqtvObzqi9AW2GsVH2OgpVdaZ0oc8TcCM6lcbLUjgO3OrfWSm2LyyUE2WsAJ7mBBXn6m_WIuD9N_mim854zVUCpJPwA-llX7w
CODEN	IEEPAD
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/FITYR63263.2024.00017
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
EISBN	9798331539757
EndPage	72
ExternalDocumentID	10763874
Genre	orig-research
GroupedDBID	6IE 6IL CBEJK RIE RIL
ID	FETCH-LOGICAL-i91t-f639b2ab47f5e208d8bc285ee5a2114ac489c131b95bbb49b6798e34f933c41a3
IEDL.DBID	RIE
IngestDate	Wed Aug 06 17:54:14 EDT 2025
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i91t-f639b2ab47f5e208d8bc285ee5a2114ac489c131b95bbb49b6798e34f933c41a3
PageCount	6
ParticipantIDs	ieee_primary_10763874
PublicationCentury	2000
PublicationDate	2024-July-15
PublicationDateYYYYMMDD	2024-07-15
PublicationDate_xml	– month: 07 year: 2024 text: 2024-July-15 day: 15
PublicationDecade	2020
PublicationTitle	2024 IEEE 1st International Workshop on Future Intelligent Technologies for Young Researchers (FITYR)
PublicationTitleAbbrev	FITYR
PublicationYear	2024
Publisher	IEEE
Publisher_xml	– name: IEEE
Score	1.8791134
Snippet	In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in...
SourceID	ieee
SourceType	Publisher
StartPage	67
SubjectTerms	Accuracy cnn Convolutional neural networks Deep learning Depression depression recognition Feature extraction Radio frequency random forest Random forests Resistance Support vector machines svm Testing
Title	Depression Detection and Recognition Research Based on Audio Analysis and Artificial Intelligence Algorithms
URI	https://ieeexplore.ieee.org/document/10763874
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA7akycVK77JwevWzSbZTY7VWlrBIqVCPZU8ZrVYd6XuXvz1JmnXFwjekkBIyCTMTPJ9XxA69zCdlGY80gAsYlTHkYoNRMIqy7iVLsb3ROHbUTq4ZzdTPl2T1QMXBgAC-Aw6vhje8m1pan9V5k64Ow0iY5to02VuK7LWmpVDYnnRH04exqmLR6jL-xKvih2Tn7-mBKfR30ajZrgVVuS5U1e6Y95_KTH-ez47qP3Fz8N3n55nF21AsYcWvQbUWuAeVAFiVWBVWDxuQEKu3iDt8KVzXxa7lm5t5yVuxElCh-4yIIjc1sTDb5KduLt4LJfz6unlrY0m_evJ1SBa_6UQzSWpotwFIjpRmmU5hyQWVmiTCA7AlcsAmTJMSEMo0ZJrrZnU_nEGKMslpYYRRfdRqygLOEBY5i7osllimaZMGKoJSS1LYi6sVbmVh6jtV2r2ulLLmDWLdPRH-zHa8tby96WEn6BWtazh1Dn6Sp8FA38AESyrfw
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8MwGA46D3pSceK3OXjtbJqkTY7TOTbdhowKehr5qg5nK7O9-OtNstUvELy1gdDyvgnPm-R5ngBw5mg6MU5oII0hAcEyDESoTMC00IRqbmt8JxQejuLeHbm-p_dLsbrXwhhjPPnMtNyjP8vXharcVpmd4XY2sISsgjUL_BQt5FpLXQ4K-Xm3nz6MY1uRYLvyi5wvdoh-3pviYaO7CUb1BxdskedWVcqWev_lxfjvP9oCzS-FHrz9xJ5tsGLyHTDr1LTWHHZM6UlWORS5huOaJmTfa64dvLAApqFtaVd6WsDansR3aM89h8gOTtj_ZtoJ27PHYj4tn17emiDtXqWXvWB5m0Iw5agMMluKyEhIkmTURCHTTKqIUWOosGtAIhRhXCGMJKdSSsKlO54xmGQcY0WQwLugkRe52QOQZzb6Ook0kZgwhSVCsSZRSJnWItN8HzRdpCavC7-MSR2kgz_aT8F6Lx0OJoP-6OYQbLjMud1TRI9Ao5xX5tjCfilPfLI_AC-Grsg
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+IEEE+1st+International+Workshop+on+Future+Intelligent+Technologies+for+Young+Researchers+%28FITYR%29&rft.atitle=Depression+Detection+and+Recognition+Research+Based+on+Audio+Analysis+and+Artificial+Intelligence+Algorithms&rft.au=Wang%2C+Zhuozheng&rft.au=Wang%2C+Yunlong&rft.au=Zhao%2C+Xixi&rft.au=Chen%2C+Bingxu&rft.date=2024-07-15&rft.pub=IEEE&rft.spage=67&rft.epage=72&rft_id=info:doi/10.1109%2FFITYR63263.2024.00017&rft.externalDocID=10763874