Depression Detection and Recognition Research Based on Audio Analysis and Artificial Intelligence Algorithms
In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in contemporary society, this study proposes an automated depression detection method based on audio analysis. This method aims to enhance the accura...
Saved in:
Published in | 2024 IEEE 1st International Workshop on Future Intelligent Technologies for Young Researchers (FITYR) pp. 67 - 72 |
---|---|
Main Authors | , , , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
15.07.2024
|
Subjects | |
Online Access | Get full text |
DOI | 10.1109/FITYR63263.2024.00017 |
Cover
Loading…
Abstract | In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in contemporary society, this study proposes an automated depression detection method based on audio analysis. This method aims to enhance the accuracy of depression state recognition by comparing the efficiency of different models, including machine learning and deep learning. Initially, the study collected audio sample data from 50 individuals, including both healthy subjects and patients with depression. During the audio data processing phase, features related to depression recognition, such as Mel-frequency cepstral coefficients (MFCC) and fundamental frequency, were extracted from these samples. Subsequently, a series of recognition comparison experiments were conducted based on the extracted features, involving various algorithmic models such as Support Vector Machine (SVM), Random Forest (RF), and Convolutional Neural Network (CNN). Experimental results demonstrated that the Convolutional Neural Network exhibited higher accuracy in recognizing depression states from audio data, with an average recognition accuracy rate of 95.82%. This finding indicates that deep learning models, especially Convolutional Neural Networks, have significant advantages in addressing such issues, providing robust technical support for the future automatic detection of depression. |
---|---|
AbstractList | In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in contemporary society, this study proposes an automated depression detection method based on audio analysis. This method aims to enhance the accuracy of depression state recognition by comparing the efficiency of different models, including machine learning and deep learning. Initially, the study collected audio sample data from 50 individuals, including both healthy subjects and patients with depression. During the audio data processing phase, features related to depression recognition, such as Mel-frequency cepstral coefficients (MFCC) and fundamental frequency, were extracted from these samples. Subsequently, a series of recognition comparison experiments were conducted based on the extracted features, involving various algorithmic models such as Support Vector Machine (SVM), Random Forest (RF), and Convolutional Neural Network (CNN). Experimental results demonstrated that the Convolutional Neural Network exhibited higher accuracy in recognizing depression states from audio data, with an average recognition accuracy rate of 95.82%. This finding indicates that deep learning models, especially Convolutional Neural Networks, have significant advantages in addressing such issues, providing robust technical support for the future automatic detection of depression. |
Author | Wang, Zhuozheng Chen, Bingxu Wang, Yunlong Zhao, Xixi Wang, Gang Chen, Haonan Feng, Lei |
Author_xml | – sequence: 1 givenname: Zhuozheng surname: Wang fullname: Wang, Zhuozheng email: Wangzhuozheng@bjut.edu.cn organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China – sequence: 2 givenname: Yunlong surname: Wang fullname: Wang, Yunlong email: wyl1231@emails.bjut.edu.cn organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China – sequence: 3 givenname: Xixi surname: Zhao fullname: Zhao, Xixi email: zhaoxixi@ccmu.edu.cn organization: Beijing Anding Hospital, Capital Medical University,Beijing Key Laboratory of Mental Disorders,National Clinical Research Center for Mental Disorders & National Center for Mental Disorders,Beijing,China – sequence: 4 givenname: Bingxu surname: Chen fullname: Chen, Bingxu email: chenbingxu3@emails.bjut.edu.cn organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China – sequence: 5 givenname: Haonan surname: Chen fullname: Chen, Haonan email: cheng_hnn@emails.bjut.edu.cn organization: Beijing University of Technology,Faculty of Information Technology,Beijing,China – sequence: 6 givenname: Gang surname: Wang fullname: Wang, Gang email: gangwangdoc@ccmu.edu.cn organization: Beijing Anding Hospital, Capital Medical University,Beijing Key Laboratory of Mental Disorders,National Clinical Research Center for Mental Disorders & National Center for Mental Disorders,Beijing,China – sequence: 7 givenname: Lei surname: Feng fullname: Feng, Lei email: flxlm@ccmu.edu.cn organization: Beijing Anding Hospital,Beijing Key Laboratory of Mental Disorders,National Clinical Research Center for Mental Disorders & National Center for Mental Disorders,Capital Beijing,China |
BookMark | eNotj9FKwzAYhSPohc69gUJeYDXJnzbNZd2cFgZC6Y1XI0n_doEsHU292Ns7p1fnnA_OgfNAbuMYkZBnzjLOmX7Z1u1XU4AoIBNMyIwxxtUNWWqlSwCeg1a5uidhg6cJU_JjpBuc0c2_zsSONujGIfprbjChmdyBvpqEHb2Q6rvzI62iCefk07VQTbPvvfMm0DrOGIIfMDqkVRjGyc-HY3okd70JCZf_uiDt9q1df6x2n-_1utqtvObzqi9AW2GsVH2OgpVdaZ0oc8TcCM6lcbLUjgO3OrfWSm2LyyUE2WsAJ7mBBXn6m_WIuD9N_mim854zVUCpJPwA-llX7w |
CODEN | IEEPAD |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/FITYR63263.2024.00017 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9798331539757 |
EndPage | 72 |
ExternalDocumentID | 10763874 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i91t-f639b2ab47f5e208d8bc285ee5a2114ac489c131b95bbb49b6798e34f933c41a3 |
IEDL.DBID | RIE |
IngestDate | Wed Aug 06 17:54:14 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i91t-f639b2ab47f5e208d8bc285ee5a2114ac489c131b95bbb49b6798e34f933c41a3 |
PageCount | 6 |
ParticipantIDs | ieee_primary_10763874 |
PublicationCentury | 2000 |
PublicationDate | 2024-July-15 |
PublicationDateYYYYMMDD | 2024-07-15 |
PublicationDate_xml | – month: 07 year: 2024 text: 2024-July-15 day: 15 |
PublicationDecade | 2020 |
PublicationTitle | 2024 IEEE 1st International Workshop on Future Intelligent Technologies for Young Researchers (FITYR) |
PublicationTitleAbbrev | FITYR |
PublicationYear | 2024 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.8791134 |
Snippet | In response to the prevalent symptoms of depression, the complexity of diagnostic processes, and individuals' resistance to psychological testing in... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 67 |
SubjectTerms | Accuracy cnn Convolutional neural networks Deep learning Depression depression recognition Feature extraction Radio frequency random forest Random forests Resistance Support vector machines svm Testing |
Title | Depression Detection and Recognition Research Based on Audio Analysis and Artificial Intelligence Algorithms |
URI | https://ieeexplore.ieee.org/document/10763874 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA7akycVK77JwevWzSbZTY7VWlrBIqVCPZU8ZrVYd6XuXvz1JmnXFwjekkBIyCTMTPJ9XxA69zCdlGY80gAsYlTHkYoNRMIqy7iVLsb3ROHbUTq4ZzdTPl2T1QMXBgAC-Aw6vhje8m1pan9V5k64Ow0iY5to02VuK7LWmpVDYnnRH04exqmLR6jL-xKvih2Tn7-mBKfR30ajZrgVVuS5U1e6Y95_KTH-ez47qP3Fz8N3n55nF21AsYcWvQbUWuAeVAFiVWBVWDxuQEKu3iDt8KVzXxa7lm5t5yVuxElCh-4yIIjc1sTDb5KduLt4LJfz6unlrY0m_evJ1SBa_6UQzSWpotwFIjpRmmU5hyQWVmiTCA7AlcsAmTJMSEMo0ZJrrZnU_nEGKMslpYYRRfdRqygLOEBY5i7osllimaZMGKoJSS1LYi6sVbmVh6jtV2r2ulLLmDWLdPRH-zHa8tby96WEn6BWtazh1Dn6Sp8FA38AESyrfw |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8MwGA46D3pSceK3OXjtbJqkTY7TOTbdhowKehr5qg5nK7O9-OtNstUvELy1gdDyvgnPm-R5ngBw5mg6MU5oII0hAcEyDESoTMC00IRqbmt8JxQejuLeHbm-p_dLsbrXwhhjPPnMtNyjP8vXharcVpmd4XY2sISsgjUL_BQt5FpLXQ4K-Xm3nz6MY1uRYLvyi5wvdoh-3pviYaO7CUb1BxdskedWVcqWev_lxfjvP9oCzS-FHrz9xJ5tsGLyHTDr1LTWHHZM6UlWORS5huOaJmTfa64dvLAApqFtaVd6WsDansR3aM89h8gOTtj_ZtoJ27PHYj4tn17emiDtXqWXvWB5m0Iw5agMMluKyEhIkmTURCHTTKqIUWOosGtAIhRhXCGMJKdSSsKlO54xmGQcY0WQwLugkRe52QOQZzb6Ook0kZgwhSVCsSZRSJnWItN8HzRdpCavC7-MSR2kgz_aT8F6Lx0OJoP-6OYQbLjMud1TRI9Ao5xX5tjCfilPfLI_AC-Grsg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+IEEE+1st+International+Workshop+on+Future+Intelligent+Technologies+for+Young+Researchers+%28FITYR%29&rft.atitle=Depression+Detection+and+Recognition+Research+Based+on+Audio+Analysis+and+Artificial+Intelligence+Algorithms&rft.au=Wang%2C+Zhuozheng&rft.au=Wang%2C+Yunlong&rft.au=Zhao%2C+Xixi&rft.au=Chen%2C+Bingxu&rft.date=2024-07-15&rft.pub=IEEE&rft.spage=67&rft.epage=72&rft_id=info:doi/10.1109%2FFITYR63263.2024.00017&rft.externalDocID=10763874 |