Investigation of speech-based language-independent possibilities of depression recognition

The presented research examined whether it is possible to use the same method to create different language-specific models with similar performance, and whether it is possible to implement language-independent recognition of depression. A depression database one in German and one in Hungarian were u...

Full description

Saved in:
Bibliographic Details
Published in2022 45th International Conference on Telecommunications and Signal Processing (TSP) pp. 226 - 229
Main Author Kiss, Gabor
Format Conference Proceeding
LanguageEnglish
Published IEEE 13.07.2022
Subjects
Online AccessGet full text
DOI10.1109/TSP55681.2022.9851347

Cover

Loading…
Abstract The presented research examined whether it is possible to use the same method to create different language-specific models with similar performance, and whether it is possible to implement language-independent recognition of depression. A depression database one in German and one in Hungarian were used to perform the experiments. The x-vector architecture published by Snyder et al. was used for feature extraction, and Support Vector Regression was used to predict the severity of depression. Classification (depressed / healthy) based on regression results was also implemented. Monolingual and multilingual experiments were performed. Based on the results, it can be stated that it is possible to create different language models with similar performance using the same method. Furthermore, it can be stated that it is possible to create a model valid for multiple languages. Research is currently at an early stage. In the future, it is necessary to expand the number of speech databases used in the experiments.
AbstractList The presented research examined whether it is possible to use the same method to create different language-specific models with similar performance, and whether it is possible to implement language-independent recognition of depression. A depression database one in German and one in Hungarian were used to perform the experiments. The x-vector architecture published by Snyder et al. was used for feature extraction, and Support Vector Regression was used to predict the severity of depression. Classification (depressed / healthy) based on regression results was also implemented. Monolingual and multilingual experiments were performed. Based on the results, it can be stated that it is possible to create different language models with similar performance using the same method. Furthermore, it can be stated that it is possible to create a model valid for multiple languages. Research is currently at an early stage. In the future, it is necessary to expand the number of speech databases used in the experiments.
Author Kiss, Gabor
Author_xml – sequence: 1
  givenname: Gabor
  surname: Kiss
  fullname: Kiss, Gabor
  email: kiss.gabor@vik.bme.hu
  organization: Budapest University of Technology and Economics,Faculty of Electrical Engineering and Informatics,Department of Telecommunications and Media Informatics,Budapest,Hungary
BookMark eNotj91Kw0AQRlfQC9v6BCLkBRJnd3aTzaUUfwoFhVYQb8ommcSBuAnZKPj2bmlvZmDONwe-hbj0gych7iRkUkJ5v9-9GZNbmSlQKiutkaiLC7GQeW50Xmr7cS0-N_6Xwsydm3nwydAmYSSqv9LKBWqS3vnux3WUsm9opDj8nIxDCFxxzzNTOL5ENFG8RcFE9dB5PspW4qp1faCb816K96fH_fol3b4-b9YP25QV4JwaAKws6JoIbYNOlYWu2wotVHVkyhaoQUtqUYI1CCViG7POEercQoFLcXvyMhEdxom_3fR3ONfFfw9RUSo
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/TSP55681.2022.9851347
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 166546948X
9781665469487
EndPage 229
ExternalDocumentID 9851347
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i203t-5003b804cee38d3a2974cfb380bc50028734041ef3108530933fceeaae3468073
IEDL.DBID RIE
IngestDate Thu Jun 29 18:38:16 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i203t-5003b804cee38d3a2974cfb380bc50028734041ef3108530933fceeaae3468073
PageCount 4
ParticipantIDs ieee_primary_9851347
PublicationCentury 2000
PublicationDate 2022-July-13
PublicationDateYYYYMMDD 2022-07-13
PublicationDate_xml – month: 07
  year: 2022
  text: 2022-July-13
  day: 13
PublicationDecade 2020
PublicationTitle 2022 45th International Conference on Telecommunications and Signal Processing (TSP)
PublicationTitleAbbrev TSP
PublicationYear 2022
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8244481
Snippet The presented research examined whether it is possible to use the same method to create different language-specific models with similar performance, and...
SourceID ieee
SourceType Publisher
StartPage 226
SubjectTerms cross-lingual
Depression
Feature extraction
Signal processing
speech
Support vector machines
SVMs
Telecommunications
x-vector
Title Investigation of speech-based language-independent possibilities of depression recognition
URI https://ieeexplore.ieee.org/document/9851347
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09a8MwED2STJ3akpR-o6Fj5TiSbMtzaQmFlEATCF2CJZ1pKNihcZb--p4cx_2gQzdjJCQkpHfvdO8O4CY1ZBaLXPNMJiFXmaB7MMlTrpVAazAVTnhx8uQpHs_V4yJadOC21cIgYh18hoH_rN_yXWm33lU2TMk8kCrpQpeI206r1YhyRmE6nD1P63RaRPqECJq2P4qm1JjxcAiT_Wi7UJG3YFuZwH78SsT43-kcweBLncemLe4cQweLPrx8S5hRFqzM2WaNaF-5hynH9m5JvmrL3lZsXW6a4Fiiy75LGxZbsDawqCwGMH-4n92NeVM3ga9EKCse0Uk1OlQ0D6mdzARxBpsbqUNjI0-yEqlCNcJceumBfwqVObXNMpQq1nTmT6BXlAWeAiNwd3FmyEazRCVisg0wEi7SRKIcChufQd-vy3K9S42xbJbk_O_fF3Dg98a7RkfyEnrV-xavCNMrc11v5ic5WqQA
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED6VMsAEqEW88cCI09R2XjOiKtBWlWiliqWKnYuokJKKpgu_nnOahocY2KLIli1b9nff-b47gJtIk1ks0pDHMnC5igXdg0Ea8VAJNBojkQgrTh6O_P5UPc68WQNuay0MIpbBZ-jYz_ItP8nN2rrKOhGZB1IFO7BLuK-ijVqrkuV03agzeR6XCbWI9gnhVK1_lE0pUaN3AMPteJtgkTdnXWjHfPxKxfjfCR1C-0ufx8Y18hxBA7MWvHxLmZFnLE_ZaoloXrkFqoRtHZN8URe-LdgyX1XhsUSYbZc6MDZjdWhRnrVh2ruf3PV5VTmBL4QrC-7RWdWhq2geMkxkLIg1mFTL0NXGszQrkMpVXUylFR_Yx1CZUts4Rqn8kE79MTSzPMMTYATviR9rstIMkQmfrAP0ROKFRKMSFMY_hZZdl_lykxxjXi3J2d-_r2GvPxkO5oOH0dM57Nt9so7SrryAZvG-xktC-EJflRv7CQMVp1A
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2022+45th+International+Conference+on+Telecommunications+and+Signal+Processing+%28TSP%29&rft.atitle=Investigation+of+speech-based+language-independent+possibilities+of+depression+recognition&rft.au=Kiss%2C+Gabor&rft.date=2022-07-13&rft.pub=IEEE&rft.spage=226&rft.epage=229&rft_id=info:doi/10.1109%2FTSP55681.2022.9851347&rft.externalDocID=9851347