Speech intelligibility prediction method using machine learning for outdoor public address systems
Subjective speech intelligibility assessment is important for the development of outdoor public address system. However, as this assessment is difficult in many cases, we propose an objective speech intelligibility evaluation system that includes a machine learning technique. In this talk, we have p...
Saved in:
Published in | The Journal of the Acoustical Society of America Vol. 140; no. 4; p. 3192 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
01.10.2016
|
Online Access | Get full text |
ISSN | 0001-4966 1520-8524 |
DOI | 10.1121/1.4970044 |
Cover
Loading…
Abstract | Subjective speech intelligibility assessment is important for the development of outdoor public address system. However, as this assessment is difficult in many cases, we propose an objective speech intelligibility evaluation system that includes a machine learning technique. In this talk, we have proved a subjective evaluation and objective prediction of speech intelligibility using the outdoor public address systems at 10 locations in Sendai City, where impulse responses were recorded after the Great East Japan Earthquake. First, the results of the subjective intelligibility evaluation by different test word lists with the same sound field conditions showed that the root mean squared error (RMSE) was very small, not exceeding 7.0%. Next, we generated the intelligibility prediction model trained with true/false results of 22 subjects using the support vector machine (SVM). This prediction model extracted the feature vector, using the ITU-T P.563 speech quality feature set of the test speech signal. We evaluated the predictive performance of the prediction model using data that was not used in training, and the RMSE obtained was 4.0%. This result was shown to be highly accurate with a value even less than the subject experiment result. |
---|---|
AbstractList | Subjective speech intelligibility assessment is important for the development of outdoor public address system. However, as this assessment is difficult in many cases, we propose an objective speech intelligibility evaluation system that includes a machine learning technique. In this talk, we have proved a subjective evaluation and objective prediction of speech intelligibility using the outdoor public address systems at 10 locations in Sendai City, where impulse responses were recorded after the Great East Japan Earthquake. First, the results of the subjective intelligibility evaluation by different test word lists with the same sound field conditions showed that the root mean squared error (RMSE) was very small, not exceeding 7.0%. Next, we generated the intelligibility prediction model trained with true/false results of 22 subjects using the support vector machine (SVM). This prediction model extracted the feature vector, using the ITU-T P.563 speech quality feature set of the test speech signal. We evaluated the predictive performance of the prediction model using data that was not used in training, and the RMSE obtained was 4.0%. This result was shown to be highly accurate with a value even less than the subject experiment result. |
Author | Kondo, Kazuhiro Ohta, Kengo Kobayashi, Yosuke Sakamoto, Shuichi |
Author_xml | – sequence: 1 givenname: Yosuke surname: Kobayashi fullname: Kobayashi, Yosuke organization: Graduate School of Eng., Muroran Inst. of Technol., 27-1 Mizumoto-cho, Muroran 050-8585, Japan, ykobayashi@csse.muroran-it.ac.jp – sequence: 2 givenname: Kengo surname: Ohta fullname: Ohta, Kengo organization: Anan College, National Inst. of Technol., Anan, Japan – sequence: 3 givenname: Kazuhiro surname: Kondo fullname: Kondo, Kazuhiro organization: Graduate School of Sci. and Eng., Yamagata Univ., Yonezawa, Japan – sequence: 4 givenname: Shuichi surname: Sakamoto fullname: Sakamoto, Shuichi organization: Res. Inst. of Elec. Commun., Tohoku Univ., Sendai, Japan |
BookMark | eNp9kD1rwzAYhEVJoUnaof9AawtOJVmyrbGEfkGgQ7MbWXoVq9iSkZQh_74OydzpuOPh4G6FFj54QOiRkg2ljL7QDZc1IZzfoCUVjBSNYHyBloQQWnBZVXdoldLvbEVTyiXqfiYA3WPnMwyDO7jODS6f8BTBOJ1d8HiE3AeDj8n5Ax6V7p0HPICK_hzYEHE4ZhNmnY7d4DRWxkRICadTyjCme3Rr1ZDg4aprtH9_228_i933x9f2dVfoSvLCVrVizGgtysYoTRolO7ANZVbIitVag1G1YAJqQ6iWlCheisZKAVVnGSnLNXq61OoYUopg2ym6UcVTS0l7_qal7fWbmX2-sEm7rM4r_4H_ALNuZ-o |
CODEN | JASMAN |
ContentType | Journal Article |
Copyright | Acoustical Society of America |
Copyright_xml | – notice: Acoustical Society of America |
DBID | AAYXX CITATION |
DOI | 10.1121/1.4970044 |
DatabaseName | CrossRef |
DatabaseTitle | CrossRef |
DatabaseTitleList | CrossRef |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Physics |
EISSN | 1520-8524 |
EndPage | 3192 |
ExternalDocumentID | 10_1121_1_4970044 |
GroupedDBID | --- --Z -~X .DC .GJ 123 186 29L 3O- 4.4 41~ 5-Q 53G 5RE 5VS 6TJ 85S AAAAW AAEUA AAPUP AAYIH ABDNZ ABEFF ABEFU ABJNI ABNAN ABPPZ ABTAH ABZEH ACBNA ACBRY ACCUC ACGFO ACGFS ACNCT ACXMS ACYGS ADCTM AEGXH AENEX AETEA AFFNX AFHCQ AGKCL AGLKD AGMXG AGTJO AGVCI AHPGS AHSDT AI. AIAGR AIDUJ AIZTS ALMA_UNASSIGNED_HOLDINGS AQWKA BAUXJ CS3 D0L DU5 EBS EJD ESX F5P G8K H~9 M71 M73 MVM NEJ NHB OHT OK1 P2P RAZ RIP RNS ROL RQS S10 SC5 SJN TN5 TWZ UCJ UHB UPT UQL VH1 VOH VQA WH7 XFK XJT XOL XSW YQT ZCG ZXP ZY4 ~02 ~G0 AAGWI AAYXX ABJGX ADMLS AEILP CITATION |
ID | FETCH-LOGICAL-c694-f67a22dcc538dac08a9bef812f59627cceda7525e7d01c910a4358f95e6bf2033 |
ISSN | 0001-4966 |
IngestDate | Tue Jul 01 01:15:44 EDT 2025 Fri Jun 21 00:14:42 EDT 2024 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 4 |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c694-f67a22dcc538dac08a9bef812f59627cceda7525e7d01c910a4358f95e6bf2033 |
PageCount | 1 |
ParticipantIDs | crossref_primary_10_1121_1_4970044 scitation_primary_10_1121_1_4970044 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 20161000 2016-10-01 |
PublicationDateYYYYMMDD | 2016-10-01 |
PublicationDate_xml | – month: 10 year: 2016 text: 20161000 |
PublicationDecade | 2010 |
PublicationTitle | The Journal of the Acoustical Society of America |
PublicationYear | 2016 |
SSID | ssj0005839 |
Score | 2.1817696 |
Snippet | Subjective speech intelligibility assessment is important for the development of outdoor public address system. However, as this assessment is difficult in... |
SourceID | crossref scitation |
SourceType | Index Database Publisher |
StartPage | 3192 |
Title | Speech intelligibility prediction method using machine learning for outdoor public address systems |
URI | http://dx.doi.org/10.1121/1.4970044 |
Volume | 140 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1NT9tAEF2lQYheKkpB0Ba0Kr1Vhniz_tgjKiCESjkklbhZ6901CYgYkfgAf6p_seOd9dpAK0EvVuKvyJ6XmTf2mxlCvuYi5TIswkCCIwh4omP4z-UiSGSRh0kqVZLUhcJnP-OTX_z0Irro9X53VEvVIt9TD3-tK_kfq8I6sGtdJfsKy_qTwgr4DPaFJVgYli-y8ejWGPvIAttqotD1vq7711McAY4Dor9V9onAjRVOmmZSBCooy2qhy9L3uwZHZItH5p1O5lctojr81dakqNJOA6tbijj5Z81s8S2Q9-XgMu7rmU3W3Zfz6tqD6XyC5BXc_WXZ7j_TJSo9HqrJ9M5vGMlrCciy20aTagpX031oEcZe_tY6YkCGiF0XbOd7IZNNIyyp9s4Zmzk5FPLMzjptRUHodcGNsE4Eb74-jw6sjg7hHhd1U3_ehsDmtf-TyOj1ijZTYmEWZu7QN2SJQV7C-mTp4PDsx6hVFaVDl3Hh9blmVnDwvv_dRxRoBZgOii46vGa8St45g9IDRNd70jOzNbJshcFq_oHkiDH6BGO0xRhFjFGLMeowRhuMUcAYdRijiDHqMEYdxtbJ-Pho_P0kcHM5AhULHhRxIhnTSkGs1FINUilyUwBRLOwkJ6WMlknEIpPoQaiAjkqg5GkhIhPnBRsMhxukPytnZpPQSBdgdAm0Uww5ZPqQrGiIQcBZGRdcyi3ypblT2S12X8me2WKL7Pp7-O-9Pr7kVJ_I2xatn0l_cVeZbSCdi3zHGfoP6SaGfA |
linkProvider | EBSCOhost |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Speech+intelligibility+prediction+method+using+machine+learning+for+outdoor+public+address+systems&rft.jtitle=The+Journal+of+the+Acoustical+Society+of+America&rft.au=Kobayashi%2C+Yosuke&rft.au=Ohta%2C+Kengo&rft.au=Kondo%2C+Kazuhiro&rft.au=Sakamoto%2C+Shuichi&rft.date=2016-10-01&rft.issn=0001-4966&rft.eissn=1520-8524&rft.volume=140&rft.issue=4_Supplement&rft.spage=3192&rft.epage=3192&rft_id=info:doi/10.1121%2F1.4970044&rft.externalDBID=n%2Fa&rft.externalDocID=10_1121_1_4970044 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0001-4966&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0001-4966&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0001-4966&client=summon |