Speech intelligibility prediction method using machine learning for outdoor public address systems

Subjective speech intelligibility assessment is important for the development of outdoor public address system. However, as this assessment is difficult in many cases, we propose an objective speech intelligibility evaluation system that includes a machine learning technique. In this talk, we have p...

Full description

Saved in:
Bibliographic Details
Published inThe Journal of the Acoustical Society of America Vol. 140; no. 4; p. 3192
Main Authors Kobayashi, Yosuke, Ohta, Kengo, Kondo, Kazuhiro, Sakamoto, Shuichi
Format Journal Article
LanguageEnglish
Published 01.10.2016
Online AccessGet full text
ISSN0001-4966
1520-8524
DOI10.1121/1.4970044

Cover

Loading…
Abstract Subjective speech intelligibility assessment is important for the development of outdoor public address system. However, as this assessment is difficult in many cases, we propose an objective speech intelligibility evaluation system that includes a machine learning technique. In this talk, we have proved a subjective evaluation and objective prediction of speech intelligibility using the outdoor public address systems at 10 locations in Sendai City, where impulse responses were recorded after the Great East Japan Earthquake. First, the results of the subjective intelligibility evaluation by different test word lists with the same sound field conditions showed that the root mean squared error (RMSE) was very small, not exceeding 7.0%. Next, we generated the intelligibility prediction model trained with true/false results of 22 subjects using the support vector machine (SVM). This prediction model extracted the feature vector, using the ITU-T P.563 speech quality feature set of the test speech signal. We evaluated the predictive performance of the prediction model using data that was not used in training, and the RMSE obtained was 4.0%. This result was shown to be highly accurate with a value even less than the subject experiment result.
AbstractList Subjective speech intelligibility assessment is important for the development of outdoor public address system. However, as this assessment is difficult in many cases, we propose an objective speech intelligibility evaluation system that includes a machine learning technique. In this talk, we have proved a subjective evaluation and objective prediction of speech intelligibility using the outdoor public address systems at 10 locations in Sendai City, where impulse responses were recorded after the Great East Japan Earthquake. First, the results of the subjective intelligibility evaluation by different test word lists with the same sound field conditions showed that the root mean squared error (RMSE) was very small, not exceeding 7.0%. Next, we generated the intelligibility prediction model trained with true/false results of 22 subjects using the support vector machine (SVM). This prediction model extracted the feature vector, using the ITU-T P.563 speech quality feature set of the test speech signal. We evaluated the predictive performance of the prediction model using data that was not used in training, and the RMSE obtained was 4.0%. This result was shown to be highly accurate with a value even less than the subject experiment result.
Author Kondo, Kazuhiro
Ohta, Kengo
Kobayashi, Yosuke
Sakamoto, Shuichi
Author_xml – sequence: 1
  givenname: Yosuke
  surname: Kobayashi
  fullname: Kobayashi, Yosuke
  organization: Graduate School of Eng., Muroran Inst. of Technol., 27-1 Mizumoto-cho, Muroran 050-8585, Japan, ykobayashi@csse.muroran-it.ac.jp
– sequence: 2
  givenname: Kengo
  surname: Ohta
  fullname: Ohta, Kengo
  organization: Anan College, National Inst. of Technol., Anan, Japan
– sequence: 3
  givenname: Kazuhiro
  surname: Kondo
  fullname: Kondo, Kazuhiro
  organization: Graduate School of Sci. and Eng., Yamagata Univ., Yonezawa, Japan
– sequence: 4
  givenname: Shuichi
  surname: Sakamoto
  fullname: Sakamoto, Shuichi
  organization: Res. Inst. of Elec. Commun., Tohoku Univ., Sendai, Japan
BookMark eNp9kD1rwzAYhEVJoUnaof9AawtOJVmyrbGEfkGgQ7MbWXoVq9iSkZQh_74OydzpuOPh4G6FFj54QOiRkg2ljL7QDZc1IZzfoCUVjBSNYHyBloQQWnBZVXdoldLvbEVTyiXqfiYA3WPnMwyDO7jODS6f8BTBOJ1d8HiE3AeDj8n5Ax6V7p0HPICK_hzYEHE4ZhNmnY7d4DRWxkRICadTyjCme3Rr1ZDg4aprtH9_228_i933x9f2dVfoSvLCVrVizGgtysYoTRolO7ANZVbIitVag1G1YAJqQ6iWlCheisZKAVVnGSnLNXq61OoYUopg2ym6UcVTS0l7_qal7fWbmX2-sEm7rM4r_4H_ALNuZ-o
CODEN JASMAN
ContentType Journal Article
Copyright Acoustical Society of America
Copyright_xml – notice: Acoustical Society of America
DBID AAYXX
CITATION
DOI 10.1121/1.4970044
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
CrossRef
DeliveryMethod fulltext_linktorsrc
Discipline Physics
EISSN 1520-8524
EndPage 3192
ExternalDocumentID 10_1121_1_4970044
GroupedDBID ---
--Z
-~X
.DC
.GJ
123
186
29L
3O-
4.4
41~
5-Q
53G
5RE
5VS
6TJ
85S
AAAAW
AAEUA
AAPUP
AAYIH
ABDNZ
ABEFF
ABEFU
ABJNI
ABNAN
ABPPZ
ABTAH
ABZEH
ACBNA
ACBRY
ACCUC
ACGFO
ACGFS
ACNCT
ACXMS
ACYGS
ADCTM
AEGXH
AENEX
AETEA
AFFNX
AFHCQ
AGKCL
AGLKD
AGMXG
AGTJO
AGVCI
AHPGS
AHSDT
AI.
AIAGR
AIDUJ
AIZTS
ALMA_UNASSIGNED_HOLDINGS
AQWKA
BAUXJ
CS3
D0L
DU5
EBS
EJD
ESX
F5P
G8K
H~9
M71
M73
MVM
NEJ
NHB
OHT
OK1
P2P
RAZ
RIP
RNS
ROL
RQS
S10
SC5
SJN
TN5
TWZ
UCJ
UHB
UPT
UQL
VH1
VOH
VQA
WH7
XFK
XJT
XOL
XSW
YQT
ZCG
ZXP
ZY4
~02
~G0
AAGWI
AAYXX
ABJGX
ADMLS
AEILP
CITATION
ID FETCH-LOGICAL-c694-f67a22dcc538dac08a9bef812f59627cceda7525e7d01c910a4358f95e6bf2033
ISSN 0001-4966
IngestDate Tue Jul 01 01:15:44 EDT 2025
Fri Jun 21 00:14:42 EDT 2024
IsPeerReviewed true
IsScholarly true
Issue 4
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c694-f67a22dcc538dac08a9bef812f59627cceda7525e7d01c910a4358f95e6bf2033
PageCount 1
ParticipantIDs crossref_primary_10_1121_1_4970044
scitation_primary_10_1121_1_4970044
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 20161000
2016-10-01
PublicationDateYYYYMMDD 2016-10-01
PublicationDate_xml – month: 10
  year: 2016
  text: 20161000
PublicationDecade 2010
PublicationTitle The Journal of the Acoustical Society of America
PublicationYear 2016
SSID ssj0005839
Score 2.1817696
Snippet Subjective speech intelligibility assessment is important for the development of outdoor public address system. However, as this assessment is difficult in...
SourceID crossref
scitation
SourceType Index Database
Publisher
StartPage 3192
Title Speech intelligibility prediction method using machine learning for outdoor public address systems
URI http://dx.doi.org/10.1121/1.4970044
Volume 140
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1NT9tAEF2lQYheKkpB0Ba0Kr1Vhniz_tgjKiCESjkklbhZ6901CYgYkfgAf6p_seOd9dpAK0EvVuKvyJ6XmTf2mxlCvuYi5TIswkCCIwh4omP4z-UiSGSRh0kqVZLUhcJnP-OTX_z0Irro9X53VEvVIt9TD3-tK_kfq8I6sGtdJfsKy_qTwgr4DPaFJVgYli-y8ejWGPvIAttqotD1vq7711McAY4Dor9V9onAjRVOmmZSBCooy2qhy9L3uwZHZItH5p1O5lctojr81dakqNJOA6tbijj5Z81s8S2Q9-XgMu7rmU3W3Zfz6tqD6XyC5BXc_WXZ7j_TJSo9HqrJ9M5vGMlrCciy20aTagpX031oEcZe_tY6YkCGiF0XbOd7IZNNIyyp9s4Zmzk5FPLMzjptRUHodcGNsE4Eb74-jw6sjg7hHhd1U3_ehsDmtf-TyOj1ijZTYmEWZu7QN2SJQV7C-mTp4PDsx6hVFaVDl3Hh9blmVnDwvv_dRxRoBZgOii46vGa8St45g9IDRNd70jOzNbJshcFq_oHkiDH6BGO0xRhFjFGLMeowRhuMUcAYdRijiDHqMEYdxtbJ-Pho_P0kcHM5AhULHhRxIhnTSkGs1FINUilyUwBRLOwkJ6WMlknEIpPoQaiAjkqg5GkhIhPnBRsMhxukPytnZpPQSBdgdAm0Uww5ZPqQrGiIQcBZGRdcyi3ypblT2S12X8me2WKL7Pp7-O-9Pr7kVJ_I2xatn0l_cVeZbSCdi3zHGfoP6SaGfA
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Speech+intelligibility+prediction+method+using+machine+learning+for+outdoor+public+address+systems&rft.jtitle=The+Journal+of+the+Acoustical+Society+of+America&rft.au=Kobayashi%2C+Yosuke&rft.au=Ohta%2C+Kengo&rft.au=Kondo%2C+Kazuhiro&rft.au=Sakamoto%2C+Shuichi&rft.date=2016-10-01&rft.issn=0001-4966&rft.eissn=1520-8524&rft.volume=140&rft.issue=4_Supplement&rft.spage=3192&rft.epage=3192&rft_id=info:doi/10.1121%2F1.4970044&rft.externalDBID=n%2Fa&rft.externalDocID=10_1121_1_4970044
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0001-4966&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0001-4966&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0001-4966&client=summon