Multimodal control system for autonomous vehicles using speech and gesture recognition

The recent development of autonomous vehicles has attracted much attention, but operating these vehicles may be too complex for average users. Therefore, we propose an intuitive, multimodal interface for the control of autonomous vehicles using speech and gesture recognition to interpret and execute...

Full description

Saved in:
Bibliographic Details
Published inThe Journal of the Acoustical Society of America Vol. 140; no. 4; pp. 2963 - 2964
Main Authors Nakagawa, Takuma, Kitaoka, Norihide
Format Journal Article
LanguageEnglish
Japanese
Published 01.10.2016
Online AccessGet full text

Cover

Loading…
Abstract The recent development of autonomous vehicles has attracted much attention, but operating these vehicles may be too complex for average users. Therefore, we propose an intuitive, multimodal interface for the control of autonomous vehicles using speech and gesture recognition to interpret and execute the commands of users. For example, if the user says “turn there” while pointing at a landmark, the vehicle can utilize this behavior to correctly understand and comply with the user’s intent. To achieve this, we designed a two-part interface consisting of a multimodal understanding component and a dialog control component. Our multimodal understanding and dialog control components can be seen as a concatenation of two separate transducers. One transducer is used for multimodal understanding and the other for a conventional dialog system. We then construct a combined transducer from these two transducers. We developed various scenarios which might arise while operating an autonomous vehicle and displayed these scenes on a monitor. Subjects were then asked to operate a virtual car using speech commands and pointing gestures to control the vehicle while observing the monitor. The questionnaire results show that subjects felt they were able to easily and naturally operate the autonomous vehicle using utterances and gestures.
AbstractList The recent development of autonomous vehicles has attracted much attention, but operating these vehicles may be too complex for average users. Therefore, we propose an intuitive, multimodal interface for the control of autonomous vehicles using speech and gesture recognition to interpret and execute the commands of users. For example, if the user says “turn there” while pointing at a landmark, the vehicle can utilize this behavior to correctly understand and comply with the user’s intent. To achieve this, we designed a two-part interface consisting of a multimodal understanding component and a dialog control component. Our multimodal understanding and dialog control components can be seen as a concatenation of two separate transducers. One transducer is used for multimodal understanding and the other for a conventional dialog system. We then construct a combined transducer from these two transducers. We developed various scenarios which might arise while operating an autonomous vehicle and displayed these scenes on a monitor. Subjects were then asked to operate a virtual car using speech commands and pointing gestures to control the vehicle while observing the monitor. The questionnaire results show that subjects felt they were able to easily and naturally operate the autonomous vehicle using utterances and gestures.
Author Nakagawa, Takuma
Kitaoka, Norihide
Author_xml – sequence: 1
  givenname: Takuma
  surname: Nakagawa
  fullname: Nakagawa, Takuma
  organization: Tokushima Univ., 2-1 Minami-Johsanjima-cho, Tokushima-shi, Tokushima 770-8506, Japan, c501637005@tokushima-u.ac.jp
– sequence: 2
  givenname: Norihide
  surname: Kitaoka
  fullname: Kitaoka, Norihide
  organization: Tokushima Univ., 2-1 Minami-Johsanjima-cho, Tokushima-shi, Tokushima 770-8506, Japan, c501637005@tokushima-u.ac.jp
BookMark eNp9kEtLAzEUhYNUsK0u_AfZKkybzGQeWUrxBRU36nbI46aNzCQlyQj9905p3Yi4OlzOdw6XM0MT5x0gdE3JgtKcLumC8YrTip6hKS1zkjVlziZoSgih2WhVF2gW4-d4lk3Bp-jjZeiS7b0WHVbepeA7HPcxQY-ND1gMyTvf-yHiL9ha1UHEQ7Rug-MOQG2xcBpvIKYhAA6g_MbZZL27ROdGdBGuTjpH7w_3b6unbP36-Ly6W2eK1iXNgCpSK13npJCGKMM0hxyMZAxEYbhUPJdaClPWQjeaKSmZqE2lypqZhjNWzNHy2KuCjzGAaZVN4vBBCsJ2LSXtYZaWtqdZxsTNr8Qu2F6E_Z_s7ZGNP63_wN-nIXSL
CODEN JASMAN
CitedBy_id crossref_primary_10_1109_ACCESS_2020_3005956
crossref_primary_10_1049_iet_cds_2018_5225
ContentType Journal Article
Copyright Acoustical Society of America
Copyright_xml – notice: Acoustical Society of America
DBID AAYXX
CITATION
DOI 10.1121/1.4969161
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
CrossRef
DeliveryMethod fulltext_linktorsrc
Discipline Physics
EISSN 1520-8524
EndPage 2964
ExternalDocumentID 10_1121_1_4969161
GroupedDBID ---
--Z
-~X
.DC
.GJ
123
186
29L
3O-
4.4
41~
5-Q
53G
5RE
5VS
6TJ
85S
AAAAW
AAEUA
AAPUP
AAYIH
ABDNZ
ABEFF
ABEFU
ABJNI
ABNAN
ABPPZ
ABTAH
ABZEH
ACBNA
ACBRY
ACCUC
ACGFO
ACGFS
ACNCT
ACXMS
ACYGS
ADCTM
AEGXH
AENEX
AETEA
AFFNX
AFHCQ
AGKCL
AGLKD
AGMXG
AGTJO
AGVCI
AHPGS
AHSDT
AI.
AIAGR
AIDUJ
AIZTS
ALMA_UNASSIGNED_HOLDINGS
AQWKA
BAUXJ
CS3
D0L
DU5
EBS
EJD
ESX
F5P
G8K
H~9
M71
M73
MVM
NEJ
NHB
OHT
OK1
P2P
RAZ
RIP
RNS
ROL
RQS
S10
SC5
SJN
TN5
TWZ
UCJ
UHB
UPT
UQL
VH1
VOH
VQA
WH7
XFK
XJT
XOL
XSW
YQT
ZCG
ZXP
ZY4
~02
~G0
AAGWI
AAYXX
ABJGX
ADMLS
AEILP
CITATION
ID FETCH-LOGICAL-c1751-e1c07cd7203bf0cf4d9e2efb44ea3f9bc92bdbaf57ad8d4cbb4a7f6c574f89443
ISSN 0001-4966
IngestDate Thu Apr 24 23:07:23 EDT 2025
Tue Jul 01 01:15:43 EDT 2025
Fri Jun 21 00:14:43 EDT 2024
IsPeerReviewed true
IsScholarly true
Issue 4
Language English
Japanese
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c1751-e1c07cd7203bf0cf4d9e2efb44ea3f9bc92bdbaf57ad8d4cbb4a7f6c574f89443
PageCount 2
ParticipantIDs scitation_primary_10_1121_1_4969161
crossref_citationtrail_10_1121_1_4969161
crossref_primary_10_1121_1_4969161
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 20161000
2016-10-01
PublicationDateYYYYMMDD 2016-10-01
PublicationDate_xml – month: 10
  year: 2016
  text: 20161000
PublicationDecade 2010
PublicationTitle The Journal of the Acoustical Society of America
PublicationYear 2016
SSID ssj0005839
Score 2.1819327
Snippet The recent development of autonomous vehicles has attracted much attention, but operating these vehicles may be too complex for average users. Therefore, we...
SourceID crossref
scitation
SourceType Enrichment Source
Index Database
Publisher
StartPage 2963
Title Multimodal control system for autonomous vehicles using speech and gesture recognition
URI http://dx.doi.org/10.1121/1.4969161
Volume 140
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELZgK0QvqLxEgSILOCChlI3jvI5VC6qAcmGLeov8GLfLwm7VZkHi1zN-xF5gkQqXaDdyEsXzafKN55sxIc81sm4oZZMZWdQZh6bKBLA8K5UpGyWZKVyV69GH6vCYvz0pT9Karqsu6eWu-rG2ruR_rIrn0K62SvYfLBtviifwN9oXj2hhPF7Jxq569utCux4fXnLuOzN7beSytxULVuP6Dc6c_O3l0i0NXJ4DKF_SZtNLNocQhUTBTJ8TilY4q6tDUQu3A5htIxIkn5bN-sxPXFsWM3EqvjtmOhGzZfL-76a9WMxEyBlNz6YaVlce8ipq2JI3xfizrUIr6-BAMRxtSl8XHT2s78gUoMQ7t2FpUvYE19kGTwfDX77exTPr4vNdfDBS2zx9x4bc_W-ftyg6dOEOy7u8C5deJxsMgws2Iht7B0fvPyZpUFOEsMm_X-hIhRe_is_9hcfcRLrilRMr5GSyRW4FC9E9D5Hb5BrM75AbTt2rLu-STwkoNACFeqBQBApNQKEDUKgDCvVAoQgUGoBCV4Byjxy_eT3ZP8zChhqZQpaYZ5Crca20zbxLM1aG6xYYGMk5iMK0UrVMailMWQvdaK6k5KI2lSprbpqW8-I-Gc0Xc3hAqKw4gOSFGDPNS-CNxnlqjShwcowu5DZ5MUxPN8yN3fTkS_eHGbbJ0zj03LdYWTfoWZzjv496eJVbPSKbCc2Pyai_WMIOMstePglA-AlNdn2X
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Multimodal+control+system+for+autonomous+vehicles+using+speech+and+gesture+recognition&rft.jtitle=The+Journal+of+the+Acoustical+Society+of+America&rft.au=Nakagawa%2C+Takuma&rft.au=Kitaoka%2C+Norihide&rft.date=2016-10-01&rft.issn=0001-4966&rft.eissn=1520-8524&rft.volume=140&rft.issue=4_Supplement&rft.spage=2963&rft.epage=2964&rft_id=info:doi/10.1121%2F1.4969161&rft.externalDBID=n%2Fa&rft.externalDocID=10_1121_1_4969161
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0001-4966&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0001-4966&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0001-4966&client=summon