Multimodal control system for autonomous vehicles using speech and gesture recognition
The recent development of autonomous vehicles has attracted much attention, but operating these vehicles may be too complex for average users. Therefore, we propose an intuitive, multimodal interface for the control of autonomous vehicles using speech and gesture recognition to interpret and execute...
Saved in:
Published in | The Journal of the Acoustical Society of America Vol. 140; no. 4; pp. 2963 - 2964 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English Japanese |
Published |
01.10.2016
|
Online Access | Get full text |
Cover
Loading…
Abstract | The recent development of autonomous vehicles has attracted much attention, but operating these vehicles may be too complex for average users. Therefore, we propose an intuitive, multimodal interface for the control of autonomous vehicles using speech and gesture recognition to interpret and execute the commands of users. For example, if the user says “turn there” while pointing at a landmark, the vehicle can utilize this behavior to correctly understand and comply with the user’s intent. To achieve this, we designed a two-part interface consisting of a multimodal understanding component and a dialog control component. Our multimodal understanding and dialog control components can be seen as a concatenation of two separate transducers. One transducer is used for multimodal understanding and the other for a conventional dialog system. We then construct a combined transducer from these two transducers. We developed various scenarios which might arise while operating an autonomous vehicle and displayed these scenes on a monitor. Subjects were then asked to operate a virtual car using speech commands and pointing gestures to control the vehicle while observing the monitor. The questionnaire results show that subjects felt they were able to easily and naturally operate the autonomous vehicle using utterances and gestures. |
---|---|
AbstractList | The recent development of autonomous vehicles has attracted much attention, but operating these vehicles may be too complex for average users. Therefore, we propose an intuitive, multimodal interface for the control of autonomous vehicles using speech and gesture recognition to interpret and execute the commands of users. For example, if the user says “turn there” while pointing at a landmark, the vehicle can utilize this behavior to correctly understand and comply with the user’s intent. To achieve this, we designed a two-part interface consisting of a multimodal understanding component and a dialog control component. Our multimodal understanding and dialog control components can be seen as a concatenation of two separate transducers. One transducer is used for multimodal understanding and the other for a conventional dialog system. We then construct a combined transducer from these two transducers. We developed various scenarios which might arise while operating an autonomous vehicle and displayed these scenes on a monitor. Subjects were then asked to operate a virtual car using speech commands and pointing gestures to control the vehicle while observing the monitor. The questionnaire results show that subjects felt they were able to easily and naturally operate the autonomous vehicle using utterances and gestures. |
Author | Nakagawa, Takuma Kitaoka, Norihide |
Author_xml | – sequence: 1 givenname: Takuma surname: Nakagawa fullname: Nakagawa, Takuma organization: Tokushima Univ., 2-1 Minami-Johsanjima-cho, Tokushima-shi, Tokushima 770-8506, Japan, c501637005@tokushima-u.ac.jp – sequence: 2 givenname: Norihide surname: Kitaoka fullname: Kitaoka, Norihide organization: Tokushima Univ., 2-1 Minami-Johsanjima-cho, Tokushima-shi, Tokushima 770-8506, Japan, c501637005@tokushima-u.ac.jp |
BookMark | eNp9kEtLAzEUhYNUsK0u_AfZKkybzGQeWUrxBRU36nbI46aNzCQlyQj9905p3Yi4OlzOdw6XM0MT5x0gdE3JgtKcLumC8YrTip6hKS1zkjVlziZoSgih2WhVF2gW4-d4lk3Bp-jjZeiS7b0WHVbepeA7HPcxQY-ND1gMyTvf-yHiL9ha1UHEQ7Rug-MOQG2xcBpvIKYhAA6g_MbZZL27ROdGdBGuTjpH7w_3b6unbP36-Ly6W2eK1iXNgCpSK13npJCGKMM0hxyMZAxEYbhUPJdaClPWQjeaKSmZqE2lypqZhjNWzNHy2KuCjzGAaZVN4vBBCsJ2LSXtYZaWtqdZxsTNr8Qu2F6E_Z_s7ZGNP63_wN-nIXSL |
CODEN | JASMAN |
CitedBy_id | crossref_primary_10_1109_ACCESS_2020_3005956 crossref_primary_10_1049_iet_cds_2018_5225 |
ContentType | Journal Article |
Copyright | Acoustical Society of America |
Copyright_xml | – notice: Acoustical Society of America |
DBID | AAYXX CITATION |
DOI | 10.1121/1.4969161 |
DatabaseName | CrossRef |
DatabaseTitle | CrossRef |
DatabaseTitleList | CrossRef |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Physics |
EISSN | 1520-8524 |
EndPage | 2964 |
ExternalDocumentID | 10_1121_1_4969161 |
GroupedDBID | --- --Z -~X .DC .GJ 123 186 29L 3O- 4.4 41~ 5-Q 53G 5RE 5VS 6TJ 85S AAAAW AAEUA AAPUP AAYIH ABDNZ ABEFF ABEFU ABJNI ABNAN ABPPZ ABTAH ABZEH ACBNA ACBRY ACCUC ACGFO ACGFS ACNCT ACXMS ACYGS ADCTM AEGXH AENEX AETEA AFFNX AFHCQ AGKCL AGLKD AGMXG AGTJO AGVCI AHPGS AHSDT AI. AIAGR AIDUJ AIZTS ALMA_UNASSIGNED_HOLDINGS AQWKA BAUXJ CS3 D0L DU5 EBS EJD ESX F5P G8K H~9 M71 M73 MVM NEJ NHB OHT OK1 P2P RAZ RIP RNS ROL RQS S10 SC5 SJN TN5 TWZ UCJ UHB UPT UQL VH1 VOH VQA WH7 XFK XJT XOL XSW YQT ZCG ZXP ZY4 ~02 ~G0 AAGWI AAYXX ABJGX ADMLS AEILP CITATION |
ID | FETCH-LOGICAL-c1751-e1c07cd7203bf0cf4d9e2efb44ea3f9bc92bdbaf57ad8d4cbb4a7f6c574f89443 |
ISSN | 0001-4966 |
IngestDate | Thu Apr 24 23:07:23 EDT 2025 Tue Jul 01 01:15:43 EDT 2025 Fri Jun 21 00:14:43 EDT 2024 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 4 |
Language | English Japanese |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c1751-e1c07cd7203bf0cf4d9e2efb44ea3f9bc92bdbaf57ad8d4cbb4a7f6c574f89443 |
PageCount | 2 |
ParticipantIDs | scitation_primary_10_1121_1_4969161 crossref_citationtrail_10_1121_1_4969161 crossref_primary_10_1121_1_4969161 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 20161000 2016-10-01 |
PublicationDateYYYYMMDD | 2016-10-01 |
PublicationDate_xml | – month: 10 year: 2016 text: 20161000 |
PublicationDecade | 2010 |
PublicationTitle | The Journal of the Acoustical Society of America |
PublicationYear | 2016 |
SSID | ssj0005839 |
Score | 2.1819327 |
Snippet | The recent development of autonomous vehicles has attracted much attention, but operating these vehicles may be too complex for average users. Therefore, we... |
SourceID | crossref scitation |
SourceType | Enrichment Source Index Database Publisher |
StartPage | 2963 |
Title | Multimodal control system for autonomous vehicles using speech and gesture recognition |
URI | http://dx.doi.org/10.1121/1.4969161 |
Volume | 140 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELZgK0QvqLxEgSILOCChlI3jvI5VC6qAcmGLeov8GLfLwm7VZkHi1zN-xF5gkQqXaDdyEsXzafKN55sxIc81sm4oZZMZWdQZh6bKBLA8K5UpGyWZKVyV69GH6vCYvz0pT9Karqsu6eWu-rG2ruR_rIrn0K62SvYfLBtviifwN9oXj2hhPF7Jxq569utCux4fXnLuOzN7beSytxULVuP6Dc6c_O3l0i0NXJ4DKF_SZtNLNocQhUTBTJ8TilY4q6tDUQu3A5htIxIkn5bN-sxPXFsWM3EqvjtmOhGzZfL-76a9WMxEyBlNz6YaVlce8ipq2JI3xfizrUIr6-BAMRxtSl8XHT2s78gUoMQ7t2FpUvYE19kGTwfDX77exTPr4vNdfDBS2zx9x4bc_W-ftyg6dOEOy7u8C5deJxsMgws2Iht7B0fvPyZpUFOEsMm_X-hIhRe_is_9hcfcRLrilRMr5GSyRW4FC9E9D5Hb5BrM75AbTt2rLu-STwkoNACFeqBQBApNQKEDUKgDCvVAoQgUGoBCV4Byjxy_eT3ZP8zChhqZQpaYZ5Crca20zbxLM1aG6xYYGMk5iMK0UrVMailMWQvdaK6k5KI2lSprbpqW8-I-Gc0Xc3hAqKw4gOSFGDPNS-CNxnlqjShwcowu5DZ5MUxPN8yN3fTkS_eHGbbJ0zj03LdYWTfoWZzjv496eJVbPSKbCc2Pyai_WMIOMstePglA-AlNdn2X |
linkProvider | EBSCOhost |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Multimodal+control+system+for+autonomous+vehicles+using+speech+and+gesture+recognition&rft.jtitle=The+Journal+of+the+Acoustical+Society+of+America&rft.au=Nakagawa%2C+Takuma&rft.au=Kitaoka%2C+Norihide&rft.date=2016-10-01&rft.issn=0001-4966&rft.eissn=1520-8524&rft.volume=140&rft.issue=4_Supplement&rft.spage=2963&rft.epage=2964&rft_id=info:doi/10.1121%2F1.4969161&rft.externalDBID=n%2Fa&rft.externalDocID=10_1121_1_4969161 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0001-4966&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0001-4966&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0001-4966&client=summon |