Cross-modal Association between Auditory and Visuospatial Information in Mandarin Tone Perception in Noise by Native and Non-native Perceivers

Speech perception involves multiple input modalities. Research has indicated that perceivers establish cross-modal associations between auditory and visuospatial events to aid perception. Such intermodal relations can be particularly beneficial for speech development and learning, where infants and...

Full description

Saved in:

Bibliographic Details
Published in	Frontiers in psychology Vol. 8; p. 2051
Main Authors	Hannah, Beverly, Wang, Yue, Jongman, Allard, Sereno, Joan A, Cao, Jiguo, Nie, Yunlong
Format	Journal Article
Language	English
Published	Switzerland Frontiers Media S.A 04.12.2017
Subjects	audio-visual cross-modal association English gesture Mandarin Psychology tone perception English tone perception Mandarin cross-modal association audio-visual gesture
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Speech perception involves multiple input modalities. Research has indicated that perceivers establish cross-modal associations between auditory and visuospatial events to aid perception. Such intermodal relations can be particularly beneficial for speech development and learning, where infants and non-native perceivers need additional resources to acquire and process new sounds. This study examines how facial articulatory cues and co-speech hand gestures mimicking pitch contours in space affect non-native Mandarin tone perception. Native English as well as Mandarin perceivers identified tones embedded in noise with either congruent or incongruent Auditory-Facial (AF) and Auditory-FacialGestural (AFG) inputs. Native Mandarin results showed the expected ceiling-level performance in the congruent AF and AFG conditions. In the incongruent conditions, while AF identification was primarily auditory-based, AFG identification was partially based on gestures, demonstrating the use of gestures as valid cues in tone identification. The English perceivers' performance was poor in the congruent AF condition, but improved significantly in AFG. While the incongruent AF identification showed some reliance on facial information, incongruent AFG identification relied more on gestural than auditory-facial information. These results indicate positive effects of facial and especially gestural input on non-native tone perception, suggesting that cross-modal (visuospatial) resources can be recruited to aid auditory perception when phonetic demands are high. The current findings may inform patterns of tone acquisition and development, suggesting how multi-modal speech enhancement principles may be applied to facilitate speech learning.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 Reviewed by: Gang Peng, Hong Kong Polytechnic University, Hong Kong; Bencie Woll, University College London, United Kingdom Edited by: Leher Singh, National University of Singapore, Singapore This article was submitted to Language Sciences, a section of the journal Frontiers in Psychology
ISSN:	1664-1078 1664-1078
DOI:	10.3389/fpsyg.2017.02051