Interplay of Lexical and Vocal Cues in Emotional Voice Perception
This study examined listeners’ ability to detect emotion from various speech samples, including both spontaneous conversations and actor-posed speech. We investigated the interaction between lexical content and vocal cues (i.e., tone of voice) and how these factors influence emotion perception from...
Saved in:
Published in | International Journal of Advanced Culture Technology(IJACT) Vol. 12; no. 4; pp. 451 - 465 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
국제문화기술진흥원
31.12.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | This study examined listeners’ ability to detect emotion from various speech samples, including both spontaneous conversations and actor-posed speech. We investigated the interaction between lexical content and vocal cues (i.e., tone of voice) and how these factors influence emotion perception from voice. Our approach involved two experimental conditions. In the text condition, participants assessed emotional attributes solely from written transcripts, devoid of vocal information. In the voice condition, participants evaluated emotions by listening to audio recordings. Results indicated that vocal cues generally enhanced emotional expressions, overriding lexical meaning for certain negative emotions in spontaneous speech. Correlation analysis revealed that lexical meanings suggesting anger or hostility could be perceived as positive affective states in some spontaneous speech when vocal cues were present. In perceiving sadness or anger in posed speech, much higher correct response rates were obtained in the voice condition compared to the text condition, indicating successful use of vocal cues alongside lexical meaning. However, when identifying happiness in posed speech, listeners showed little improvement from vocal cues. The correlation between acoustic parameters and emotional ratings was analyzed to understand how listeners utilized vocal cues. The vocal cues with the highest correlations were pitch and harmonicity to noise ratio. Listeners perceived angry and happy voices as pitch values increased, but distinguished happiness from anger when the voice contained more noise compared to harmonicity. Sad or timid voices were perceived as pitch values and variation decreased. Our research highlights the complex interaction between lexical content and vocal cues in emotional communication. |
---|---|
Bibliography: | http://www.ipact.kr/eng/iconf/ijact/sub05.php |
ISSN: | 2288-7202 2288-7318 |
DOI: | 10.17703/IJACT.2024.12.4.451 |