Attentional Modulation of Hierarchical Speech Representations in a Multitalker Environment

Abstract Humans are remarkably adept in listening to a desired speaker in a crowded environment, while filtering out nontarget speakers in the background. Attention is key to solving this difficult cocktail-party task, yet a detailed characterization of attentional effects on speech representations...

Full description

Saved in:
Bibliographic Details
Published inCerebral cortex (New York, N.Y. 1991) Vol. 31; no. 11; pp. 4986 - 5005
Main Authors Kiremitçi, Ibrahim, Yilmaz, Özgür, Çelik, Emin, Shahdloo, Mo, Huth, Alexander G, Çukur, Tolga
Format Journal Article
LanguageEnglish
Published United States Oxford University Press 01.10.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Humans are remarkably adept in listening to a desired speaker in a crowded environment, while filtering out nontarget speakers in the background. Attention is key to solving this difficult cocktail-party task, yet a detailed characterization of attentional effects on speech representations is lacking. It remains unclear across what levels of speech features and how much attentional modulation occurs in each brain area during the cocktail-party task. To address these questions, we recorded whole-brain blood-oxygen-level-dependent (BOLD) responses while subjects either passively listened to single-speaker stories, or selectively attended to a male or a female speaker in temporally overlaid stories in separate experiments. Spectral, articulatory, and semantic models of the natural stories were constructed. Intrinsic selectivity profiles were identified via voxelwise models fit to passive listening responses. Attentional modulations were then quantified based on model predictions for attended and unattended stories in the cocktail-party task. We find that attention causes broad modulations at multiple levels of speech representations while growing stronger toward later stages of processing, and that unattended speech is represented up to the semantic level in parabelt auditory cortex. These results provide insights on attentional mechanisms that underlie the ability to selectively listen to a desired speaker in noisy multispeaker environments.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1047-3211
1460-2199
1460-2199
DOI:10.1093/cercor/bhab136