Cortical processing of pitch: Model-based encoding and decoding of auditory fMRI responses to real-life sounds

Pitch is a perceptual attribute related to the fundamental frequency (or periodicity) of a sound. So far, the cortical processing of pitch has been investigated mostly using synthetic sounds. However, the complex harmonic structure of natural sounds may require different mechanisms for the extractio...

Full description

Saved in:

Bibliographic Details
Published in	NeuroImage (Orlando, Fla.) Vol. 180; no. Pt A; pp. 291 - 300
Main Authors	De Angelis, Vittoria, De Martino, Federico, Moerel, Michelle, Santoro, Roberta, Hausfeld, Lars, Formisano, Elia
Format	Journal Article
Language	English
Published	United States Elsevier Inc 15.10.2018 Elsevier Limited
Subjects	Accuracy Acoustic Stimulation Adult Auditory cortex Auditory Cortex - physiology Auditory system Brain Mapping - methods Computer applications Cortex (auditory) Decoding Encoding Energy Evoked Potentials, Auditory - physiology Female fMRI Frequency Functional magnetic resonance imaging Hemispheric laterality Humans Hypotheses Image Processing, Computer-Assisted - methods Magnetic Resonance Imaging - methods Male Models, Neurological Neural coding Neuroimaging Noise control Perceptions Periodicity Pitch Perception - physiology Pitch processing Real-life sounds Sound Superior temporal gyrus Temporal cortex Temporal gyrus Temporal variations Real-life sounds fMRI Pitch processing Encoding Decoding Auditory cortex
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Pitch is a perceptual attribute related to the fundamental frequency (or periodicity) of a sound. So far, the cortical processing of pitch has been investigated mostly using synthetic sounds. However, the complex harmonic structure of natural sounds may require different mechanisms for the extraction and analysis of pitch. This study investigated the neural representation of pitch in human auditory cortex using model-based encoding and decoding analyses of high field (7 T) functional magnetic resonance imaging (fMRI) data collected while participants listened to a wide range of real-life sounds. Specifically, we modeled the fMRI responses as a function of the sounds' perceived pitch height and salience (related to the fundamental frequency and the harmonic structure respectively), which we estimated with a computational algorithm of pitch extraction (de Cheveigné and Kawahara, 2002). First, using single-voxel fMRI encoding, we identified a pitch-coding region in the antero-lateral Heschl's gyrus (HG) and adjacent superior temporal gyrus (STG). In these regions, the pitch representation model combining height and salience predicted the fMRI responses comparatively better than other models of acoustic processing and, in the right hemisphere, better than pitch representations based on height/salience alone. Second, we assessed with model-based decoding that multi-voxel response patterns of the identified regions are more informative of perceived pitch than the remainder of the auditory cortex. Further multivariate analyses showed that complementing a multi-resolution spectro-temporal sound representation with pitch produces a small but significant improvement to the decoding of complex sounds from fMRI response patterns. In sum, this work extends model-based fMRI encoding and decoding methods - previously employed to examine the representation and processing of acoustic sound features in the human auditory system - to the representation and processing of a relevant perceptual attribute such as pitch. Taken together, the results of our model-based encoding and decoding analyses indicated that the pitch of complex real life sounds is extracted and processed in lateral HG/STG regions, at locations consistent with those indicated in several previous fMRI studies using synthetic sounds. Within these regions, pitch-related sound representations reflect the modulatory combination of height and the salience of the pitch percept. •Pitch processing is analyzed with a model-based fMRI encoding/decoding approach•The perceived pitch is modeled as a combination of height and salience•Lateral HG and STG responses reflect pitch height and salience•Pitch information improves the decoding of the fMRI responses to natural sounds
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	1053-8119 1095-9572 1095-9572
DOI:	10.1016/j.neuroimage.2017.11.020