A computational model of auditory selective attention

The human auditory system is able to separate acoustic mixtures in order to create a perceptual description of each sound source. It has been proposed that this is achieved by an auditory scene analysis (ASA) in which a mixture of sounds is parsed to give a number of perceptual streams, each of whic...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on neural networks Vol. 15; no. 5; pp. 1151 - 1163
Main Authors	Wrigley, S.N., Brown, G.J.
Format	Journal Article
Language	English
Published	United States IEEE 01.09.2004
Subjects	Action Potentials - physiology Animals Attention - physiology Auditory Cortex - physiology Auditory Pathways - physiology Auditory Perception - physiology Auditory system Biological Clocks - physiology Computational modeling Computer networks Ear Frequency Gaussian distribution Humans Image analysis Memory - physiology Models, Neurological Neural Networks (Computer) Neurons - physiology Normal Distribution Oscillators Psychology Synapses - physiology Synaptic Transmission - physiology
Online Access	Get full text
ISSN	1045-9227
DOI	10.1109/TNN.2004.832710

Cover

Loading…

More Information
Summary:	The human auditory system is able to separate acoustic mixtures in order to create a perceptual description of each sound source. It has been proposed that this is achieved by an auditory scene analysis (ASA) in which a mixture of sounds is parsed to give a number of perceptual streams, each of which describes a single sound source. It is widely assumed that ASA is a precursor of attentional mechanisms, which select a stream for attentional focus. However, recent studies suggest that attention plays a key role in the formation of auditory streams. Motivated by these findings, this paper presents a conceptual framework for auditory selective attention in which the formation of groups and streams is heavily influenced by conscious and subconscious attention. This framework is implemented as a computational model comprising a network of neural oscillators, which perform stream segregation on the basis of oscillatory correlation. Within the network, attentional interest is modeled as a Gaussian distribution in frequency. This determines the connection weights between oscillators and the attentional process, which is modeled as an attentional leaky integrator (ALI). Acoustic features are held to be the subject of attention if their oscillatory activity coincides temporally with a peak in the ALI activity. The output of the model is an "attentional stream," which encodes the frequency bands in the attentional focus at each epoch. The model successfully simulates a range of psychophysical phenomena.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 ObjectType-Article-1 ObjectType-Feature-2
ISSN:	1045-9227
DOI:	10.1109/TNN.2004.832710