Parametric multichannel audio coding: synthesis of coherence cues

Parametric multichannel audio coding represents an audio signal as one single audio channel plus side information. The side information contains estimates of perceptually relevant differences between the original audio channels. Usually, time difference, level difference, and coherence cues are cons...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on audio, speech, and language processing Vol. 14; no. 1; pp. 299 - 310
Main Author	Faller, C.
Format	Journal Article
Language	English
Published	Piscataway, NJ IEEE 01.01.2006 Institute of Electrical and Electronics Engineers
Subjects	Applied sciences Audio coding Audio signals Auditory spatial image Channels Coding Coding, codes Coherence Computational complexity Cues Decoding Delay effects diffuse sound Exact sciences and technology Filters Information, signal and communications theory late reverberation Miscellaneous Multichannel parametric multichannel audio coding Reverberation Signal and communications theory Signal generators Signal processing Signal synthesis spatial perception surround Synthesis Telecommunications and information theory Testing Performance evaluation Acoustic reverberation Multichannel transmission Auditory spatial image Audio signal processing diffuse sound late reverberation Speech synthesis Parametric method Coding spatial perception parametric multichannel audio coding surround
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Parametric multichannel audio coding represents an audio signal as one single audio channel plus side information. The side information contains estimates of perceptually relevant differences between the original audio channels. Usually, time difference, level difference, and coherence cues are considered. These cues determine, to a large degree, the auditory spatial image that is perceived when playing back multichannel audio signals. Level difference and time difference synthesis is simple: Different gain factors and delays are applied to the sum signal in subbands for generating the different decoder output channels. However, it is not as obvious how coherence cues can be synthesized. Several heuristic methods for coherence synthesis were proposed previously. In this paper, we are proposing a systematic approach for coherence synthesis. The coherence that is measured in the encoder between a pair of channels is reproduced in the decoder. For that purpose, de-correlation filters modeling late reverberation with impulse responses of a length of several hundred milliseconds are used, resulting in the ability of the scheme to generate naturally sounding diffuse sound. A method for reducing the computational complexity of the scheme is presented. The results of a subjective test indicate that the proposed scheme achieves good audio quality. Furthermore, the scheme was compared to a previous scheme without multichannel coherence synthesis and performs significantly better for all items tested.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23
ISSN:	1558-7916 1558-7924
DOI:	10.1109/TSA.2005.854105