版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Agere Syst Media Signal Proc Res Dept Allentown PA 18109 USA
出 版 物:《IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING》 (IEEE Trans Speech Audio Process)
年 卷 期:2003年第11卷第6期
页 面:520-531页
核心收录:
主 题:audio coding auralization binaural signal HRTF multichannel audio spatial image spatial rendering stereo audio surround sound
摘 要:Binaural Cue Coding (BCC) is a method for multichannel spatial rendering based on one down-mixed audio channel and side information. The companion paper (Part I) covers the psychoacoustic fundamentals of this method and outlines principles for the design of BCC schemes. The BCC analysis and synthesis methods of Part I are motivated and presented in the framework of stereophonic audio coding. This paper, Part II, generalizes the basic BCC schemes presented in Part I. It includes BCC for multichannel signals and employs an enhanced set of perceptual spatial cues for BCC synthesis. A scheme for multichannel audio coding is presented. Moreover, a modified scheme is derived that allows flexible rendering of the spatial image at the receiver supporting dynamic control. All aspects of complete BCC encoder and decoder implementations are discussed, such as down-mixing of the input signals, low complexity estimation of the spatial cues, and quantization and coding of the-side information. Application examples are given and the performance of the coder implementations are evaluated and discussed based on subjective listening test results.