版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Arizona State Univ Ctr Telecommun Res Dept Elect Engn Tempe AZ 85287 USA
出 版 物:《PROCEEDINGS OF THE IEEE》 (电气与电子工程师学会会报)
年 卷 期:2000年第88卷第4期
页 面:451-513页
核心收录:
基 金:Intel Corp
主 题:AC-2 AC-3 advanced audio coding (AAC) MPEG ATRAC audio coding audio coding standards audio signal processing data compression digital audio radio (DAR) digital broadcast audio (DBA) filter banks high-definition TV (HDTV) linear predictive coding lossy compression modified discrete cosine transform (MDCT) MP3 MPEG MPEG-1 MPEG-2 MPEG-4 MPEG audio multimedia signal processing perceptual audio coding (PAC) perceptual coding perceptual model pseudoquadrature mirror filter (PQMF) psychoacoustic model psychoacoustics SDDS signal compression signal-processing applications sinusoidal coding subband coding transform coding
摘 要:During the last decade, CD-quality digital audio has essentially replaced analog audio. Emerging digital audio applications for network, wireless, and multimedia computing systems face a series of constraints such as reduced channel bandwidth, limited storage capacity, and low rest. These new applications have created a demand for high-quality digital audio delivery at low bit rates. In response to this need, consider-able research has been devoted to the development of algorithms for perceptually transparent coding of high,fidelity (CD-quality) digital audio. As a result, many algorithms have been proposed, and several have non become international and/or commercial product standards. This paper reviews algorithms for perceptually transparent coding of CD-quality digital audio, including both research and standardization activities. This paper is organized as follows. First, psychoacoustic principles are described, with the MPEG psychoacoustic signal analysis model I discussed in some detail. Next, filter bank design issues and algorithms are addressed, with a particular emphasis placed on the modified discrete cosine transform, a perfect reconstruction cosine-modulated filter bank that has become of central importance in perceptual audio coding. Then, we review methodologies that achieve perceptually transparent coding of FM- and CD-quality audio signals, including algorithms that manipulate transform components, subband signal decompositions, sinusoidal signal components, and linens prediction parameters, as well as hybrid algorithms that make use of more than one signal model. These discussions concentrate on architectures and applications of those techniques that utilize psychoacoustic models to exploit efficiently masking characteristics of the human receiver. Several algorithms that have become international and/or commercial standards receive in-depth treatment, including the ISO/IEC MPEG family (-1, -2, -4), the Lucent Technologies PAC/EPAC/MPAC, the Dolby(