检索结果-内蒙古大学图书馆

An Efficient Time-Frequency Representation for parametric-Based audio Object coding

ETRI JOURNAL 2011年第6期33卷 945-948页

作者： Beack, Seungkwon Lee, Taejin Kim, Minje Kang, Kyeongok ETRI Broadcasting & Telecommun Convergence Res Lab Taejon South Korea

Object-based audio coding can provide new music applications with interactivity. To efficiently compress a lot of target audio objects, a subband-based parametric coding scheme has been adopted for MPEG spatial audio object coding. in this letter, the time-frequency (T/F) subband analysis structure is investigated. A reconfigured T/F structure is also proposed to enhance the generating performance of sound scenes such as 'karaoke' and 'solo' play in interactive music scenarios. From the experimental results, it was confirmed that the proposed scheme remarkably improves the SNR and sound quality.

关键词： SAOC parametric audio coding spatial cue

来源：评论

学校读者我要写书评

暂无评论

Construction of Ambience Bases from Weighing Matrices with Application in Spatial audio coding

Construction of Ambience Bases from Weighing Matrices with A...

引用

IEEE International Workshop on Signal Processing Systems (IEEE SiPS)

作者： Gorlow, Stanislaw Dolby Sweden Stockholm Sweden

ISBN: (纸本)9781538663189

parametric spatial audio coding schemes, such as advanced joint channel coding in Dolby's next-generation audio coding system AC-4, achieve a higher data compression ratio as a result of a lower-dimensional intermediate signal representation, known as the downmix. During the inverse process, the upmix, which is guided by side information, the covariance between the source signals is reconstructed to preserve perceptually important cues such as ambience or source width. In this manuscript, a systematic approach for the construction of ambience bases from weighing matrices is presented. Furthermore, the basis vectors are generalized to accommodate for nonunitary mixing weights, and a new basis is derived. Round figures from internal listening tests are shared to underpin the utility of the approach.

关键词： variance-covariance reconstruction Gaussian single-channel mixture decorrelation parametric audio coding spatial audio coding

来源：评论

学校读者我要写书评

暂无评论

A CROSS-DOMAIN APPROACH TO TEMPORAL ENVELOPE SHAPING IN parametric STEREO coding USING DEEP LEARNING 18

A CROSS-DOMAIN APPROACH TO TEMPORAL ENVELOPE SHAPING IN PARA...

引用

18th International Workshop on Acoustic Signal Enhancement (IWAENC)

作者： Kechichian, Patrick Ravi, Akshaya Schuijers, Erik Philips Eindhoven Netherlands

ISBN: (纸本)9798350361865;9798350361858

In parametric stereo audio coding, at the encoder a stereo signal is downmixed to a mono signal along with a set of time-frequency dependent stereo parameters. At the decoder, using a decorrelator, a decorrelated signal is first generated from the downmix signal. A replica of the stereo signal is subsequently reconstructed based on the time-frequency dependent stereo parameters, the downmix and the decorrelated signal. A disadvantage of traditional decorrelators is that they have trouble following the temporal envelope of the mono signal due to frequency-dependent delays introduced in their processing. This is especially problematic for signals with strong, short energy bursts like transients, and leads to unwanted smearing of the decorrelated signal. In this work, we introduce a cross-domain deep learning approach for reshaping a decorrelated signal's temporal envelope in the subband domain, making use of envelope features learned from the time-domain downmix.

关键词： parametric audio coding decorrelator temporal envelope hybrid QMF filterbank convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Fast implementation of an improved parametric audio coder based on a mixed dictionary

引用

SIGNAL PROCESSING 2006年第3期86卷 432-443页

作者： Vera-Candeas, P Ruiz-Reyes, N Rosa-Zurera, M Cuevas-Martínez, JC López-Ferreras, F Univ Jaen Polytech Sch Elect & Telecommun Engn Dept Jaen 23700 Spain Univ Alcala De Henares Polytech Sch Signal Theory & Commun Dept Madrid 28871 Spain

This paper deals with the application of adaptive signal models for representing transients and sinusoids at the same stage in a parametric audio coder. To accomplish such a goal, we search for sparse approximations by means of matching pursuit with a mixed dictionary, instead of using two different dictionaries that operate in cascade. In such sense, complex exponentials and wavelet packets are chosen for modeling the tonal and transient features of an audio signal, respectively. At each iteration of the pursuit, the mixed dictionary function that extracts the most energy from the residue is selected. This function will be either a complex exponential or a wavelet packet, depending on the characteristics of the residue at that iteration. Experimental results clearly show the objective (compression rate) and subjective (% preference) advantages of the mixed dictionary over two cascaded dictionaries. The approach proposed in this paper is successfully applied for parametric audio coding purposes, assuring better perceptual audio quality than MPEG2/4-AAC at 16 Kbits/s for most of the CD-quality one channel audio signals considered for testing. (C) 2005 Elsevier B.V. All rights reserved.

关键词： matching pursuit overcomplete dictionary sparse approximation parametric audio coding wavelet packets complex exponentials

来源：评论

学校读者我要写书评

暂无评论

Bark scale-based perceptual matching pursuit for improving sinusoidal audio modeling

引用

DIGITAL SIGNAL PROCESSING 2009年第2期19卷 229-240页

作者： Vera-Candeas, P. Ruiz-Reyes, N. Lopez-Ferreras, F. Univ Jaen Polytech Sch Dept Telecommun Engn Jaen Spain Univ Alcala de Henares Polytech Sch Signal Theory & Commun Dept Madrid Spain

In this paper we propose an improved sinusoidal modeling method based on perceptual matching pursuits computed in the bark scale for parametric audio coding applications. Complex exponentials compose the overcomplete dictionary for matching pursuits. The main contribution is the minimization of a perceptual distortion measure defined in the bark scale to select the optimum atom at each iteration of the pursuits. Furthermore, a psychoacoustic stopping criterion for the pursuits is presented. The proposed sinusoidal modeling method is suitable to be integrated into a parametric audio coder based on the three-part model of sines, transients and noise (STN model), as can be appreciated in experimental results. Our method provides significant advantages regarding previous works mainly because it operates in the bark scale rather than in frequency domain. (C) 2008 Elsevier Inc. All rights reserved.

关键词： Sinusoidal modeling Matching pursuit Bark scale Complex exponentials Overcomplete dictionary parametric audio coding Psychoacoustics

来源：评论

学校读者我要写书评

暂无评论

Variable dimension trellis-coded quantization of sinusoidal parameters

引用

IEEE SIGNAL PROCESSING LETTERS 2008年 15卷 17-20页

作者： Larsen, Morten Holm Christensen, Mads Graesboll Jensen, Soren Holdt Aalborg Univ Dept Elect Syst DK-9220 Aalborg Denmark

In this letter, we propose joint quantization of the parameters of a set of sinusoids based on the theory of trellis-coded quantization. A particular advantage of this approach is that it allows for joint quantization of a variable number of sinusoids, which is particularly relevant in variable rate parametric audio coding. Under high-resolution assumptions and based on a perceptually relevant distortion measure, we derive analytical expressions for the optimal design subject to an entropy constraint. Numerical experiments show a significant performance gain compared to optimal spherical quantization at the cost of a slight increase in computational complexity.

关键词： parametric audio coding spherical quantization trellis-coded quantization variable dimension vector quantization

来源：评论

学校读者我要写书评

暂无评论

Sparse and structured decompositions of signals with the Molecular Matching Pursuit

引用

IEEE TRANSACTIONS ON audio SPEECH AND LANGUAGE PROCESSING 2006年第5期14卷 1808-1816页

作者： Daudet, Laurent Univ Paris 06 Lab Acoust Musicale F-75015 Paris France

This paper describes the Molecular Matching Pursuit (MMP), an extension of the popular Matching Pursuit (MP) algorithm for the decomposition of signals. The MMP is a practical solution which introduces the notion of structures within the framework of sparse overcomplete representations;these structures are based on the local dependency of significant time-frequency or time-scale atoms. We show that this algorithm is well adapted to the representation of real signals such-as percussive audio signals. This is at the cost of a slight sub-optimality in terms of the rate of convergence for the approximation error, but the benefits are numerous, most notably a significant reduction in the computational cost, which facilitates the processing of long signals. Results show that this algorithm is very promising for high-quality adaptive coding of audio signals.

关键词： matching pursuit overcomplete representations parametric audio coding time-frequency transforms

来源：评论

学校读者我要写书评

暂无评论

DYNAMIC STRATEGY FOR WINDOW SPLITTING, PARAMETERS ESTIMATION AND INTERPOLATION IN SPATIAL parametric audio CODERS

DYNAMIC STRATEGY FOR WINDOW SPLITTING, PARAMETERS ESTIMATION...

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Capobianco, Julien Pallone, Gregory Daudet, Laurent France Telecom Orange Labs TECH OPERA Av Pierre Marzin F-22307 Lannion France ESPCI Paris Diderot Univ F-75005 Paris France ESPCI Inst Langevin F-75005 Paris France

ISBN: (纸本)9781467300469

In most parametric stereo audio coders, sets of spatial parameters are extracted from the audio channels in a time-frequency domain. In order to reduce the amount of data, the parameters plane is highly down-sampled, and transmitted together with a mono downmix. Then, in the decoding process, it is necessary to interpolate the upmix matrix computed from these parameters. Usually, this is done in the same way for each portion of signal, regardless of its nature. In this article, we propose a dynamic strategy of window splitting, estimation of the parameters and interpolation of the upmix matrix based on transient detection in the audio signal. Subjective tests show an improvement when applied to the new stereo parametric tool from MPEG USAC.

关键词： parametric audio coding stereo

来源：评论

学校读者我要写书评

暂无评论

SINUSOIDAL SUBSTITUTION - AN INTEGRATED parametric TOOL FOR ENHANCEMENT OF TRANSFORM-BASED PERCEPTUAL audio CODERS

SINUSOIDAL SUBSTITUTION - AN INTEGRATED PARAMETRIC TOOL FOR ...

引用

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

作者： Disch, Sascha Schubert, Benjamin Fraunhofer Inst Integrated Circuits IIS Erlangen Germany

ISBN: (纸本)9781479928934

Transform-based audio coders are the preferred technique for music data compression. However, at low bitrates, traditional coders based on Modified Discrete Cosine Transform are prone to strong warbling and roughness artifacts originating from sparsely coded tonal components. parametric coders, in turn, suffer from an unpleasantly artificial sound and do not scale well up to perceptual transparency. Hybrid transform-based and parametric coding could potentially overcome the limits of the individual approaches. Yet, existing hybrid coders are hampered by the lack of integrative interplay between both techniques. We outline our ideas how to tightly integrate transform-based coding and parametric coding to obtain an enhanced perceptual quality and scalability. Also, we provide listening test results which demonstrate the benefits of our hybrid coder design.

关键词： Codecs parametric audio coding Signal Synthesis

来源：评论

学校读者我要写书评

暂无评论

SINUSOIDAL COMPONENT SELECTION BASED ON PARTIAL LOUDNESS CRITERIA

SINUSOIDAL COMPONENT SELECTION BASED ON PARTIAL LOUDNESS CRI...

引用

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Krishnamoorthi, Harish Spanias, Andreas Arizona State Univ SenSIP Ctr Sch Elect Comp & Energy Engn Tempe AZ 85287 USA

ISBN: (纸本)9781479903566

Sinusoidal models are widely used in parametric speech and audio coding schemes. A common requirement in these applications is to select only a subset of components that provide the greatest perceptual benefit particularly at low bitrates. Usually, perceptual sinusoidal component selection algorithms make use of greedy algorithms that are computationally expensive. In this paper, we present a new algorithm that selects sinusoidal components based on the partial loudness model proposed by Moore & Glasberg. We compare the performance of the proposed algorithm in terms of perceptual benefit and computational complexity to other existing sinusoidal selection algorithms.

关键词： loudness sinusoidal models parametric audio coding audio coding auditory patterns

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：