This paper focuses on stream-oriented applications with real-time constraints for on-chip multiprocessors, e.g. video/audio coding. In such an application the same function, e.g. frame decoding, is executed over and o...
详细信息
ISBN:
(纸本)9780780392274
This paper focuses on stream-oriented applications with real-time constraints for on-chip multiprocessors, e.g. video/audio coding. In such an application the same function, e.g. frame decoding, is executed over and over again. In general, the execution time is data-dependent. Thus, at run-time a situation may arise when the execution time exceeds the deadline specified in the real-time constraints and a preventive action should be taken. For stream-oriented applications pipelined execution of the function is very important for achieving the required throughput. An accurate execution time estimation method supporting pipelined execution on a multiprocessor architecture is proposed in this paper
An additional coding of auditory objects for packet loss concealment has been proven to be effective in music streaming applications. This paper describes a new extension to our previous method in separating drum obje...
详细信息
An additional coding of auditory objects for packet loss concealment has been proven to be effective in music streaming applications. This paper describes a new extension to our previous method in separating drum objects from polyphonic music signals with improved performance. After a simple time domain separation method employed in our early system, we propose in this paper a novel frequency domain technique, a tonal-components tracking and attenuation (TTA), to suppress quasi-stationary auditory objects such as singing voice in the separated drum objects. Experimental results show that the new method is an effective pre-processing step to separate drum objects from polyphonic music signals. This method helps to improve accuracy of drum clustering and to mitigate the pitch and harmonic structure mismatch problem when applied in packet loss recovery in music streaming.
In this paper, we describe a scalable (i.e., lossy-to-lossless) watermarking scheme which overcomes the problem of non-invertible distortion introduced by the watermark signal. The scheme is based on a standardized sc...
详细信息
In this paper, we describe a scalable (i.e., lossy-to-lossless) watermarking scheme which overcomes the problem of non-invertible distortion introduced by the watermark signal. The scheme is based on a standardized scalable audio coder (R.S. Yu, et al, 2004) -as a result, the embedded watermark inherits the scalability of the audio coder. We elaborate how the scalability can be used to realize the recovery of the lossless audio signal after watermark embedding. The experimental results demonstrate the validity of the proposed watermarking scheme in terms of robustness, data expansion and perceptual quality.
Sinusoidal modelling is a key technology in low rate audio coding, and methods for efficient quantization of sinusoidal parameters are therefore of high importance. We derive analytical formulas for the optimal entrop...
详细信息
Sinusoidal modelling is a key technology in low rate audio coding, and methods for efficient quantization of sinusoidal parameters are therefore of high importance. We derive analytical formulas for the optimal entropy constrained unrestricted spherical quantizers for amplitude, phase and frequency, using a perceptual distortion measure. This is done both for a single sinusoid, and for multiple sinusoids distributed over multiple segments. The quantizers minimize a high-resolution approximation of the expected distortion, while the corresponding quantization indices satisfy an entropy constraint. The quantizers turn out to be flexible and of low complexity, in the sense that they can be determined easily for varying bit rate requirements, without any sort of retraining or iterative procedures. In objective and subjective comparison tests, the proposed method is shown to outperform an existing state-of-the-art sinusoidal quantization scheme, where quantization of frequency parameters is done independently.
This paper presents an SoC platform based design for the implementation of an AAC audio decoder. We present the approach not only for the characteristics of the algorithm, but also provide the numerical decision for e...
详细信息
ISBN:
(纸本)0780388348
This paper presents an SoC platform based design for the implementation of an AAC audio decoder. We present the approach not only for the characteristics of the algorithm, but also provide the numerical decision for evaluation of the various approaches. The overall system is first analyzed and profiled with the ARM profiler. Then the decoder system is partitioned into software part and hardware part respectively based on the property of analysis. The software part is developed for the implementation of intensive decision making operations needed for audio bitstreams. The hardware part is a dedicated hardware for the regular and computation intensive operations in AAC audio decoding. The decoder system is realized on the ARM integrator platform where the hardware and software is communicated efficiently with the AMBA architecture.
This paper gives a brief overview of the current research of digital audio coding and decoding standard, and introduces the newly established AVS audio standard. It also draws the roadmap of AVS/spl ***/M audio standa...
详细信息
This paper gives a brief overview of the current research of digital audio coding and decoding standard, and introduces the newly established AVS audio standard. It also draws the roadmap of AVS/spl ***/M audio standard that will be made soon.
This paper discusses methods to implement the IMDCT filter bank, noiseless decoder, inverse quantiser and scale factor application modules of MPEG-2 advanced audio coding decoder more efficiently when implemented on F...
详细信息
This paper discusses methods to implement the IMDCT filter bank, noiseless decoder, inverse quantiser and scale factor application modules of MPEG-2 advanced audio coding decoder more efficiently when implemented on FPGAs. The efficiency of the algorithms has been validated through implementation on Xilinx Virtex II FPGAs.
In this paper, dynamic algorithm transforms (DTA) for reconfigurable real-time processor for audio coding based on the adaptive wavelet packet (WP) decomposition are presented DAT techniques is to constrain a minimum ...
详细信息
ISBN:
(纸本)0769517307;0769517315
In this paper, dynamic algorithm transforms (DTA) for reconfigurable real-time processor for audio coding based on the adaptive wavelet packet (WP) decomposition are presented DAT techniques is to constrain a minimum cost subband decomposition of wavelet transform by maximizing the minimum masking threshold (which is limited by the perceptual entropy) in every subband for the given embedded processor architecture and temporal resolution.
The use of psychoacoustical. masking models for audio coding applications has been wide spread over the past decades. In such applications, it is typically assumed that the original input signal serves as a masker for...
详细信息
ISBN:
(纸本)0780374029
The use of psychoacoustical. masking models for audio coding applications has been wide spread over the past decades. In such applications, it is typically assumed that the original input signal serves as a masker for the distortions that are introduced by the lossy coding method that is used. Such masking models are based on the peripheral bandpass filtering properties of the auditory system and basically evaluate the distortion-to-masker ratio within each auditory filter. Up to now these models have been based on the assumption that the masking of distortions is governed by the auditory filter for which the ratio between distortion and masker is largest. This assumption, however, is not in line with some new findings within the field of psychoacoustics. A more accurate assumption would be that the human auditory system is able to integrate distortions that are present within a range of auditory filters, In this contribution a new model is presented which is in line with new psychoacoustical studies and which is suitable for application within an audio codec. Although this model can be used to derive a masking curve, the model also gives a measure for the detectability of distortions provided that distortions are not too large.
暂无评论