The main objective of the MPEG Reconfigurable Video coding (RVC) standard is to establish a framework for a more flexible usage of standard video coding technology. The framework not only supports multiple standards a...
详细信息
The main objective of the MPEG Reconfigurable Video coding (RVC) standard is to establish a framework for a more flexible usage of standard video coding technology. The framework not only supports multiple standards and new coding configurations, but also provides an incremental and modular approach to innovation in video compression development and design. This paper provides an overview of the main objectives of RVC, standard accompanied with a presentation of the components of the framework for both normative parts and supporting tools useful for the final implementation of RVC codecs. These elements include: the Video Tool Library (VTL), the new standard RVC-CAL language used for the specification of the library, the Bitstream Syntax Description (BSD) used for the specification of the compressed bitstreams, as well as the Functional unit Network Description (FND) that constitutes the specification of a modular library. Technologies and tools that support the RVC standard are also briefly introduced.
We present strategies for perceptual improvements of embedded audio coding based on psychoacoustic weighting and spectral envelope restoration. The encoding schemes exhibit fine-grain bitrate scalability via the set p...
详细信息
We present strategies for perceptual improvements of embedded audio coding based on psychoacoustic weighting and spectral envelope restoration. The encoding schemes exhibit fine-grain bitrate scalability via the set partitioning in hierarchical trees (SPIHT) algorithm. Weighting factors and envelope parameters are transmitted under careful consideration of the amount of side information. For low bitrates, where the number of actually transmitted waveform coefficients is low, missing coefficients are shaped w.r.t. the spectral envelope. In our approach, the envelope information is transmitted in form of band-wise values of the l 1 -norm. Sets of standardized audio files as well as various audio data of contemporary music are encoded and the results are analyzed with objective measures of perceptual quality. The proposed coding scheme competes in perceptual quality with existing state-of-the-art fixed bitrate coders such as MPEG-2/4 AAC. For low bitrates, the proposed embedded coding envelope restoration (ECER) improves the perceptual audio quality notably.
Media authentication is important in content delivery via untrusted intermediaries, such as peer-to-peer (P2P) file sharing. Many differently encoded versions of a media file might exist. Our previous work applied dis...
详细信息
Media authentication is important in content delivery via untrusted intermediaries, such as peer-to-peer (P2P) file sharing. Many differently encoded versions of a media file might exist. Our previous work applied distributed source coding not only to distinguish the legitimate diversity of encoded images from tampering but also to localize tampered regions in an image already deemed to be in authentic. In both cases, authentication requires a Slepian-Wolf encoded image projection that is supplied to the decoder. We extend our scheme to authenticate images that have undergone affine warping. Our approach incorporates an expectation maximization algorithm into the Slepian-Wolf decoder. Experimental results demonstrate that the proposed algorithm can distinguish legitimate encodings of authentic images from illegitimately modified versions, despite an arbitrary affine warping, using authentication data of less than 250 bytes per image.
A framework for implementing the forward adaptive Karhunen-Loeve transform (FAKLT) is described. Unlike backward adaptive methods, FAKLT computes transform coefficients using basis vectors derived from the most recent...
详细信息
A framework for implementing the forward adaptive Karhunen-Loeve transform (FAKLT) is described. Unlike backward adaptive methods, FAKLT computes transform coefficients using basis vectors derived from the most recent signal frame. As a result, it exhibits improved energy compaction compared to the backward adaptive KLT. The method encodes only the KLT coefficients and a small amount of side information, the KLT basis vectors (eigenvectors) are not encoded.
A perceptual image compression codec exploits the characteristics of the human senses to minimize the perceivable quality loss of digital images under compression. Such a codec has an even higher value if the resultin...
详细信息
A perceptual image compression codec exploits the characteristics of the human senses to minimize the perceivable quality loss of digital images under compression. Such a codec has an even higher value if the resulting code-streams are compatible to an existing standard, and are thus decode-able by all-day, existing applications. This work describes strategies how to implement perceptual coding in standardized environments, namely JPEG, JPEG 2000 and JPEG-XR.
H.264 SVC is the scalable extension of H.264/AVC standard. This article provides an overview of the technical features of which enhanced compression performance and provision of a ldquonetwork-friendlyrdquo video repr...
详细信息
H.264 SVC is the scalable extension of H.264/AVC standard. This article provides an overview of the technical features of which enhanced compression performance and provision of a ldquonetwork-friendlyrdquo video representation addressing ldquoconversationalrdquo (video telephony) and ldquonon-conversationalrdquo (storage, broadcast, or streaming) applications. And we describe the subjective assessment methodologies, make a prediction about the switching on the decoder side, then give a recommendation for it.
In JPEG2000, embedded block coding with optimized truncation (EBCOT) requires a lot of computations. To enhance the performance of EBCOT, an efficient EBCOT algorithm is proposed. This algorithm consists of pass-predi...
详细信息
ISBN:
(纸本)9781424455430
In JPEG2000, embedded block coding with optimized truncation (EBCOT) requires a lot of computations. To enhance the performance of EBCOT, an efficient EBCOT algorithm is proposed. This algorithm consists of pass-prediction and pass-propagation. Pass prediction predicts the pass state in the next bit-plane. In particular, pass propagation generates the state of neighboring passes in the next bit-plane. This work also accomplishes boundary pass classification. With the propagated state, the pass state of the code-block can be classified for indicating the correct coding method. Various simulations have been conducted. The experimental results show that the proposed algorithm reduces the number of scan times by 42% to 52.7% for gray-level images and 11.8% to 34% for color images. The comparison demonstrates that the performance of our proposed method is higher than that of previous methods.
In this paper, we propose a novel two-pass reconstruction algorithm for the Wyner-Ziv (WZ) frames in distributed video coding (DVC), in which the traditional reconstructed WZ frame is utilized to perform motion estima...
详细信息
In this paper, we propose a novel two-pass reconstruction algorithm for the Wyner-Ziv (WZ) frames in distributed video coding (DVC), in which the traditional reconstructed WZ frame is utilized to perform motion estimation to obtain a more accurate motion field. During the motion estimation, the block, as well as its neighboring pixels are concerned. An overlapped block motion compensation is subsequently performed with the help of the motion field, consequently, an enhanced prediction for the WZ frame can be obtained, based on which an improved reconstruction can be achieved. Simulation results show that both the objective and subjective quality of WZ frames can be improved significantly.
This paper presents a novel stream-level joint layered coding scheme for unified reliable and secure media transmission over wireless networks. The proposed scheme simultaneously protects both compressed media content...
详细信息
This paper presents a novel stream-level joint layered coding scheme for unified reliable and secure media transmission over wireless networks. The proposed scheme simultaneously protects both compressed media content and the authentication data from wireless channel impairments. Therefore, the media quality degradation incurred by both channel noise and authentication constraints can be minimized. With a prior knowledge of source coding and channel coding, the proposed joint system integrates authentication into the media error protection components to ensure 100% effective verification probability, i.e. every source decodable media unit is authenticable. In particular, by utilizing the contextual relationship, such as coding dependency and content importance between media slices for authentication hash appending, the proposed scheme generates an extremely low authentication overhead. The proposed authentication scheme is fundamentally different from many existing systems that consider the problem of authenticating media content separately from the other operations in the media transmission system. Under this joint layered coding framework, an optimal rate allocation algorithm for source coding, channel coding and media authentication is developed to guarantee the end-to-end media quality. Experiment results on JPEG 2000 images validate the proposed scheme and demonstrate that the performance of the proposed approach is approaching its upper bound, in which case no authentication is applied to the media stream.
In the date-link transmission systems, especially video-guided systems, information data and image data are transmitted among the communication units. A kind of synthesis algorithm is proposed based on region of inter...
详细信息
In the date-link transmission systems, especially video-guided systems, information data and image data are transmitted among the communication units. A kind of synthesis algorithm is proposed based on region of interesting (ROI) coding method of JPEG2000. Since the algorithm is based on pixel data level, it is easy to realize for application. The simulation results show that reconstructed image quality decays ldB under low compression rate, while almost 0 dB under compression rate larger than 0.3 bpp.
暂无评论