A nonlinear detector is proposed for optimal DCT-domain watermark detection, which comes from a watermarking embedding algorithm based on the absolute value reordering rule of DCT coefficients. The performances of the...
详细信息
A nonlinear detector is proposed for optimal DCT-domain watermark detection, which comes from a watermarking embedding algorithm based on the absolute value reordering rule of DCT coefficients. The performances of the nonlinear detector versus the linear detector, described by a measure of deflection coefficient, are evaluated in the watermark detection. Experimental results demonstrate that the proposed nonlinear detector is robust to some common attacks on the marked image, including JPEG compression, median filtering and addition of Gaussian noise.
The recently developed video compression standard, H.264/AVC surpasses the performance of previous video standards, such as MPEG-2, MPEG-4(part2), and H.263 and is therefore expected to be selected as the video standa...
详细信息
ISBN:
(纸本)9781424459421
The recently developed video compression standard, H.264/AVC surpasses the performance of previous video standards, such as MPEG-2, MPEG-4(part2), and H.263 and is therefore expected to be selected as the video standard for most digital video applications. The widely distributed infrastructure, however, continues to use the previous standards. Heterogeneous video transcoding offers a significant key to the resolution of this problem. This paper suggests a new algorithm for H.264/AVC to MPEG-2 transcoding that uses motion vector clustering to reduce the computation time with no loss of quality. Such a clustering method can reduce the number of candidate motion vectors that are gathered during the H.264 decoding stage. These candidate motion vectors consider the correlation between the direction and distance of the motion vectors in the variable blocks in H.264/AVC. The candidate motion vector that has the least distortion is then selected in the MPEG-2 encoder. The MPEG-2 encoder can therefore use the best motion vector without carrying out computations for motion estimation. The experimental results show that the proposed method can maintain a good level of video quality while reducing the computational complexity by a considerable 64%, on average, compared to a cascade transcoder.
The constant evolution of video standards and technologies and the important growth of the amount of video data in networks leads to a need to build new models for video sources. In this paper, we specify a Markovian ...
详细信息
The constant evolution of video standards and technologies and the important growth of the amount of video data in networks leads to a need to build new models for video sources. In this paper, we specify a Markovian fluid model based on GoPs (Group of Pictures) that creates a video source descriptor. This descriptor can be used to compute the loss rate observed when transmitting the video via the network. We show also how to build an artificial video traffic having the same statistical characteristics of the original source.
In this paper, the BYPASS and PARALLEL modes of JPEG2000 are investigated and implemented in a software compression system for verification and validation. BYPASS and PARALLEL modes in JPEG2000 are options in the stan...
详细信息
In this paper, the BYPASS and PARALLEL modes of JPEG2000 are investigated and implemented in a software compression system for verification and validation. BYPASS and PARALLEL modes in JPEG2000 are options in the standard to facilitate fast compression and parallel computation for embed ded applications. Our results show minimal performance degradation in both BYPASS and PARALLEL modes, where BYPASS mode degrades PSNR performance by 0.1 dB, PARALLEL mode degrades performance by 0.15 dB, and utilizing both BYPASS and PAR ALLEL modes results in a performance degradation of 0.25 dB, on average. The implementation of the different modes results in a com pression speedup of approximately 10% in BYPASS mode, and a potential 3x speedup in PARALLEL mode, if independent coding passes are executed concurrently.
Within the scope of information retrieval, efficient similarity search in large document or multimedia collections is a critical task. In this paper, we present a rigorous comparison of three different approaches to t...
详细信息
ISBN:
(纸本)9781424475421
Within the scope of information retrieval, efficient similarity search in large document or multimedia collections is a critical task. In this paper, we present a rigorous comparison of three different approaches to the image retrieval problem, including cluster-based indexing, distance-based indexing, and multidimensional scaling methods. The time and accuracy trade-offs for each of these methods are demonstrated on a large Corel image database. Similarity of images is obtained via a feature-based similarity measure using four MPEG-7 low-level descriptors. We show that an optimization of feature contributions to the distance measure can identify irrelevant features and is necessary to obtain the maximum accuracy. We further show that using multidimensional scaling can achieve comparable accuracy, while speeding-up the query times significantly by allowing the use of spatial access methods.
Analyze the implementations of the current traditional WebGIS, such as CGI, Server API and ActiveX, summarize their advantages and disadvantages. Through these, propose three urgent problems existing in the traditiona...
详细信息
Analyze the implementations of the current traditional WebGIS, such as CGI, Server API and ActiveX, summarize their advantages and disadvantages. Through these, propose three urgent problems existing in the traditional WebGIS. The WebGIS based on Flex technology can just remedy these defects. Base on the comprehensive analysis of the characteristic of Flex, the paper builds a demo version of oilfield visual management system as a practical system, and using the tile cache to improve system efficiency. Research results show that, development of WebGIS based on Flex is a very good way.
The H.264/AVC intra-only frame encoder, for its excellent encoding performance, is well-suited for image/video compression applications such as Digital Still Camera (DSC), Digital Video Camera (DVC), Television Studio...
详细信息
The H.264/AVC intra-only frame encoder, for its excellent encoding performance, is well-suited for image/video compression applications such as Digital Still Camera (DSC), Digital Video Camera (DVC), Television Studio Broadcast and Surveillance video. The forward integer transform is an integral part of the H.264/AVC video encoder. In this paper, for image compression applications running on battery-powered electronic devices (such as DSC), we propose a low-power, area-efficient realization of the forward integer transform. The proposed solution reduces the number of operations by more than 50% (30 vs. 64) and consumes significantly less dynamic power when compared with existing state-of-the-art designs for the forward integer transform. For video compression applications such as Television Studio Broadcast or Surveillance Videos, where throughput is more important, we propose a low-latency, area-efficient realization of the forward integer transform unit in the intra frame processing chain. With the proposed solution, the effective latency for forward integer transform is drastically reduced, as the processing unit is no longer on the critical path of the intra-frame processing chain. Moreover, the proposed solution requires half the numbers of operations for its hardware implementation, when compared with existing state-of-the-art designs for forward integer transform.
This paper presents a new application-specific lossless compression scheme developed for video identification descriptors, also known as video fingerprints or signatures. In designing such a descriptor, one usually ha...
详细信息
This paper presents a new application-specific lossless compression scheme developed for video identification descriptors, also known as video fingerprints or signatures. In designing such a descriptor, one usually has to balance the descriptor size against discriminating power and temporal localisation performance. The proposed compression scheme alleviates this problem by efficiently exploiting the temporal redundancies present in the video fingerprint, allowing highly accurate fingerprints which also entail low transmission and storage costs. In this paper we provide a detailed description of our compression scheme and a comparative evaluation against well known state-of-the-art generic compression tools.
We have large needs for estimating plays in sport videos. Plays in sports are described as the motions of players. This paper proposes the play retrieving method based on the motion compensation vector in MPEG sports ...
详细信息
We have large needs for estimating plays in sport videos. Plays in sports are described as the motions of players. This paper proposes the play retrieving method based on the motion compensation vector in MPEG sports videos. In MPEG videos, there are motion compensation vectors. Using the motion compensation vectors, we don't need to estimate the motion vectors between adjacent frames. This leads to decrease the huge computations about motion estimations. This work uses the 1D degenerated descriptions of each motion image between 2 adjacent frames. Connecting the 1D degenerated descriptions on time direction, we have the space-time image. This space-time image describes a sequence of frames as a 2-dimensional image. Using this space-time image, this work shows the method to retrieve a small number of plays in huge number of frames. Our experiment records 0.63 as recall, 0.84 as precision and 0.70 as F-measure in 139 plays in 132503 frames.
The derivation of spatial cues representing source localization information is a typical component of multichannel spatial audio coders such as EAAC+ and MPEG Surround. Efficient compression of spatial cues based on t...
详细信息
ISBN:
(纸本)9781424458509;9781424458493
The derivation of spatial cues representing source localization information is a typical component of multichannel spatial audio coders such as EAAC+ and MPEG Surround. Efficient compression of spatial cues based on the inter-frame difference distribution of spatial cues is investigated. Using a Bayesian Gradient model, the inter-frame correlations can be predicted more accurately. Results show that the proposed higher-order prediction method for spatial cue compression achieves about 20% bit-rate reduction with respect to the inter-freq differential coding method used in MPEG Surround.
暂无评论