A plethora of Image Quality Metrics (IQM) has been proposed during the last two decades. However, at present time, there is no accepted IQM able to predict the perceptual level of image degradation across different ty...
详细信息
A plethora of Image Quality Metrics (IQM) has been proposed during the last two decades. However, at present time, there is no accepted IQM able to predict the perceptual level of image degradation across different types of visual distortions. Some measures are more adapted for a set of degradations but inefficient for others. Indeed, the efficiency of any IQM has been shown to depend upon the type of degradation. Thus, we propose here a new approach for predicting the type of degradation before using IQMs. The basic idea is first to identify the type of distortion using a Bayesian approach, then select the most appropriate IQM for estimating image quality for that specific type of distortion. The performance of the proposed method is evaluated in terms of classification accuracy across different types of degradations.
Video streaming applications are commonly used in both wired and wireless environment; however, wireless links are burden by higher packet loss ratio and delay variation. In order to make video transmission possible i...
详细信息
Video streaming applications are commonly used in both wired and wireless environment; however, wireless links are burden by higher packet loss ratio and delay variation. In order to make video transmission possible in wireless networks MPEG video coding is usually used to reach the bandwidth constrains of the links. The video quality and compression ratio depends on Group of Pictures (GOP) structure, but it also affects the distortion sensitivity of the video stream due to packet losses. In this paper the correlation between GOP size, packet loss ratio and video quality is investigated. By increasing the distance between the reference frames the effectiveness of coding can be improved, but on the other hand the effect of error propagation due to packet losses also increases. Our aim was to find the optimal GOP structure to maximize the coding efficiency and minimize the quality distortion due to error propagation. We have implemented a simulation tool to make the analysis of differently structured video streams transmitted over lossy channels.
In the CDMA-based fingerprinting scheme, a fingerprint sequence is composed of two kinds of spread spectrum sequences generated from DCT basic vectors and PN sequences. In this study, the DCT basic vectors are randoml...
详细信息
In the CDMA-based fingerprinting scheme, a fingerprint sequence is composed of two kinds of spread spectrum sequences generated from DCT basic vectors and PN sequences. In this study, the DCT basic vectors are randomly permuted in order to reduce the interference among colluders' fingerprint sequences, which effect is similar to the interleaver used in Turbo codes. Our experimental results reveal that the false-positive probability is reduced without degrading the traceability.
High resolution still image compression and storage plays a very important role in the application field of aero military reconnaissance, topography mapping and satellite remote sensing exploration. An embedded high r...
详细信息
High resolution still image compression and storage plays a very important role in the application field of aero military reconnaissance, topography mapping and satellite remote sensing exploration. An embedded high resolution still image compression and storage system based on POWER PC, FPGA and ADV212 is presented in the paper in order to implement 4008×5366×12bit lossy and lossless compression of high resolution aero image. Image compression storage system structure and the software and hardware design method of the system are shown here. The design object is to save one image each two seconds and to send one image each eight seconds on the condition of 1.44Mbps transmission rate. The experimental results show that the system could implement expected real-time compression and storage of high resolution aero image.
Summary form only given. Time-frequency analysis is the fundamental methodology in signal processing. It provides a description of signal in the time-frequency plane. Conventionally, in applications of time-frequency ...
详细信息
ISBN:
(纸本)9781424464258;9780769539942
Summary form only given. Time-frequency analysis is the fundamental methodology in signal processing. It provides a description of signal in the time-frequency plane. Conventionally, in applications of time-frequency analysis, the time domain of a signal is partitioned into intervals at first. Interval by interval, one computes the local frequency spectrum in an interval and then makes signal processing with respect to this local spectrum. The whole procedure goes on while time interval changes. This is the popular, classic approach to time-frequency analysis. To date, the research and applications of time-frequency analysis have been taking this approach. By this approach, signal contents are treated interval by interval locally in time. It neglects the statistical dependency between local frequency spectra in neighboring time intervals. Data compression is an important and successful application of time-frequency analysis. The international standards JPEG(1990), MPEG1(1993), MPEG2(1994), MPEG4(1998), H.264(2003) and JPEG XR(2009) are all good examples. In these systems, image data are encoded in the classic approach to time-frequency analysis. Separately block by block, image data are transformed, quantized and entropy encoded. Another mentionable image coding scheme in classic approach was introduced, where image data are transformed by 8 x 8 DCT. After transform, the 64 coefficients in an 8 x 8 block are treated as a depth-3 tree, without any inter-block regroupment or other operations. Within each block, the coefficients are separately quantized and encoded by the famous EZW method which is popular in wavelet-based image coding. In this paper, we change the way and advocate a new approach to time-frequency analysis. We make signal processing within frequency bands rather than time intervals. We treat signal contents band by band locally in frequency. The new approach gives rise to a good platform to exploit the dependency between local frequency spectra. The ba
The new JPEG XR image encoding standard offers a great compression rate while maintaining a good visual quality. Nonetheless, it has low error robustness, making it unusable in case of unreliable transmission over err...
详细信息
ISBN:
(纸本)9781424455928;9781424455935
The new JPEG XR image encoding standard offers a great compression rate while maintaining a good visual quality. Nonetheless, it has low error robustness, making it unusable in case of unreliable transmission over error prone channels, e.g., wireless channels. An improvement to the standard was developed, which can correct transmission errors, both bit or packet losses, and which is fully compatible with legacy decoders. Data interleaving and channel coding can offer a good protection against transmission errors; different levels of protection can be adopted, in order to trade-off between error protection capabilities and decompressed image quality.
This paper presents a copy-paste block detection method based on characteristics of double JPEG compress. The JPEG compress will bring JPEG compression characteristics to the DCT coefficients, these characteristics ar...
详细信息
This paper presents a copy-paste block detection method based on characteristics of double JPEG compress. The JPEG compress will bring JPEG compression characteristics to the DCT coefficients, these characteristics are closely related with the quality factor. Copy-paste tamper between JPEG images will disrupt the JPEG compression characteristics of the final image. The method in this paper is designed to deal with double JPEG compression whose DCT blocks are different during the two compresses, and the experiment shows that our method can work effectively on double JPEG compression with different quality factors and is not subject to the impact of DCT blocks.
Embedded multimedia systems are expected to fully embrace the future many-core wave. As a consequence parallel programming is being revamped as the only way to exploit the power of coming chips. While waiting for them...
详细信息
Embedded multimedia systems are expected to fully embrace the future many-core wave. As a consequence parallel programming is being revamped as the only way to exploit the power of coming chips. While waiting for them we try to extrapolate some lessons learned from current multi-cores to influence future architectures and programming methods. In this paper we investigate the parallelism and scalability of a JPEG image encoder, which is a typical embedded application, on several shared memory machines using the OpenMP programming framework. We identify the Huffman coding as the bottleneck that blocks the application from scaling above a 7x factor. We propose a strategy to parallelize the Huffman coding, which introduces a small degradation in some parts of the image, allowing to reach higher speedup factors. A factor of 18.8x has been reached in SGI Altix 4700 using 22 threads. Contrasting these results with some previous works using message passing architectures we consider that the use of OpenMP on top of shared memory architectures should be reconsidered for future chips in favor of message passing architectures and programming models.
Video identification is extremely important in video browsing, database search and security. In this paper, we present a video hashing based on MDS (Multi-Dimensional Scaling) which is able to work under variable vide...
详细信息
Video identification is extremely important in video browsing, database search and security. In this paper, we present a video hashing based on MDS (Multi-Dimensional Scaling) which is able to work under variable video transmission impairments and resistant to signal processing. In this method, each frame of the video is divided into blocks and compute its low and middle frequency DCT coefficients of luminance component as a disparities measurement for MDS. Then the video is mapped to two-dimensional space using MDS, and generate a robust hashing as a video signature utilizing the distances between two points mapping from frames. It found that this video hashing is resistant frame geometric attacks (rotation, shift), random noises, lossy compression and other video transmission impairments. It can be instrumental in building database search, video copy detection and watermarking applications for video.
Shot segmentation is one of the most important steps for video analysis and retrieval. An effective shot segmentation algorithm is proposed by using DC-reduced graph which is on the basis of DC coefficients in I-Frame...
详细信息
Shot segmentation is one of the most important steps for video analysis and retrieval. An effective shot segmentation algorithm is proposed by using DC-reduced graph which is on the basis of DC coefficients in I-Frames. This algorithm constructs the sequence of DC-reduced graph by DC coefficients in I-Frame firstly, and then the difference between two adjacent DC-reduced graphs is measured using color moments, spatial distribution entropy of colors and color-histogram. Finally, shot segmentation is achieved through selecting an appropriate threshold. The experiments show that the algorithm has efficiently improved the performance of shot detection, and to a certain extent, the complexity of the detection on gradual shot cuts is reduced.
暂无评论