Many user-end applications require an estimate of the quality of coded video or images without having access to the original, i.e. a no-reference quality metric. Furthermore, in many such applications, the compressed ...
详细信息
Many user-end applications require an estimate of the quality of coded video or images without having access to the original, i.e. a no-reference quality metric. Furthermore, in many such applications, the compressed video bitstream is also not available. The paper describes methods for using the statistical properties of the coded video data to estimate the quantization error caused by compression without accessing either the original pictures or the bitstream. A commonly used quality metric, the peak signal-to-noise ratio (PSNR) is subsequently computed from the estimated quantization error. Since quantization error is the most significant loss incurred during typical coding schemes, the estimated PSNR, or any PSNR-based quality metric may be used to gauge the overall quality of the pictures.
A statistical signal processing approach to multisensor image fusion is presented for concealed weapon detection (CWD). This approach is based on an image formation model in which the sensor images are described as th...
详细信息
A statistical signal processing approach to multisensor image fusion is presented for concealed weapon detection (CWD). This approach is based on an image formation model in which the sensor images are described as the true scene corrupted by additive non-Gaussian distortion. The expectation-maximization (EM) algorithm is used to estimate the model parameters and the fused image. We demonstrate the efficiency of this approach by applying this method to fusion of visual and non-visualimages with emphasis on CWD applications.
An improved Zernike moment using a region-based shape descriptor is presented. The improved Zernike moment not only has rotation invariance, but also has scale invariance that the unimproved Zernike moment does not ha...
详细信息
An improved Zernike moment using a region-based shape descriptor is presented. The improved Zernike moment not only has rotation invariance, but also has scale invariance that the unimproved Zernike moment does not have. The experimental results show that the improved Zernike moment has better invariant properties than the unimproved Zernike moment using a region-based shape descriptor.
This paper presents a segmentation algorithm to extract endocardial and epicardial walls of left ventricle in MR Cardiac images. The algorithm is based on a generalized gradient vector flow (GGVF) snake and a predicti...
详细信息
This paper presents a segmentation algorithm to extract endocardial and epicardial walls of left ventricle in MR Cardiac images. The algorithm is based on a generalized gradient vector flow (GGVF) snake and a prediction of initial contour (PIC). Especially, the proposed algorithm uses physical characteristics of endocardial and epicardial contours, cross profile correlation matching (CPCM), and a mixed interpolation model. In the experiment, the proposed method is applied to short axis MR Cardiac image sets, which are obtained by Siemens, Medinus, and GE MRI Systems. The experimental results show that the proposed algorithm can extract acceptable epicardial and endocardial walls. In addition, we calculate quantitative parameters from the segmented results, which are displayed graphically. The segmented left ventricle is visualized volumetrically by surface rendering. The proposed algorithm is implemented on Windows environment using visual C++.
Standard DCT -based video coding techniques perform good results in terms of data compaction, making feasible the use of digital video in several application frameworks. The price to pay is the introduction of annoyin...
详细信息
Standard DCT -based video coding techniques perform good results in terms of data compaction, making feasible the use of digital video in several application frameworks. The price to pay is the introduction of annoying visual distortions/artefacts in the reconstructed video. The lower the encoding bit-rate, the larger the number of artefacts. Post-processing is a practical solution that achieves a visual enhancement of the compressed images after decoding. Some of the artefacts, such as blocking (tiled-effect aspect) and ringing (ghost effect) have already been widely studied.
This paper presents an approach of the fuzzy recognition of the object nameplate in video sequences based on computer vision. After the nameplate character image is layered, the critical points and critical line part ...
详细信息
This paper presents an approach of the fuzzy recognition of the object nameplate in video sequences based on computer vision. After the nameplate character image is layered, the critical points and critical line part can be extracted in the layers of the images. According to the stroke extracting regularity, the critical points and line parts can be merged into strokes. The stroke structure is used to describe the spatial relation of the conjoint strokes. The text image can be mapped into the feature space of the stroke and stroke structure. In the visual feature space, a fuzzy classifier is extensively used. The visual fuzzy recognition can much improve the recognition of the rotated, distorted, deformed and defiled text image in the video sequences. Experiments carried out on large data sets of video sequences, automatically recognized the licenses of the automobiles in real time, show very promising results.
We propose a new blind image watermarking method in the discrete cosine transform (DCT) domain, which is widely used in compression applications and consequently in digital distribution networks. Four watermarking sch...
详细信息
ISBN:
(纸本)0780374886
We propose a new blind image watermarking method in the discrete cosine transform (DCT) domain, which is widely used in compression applications and consequently in digital distribution networks. Four watermarking schemes are presented and experimentally analyzed, two of them use fixed parameters and the others are adaptive. The characteristics of human visual systems (HVS) are exploited in two adaptive watermarking schemes, so as to achieve high visual quality of watermarked image and robustness of watermarking. Moreover, the proposed algorithms can be easily implemented by existing software or hardware systems only with small modification.
Halftoning is the rendition of continuous-tone pictures on bi-level displays. Here we first review some of the halftoning algorithms which have a direct bearing on our paper and then describe some of the more recent a...
详细信息
Halftoning is the rendition of continuous-tone pictures on bi-level displays. Here we first review some of the halftoning algorithms which have a direct bearing on our paper and then describe some of the more recent advances in the field. Dot diffusion halftoning has the advantage of pixel-level parallelism, unlike the popular error diffusion halftoning method. We first review the dot diffusion algorithm and describe a recent method to improve its image quality by taking advantage of the Human visual System function. Then we discuss the inverse halftoning problem: The reconstruction of a continuous tone image from its halftone. We briefly review the methods for inverse halftoning, and discuss the advantages of a recent algorithm, namely, the Look Up Table (LUT) Method. This method is extremely fast and achieves image quality comparable to that of the best known methods. It can be applied to any halftoning scheme. We then introduce LUT based halftoning and tree-structured LUT (TLUT) halftoning. We demonstrate how halftone image quality in between that of error diffusion and Direct Binary Search (DBS) can be achieved depending on the size of tree structure in TLUT algorithm while keeping the complexity of the algorithm much lower than that of DBS.
Recent research efforts in the development of objective quality and impairment measures for quality assessment of digital video and television have helped to improve and refine models for the human visual system. In p...
详细信息
Recent research efforts in the development of objective quality and impairment measures for quality assessment of digital video and television have helped to improve and refine models for the human visual system. In particular, recent publications by S. Winkler (see "Vision Models and Quality Metrics for images processing Applications", PhD thesis, EPFL, Lausanne, 2000), J. Lubin and D. Fibush, (see T1A1.5 Working Group Document #97-612, ANSI T1 Standards Committee, 1997) and Z. Yu et al. (see Proc. IEEE, Jan. 2002) have incorporated comprehensive vision models as part of their picture quality and impairment metric design. Applications of these vision-model-based perceptual quality or impairment metrics to image/video compression are seen as the next step in delivering perceptual image or video coding systems which cater for improvement in perceived picture quality. This paper subscribes to an approach to perceptual image coder design whereby a vision-model-based perceptual distortion metric is introduced into a conventional coder, e.g. JPEG2000 compliant coder, in place of the mean-square-error measure for rate-distortion optimization. Simulation results show that the new perceptual image coder provides better performance over the JPEG2000 coder with or without the visual masking option (see Taubman, D., IEEE Trans. image Proc., vol.7, p.1158-70, 2000).
This paper presents a study which aims to reproduce the human visual system's behavior and functioning in order to judge the embarrassment (function impairment) procured by the use of a given audiovisual service f...
详细信息
This paper presents a study which aims to reproduce the human visual system's behavior and functioning in order to judge the embarrassment (function impairment) procured by the use of a given audiovisual service for a given task. Implementation is given and results are shown for faces recognition and quality evaluation. visual embarrassment is briefly introduced and discussed.
暂无评论