The demand for human face enhancement in pictures is increasing. This paper describes an effort to utilize state-of-the-art signal processing technologies for the enhancement of the human face in pictures. First, seve...
详细信息
ISBN:
(纸本)9781479961399
The demand for human face enhancement in pictures is increasing. This paper describes an effort to utilize state-of-the-art signal processing technologies for the enhancement of the human face in pictures. First, several non-linear filters are examined, and it is demonstrated that the total variation regularization filter (TV filter) shows the remarkably best effect for skin smoothing including the removal of wrinkles, stains, moles, and freckles. The reason is analyzed in detail. Then, super-resolution technology is utilized to enhance the image quality for specific parts of the face, such as the eye line, pupil, eyelashes, and hair. Facial part extraction technology is also utilized for the enhancement of selected face parts. Interestingly, we found that the super-resolution technology not only improves the clarity of the image but also increases the brilliancy in the pupil and hair. The super-resolution technology used in this paper is based on the non-linear filtering method developed for 4K high-definition television.
Correct assessment of image fidelity is fundamental to the development of efficient image compression schemes especially to that of the schemes taking characteristics of the human visual system into account. In this p...
详细信息
ISBN:
(纸本)9781424412501
Correct assessment of image fidelity is fundamental to the development of efficient image compression schemes especially to that of the schemes taking characteristics of the human visual system into account. In this paper, a fidelity metric for assessing the visual quality of color images is presented. The proposed fidelity metric is designed to measure the perceivable distortion in the quasi-uniform color space of CIELAB. To evaluate the perceptibility of distortion, a color visual model is employed to estimate the visibility threshold of distortion for each color pixel. The visibility threshold of each color is measured as the size of the sphere of just-noticeable color difference and is modeled as a function of chroma, local luminance gradient and background uniformity. Simulation results show that the assessment of the proposed fidelity metric is more correspondent with the subjective assessment than other metrics using PSNR and CIELAB Delta E.
In social image search, most existing hypergraph methods use the visual and textual features in isolation by treating each feature term as a hyperedge. Nevertheless, they neglect the correlations of visual and textual...
详细信息
ISBN:
(纸本)9781479961399
In social image search, most existing hypergraph methods use the visual and textual features in isolation by treating each feature term as a hyperedge. Nevertheless, they neglect the correlations of visual and textual hyperedges, which are more robust to represent the high-order relationship among vertices. In this paper, we propose a hypergraph with correlated hyperedges (CHH), which introduces high-order relationship of hyperedges into hypergraph learning. Based on CHH, a pairwise visual-textual correlation hypergraph (VTCH) model is used for tagbased social image search. To overcome the large number of newly generated hybrid hyperedges, a bagging-based method is adopted to balance the accuracy and speed. Finally, adaptive hyperedges learning method is used to obtain the relevance score for social image search. The experiments conducted on MIR Flickr show the effectiveness of our proposed method.
In this paper, we propose a new method for removing coding artifacts appeared in JPEG 2000 coded images. The proposed method uses a fuzzy control model to control the weighting function for different image edges accor...
详细信息
ISBN:
(纸本)0819450235
In this paper, we propose a new method for removing coding artifacts appeared in JPEG 2000 coded images. The proposed method uses a fuzzy control model to control the weighting function for different image edges according to the gradient of pixels and membership functions. Regularized post-processing approach and recursive line algorithm are described in this paper. Experimental results demonstrate that the proposed algorithm can significantly improve image quality in terms of objective and subjective evaluation.
image matching and search is gaining significant commercial importance nowadays due to various applications it enables such as augmented reality, image-queries for internet search, etc. Many researchers have effective...
详细信息
ISBN:
(纸本)9781479902880
image matching and search is gaining significant commercial importance nowadays due to various applications it enables such as augmented reality, image-queries for internet search, etc. Many researchers have effectively used color information in an image to improve its matching accuracy. These techniques, however, cannot be directly used for large scale mobile visual search applications that pose strict constraints on the size of the extracted features, computational resources and the system accuracy. To overcome this limitation, we propose a new and effective technique to incorporate color information that can use the SIFT extraction technique. We conduct our experiments on a large dataset containing around 33, 000 images that is currently being investigated in the MPEG-Compact Descriptors for visual Search Standard and show substantial improvement compared to baseline.
Robust visual hash functions have been designed to ensure the data integrity of digital visual data. Such algorithms rely on an efficient scheme for robust visual feature extraction. We propose to use the wavelet-base...
详细信息
ISBN:
(纸本)0387244859
Robust visual hash functions have been designed to ensure the data integrity of digital visual data. Such algorithms rely on an efficient scheme for robust visual feature extraction. We propose to use the wavelet-based JPEG2000 image compression algorithm for feature extraction. We discuss the sensitivity of our proposed method against different malicious data modifications including local image alterations and Stirmark attacks.
In this paper, a simple and effective fractal-based simultaneous image denoising and interpolation scheme is proposed and implemented. The denoising is performed during the fractal encoding process while the interpola...
详细信息
ISBN:
(纸本)0780391950
In this paper, a simple and effective fractal-based simultaneous image denoising and interpolation scheme is proposed and implemented. The denoising is performed during the fractal encoding process while the interpolation is performed during the decoding process. The fractal-based image denoising involves predicting the fractal code of the original noiseless image from the statistics of the noisy observation. This fractal code can then be used to generate a fractally denoised estimate of the original image. The fractal interpolation can be easily achieved during the decoding process by iterating the predicted fractal code on a suitably sized blank intital image seed. The cycle spinning algorithm can also be incorporated in the proposed fractal joint denoising and resizing scheme in order to reduce some of the artifacts and enhance the visual quality of the fractally denoised and resized estimates.
In the present paper, we study the spatialization of the sound field in a room, in particular the evolution of room impulse responses as function of their spatial positions. The presented technique allows us to comple...
详细信息
ISBN:
(纸本)0819450235
In the present paper, we study the spatialization of the sound field in a room, in particular the evolution of room impulse responses as function of their spatial positions. The presented technique allows us to completely characterize the sound field in any arbitrary location if the sound field is known in a certain finite number of positions. Our technique simply starts from the measurements of impulse responses in a finite number of positions and with this information the total sound field can be recreated. An analytical solution of the problem is given for any rectangular room. Further, we determine the number and the spacing between the microphones needed to perfectly reconstruct the sound field up to a certain temporal frequency.
Contemporary video search and categorization are non-trivial tasks due to the massively increasing amount and content variety of videos. We put forward the study of visual saliency models in video. Such a model is emp...
详细信息
ISBN:
(纸本)9781479902880
Contemporary video search and categorization are non-trivial tasks due to the massively increasing amount and content variety of videos. We put forward the study of visual saliency models in video. Such a model is employed to identify salient objects from the image background. Starting from the observation that motion information in video often attracts more human attention compared to static images, we devise a region contrast based saliency detection model using spatial-temporal cues (RCST). We introduce and study four saliency principles to realize the RCST. This generalizes the previous static image for saliency computational model to video. We conduct experiments on a publicly available video segmentation database where our method significantly outperforms seven state-of-the-art methods with respect to PR curve, ROC curve and visual comparison.
This paper presents a new method to measure the quality of compressed images. The method is based on a Human visual System model and extracts perceptual structural information from images. This model is implemented an...
详细信息
ISBN:
(纸本)0819450235
This paper presents a new method to measure the quality of compressed images. The method is based on a Human visual System model and extracts perceptual structural information from images. This model is implemented and perceptual representations of images are built. These representations describe the structural information of images. For quality assessment, the representation of the original image, actually a reduced reference, is compared to the representation of the distorted image using similarity measures. Similarity scores have shown to be highly correlated with the quality of images produced by human observers in experiments. So the novelty of this method is that structural information is used to assess the quality. This method has been implemented in an application called "Smart Compress" (freely available on the Internet) which allows the user to compress images in JPEG format by choosing the visual quality of the output images.
暂无评论