In social image search, most existing hypergraph methods use the visual and textual features in isolation by treating each feature term as a hyperedge. Nevertheless, they neglect the correlations of visual and textual...
详细信息
ISBN:
(纸本)9781479961399
In social image search, most existing hypergraph methods use the visual and textual features in isolation by treating each feature term as a hyperedge. Nevertheless, they neglect the correlations of visual and textual hyperedges, which are more robust to represent the high-order relationship among vertices. In this paper, we propose a hypergraph with correlated hyperedges (CHH), which introduces high-order relationship of hyperedges into hypergraph learning. Based on CHH, a pairwise visual-textual correlation hypergraph (VTCH) model is used for tagbased social image search. To overcome the large number of newly generated hybrid hyperedges, a bagging-based method is adopted to balance the accuracy and speed. Finally, adaptive hyperedges learning method is used to obtain the relevance score for social image search. The experiments conducted on MIR Flickr show the effectiveness of our proposed method.
In this paper, we propose a new method for removing coding artifacts appeared in JPEG 2000 coded images. The proposed method uses a fuzzy control model to control the weighting function for different image edges accor...
详细信息
ISBN:
(纸本)0819450235
In this paper, we propose a new method for removing coding artifacts appeared in JPEG 2000 coded images. The proposed method uses a fuzzy control model to control the weighting function for different image edges according to the gradient of pixels and membership functions. Regularized post-processing approach and recursive line algorithm are described in this paper. Experimental results demonstrate that the proposed algorithm can significantly improve image quality in terms of objective and subjective evaluation.
Robust visual hash functions have been designed to ensure the data integrity of digital visual data. Such algorithms rely on an efficient scheme for robust visual feature extraction. We propose to use the wavelet-base...
详细信息
ISBN:
(纸本)0387244859
Robust visual hash functions have been designed to ensure the data integrity of digital visual data. Such algorithms rely on an efficient scheme for robust visual feature extraction. We propose to use the wavelet-based JPEG2000 image compression algorithm for feature extraction. We discuss the sensitivity of our proposed method against different malicious data modifications including local image alterations and Stirmark attacks.
image matching and search is gaining significant commercial importance nowadays due to various applications it enables such as augmented reality, image-queries for internet search, etc. Many researchers have effective...
详细信息
ISBN:
(纸本)9781479902880
image matching and search is gaining significant commercial importance nowadays due to various applications it enables such as augmented reality, image-queries for internet search, etc. Many researchers have effectively used color information in an image to improve its matching accuracy. These techniques, however, cannot be directly used for large scale mobile visual search applications that pose strict constraints on the size of the extracted features, computational resources and the system accuracy. To overcome this limitation, we propose a new and effective technique to incorporate color information that can use the SIFT extraction technique. We conduct our experiments on a large dataset containing around 33, 000 images that is currently being investigated in the MPEG-Compact Descriptors for visual Search Standard and show substantial improvement compared to baseline.
In this paper, a simple and effective fractal-based simultaneous image denoising and interpolation scheme is proposed and implemented. The denoising is performed during the fractal encoding process while the interpola...
详细信息
ISBN:
(纸本)0780391950
In this paper, a simple and effective fractal-based simultaneous image denoising and interpolation scheme is proposed and implemented. The denoising is performed during the fractal encoding process while the interpolation is performed during the decoding process. The fractal-based image denoising involves predicting the fractal code of the original noiseless image from the statistics of the noisy observation. This fractal code can then be used to generate a fractally denoised estimate of the original image. The fractal interpolation can be easily achieved during the decoding process by iterating the predicted fractal code on a suitably sized blank intital image seed. The cycle spinning algorithm can also be incorporated in the proposed fractal joint denoising and resizing scheme in order to reduce some of the artifacts and enhance the visual quality of the fractally denoised and resized estimates.
In the present paper, we study the spatialization of the sound field in a room, in particular the evolution of room impulse responses as function of their spatial positions. The presented technique allows us to comple...
详细信息
ISBN:
(纸本)0819450235
In the present paper, we study the spatialization of the sound field in a room, in particular the evolution of room impulse responses as function of their spatial positions. The presented technique allows us to completely characterize the sound field in any arbitrary location if the sound field is known in a certain finite number of positions. Our technique simply starts from the measurements of impulse responses in a finite number of positions and with this information the total sound field can be recreated. An analytical solution of the problem is given for any rectangular room. Further, we determine the number and the spacing between the microphones needed to perfectly reconstruct the sound field up to a certain temporal frequency.
This paper presents a new method to measure the quality of compressed images. The method is based on a Human visual System model and extracts perceptual structural information from images. This model is implemented an...
详细信息
ISBN:
(纸本)0819450235
This paper presents a new method to measure the quality of compressed images. The method is based on a Human visual System model and extracts perceptual structural information from images. This model is implemented and perceptual representations of images are built. These representations describe the structural information of images. For quality assessment, the representation of the original image, actually a reduced reference, is compared to the representation of the distorted image using similarity measures. Similarity scores have shown to be highly correlated with the quality of images produced by human observers in experiments. So the novelty of this method is that structural information is used to assess the quality. This method has been implemented in an application called "Smart Compress" (freely available on the Internet) which allows the user to compress images in JPEG format by choosing the visual quality of the output images.
Contemporary video search and categorization are non-trivial tasks due to the massively increasing amount and content variety of videos. We put forward the study of visual saliency models in video. Such a model is emp...
详细信息
ISBN:
(纸本)9781479902880
Contemporary video search and categorization are non-trivial tasks due to the massively increasing amount and content variety of videos. We put forward the study of visual saliency models in video. Such a model is employed to identify salient objects from the image background. Starting from the observation that motion information in video often attracts more human attention compared to static images, we devise a region contrast based saliency detection model using spatial-temporal cues (RCST). We introduce and study four saliency principles to realize the RCST. This generalizes the previous static image for saliency computational model to video. We conduct experiments on a publicly available video segmentation database where our method significantly outperforms seven state-of-the-art methods with respect to PR curve, ROC curve and visual comparison.
Collective motions, one of the coordinated behaviors in crowd system, widely exist in nature. Orderliness characterizes how well an individual will move smoothly and consistently with his neighbors in collective motio...
详细信息
ISBN:
(纸本)9781479902880
Collective motions, one of the coordinated behaviors in crowd system, widely exist in nature. Orderliness characterizes how well an individual will move smoothly and consistently with his neighbors in collective motions. It is still an open problem in computer vision. In this paper, we propose an orderliness descriptor based on correlation of interactive social force between individuals. In order to include the force correlation between two individuals in a distance, we propose a Social Force Correlation Propagation algorithm to calculate orderliness of every individual effectively and efficiently. We validate the effectiveness of the proposed orderliness descriptor on synthetic simulation. Experimental results on challenging videos of real scene crowds demonstrate that orderliness descriptor can perceive motion with low smoothness and locate disorder.
This paper presents a noise-aided dynamic range compression algorithm using a stochastic resonance model in spatial domain. An input statistics-dependent stochastic resonance (ISSR) model, that is designed for contras...
详细信息
ISBN:
(纸本)9781467373142
This paper presents a noise-aided dynamic range compression algorithm using a stochastic resonance model in spatial domain. An input statistics-dependent stochastic resonance (ISSR) model, that is designed for contrast enhancement of dark images, is used here to enhance an image with both bright and dark areas. The underilluminated regions of such an image are selected as the De Vries Rose region from a human visual system-based segmentation algorithm, and then processed using the ISSR model. It is observed that by semi-adaptively changing the processing parameters with iteration, the processed dark regions and the unprocessed bright regions of an image smoothly merge producing a quality of dynamic range compression in the image. The performance of the proposed algorithm is characterized using image quality index for tone-mapped images and a no-reference perceptual quality measure. Results and comparative analysis suggest notable performance of the proposed algorithm with fewer iteration.
暂无评论