Inexpensive computer hardware and optical devices has made image/video applications available even for private individuals. This has created a huge demand for image and multimedia databases and other systems, which wo...
详细信息
ISBN:
(纸本)0819450235
Inexpensive computer hardware and optical devices has made image/video applications available even for private individuals. This has created a huge demand for image and multimedia databases and other systems, which work with visual information. Analysis of visual information has not been completely formalized and automated yet. The reason for that is a long tradition of separation of vision and knowledge subsystems. However, brain researches show that vision is a part of a larger information system that converts visual information into knowledge structures. These structures drive vision process, resolve ambiguity and uncertainty in real images via feedback projections, and provide image understanding that is an interpretation of visual information in terms of such knowledge models. It is hard to split such system apart. Vision mechanisms can never be completely understood separately from the informational processes related to knowledge and intelligence. MPEG-7 is an industry-wide effort to incorporate knowledge into image/video code. This article describes basic principles of integration low-level imageprocessing with high-level knowledge reasoning, and shows how image Understanding systems can utilize MPEG-7 standard. Such applications can add to the standard the power of image understanding.
image segmentation and border ownership assignment are two widely studied areas in the computer vision literature. It is well known that both the segmentation and the border ownership assignment play an important role...
详细信息
ISBN:
(纸本)9781467373869
image segmentation and border ownership assignment are two widely studied areas in the computer vision literature. It is well known that both the segmentation and the border ownership assignment play an important role in the visual perception. In this study, a Markov Random Fields model which provides a dual solution for the segmentation and the border ownership assignment is proposed. The proposed system is analyzed both quantitatively and qualitatively.
The task of image coding is to improve the efficiency of visual communication channels. This entails minimizing the amount of data required to transmit the information about the radiance field. We assess this task in ...
详细信息
ISBN:
(纸本)081941543X
The task of image coding is to improve the efficiency of visual communication channels. This entails minimizing the amount of data required to transmit the information about the radiance field. We assess this task in the context of visual communication channel design including image gathering, coding, and Wiener restoration which results in channel designs with significantly improved performance. Conventional assessments are limited to the digital transmission channel beginning at the output of the image-gathering device and ending at the input to the image-display device. Our end-to-end assessment, in addition, incorporates these two devices. This assessment combines Shannon's communication theory with Wiener's restoration filter and with the critical design factors of the image gathering and display devices. This provides the metrics needed to quantify and optimize the end-to-end performance of the visual communication channel. The results are described.
In this paper, we propose a simple but efficient wavelet-based embedded image coder that employs a new inter-band magnitude relationship in the wavelet coefficients and block trees. The proposed scheme includes multi-...
详细信息
ISBN:
(纸本)0819444111
In this paper, we propose a simple but efficient wavelet-based embedded image coder that employs a new inter-band magnitude relationship in the wavelet coefficients and block trees. The proposed scheme includes multi-level dyadic wavelet decomposition, raster scanning within each subband, formation of block trees, partitioning of block trees and adaptive arithmetic entropy coding. Although the proposed scheme is simple, it produces a bitstream with a rich set of features, including SNR scalability and the embedded nature. Experimental results demonstrate that the new scheme is quite competitive to and outperforms other good image coders in the literature.
image annotation is a fundamental and challenging task in the field of semantic image retrieval. In this paper, we deal with image annotation via matrix completion. Concretely, we formulate the problem of annotating t...
详细信息
ISBN:
(纸本)9781467373142
image annotation is a fundamental and challenging task in the field of semantic image retrieval. In this paper, we deal with image annotation via matrix completion. Concretely, we formulate the problem of annotating the tags of an image into a constrained optimization problem, in which the constraint is to keep the consistency with the given initial labels and the objective is to minimize the discrepancy between the correlation in visual content and the correlation in semantic tags. We solve the optimization problem with the linearized alternating direction method. Experimental results on benchmark data demonstrate the effectiveness of our proposals.
This paper presents a new filtering scheme for the removal of impulsive noise in multichannel images. It is based on estimating the probability density function for image pixels in a filtering window by means of the k...
详细信息
ISBN:
(纸本)0819450235
This paper presents a new filtering scheme for the removal of impulsive noise in multichannel images. It is based on estimating the probability density function for image pixels in a filtering window by means of the kernel density estimation method. The filtering algorithm itself is based on the comparison of pixels with their neighborhood in a sliding filter window. The quality of noise suppression and detail preservation of the new filter is measured quantitatively in terms of the standard image quality criteria. The filtering results obtained with the new filter show its excellent ability to reduce noise while simultaneously preserving fine image details.
We propose an effective and efficient local decolorization method in this paper. It is an extension of the global decolorization method [6] which robustly reproduces visual appearance of a color image in the grayscale...
详细信息
ISBN:
(纸本)9781509028603
We propose an effective and efficient local decolorization method in this paper. It is an extension of the global decolorization method [6] which robustly reproduces visual appearance of a color image in the grayscale output. The improvement of the local extension is the effective preservation of the local color contrast which may diminish in the global method. Meanwhile the proposed local extension is efficient in that the computational complexity is O(1) for each pixel, which will be independent of the local kernel size. Quantitative evaluation among existing decolorization methods shows that our local extension performs favorable in both image quality and time cost. Meanwhile, our method can be extended into temporal domain for robust video decolorization.
Medical images generate enormous amounts of data and therefore, efficient image compression techniques need to be employed in order to save on cost and time of storage and transmission respectively. In this research w...
详细信息
ISBN:
(纸本)9781467355636;9781467355629
Medical images generate enormous amounts of data and therefore, efficient image compression techniques need to be employed in order to save on cost and time of storage and transmission respectively. In this research work, we propose a new lossy compression technique by using singular value decomposition (SVD) followed by Huffman coding. In the proposed technique firstly the image is decomposed by using SVD and then the rank is being reduced by ignoring some of the lower singular values as well as rows of hanger and aligner matrices. Then the reconstructed lossy image is being compressed again by using Huffman coding. The compression ratio is obtained by multiplication of the compression ratio achieved by using SVD with the compression ratio achieved by using Huffman coding. The proposed technique is tested on several medical images. The obtained results were also compared with those of conventional Huffman coding and JPEG2000. The quantitative and visual results are showing the superiority of the proposed compression technique over the aforementioned compression technique.
The video captured by different visual sensor in a visual sensor network is first compressed using the block-based compressive sensing algorithm. All the videos are encoded independently at different sub-rates and tra...
详细信息
ISBN:
(纸本)9781467373142
The video captured by different visual sensor in a visual sensor network is first compressed using the block-based compressive sensing algorithm. All the videos are encoded independently at different sub-rates and transmitted to a host workstation for reconstruction. Then, the proposed multi-phase joint reconstruction framework is applied to improve the reconstruction of lower subrate videos. In this case, frames extracted from higher subrate videos are used to produce some side information, which serve as prediction of the counterpart frames in lower subrate videos. Next, the difference between them at the measurement level is calculated. The difference is then added to the prediction to obtain the final reconstruction. The experimental results show that the proposed framework is able to outperform the other frameworks on average by a margin of 1dB to 2dB at different subrates on various multi-view videos.
View synthesis is dedicated to generating arbitrary views of the same scene from given inputs. As an alternative to depth-image-based rendering (DIBR), image warping based view synthesis approaches could automatically...
详细信息
ISBN:
(纸本)9781479961399
View synthesis is dedicated to generating arbitrary views of the same scene from given inputs. As an alternative to depth-image-based rendering (DIBR), image warping based view synthesis approaches could automatically generate visually plausible virtual views in real-time. Recognizing that existing techniques would lead to temporal incoherence and shape distortions in synthesized videos, this paper proposes a novel video warping algorithm which motion saliency map and global motion from reference views are incorporated into motion-aware constraints to maintain temporal coherence in virtual views. Furthermore, a salient curve based disparity constraint is imposed to prevent shape deformations and avoid possible artifacts. Extensive experiments are validated by visual comparison, which demonstrates that the proposed algorithm outperforms existing warping-based methods.
暂无评论