In this paper, we propose two designs of redundant finer directional wavelet transform (FiDWT) and explain its application to image denoising. 2-channel perfect reconstruction (PR) checkerboard-shaped filter bank (CSF...
详细信息
ISBN:
(纸本)9781479915880
In this paper, we propose two designs of redundant finer directional wavelet transform (FiDWT) and explain its application to image denoising. 2-channel perfect reconstruction (PR) checkerboard-shaped filter bank (CSFB) is at the core of the designs. The 2-channel CSFB, uses 2-D nonseparable analysis and synthesis filter responses without downsampling/upsampling matrices resulting in redundancy factor of 2. Both these designs have two lowpass and six highpass directional subbands. The hard-thresholding results for image denoising using proposed designs clearly shows improvement in PSNR as well as visual quality of the denoised images. Using the Bayes least squares-Gaussian scale mixture (BLS-GSM), a current state-of-the-art wavelet-based image denoising technique with the proposed two times redundant FiDWT design indicates encouraging results on textural images with much less computational cost.
This paper proposes a novel recommendation engine to suggest coordinated outfits to the users that complements each other. The proposed recommendation model encodes subjective knowledge of clothing experts in Multimed...
详细信息
ISBN:
(纸本)9781467385640
This paper proposes a novel recommendation engine to suggest coordinated outfits to the users that complements each other. The proposed recommendation model encodes subjective knowledge of clothing experts in Multimedia Web Ontology Language (MOWL) and makes use of evidential and causal reasoning scheme to deal with the media properties of concepts. Our approach automatically identifies the user visual personality and interprets the contextual meaning of media features of the garments in the context of input query image. As a result, personalized complementary garments based on occasion of wear are recommended to the user. We have validated our approach with garment preferences of various models with a large collection of shirts and trousers, collected from various websites.
The popular techniques to eliminate temporal redundancy in video sequences are Motion Estimation and Motion Compensation. These techniques have also been used in popular H.264, MPEG-2 and MPEG-4 video coding standards...
详细信息
ISBN:
(纸本)9781479915880
The popular techniques to eliminate temporal redundancy in video sequences are Motion Estimation and Motion Compensation. These techniques have also been used in popular H.264, MPEG-2 and MPEG-4 video coding standards. Conventional fast Block Matching Algorithms (BMA) perform exhaustive search between the current and the reference frame. Although BMA technique gives the exact result but it is computationally very expensive. Another drawback of this method is that it easily gets trapped into the local minima which eventually lead to degradation of the video quality. The proposed Motion Estimation Technique exploits the fact that the human eyes are incapable of detecting different frames when they are run at particular frame rate. The experimental results on various video sequences demonstrate that the proposed technique has outperformed all the existing conventional motion estimation techniques.
This paper describes a sparse representation based approach to learn a classifier for assessing the video quality without a reference. First we calculate the natural scene statistics (NSS) based spatial features of ea...
详细信息
ISBN:
(纸本)9781467385640
This paper describes a sparse representation based approach to learn a classifier for assessing the video quality without a reference. First we calculate the natural scene statistics (NSS) based spatial features of each frame/ image and then learn a dictionary by K-SVD algorithm from NSS features of correct frames. In this work we identified the fact that correct frame can be represented precisely in terms of dictionary atoms but while representing a distorted frame, the error drastically increases with increase in distortion thus we can easily classify the frames as correct and distorted based on error score calculated by sparse representation framework. This framework has been validated on two datasets and we observe improved accuracies as compared to state-of-art algorithms.
In recent years the need of a robust facial component tracking especially lip tracking algorithm has increased dramatically. We implement an active contour (snake) model inspired by human perception for lip tracking. ...
详细信息
ISBN:
(纸本)9781479915880
In recent years the need of a robust facial component tracking especially lip tracking algorithm has increased dramatically. We implement an active contour (snake) model inspired by human perception for lip tracking. In addition to the conventional energy terms for tension, rigidity (internal energy) and gradient magnitude (external energy) we propose to include energy terms from domain knowledge for lip shape constraint and local region profile constraint. Generalized deterministic annealing (GDA) update of the energy functional helps the solution to escape suboptimal local minima in the energy space and give better tracking result. Experimental results show that the proposed method efficiently adapts to the highly deformable lip boundaries even for lips with indistinct edges and colored (adorned) lips where gradient magnitude based or local region based tracking methods respectively fail. We have done a number of experiments to evaluate the performance of our method in comparison with the existing state-of-the-art methods.
Segmentation of cell nuclei in PAP-smear cervical images is of preeminent importance in computer-aided-diagnostic screening technique for cervical cancer. This paper proposes a novel nuclei segmentation approach which...
详细信息
ISBN:
(纸本)9781467385640
Segmentation of cell nuclei in PAP-smear cervical images is of preeminent importance in computer-aided-diagnostic screening technique for cervical cancer. This paper proposes a novel nuclei segmentation approach which builds upon the mean-shift method. The mean-shift method is applied on the cell images which first undergo a decorrelation-stretch contrast enhancement. The results of mean-shift based approach is refined further using morphological operations. We have validated results of segmentation on dataset which includes 900 images with the given ground truth. We demonstrate that our simple and efficient approach yields high validation rate on a large image dataset. In addition, we also show encouraging visual results on another set of more complex real images.
Analysis of a very long video and semantically describe the contents is a challenging task in computervision. The present approaches such as video shot detection and summarization address this problem partially while...
详细信息
ISBN:
(纸本)9781467385640
Analysis of a very long video and semantically describe the contents is a challenging task in computervision. The present approaches such as video shot detection and summarization address this problem partially while maintaining the temporal coherency. To reduce the user efforts for seeing the whole video we have introduced a new technique which combines similar content irrespective of their presence at different time instants. In this approach, we automatically identify only the representative frames corresponding to similar scenes which were captured at different instants of time. We also provide the labels of the objects that are present in the representative frames along with the compact representation for the video. We achieve the task of semantic labelling of frames in a unified framework using a deep learning framework involving pre-trained features through a convolutional neural network. We show that the proposed approach is able to address the semantic labelling effectively as justified by the results obtained for videos of different scenes captured through different modalities.
We present an improved mesh denoising method based on 3D geometric bilateral filtering. Its novelty is that it can preserve the details of the object as well as reduce the noise in an effective manner. The previous ap...
详细信息
ISBN:
(纸本)9781479915880
We present an improved mesh denoising method based on 3D geometric bilateral filtering. Its novelty is that it can preserve the details of the object as well as reduce the noise in an effective manner. The previous approach of geometric bilateral filtering for 3D-scan points has a limitation that it reduces the point density, thereby losing the details present in the object. The approach proposed by us, on the contrary, works on the surface mesh obtained after triangulating the 3D-scan points without any data downsampling. Each vertex of the mesh is repositioned appropriately based on the estimated centroid of the vertices in its local neighborhood and a Gaussian weight function. Experimental results demonstrate its strength, efficiency, and robustness.
In many common applications of Microsoft Kinect (TM) including navigation, surveillance, 3D reconstruction, and the like;it is required to estimate the geometry of mirrors or other reflecting surfaces existing in the ...
详细信息
ISBN:
(纸本)9781479915880
In many common applications of Microsoft Kinect (TM) including navigation, surveillance, 3D reconstruction, and the like;it is required to estimate the geometry of mirrors or other reflecting surfaces existing in the field of view. This often is difficult as in most positions a mirror does not support diffuse reflection of speckles and hence cannot be seen in the Kinect depth map. A mirror shows up as unknown depth. However, suitably placed objects reflecting in the mirror can provide important clues for the orientation and distance of the mirror. In this paper we present a method using a ball and its mirror image to set-up point-to-point correspondence between object and image points to solve for the geometry of the mirror. With this simple estimators are designed for the orientation and distance of a plane vertical mirror with respect to the Kinect camera. In addition an estimator is presented for the diameter of the ball. The estimators are validated through a set of experiments.
Accurate detection of optic disk and macula are of interest in automated analysis of retinal images as they are landmarks in retina and their detection aids in assessing the severity of diseases based on the locations...
详细信息
ISBN:
(纸本)9781467385640
Accurate detection of optic disk and macula are of interest in automated analysis of retinal images as they are landmarks in retina and their detection aids in assessing the severity of diseases based on the locations of abnormalities relative to these landmarks. The general strategy is to design different methods to these landmarks. In contrast, we propose a novel and unified approach for Optic disk and macula detection in this paper using the Generalized Motion Pattern (GMP) [10] [19] which is derived by inducing motion to an image to smooth out unwanted information. The proposed method is unsupervised, parallelizable and handles illumination differences efficiently but assumes a fixed protocol in image acquisition. The proposed method has been tested on five public datasets and obtained results indicate comparable performance to supervised approaches for the same problem.
暂无评论