In this paper we propose a novel real time video sailency detection algorithm based on disorder in Motion Field. the proposed algorithm operates on the basic premise that higher disorder pertains to higher information...
详细信息
ISBN:
(纸本)9781467385640
In this paper we propose a novel real time video sailency detection algorithm based on disorder in Motion Field. the proposed algorithm operates on the basic premise that higher disorder pertains to higher information in the scene. Based on the quantified value of the disorder, salient areas in the video frame are demarcated. In order to achieve real time operational capability, the algorithm operates in compressed H.264 domain rather than in pixel domain. the proposed algorithm has been evaluated on standard video sequences and results on real time video surveillance data are also presented.
the recent era of digitization is expected to digitized many old important documents which are degraded due to various reasons. Degraded document image binarization has many challenges like intensity variation, backgr...
详细信息
ISBN:
(纸本)9781467385640
the recent era of digitization is expected to digitized many old important documents which are degraded due to various reasons. Degraded document image binarization has many challenges like intensity variation, background contrast variation, bleed through, text size variation and so on. Many approaches are available for document image binarization, but none can handle all types of degradation at once. We proposed an approach which consists of three stages such as preprocessing, Text-Area detection and post-processing. Preprocessing enhances the contrast of the image. Next stage involves identifying Text-Area. Postprocessing technique takes care of false positives and false negative based on intensity values of preprocessed and gray image. the Performance is evaluated based on various quantitative measures and is compared withthe method regarded best so far. the algorithm is also expected to be independent of the script, hence is tested on Gujarati degraded document images.
Handwritten character recognition has various potential in the field of document imageprocessing. It is one of the important aspects for systems like handwritten optical character recognizer, writer identification/ve...
详细信息
ISBN:
(纸本)9781467385640
Handwritten character recognition has various potential in the field of document imageprocessing. It is one of the important aspects for systems like handwritten optical character recognizer, writer identification/verification, automatic document sorter etc. In Bangla only few attempts are made towards character recognition. In this current study a relatively new attempt is made towards finding the dependency of writer information on character recognition by varying the inputs. this study will provide a better understanding of the input data for character recognition. Also it will help to know the Bangla characters better for writer identification/verification. Here, highest accuracy of 100% is achieved in case of numeral 7 applying LibSVM classifier.
Constructing a high-resolution (HR) image from low-resolution (LR) image(s) has been a very active research topic recently with focus shifting from multi-frames to learning based single-frame super-resolution (SR). Mu...
详细信息
ISBN:
(纸本)9781424442195
Constructing a high-resolution (HR) image from low-resolution (LR) image(s) has been a very active research topic recently with focus shifting from multi-frames to learning based single-frame super-resolution (SR). Multi-frame SR algorithms attempt the exact reconstruction of reality, but are limited to small magnification factors. Learning based SR algorithms learn the correspondences between. LR and HR patches. Accurate replacements or revealing the exact underlying information is not guaranteed in many scenarios. In this paper we propose an alternate solution. We propose to capture images at right zoom such that it has just sufficient amount of information so that further resolution enhancements can be easily achieved using an v off the shelf single-frame SR algorithm. this is true under the assumption that such a zoom factor is not very high, which is true for most man-made structures. the low-resolution. image is divided into small patches and ideal resolution is predicted for every patch. the contextual information is incorporated using a Markov Random Field based prior. Training data is generated from high-quality images and can use any single-frame SR algorithm. Several constraints are proposed to minimize the extent of zoom-in. We validate the proposed approach on synthetic data and real world images to show the robustness.
In this paper, an embedded entropy based image registration scheme has been proposed. Here, Tsallis and Renyi's entropy have been embedded to form a new entropic measure. this parametrized entropy has been used to...
详细信息
ISBN:
(纸本)9781479915880
In this paper, an embedded entropy based image registration scheme has been proposed. Here, Tsallis and Renyi's entropy have been embedded to form a new entropic measure. this parametrized entropy has been used to determine the weighted mutual information (MI) for the CT and MR brain images. the embedded mutual information has been maximized to obtain registration. this notion of embedded mutual information has also been validated in feature space registration. the mutual information with respect to the registration parameter has been found to be a nonlinear curve. It has been found that the feature space registration resulted in higher value mutual information and hence registration process could be smoother. We have used Simulated Annealing algorithm to determine the maximum of this embedded mutual information and hence register the images.
Recent years have witnessed an exponential growth in the use of digital images due to development of high quality digital cameras and multimedia technology. Easy availability of image editing software has made digital...
详细信息
ISBN:
(纸本)9781479915880
Recent years have witnessed an exponential growth in the use of digital images due to development of high quality digital cameras and multimedia technology. Easy availability of image editing software has made digital imageprocessing very popular. Ready to use software are available on internet which can be easily used to manipulate the images. In such an environment, the integrity of the image can not be taken for granted. Malicious tampering has serious implication for legal documents, copyright issues and forensic cases. Researchers have come forward with large number of methods to detect image tampering. the proposed method is based on hash generation technique using singular value decomposition. Design of an efficient hash vector as proposed will help in detection and localization of image tampering. the proposed method shows that it is robust against content preserving manipulation but extremely sensitive to even very minute structural tampering.
Human authentication can now be seen as a crucial social problem. In this paper a multimodal authentication system is presented which is highly reliable and fuses iris, finger-knuckleprint and palmprint image matching...
详细信息
ISBN:
(纸本)9781467385640
Human authentication can now be seen as a crucial social problem. In this paper a multimodal authentication system is presented which is highly reliable and fuses iris, finger-knuckleprint and palmprint image matching scores. Segmented ROI are preprocessed using DCP (Differential Code Pattern) to obtain robust corner features. Later they are matched using the GOF (Global Optical Flow) based dissimilarity measure. the proposed system has been tested on Casia Interval and Lamp iris, PolyU finger-knuckle-print and PolyU and Casia palmprint, public databases. the proposed system has shown good performance over all unimodal databases while over multimodal (fusion of all three) databases it has shown perfect performance (i:e: CRR = 100% with EER = 0%).
this paper addresses the problem of reconstruction of specular surfaces using a combination of Dynamic Programming and Markov Random Fields formulation. Unlike traditional methods that require the exact position of en...
详细信息
ISBN:
(纸本)9781479915880
this paper addresses the problem of reconstruction of specular surfaces using a combination of Dynamic Programming and Markov Random Fields formulation. Unlike traditional methods that require the exact position of environment points to be known, our method requires only the relative position of the environment points to be known for computing approximate normals and infer shape from them. We present an approach which estimates the depth from dynamic programming routine and MRF stereo matching and use MRF optimization to fuse the results to get the robust estimate of shape. We used smooth color gradient image as our environment texture so that shape can be recovered using just a single shot. We evaluate our method using synthetic experiments on 3D models like Stanford bunny and show the real experiment results on golden statue and silver coated statue.
We propose that appearance descriptors derived from the complete animacy of an object during its scene presence more comprehensively capture the essence of an object than descriptors that merely encode uncorrelated se...
详细信息
ISBN:
(纸本)9781424442195
We propose that appearance descriptors derived from the complete animacy of an object during its scene presence more comprehensively capture the essence of an object than descriptors that merely encode uncorrelated sets of its instantaneous appearances. During its frame presence, an object presents itself in many poses with differing frequencies, thus generating multiple modes of varying strengths in the appearance feature space. Further we utilize tracking information to extract the set of all appearances of the object, while excluding those intervals where the object is partly or fully occluded by other objects or back-ground entities. this allows for completely unsupervised computation of the descriptors that consist of time-indexed vectors from shape and Haar feature templates which are then clustered to obtain appearance modes. these lead to the construction of object-animacy models as probability distributions over the space of co-occurrent shape and Haar templates. these object models are clustered further in an unsupervised manner by using different spatial clustering algorithms with a Bhattacharya distance metric between object models. Unsupervised categorization results on simple (PETS2000) and complex traffic scenes consisting of a wide variety of objects show robust performance of the proposed approach.
In this paper, an uncompressed domain video watermarking scheme resilient to temporal adaptation is proposed for scalable video coding. In the proposed scheme, each temporal layer has been separately embedded with a d...
详细信息
ISBN:
(纸本)9781467385640
In this paper, an uncompressed domain video watermarking scheme resilient to temporal adaptation is proposed for scalable video coding. In the proposed scheme, each temporal layer has been separately embedded with a different watermark which is generated by DCT domain decomposition of a single watermark image. A zigzag sequence of block wise DCT coefficients of the watermark image is partitioned into non-overlapping sets and each set is embedded separately into different temporal layers. the base layer is embedded withthe first set of DCT coefficient (which includes DC coefficient of each block) and successive layers are embedded with successive nonoverlapping coefficient sets. the coefficients of each set is chosen in such a fashion that uniform energy distribution across all temporal layers can be maintained. Experimental results show that the proposed scheme is robust against temporal scalability and robustness of the watermark increases withthe addition of successive enhancement layers.
暂无评论