In this paper we address the problem of hole filling in a point cloud of 3D object. Even with most popular 3D scanning devices like Microsoft Kinect and Time of Flight (ToF) cameras, occlusions during the scanning pro...
详细信息
ISBN:
(纸本)9781467385640
In this paper we address the problem of hole filling in a point cloud of 3D object. Even with most popular 3D scanning devices like Microsoft Kinect and Time of Flight (ToF) cameras, occlusions during the scanning process result in occurrence of missing regions or holes in 3D data. We propose a framework for hole filling in a point cloud of 3D object using Riemannian metric tensor and Christoffel symbols as a set of geometric features, which capture the inherent geometry of the 3D object. the framework involves detection and extraction of the boundary points surrounding the hole, decomposition of boundary points into basic shapes and selective surface interpolation to fill the hole. We demonstrate the performance of the proposed method on point clouds with different complexities and sizes for both synthetically generated holes and real missing regions during the capturing process on 3D models of heritage sites.
In most of the imageprocessing applications there is a maximum limit up to which an image can be compressed without adversely affecting the quality of the image to a great deal. In this paper, we propose a new waterm...
详细信息
ISBN:
(纸本)9781424442195
In most of the imageprocessing applications there is a maximum limit up to which an image can be compressed without adversely affecting the quality of the image to a great deal. In this paper, we propose a new watermarking scheme to embed text in images using an algorithm for finding the optimum bit plane that can be substituted from the image by the character bit plane. the maximum embedding capacity is kept as a function of minimum expected quality. this watermarking method is experimentally found to be highly robust against many imageprocessing attacks including JPEG 2000 as it takes in to consideration boththe lossy stages of JPEG 2000 during its implementation.
this paper proposes an approach for detection and tracking of multiple objects in a video. We detect multiple objects in the frames using an improved version of the Viola-Jones face-detector, extract Speeded Up Robust...
详细信息
ISBN:
(纸本)9781467385640
this paper proposes an approach for detection and tracking of multiple objects in a video. We detect multiple objects in the frames using an improved version of the Viola-Jones face-detector, extract Speeded Up Robust Features (SURF) from the detected objects and initialize an improved version of the Kanade-Lucas-Tomashi (KLT) tracker to track the objects throughout the video. We use Gradient Weighted Optical Flow (GWOF) feature to detect boththe static and moving objects. the improvement over the KLT tracker is done using the GWOF measure, enabling the tracking system to work in videos with camera shaking. the proposed object tracking method is capable of dealing with multiple challenges like illumination changes, variable and uneven background and poor lighting condition. the efficacy of the proposed approach is tested on challenging datasets like ALOV++ and Honda/UCSD, compared to the state-of-the-art.
Book flipping scanning refers to the process of recording a book while the user performs the flipping action of its pages. In recent years it has gained much attention as it reduces the workload of book digitization s...
详细信息
ISBN:
(纸本)9781479915880
Book flipping scanning refers to the process of recording a book while the user performs the flipping action of its pages. In recent years it has gained much attention as it reduces the workload of book digitization significantly. It is a challenging task because flipping at random speed and direction causes difficulties to identify distinct open page images (OPI) which represent each page of the book. In this paper, we propose a fast technique for removing duplicate open pages introduced in the video stream due to erroneous flipping. We present an algorithm that exploits cues from edge information of flipping pages. the nature of the cues extracted from the region of interest (ROI) of the frame, determines the flipping or an open state of a page whereas temporal position a flipping page determines the direction of the flipping. Combining these information we decide whether an open page image is a duplicate or not. Experiments are performed on video documents recorded using a standard resolution camera to validate the duplicate open page removal algorithm and we have obtained 95% accuracy.
With rapid improvements in the performance and programmability, graphicsprocessing Units (GPUs) have fostered considerable interest in substantially reducing the running time of compute intensive problems. the soluti...
详细信息
ISBN:
(纸本)9781424442195
With rapid improvements in the performance and programmability, graphicsprocessing Units (GPUs) have fostered considerable interest in substantially reducing the running time of compute intensive problems. the solution to the view-independent mutual point-pair visibility problem (required for inter-reflections in global illumination) can, it would seem, require the capabilities of the GPUs. In this paper, various ways of parallelizing the construction of the Visibility Map (V-map, a description of mutual visibility) are presented to lead the way for an implementation that achieves a speed up of 11 or more. We evaluate our scheme qualitatively and quantitatively, and conclude that parallelizing the V-map construction algorithm is eminently useful.
this paper addresses the problem of segmenting handwritten annotations on scientific research papers. the motivation of this work is to geometrically segment the complex cases of handwritten annotations, including mar...
详细信息
ISBN:
(纸本)9781467385640
this paper addresses the problem of segmenting handwritten annotations on scientific research papers. the motivation of this work is to geometrically segment the complex cases of handwritten annotations, including marks, cuts and special symbols along, withthe regular text. Our work particularly focuses on documents that have multi-oriented handwritten [1] annotations rather than annotations in controlled scenario [2]. Spectral Partitioning is adopted as the segmentation scheme to separate the printed text and annotations. A new feature Envelope Straightness is developed and included in our feature set. this leads to an improvement of accuracy over the state-of-the-art features. the experiments are performed on two datasets: 40 documents authored by two writers from IAM dataset, comprising only printed and handwritten text and a self created dataset of 40 scientific papers from various proceedings annotated by a reader, comprising varied types of annotations. In the framework of spectral partitioning, our feature set has achieved a recall of 98.39% for printed text and precision of 85.40% for handwritten annotations on our dataset. For IAM dataset our feature set has achieved a recall of 81.89% for printed text and a precision of 69.67% for handwritten annotations. the results achieved on both dataset are better compared with results obtained using [3] [1].
Tracking multiple deformable objects simultaneously in a video is a challenging problem. the tracking is more difficult when the objects are touching each other while they are moving or suddenly appear in or disappear...
详细信息
ISBN:
(纸本)9781424442195
Tracking multiple deformable objects simultaneously in a video is a challenging problem. the tracking is more difficult when the objects are touching each other while they are moving or suddenly appear in or disappear from a frame of the video. In this paper we propose a parametric active membrane, which can change its topology to detect and track multiple objects present in the video. the membrane evolves in image space and also along the image intensity surface and if requires, splits into multiple membranes to track multiple objects. the methodology is tested on real video segments that demonstrates the efficacy of the proposed scheme.
In order to chisel an iris recognition system and pass in the iris liveness test, an attacker can create semitransparent contact lens spoofing. Such a contact lens is transparent around the iris center and has fake ir...
详细信息
ISBN:
(纸本)9781467385640
In order to chisel an iris recognition system and pass in the iris liveness test, an attacker can create semitransparent contact lens spoofing. Such a contact lens is transparent around the iris center and has fake iris texture of another person printed around the outer region. In this paper, such fake iris images are synthetically created which can obviate sleuthing even after using pupillary light reflex technique. thereafter, new liveness detection is proposed to determine the perceptually invisible boundary between the fake and original iris textures. Response of Gaussian derivative filters with multiple scales and orientations at each pixel location is clustered using K- means to ascertain regions with different textures. To give robustness to the algorithm, it is iterated a certain number of times and a threshold mechanism is imposed to find the correct boundary. the proposed method is shown to achieve high liveness performance by generating a set of 600 fake iris images from the UPOL iris database.
Place of articulation obtained by analysis of the speech signal is useful for visual feedback of articulatory efforts for speech training of hearing impaired children and for improving pronunciation by learners of sec...
详细信息
ISBN:
(纸本)9781467385640
Place of articulation obtained by analysis of the speech signal is useful for visual feedback of articulatory efforts for speech training of hearing impaired children and for improving pronunciation by learners of second languages. Its estimation by direct imaging of the oral cavity is needed for validating the estimation from the speech signal. For such applications, an automated technique is presented for estimating the place of articulation by graphical processing of the upper and lower contours of the oral cavity image. It iteratively estimates the axial curve as an axis of symmetry of the oral cavity, such that the curve approximately bisects the normals to it. Distance between the contours along the normal to the axial curve gives the oral cavity opening and position of the smallest opening provides the place of articulation. the values estimated using the automated technique closely matched those obtained by manual marking of the visually estimated place of maximum constriction for the oral cavity images of vowels, stops, and fricatives, from the XRMB and MRI databases.
Multibiometric systems have recently become a preferred option for human identification over the unibiometric systems. It increases the recognition rate and confidence in the final decision, and simultaneously reduces...
详细信息
ISBN:
(纸本)9781467385640
Multibiometric systems have recently become a preferred option for human identification over the unibiometric systems. It increases the recognition rate and confidence in the final decision, and simultaneously reduces the failure to enroll rate (FER). For identification mode, rank level fusion is a feasible option as incompatibility and normalization issues present at the score level fusion are not prominent at this level and also sufficient information is present to fuse as opposed to the decision level fusion. We propose an improvement in existing rank level fusion techniques using two levels of hierarchy. Series and parallel combinations are proposed to combine the output of various rank level fusion techniques. Two formulations of series and parallel combinations are extensively evaluated on multi-algorithm, multi-instance and multi-modal biometric systems created from three publicly available datasets: (i) NIST BSSR1 [1] multimodal biometric score database, (ii) Face Recognition Grand Challenge V2.0 [2] and (iii) LG4000 [3] iris images.
暂无评论