Hand Gesture Recognition is one of the natural ways of human computer interaction (HCI) which has wide range of technological as well as social applications. A dynamic hand gesture can be characterized by its shape, p...
详细信息
ISBN:
(纸本)9781467385640
Hand Gesture Recognition is one of the natural ways of human computer interaction (HCI) which has wide range of technological as well as social applications. A dynamic hand gesture can be characterized by its shape, position and movement. This paper presents a user independent framework for dynamic hand gesture recognition in which a novel algorithm for extraction of key frames is proposed. This algorithm is based on the change in hand shape and position, to find out the most important and distinguishing frames from the video of the hand gesture, using certain parameters and dynamic threshold. For classification, Multiclass Support Vector Machine (MSVM) is used. Experiments using the videos of hand gestures of indian Sign Language show the effectiveness of the proposed system for various dynamic hand gestures. The use of key frame extraction algorithm speeds up the system by selecting essential frames and therefore eliminating extra computation on redundant frames.
In this paper, a compressed domain blind watermarking scheme is proposed which embeds the watermark by altering the number of nonzero transform co-efficients (NNZ) of 4 x 4 transform blocks of the HEVC video sequence....
详细信息
ISBN:
(纸本)9781467385640
In this paper, a compressed domain blind watermarking scheme is proposed which embeds the watermark by altering the number of nonzero transform co-efficients (NNZ) of 4 x 4 transform blocks of the HEVC video sequence. To embed the watermark, firstly, temporally homogeneous blocks having relatively less motion are selected. In this work, watermark is inserted in the Intra (I) frame and the motion characteristics of the I frame has been determined using the motion information of the Inter (P or B) predicted frames of its close neighborhood. The watermark is embedded by altering the NNZ difference of 4 x 4 transform blocks in the consecutive intra predicted frames. A comprehensive set of experiments is carried out to show that the scheme is robust against re-compression attacks while maintaining a descent visual quality (PSNR), the bit increase rate (BIR) of the watermarked video.
We propose a method to address the problem of Video Summarization, which aims to generate a summarized video by preserving the salient activities of the input video for a user specified time. We model the motion of a ...
详细信息
ISBN:
(纸本)9781467385640
We propose a method to address the problem of Video Summarization, which aims to generate a summarized video by preserving the salient activities of the input video for a user specified time. We model the motion of a feature points as Gaussian Mixture Model (GMM) to select the key feature points, which in-turn estimate the salient frames. The saliency of feature points depends on the contribution of motion in entire video and user specified time duration of summary. We generate a summarized video keeping chronology of salient frames to avoid the viewing ambiguity for the viewers. We demonstrate the proposed method for different stored surveillance videos and achieve retention ratio as 1 for the closest condensation ratio obtained for stroboscopic approach and also demonstrate the proposed GMM method with interactively selected region of interest (ROI) based results.
This paper introduces a computer aided diagnosis (CAD) technique for segmentation of mass in breast ultrasound (BUS) images followed by an efficient classification of the image into benign or malignant one. The presen...
详细信息
ISBN:
(纸本)9781467385640
This paper introduces a computer aided diagnosis (CAD) technique for segmentation of mass in breast ultrasound (BUS) images followed by an efficient classification of the image into benign or malignant one. The presence of speckle noise, low contrast and blurred boundary of mass in a BUS image makes it challenging to determine the mass, which is the region of interest (ROI) in the current work. Detecting an accurate ROI in turn results in efficient feature extraction and classification. In current work, image enhancement and speckle noise reduction are implemented for preprocessing in a simple but efficient way through filtering techniques. The results of the preprocessing stage are as effective as those obtained using traditional speckle reduction anisotropic diffusion (SRAD) algorithm. ROI is then accurately determined on preprocessed image by employing local region based active contour method. BUS images are classified through textural, morphological and histogram oriented feature metrics in this work. The obtained features are dimensionally reduced using principal component analysis (PCA) and classified through support vector machine (SVM) method. The proposed method is tested on several images and found to be very effective having an accuracy of 95.7% with very high specificity and positive predictive value (PPV).
In this paper we address the problem of hole filling in a point cloud of 3D object. Even with most popular 3D scanning devices like Microsoft Kinect and Time of Flight (ToF) cameras, occlusions during the scanning pro...
详细信息
ISBN:
(纸本)9781467385640
In this paper we address the problem of hole filling in a point cloud of 3D object. Even with most popular 3D scanning devices like Microsoft Kinect and Time of Flight (ToF) cameras, occlusions during the scanning process result in occurrence of missing regions or holes in 3D data. We propose a framework for hole filling in a point cloud of 3D object using Riemannian metric tensor and Christoffel symbols as a set of geometric features, which capture the inherent geometry of the 3D object. The framework involves detection and extraction of the boundary points surrounding the hole, decomposition of boundary points into basic shapes and selective surface interpolation to fill the hole. We demonstrate the performance of the proposed method on point clouds with different complexities and sizes for both synthetically generated holes and real missing regions during the capturing process on 3D models of heritage sites.
In this paper, a complete database of handwritten atomic Odia characters is suggested. The first version of the database has been modeled and named OHCSv1.0 (Odia handwritten character set). The database comprises of ...
详细信息
ISBN:
(纸本)9781467385640
In this paper, a complete database of handwritten atomic Odia characters is suggested. The first version of the database has been modeled and named OHCSv1.0 (Odia handwritten character set). The database comprises of 17,100 transcribed characters, each collected twice from 150 unique people at different point of time. Each character has 300 number of occurrences. The character images are standardized to a size of 6 4 x 6 4 pixels. A novel framework for perceiving transcribed Odia characters from this database has also been proposed. The character images are gathered into various groups in view of their shape components utilizing an incremental spectral clustering algorithm. During testing, affinity of probe character to a cluster is first decided. Subsequently, the trained classifier recognizes the character inside the cluster. Suitable simulation has been carried out to validate the scheme.
Development of computer-aided diagnosis (CAD) systems for early detection of the pathological brain is essential to save medical resources. In recent years, a variety of techniques have been proposed to upgrade the sy...
详细信息
ISBN:
(纸本)9781467385640
Development of computer-aided diagnosis (CAD) systems for early detection of the pathological brain is essential to save medical resources. In recent years, a variety of techniques have been proposed to upgrade the system's performance. In this paper, a new automatic CAD system for brain magnetic resonance (MR) image classification is proposed. The method utilizes two-dimensional discrete wavelet transform to extract features from the MR images. The dimension of the features have been reduced using principal component analysis (PCA) and linear discriminant analysis (LDA), to obtain the more significant features. Finally, the reduced set of features are applied to the random forests classifier to determine the normal or pathological brain. A standard dataset, Dataset-255 of 255 images (35 normal and 220 pathological) is used for the validation of the proposed scheme. To improve the generalization capability of the scheme, 5-fold stratified cross-validation procedure is utilized. The results of the experiments reveal that the proposed scheme is superior to other state-of-the-art techniques in terms of classification accuracy with substantially reduced number of features.
Existing approaches on newborn identification focuses on recognizing them using face, inked footprints, and palm prints. While palm and inked footprints are intrusive modalities, face modality suffers from non-coopera...
详细信息
ISBN:
(纸本)9781467385640
Existing approaches on newborn identification focuses on recognizing them using face, inked footprints, and palm prints. While palm and inked footprints are intrusive modalities, face modality suffers from non-cooperative nature of newborns. In this research, we investigate utilization of binocular region for recognizing newborns, as this region is considered to be relatively stable in face biometrics literature. We collect a database consisting of 402 face images pertaining to 50 babies of less than 6 months of age. A set of experiments pertaining to various descriptors, including local binary patterns, dense scale invariant feature transform, and Gabor features, along with subspace learning using principal component analysis, linear discriminant analysis, and independent component analysis. Recognition performance of various approaches are compared with respect to face and binocular modalities. Verification results are reported in terms of Receiver operating characteristics curves respectively. The results show that binocular can outperform face as a modality for newborn recognition.
Place of articulation obtained by analysis of the speech signal is useful for visual feedback of articulatory efforts for speech training of hearing impaired children and for improving pronunciation by learners of sec...
详细信息
ISBN:
(纸本)9781467385640
Place of articulation obtained by analysis of the speech signal is useful for visual feedback of articulatory efforts for speech training of hearing impaired children and for improving pronunciation by learners of second languages. Its estimation by direct imaging of the oral cavity is needed for validating the estimation from the speech signal. For such applications, an automated technique is presented for estimating the place of articulation by graphical processing of the upper and lower contours of the oral cavity image. It iteratively estimates the axial curve as an axis of symmetry of the oral cavity, such that the curve approximately bisects the normals to it. Distance between the contours along the normal to the axial curve gives the oral cavity opening and position of the smallest opening provides the place of articulation. The values estimated using the automated technique closely matched those obtained by manual marking of the visually estimated place of maximum constriction for the oral cavity images of vowels, stops, and fricatives, from the XRMB and MRI databases.
In order to chisel an iris recognition system and pass in the iris liveness test, an attacker can create semitransparent contact lens spoofing. Such a contact lens is transparent around the iris center and has fake ir...
详细信息
ISBN:
(纸本)9781467385640
In order to chisel an iris recognition system and pass in the iris liveness test, an attacker can create semitransparent contact lens spoofing. Such a contact lens is transparent around the iris center and has fake iris texture of another person printed around the outer region. In this paper, such fake iris images are synthetically created which can obviate sleuthing even after using pupillary light reflex technique. Thereafter, new liveness detection is proposed to determine the perceptually invisible boundary between the fake and original iris textures. Response of Gaussian derivative filters with multiple scales and orientations at each pixel location is clustered using K- means to ascertain regions with different textures. To give robustness to the algorithm, it is iterated a certain number of times and a threshold mechanism is imposed to find the correct boundary. The proposed method is shown to achieve high liveness performance by generating a set of 600 fake iris images from the UPOL iris database.
暂无评论