We propose a novel approach of utilizing phenomic traits to automatically quantify stress in plants using machine learning techniques. Moisture deficit conditions cause change in leaf color due to decrease in chloroph...
详细信息
We propose a novel approach of utilizing phenomic traits to automatically quantify stress in plants using machine learning techniques. Moisture deficit conditions cause change in leaf color due to decrease in chlorophyll content as chloroplast is damaged by active oxygen species. Therefore, the proposed technique uses leaf color as the phenomic trait to assess stress levels using Relative water content (RWC) as a quantitative proxy. We extracted the change in leaf color in response to drought stress using the color features obtained using Random forest. A regressor has been modeled to predict the stress level of rice genotypes via RWC by employing colour histogram as a feature vector. The experiment was performed with pot images of different rice genotypes under normal and drought stressed conditions. We report a correlation coefficient of 0.89 obtained using this model demonstrating the capability of the presented technique for stress level predictions.
image dehazing either using single visible image or using visible and near-infrared (NIR) image pair has seen growing interest in last decade for improving visibility in landscape photographs. In this paper, we propos...
详细信息
image dehazing either using single visible image or using visible and near-infrared (NIR) image pair has seen growing interest in last decade for improving visibility in landscape photographs. In this paper, we propose a novel approach for image dehazing scheme using a pair of visible and NIR images. The dehazing mechanism estimates depth map and airlight color using the visible-NIR scene statistics and uses them to form a haze-free image. Experiments on a variety of hazy images demonstrate that our method achieves high degree of detail recovery over the existing image dehazing algorithms. The resultant images exhibit a very good blend of details, contrast and color. The proposed algorithm is less computationally demanding and is fully automatic. The results are superior in both visual as well as quantitative analysis compared to state-of-the-art image dehazing algorithms.
In this paper we represent a new technique to interact with the computer in a non-tangible way. Specifically we have designed a Media Player system controller by Facial Expressions and Gestures (MP-FEG). We detect and...
详细信息
In this paper we represent a new technique to interact with the computer in a non-tangible way. Specifically we have designed a Media Player system controller by Facial Expressions and Gestures (MP-FEG). We detect and track one landmark point on the finger and 18 landmark points on the lips to capture the movement of the finger and the lips of the user. The movement patterns are classified into hand gestures and facial expressions using support vector machine (SVM). We have achieved ~98.65% and ~100% recognition accuracies for hand-gestures and facial expressions respectively. Occurrence of each of these actions (5 hand-gesturs and 3 facial expressions) is associated with a command to control (e.g., to select, play, pause the video) the video player. Perceptional quality analysis by user survey rates the experience of the non-tangible human-computer-interaction facilited by the proposed technique as `good'.
Attribute-based facial image retrieval has wide range of applications, such as in law enforcement, online social networks, etc. The problem becomes more challenging if the images are from different modalities. For exa...
详细信息
Attribute-based facial image retrieval has wide range of applications, such as in law enforcement, online social networks, etc. The problem becomes more challenging if the images are from different modalities. For example, the input is a sketch or a composite image, and the task is to retrieve photo images which have the same facial attributes as the input data. In this work, we propose a learning-based approach, in which two transformations are learnt to transform the training images from the two modalities with associated attribute annotations such that images which have similar attributes move closer to each other, and images with very different attributes move farther from each other in the transformed space. Given a query image, it is first transformed to the learnt space in which the images with similar attributes are retrieved. The same framework works seamlessly if the images to be retrieved are of same or different modality as compared to the query data. The attributes of the query image are also automatically obtained as a byproduct of the algorithm. Extensive experimental evaluation on three datasets shows the effectiveness of the proposed approach.
In this paper, a real time multi-view human activity recognition model using a RGB-D (Red Green Blue-Depth) sensor is proposed. The method receives as input RGB-D data streams in real time from a Kinect for Windows V2...
详细信息
In this paper, a real time multi-view human activity recognition model using a RGB-D (Red Green Blue-Depth) sensor is proposed. The method receives as input RGB-D data streams in real time from a Kinect for Windows V2 sensor. Initially, a skeleton-tracking algorithm is applied which gives 3D joint information of 25 unique joints. The presented approach uses a weighted version of the Fast Dynamic Time Warping that weighs the importance of each skeleton joint towards the Dynamic Time Warping (DTW) similarity cost. To recognize multi-view human activities, the weighted Dynamic Time Warping warps a time sequence of joint positions to reference time sequences and produces a similarity value. Experimental results demonstrate that the proposed method is robust, flexible and efficient with respect to multiple views activity recognition, scale and phase variations activities at different realistic scenes.
Human authentication can now be seen as a crucial social problem. In this paper a multimodal authentication system is presented which is highly reliable and fuses iris, finger-knuckle-print and palmprint image matchin...
详细信息
Human authentication can now be seen as a crucial social problem. In this paper a multimodal authentication system is presented which is highly reliable and fuses iris, finger-knuckle-print and palmprint image matching scores. Segmented ROI are preprocessed using DCP (Differential Code Pattern) to obtain robust corner features. Later they are matched using the GOF (Global Optical Flow) based dissimilarity measure. The proposed system has been tested on Casia Interval and Lamp iris, PolyU finger-knuckle-print and PolyU and Casia palmprint, public databases. The proposed system has shown good performance over all unimodal databases while over multimodal (fusion of all three) databases it has shown perfect performance (i.e. CRR = 100% with EER = 0%).
Place of articulation obtained by analysis of the speech signal is useful for visual feedback of articulatory efforts for speech training of hearing impaired children and for improving pronunciation by learners of sec...
详细信息
Place of articulation obtained by analysis of the speech signal is useful for visual feedback of articulatory efforts for speech training of hearing impaired children and for improving pronunciation by learners of second languages. Its estimation by direct imaging of the oral cavity is needed for validating the estimation from the speech signal. For such applications, an automated technique is presented for estimating the place of articulation by graphical processing of the upper and lower contours of the oral cavity image. It iteratively estimates the axial curve as an axis of symmetry of the oral cavity, such that the curve approximately bisects the normals to it. Distance between the contours along the normal to the axial curve gives the oral cavity opening and position of the smallest opening provides the place of articulation. The values estimated using the automated technique closely matched those obtained by manual marking of the visually estimated place of maximum constriction for the oral cavity images of vowels, stops, and fricatives, from the XRMB and MRI databases.
Lung tumor estimation on imaging modalities is required to assess the extent of the tumor for diagnosis. Segmentation of tumor in Cone-Beam Computed Tomography (CBCT) images is non-trivial due to its imaging artifacts...
详细信息
Lung tumor estimation on imaging modalities is required to assess the extent of the tumor for diagnosis. Segmentation of tumor in Cone-Beam Computed Tomography (CBCT) images is non-trivial due to its imaging artifacts. Here we propose a novel technique for image registration of 18-Fluoro deoxyglucose Positron Emission Tomography (PET) and Computed Tomography(CT) images with CBCT images. The computation is performed in two stages. In the first stage, mutual information based rigid image registration is performed to obtain a rough global alignment of CBCT image with the corresponding PET and CT images. This result is fed to the second stage to perform deformable image registration between a pair of corresponding CBCT volumes of the same patient captures at different time instances using a viscous fluid model. The technique is adapted in both 2D (for slicewise computation) and 3D space (for computing with volume), and a comparative performance is presented with a simulated deformation model.
Automatic and reliable identification of pedestrians from multiple camera views is very important for video surveillance and can save a lot of manual effort. The significant variations in viewpoints, poses, illuminati...
详细信息
Automatic and reliable identification of pedestrians from multiple camera views is very important for video surveillance and can save a lot of manual effort. The significant variations in viewpoints, poses, illumination and occlusions makes this problem very challenging. Most of the existing approaches addressing this problem handle drastic viewpoint change in a supervised way and thus require labelling new training data for a different pair of camera views. In this paper, we present a novel approach for pedestrian re-identification using stereo matching, which does not require any kind of training. The cost of the stereo matching of two images is used for evaluating the similarity of the images, without performing 3-D reconstruction. We show that this cost is robust to the large pose variations observed in the images captured from multiple cameras. The proposed pedestrian re-identification algorithm is built on top of a dynamic programming stereo matching algorithm. Experimental evaluation on the challenging VIPeR dataset shows the effectiveness of the proposed approach.
Steganography is the art of hiding secret data inside a carrier media. Most steganographic techniques suffer from the drawback that they are unable to retain the perceptual quality. Using saliency cues for developing ...
详细信息
Steganography is the art of hiding secret data inside a carrier media. Most steganographic techniques suffer from the drawback that they are unable to retain the perceptual quality. Using saliency cues for developing an adaptive steganographic technique can help to alleviate this problem. In this work, a novel perception driven robust crypto-steganographic algorithm is proposed for embedding secure data in videos. The proposed scheme selects the payload regions based on natural scene statistics. To further strengthen the scheme and ensure intractability of secure data, the encrypted secret data is embedded in a random manner using jumbling sequence generator in the frames. We utilize perceptual hashing to evaluate the number of bit insertions that will not compromise the perceptual quality. A comprehensive performance evaluation of the proposed scheme is provided to detail the effectiveness. We demonstrate that the scheme shows a lot of promise in being robust against statistical and saliency based attacks.
暂无评论