Motivated by deep learning approaches to classify normal and neuro-diseased subjects in functional Magnetic Resonance Imaging (fMRI), we propose stacked autoencoder (SAE) based 2-stage architecture for disease diagnos...
详细信息
ISBN:
(纸本)9781450347532
Motivated by deep learning approaches to classify normal and neuro-diseased subjects in functional Magnetic Resonance Imaging (fMRI), we propose stacked autoencoder (SAE) based 2-stage architecture for disease diagnosis. In the proposed architecture, a separate 4-hidden layer autoencoder is trained in unsupervised manner for feature extraction corresponding to every brain region. thereafter, these trained autoencoders are used to provide features on class-labeled input data for training a binary support vector machine (SVM) based classifier. In order to design a robust classifier, noisy or inactive gray matter voxels are filtered out using a proposed covariance based approach. We applied the proposed methodology on a public dataset, namely, 1000 Functional Connectomes Project Cobre dataset consisting of fMRI data of normal and Schizophrenia subjects. the proposed architecture is able to classify normal and Schizophrenia subjects with 10-fold cross-validation accuracy of 92% that is better compared to the existing methods used on the same dataset.
In quest for an efficient representation schema for activity recognition in video, we employ techniques combining diagrammatic reasoning (DR) with qualitative spatial and temporal reasoning (QSTR). QSTR allows qualita...
详细信息
ISBN:
(纸本)9781450347532
In quest for an efficient representation schema for activity recognition in video, we employ techniques combining diagrammatic reasoning (DR) with qualitative spatial and temporal reasoning (QSTR). QSTR allows qualitative abstraction of spatio-temporal relations among objects of interest;and is often thwart by ambiguous conclusions. 'Diagrams' influence cognitive reasoning by externalizing mental context. Hence, QSTR over diagrams holds promise. We define 'diagrams' as explicit representation of objects of interest and their spatial information on a 2D grid. A sequence of 'key diagrams' is extracted. Inter diagrammatic reasoning operators combine 'key diagrams' to obtain spatio-temporal information. the qualitative spatial and temporal information thus obtained define short-term activity (STA). Several STAs combine to form long-term activities (LTA). Sequence of STAs as a feature vector is used for LTA recognition. We evaluate our approach over six LTAs from the CAVIAR dataset.
Saliency plays a key role in various computervision tasks. Extracting salient regions from images and videos have been a well established problem of computervision. While segmenting salient objects from images depen...
详细信息
We study surface reconstruction using a combination of anisotropic Gaussian filter and image segmentation technique -the minimum surface method. Anisotropic Gaussian filtering allows to manage a contrast between inten...
详细信息
ISBN:
(纸本)9783642159060
We study surface reconstruction using a combination of anisotropic Gaussian filter and image segmentation technique -the minimum surface method. Anisotropic Gaussian filtering allows to manage a contrast between intensities of the discontinuity and the object in a desired direction. the minimum surface method detects properly outer boundaries even affected by boundary leakage in the vicinity of blurred edges. the algorithm is tested on a set of real 3D images of large corrosion cracks in stainless steel that initiated at the surface of the tested samples. Results are presented and discussed.
Over 70% of software development effort is spent in software maintenance comprising bug fixes and version updates. these activities involve fast comprehension of large codebases authored by multiple developers. Develo...
详细信息
this paper presents a new grassy effect generation approach. the grassy effect changes original image as the one behind the grass. A random number generation function is employed to decide the same of block and locati...
详细信息
ISBN:
(纸本)9781479977642
this paper presents a new grassy effect generation approach. the grassy effect changes original image as the one behind the grass. A random number generation function is employed to decide the same of block and location of the pixel. Obtained various block size and pixel location yield unexpected grassy execution to the original image. the subjective and the objective image performances are compared which conclude that the proposed approach successfully crates glassy effect in images.
Video object segmentation aims to segment objects in a video sequence, given some user annotation which indicates the object of interest. Although Convolutional Neural Networks (CNNs) have been used in the recent past...
详细信息
ISBN:
(纸本)9781450366151
Video object segmentation aims to segment objects in a video sequence, given some user annotation which indicates the object of interest. Although Convolutional Neural Networks (CNNs) have been used in the recent past for the purpose of foreground segmentation in videos, adversarial training methods have not been used effectively to solve this problem, in spite of its extensive use for solving many other problems in computervision. Earlier, flow features and motion trajectories have been extensively used to capture the temporal consistency between subsequent frames to segment moving objects in videos. However, we show that our proposed framework of processingthe video frames independently using a deep generative adversarial network (GAN), is able to maintain the temporal coherency across frames without the use of any explicit trajectory based information, to provide superior results. Our main contribution lies in introducing a GAN based framework along withthe incorporation of an Intersection-over-Union score based novel cost function for training the model, to solve the problem of foreground object segmentation in videos. the proposed method, when evaluated on popular real-world video segmentation datasets viz. DAVIS, SegTrack-v2 and YouTube-Objects, exhibits substantial performance gain over the recent state-of-the-art methods.
We address the problem of image denoising for an additive white noise model without placing any restrictions on the statistical distribution of noise. We assume knowledge of only the first-and second-order noise stati...
详细信息
ISBN:
(纸本)9781450347532
We address the problem of image denoising for an additive white noise model without placing any restrictions on the statistical distribution of noise. We assume knowledge of only the first-and second-order noise statistics. In the recent mean-square error (MSE) minimization approaches for image denoising, one considers a particular noise distribution and derives an expression for the unbiased risk estimate of the MSE. For additive white Gaussian noise, an unbiased estimate of the MSE is Stein's unbiased risk estimate (SURE), which relies on Stein's lemma. We derive an unbiased risk estimate without using Stein's lemma or its counterparts for additive white noise model irrespective of the noise distribution. We refer to the MSE estimate as the generic risk estimate (GenRE). We demonstrate the effectiveness of GenRE using shrinkage in the undecimated Haar wavelet transform domain as the denoising function. the estimated peak-signal-to-noise-ratio (PSNR) using GenRE is typically within 1% of the PSNR obtained when optimizing withthe oracle MSE. the performance of the proposed method is on par with SURE for Gaussian noise distribution, and better than SURE-based methods for other noise distributions such as uniform and Laplacian distribution in terms of both PSNR and structural similarity (SSIM).
Rendering of corrosion often requires pain-staking modeling and texturing. On the other hand, there exist techniques for stochastic modeling of corrosion, which can automatically perform simulation and rendering under...
详细信息
ISBN:
(纸本)9781450347532
Rendering of corrosion often requires pain-staking modeling and texturing. On the other hand, there exist techniques for stochastic modeling of corrosion, which can automatically perform simulation and rendering under control of some user-specified parameters. Unfortunately, these parameters are non-intuitive and have a global impact. It is hard to determine the values of these parameters to obtain a desired look. For example, in real life corrosion gets influenced by both internal object-specific geometric factors, like sharp corners and curvatures, and external interventions like scratches, blemishes etc. Further, a graphics designer may want to selectively corrode areas to obtain a particular scene. We present a technique for user guided spread of corrosion. Our framework encapsulates both structural and aesthetic factors. Given the material properties and the surrounding environmental conditions of an object, we employ a physio-chemically based stochastic model to deduce the decay of different points on that object. Our system equips the user with a platform where the imperfections can be provided by either manual or systematic interference on a rendering of the three dimensional object. We demonstrate several user guided characteristic simulations encompassing varied influences including material, object characteristics and environment conditions. Our results are visually validated to understand the impact of imperfections with elapsed time.
In this work, we address the problem of dynamic gesture recognition using a pose based video descriptor. the proposed approach takes as input video frames and extracts pose-specific image regions which are further pro...
详细信息
ISBN:
(纸本)9781450366151
In this work, we address the problem of dynamic gesture recognition using a pose based video descriptor. the proposed approach takes as input video frames and extracts pose-specific image regions which are further processed by a pre-trained Convolutional Neural Network (CNN) to derive a pose-based descriptor for each frame. A Long Short Term Memory (LSTM) network is trained from scratch for dynamic gesture classification by learning long-term spatiotemporal relations among features. We also demonstrate that only using video data (RGB frames and optical flow) one can design an effective model for recognizing dynamic gestures. We utilize ChaLearn multi-modal gesture challenge dataset [13] and Cambridge hand gesture dataset [18] for evaluation of the proposed algorithm achieving an accuracy of 91.27% and 96% respectively using only RGB data.
暂无评论