Color can be a useful feature in many fields of AI that are based on machine vision. Unfortunately, many existing vision system do not use color to its full extent, largely because color-based recognition in outdoor s...
详细信息
ISBN:
(纸本)9781479984459
Color can be a useful feature in many fields of AI that are based on machine vision. Unfortunately, many existing vision system do not use color to its full extent, largely because color-based recognition in outdoor scene is complicated, and existing color machine vision techniques have not been shown to be effective in realistic outdoor images. The problem of color recognition in outdoor is considerable when we are faced with glossy materials like automobiles. There is no powerful method to recognize color of a batch of pixels. Thus, for the first time, we propose a novel method to detect dominant color of a group of pixels. This method has many applications in object color detection especially for glossy objects.
We address the problem of estimating location information of an image using principles from automated representation learning. We pursue a hierarchical sparse coding approach that learns features useful in discriminat...
详细信息
ISBN:
(纸本)9781467369640
We address the problem of estimating location information of an image using principles from automated representation learning. We pursue a hierarchical sparse coding approach that learns features useful in discriminating images across locations, by initializing it with a geometric prior corresponding to transformations between image appearance space and their corresponding location grouping space using the notion, of parallel transport on manifolds. We then extend this approach to account for the availability, of heterogeneous data modalities such as geo-tags and videos pertaining to different locations, and also study a relatively under-addressed problem of transferring knowledge available from certain, locations to infer the grouping of data from novel locations. We evaluate our approach on several standard datasets such as im2gps, San Francisco and MediaEval 2010, and obtain state-of-the-art results.
Accurate segmentation of brain tumor from MR image is crucial for the diagnosis and treatment of brain cancer. We propose a novel automated brain tumor segmentation method based on a probabilistic model combining spar...
详细信息
ISBN:
(纸本)9781467371452
Accurate segmentation of brain tumor from MR image is crucial for the diagnosis and treatment of brain cancer. We propose a novel automated brain tumor segmentation method based on a probabilistic model combining sparse coding and Markov random field (MRF). We formulate the brain tumor segmentation task as a pixel -wise labeling problem with regard to three classes: tumor, edema and healthy tissue. For each class, dictionary learning is performed independently on multi -modality gray scale patches. sparse representation is then extracted based on a joint dictionary which is constructed by combing the three independent dictionaries. Finally, we build the probabilistic model aiming to estimate maximum a posterior (MAP) probability by introducing the sparse representation into likelihood probability and prior probability using the Markov random field (MRF) assumption. Compared with traditional methods, which employed hand-crafted low level features to construct the probabilistic model, our model can better represent the characteristics of a pixel and its relation with neighbors based on the sparse coefficients obtained from the learned dictionary. We validated our method on the MICAAI 2012 BRATS challenge brain MRI dataset and achieved comparable or better results compared with state-ofthe-art methods.
In practice, the training data and testing data are often from different datasets, which have an adverse impact on speech emotion recognition rates. To tackle this problem, in this paper, a novel transfer principal co...
详细信息
ISBN:
(纸本)9783319254173;9783319254166
In practice, the training data and testing data are often from different datasets, which have an adverse impact on speech emotion recognition rates. To tackle this problem, in this paper, a novel transfer principal component analysis (TPCA) and sparse coding based speech emotion recognition method is proposed. The TPCA approach is first presented for feature dimension reduction, then the sparse coding algorithm is introduced to learn the robust feature representations for both labeled source and unlabeled target corpora. To evaluate the performance of our proposed method, the experiments are conducted on two public datasets. Experimental results demonstrate that our proposed approach significantly outperforms the automatic recognition method, and obtains better performance than the state-of-the-art method.
In this paper, we present a novel feature encoding approach called Approximate sparse coding (ASC). ASC computes the sparse codes for a large collection of prototype descriptors in the off-line learning phase with Spa...
详细信息
sparse coding exhibits good performance in many computer vision applications by finding bases which capture high-level semantics of the data and learning sparse coefficients in terms of the bases. However, due to the ...
详细信息
ISBN:
(纸本)9781510600546
sparse coding exhibits good performance in many computer vision applications by finding bases which capture high-level semantics of the data and learning sparse coefficients in terms of the bases. However, due to the fact that bases are non-orthogonal, sparse coding can hardly preserve the samples' similarity, which is important for discrimination. In this paper, a new image representing method called maximum constrained sparse coding (MCSC) is proposed. sparse representation with more active coefficients means more similarity information, and the infinite norm is added to the solution for this purpose. We solve the optimizer by constraining the codes' maximum and releasing the residual to other dictionary atoms. Experimental results on image clustering show that our method can preserve the similarity of adjacent samples and maintain the sparsity of code simultaneously.
Many real-world problems usually deal with high-dimensional data, such as images, videos, text, web documents and so on. In fact, the classification algorithms used to process these high-dimensional data often suffer ...
详细信息
ISBN:
(纸本)9783319240787;9783319240770
Many real-world problems usually deal with high-dimensional data, such as images, videos, text, web documents and so on. In fact, the classification algorithms used to process these high-dimensional data often suffer from the low accuracy and high computational complexity. Therefore, we propose a framework of transforming images from a high-dimensional image space to a low-dimensional target image space, based on learning an orthogonal smooth subspace for the SIFT sparse codes (SC-OSS). It is a two stage framework for subspace learning. Firstly, a sparse coding followed by spatial pyramid max pooling is used to get the image representation. Then, the image descriptor is mapped into an orthonormal and smooth subspace to classify images in low dimension. The proposed algorithm adds the orthogonality and a Laplacian smoothing penalty to constrain the projective function coefficient to be orthogonal and spatially smooth. The experimental results on the public datasets have shown that the proposed algorithm outperforms other subspace methods.
We present a sparse coding based dense feature representation model (a preliminary version of the paper was presented at the SPIE Remote Sensing Conference, Dresden, Germany, 2013) for hyperspectral image (HSI) classi...
详细信息
We present a sparse coding based dense feature representation model (a preliminary version of the paper was presented at the SPIE Remote Sensing Conference, Dresden, Germany, 2013) for hyperspectral image (HSI) classification. The proposed method learns a new representation for each pixel in HSI through the following four steps: sub-band construction, dictionary learning, encoding, and feature selection. The new representation usually has a very high dimensionality requiring a large amount of computational resources. We applied the I-1/I-q regularized multiclass logistic regression technique to reduce the size of the new representation. We integrated the method with a linear support vector machine (SVM) and a composite kernels SVM (CKSVM) to discriminate different types of land cover. We evaluated the proposed algorithm on three well-known HSI datasets and compared our method to four recently developed classification methods: SVM, CKSVM, simultaneous orthogonal matching pursuit, and image fusion and recursive filtering. Experimental results show that the proposed method can achieve better overall and average classification accuracies with a much more compact representation leading to more efficient sparse models for HSI classification. (C) 2015 SPIE and IS&T
Environmental Microorganisms (EMs), such as Epistylis and Rotifera, are very tiny living beings in human environments and decompose pollutants as their nutrition. The classification of EMs plays a fundamental role for...
详细信息
sparse coding algorithms with geometrical constraints have received much attention recently. However, these methods are unsupervised and might lead to less discriminative representations. In this paper, we propose a s...
详细信息
暂无评论