检索结果-内蒙古大学图书馆

supervised sparse patch coding towards misalignment-robust face recognition

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION 2013年第2期24卷 103-110页

作者： Lang, Congyan Feng, Songhe Chen, Bin Yuan, Xiaotong Beijing Jiaotong Univ Dept Comp Sci & Engn Beijing Peoples R China Natl Univ Singapore Dept Elect & Comp Engn Singapore 117548 Singapore

We address the challenging problem of face recognition under the scenarios where both training and test data are possibly contaminated with spatial misalignments. A supervised sparse coding framework is developed in this paper towards a practical solution to misalignment-robust face recognition. Each gallery face image is represented as a set of patches, in both original and misaligned positions and scales, and each given probe face image is then uniformly divided into a set of local patches. We propose to sparsely reconstruct each probe image patch from the patches of all gallery images, and at the same time the reconstructions for all patches of the probe image are regularized by one term towards enforcing sparsity on the subjects of those selected patches. The derived reconstruction coefficients by l(1)-norm minimization are then utilized to fuse the subject information of the patches for identifying the probe face. Such a supervised sparse coding framework provides a unique solution to face recognition with all (Here, we emphasize "all" because some conventional algorithms for face recognition possess partial of these characteristics.) the following four characteristics: (1) the solution is model-free, without the model learning process, (2) the solution is robust to spatial misalignments, (3) the solution is robust to image occlusions, and (4) the solution is effective even when there exist spatial misalignments for gallery images. Extensive face recognition experiments on three benchmark face datasets demonstrate the advantages of the proposed framework over holistic sparse coding and conventional subspace learning based algorithms in terms of robustness to spatial misalignments and image occlusions. (C) 2012 Elsevier Inc. All rights reserved.

关键词： Face recognition Spatial misalignment Image occlusions sparse coding Misalignment robust supervised sparse coding Dual sparsity Collective sparse reconstructions

来源：评论

学校读者我要写书评

暂无评论

Task fMRI data analysis based on supervised stochastic coordinate coding

引用

MEDICAL IMAGE ANALYSIS 2017年 38卷 1-16页

作者： Lv, Jinglei Ling, Binbin Li, Qingyang Zhang, Wei Zhao, Yu Jiang, Xi Guo, Lei Han, Junwei Hu, Xintao Guo, Christine Ye, Jieping Liu, Tianming Univ Georgia Dept Comp Sci Cort Architecture Imaging & Discovery Lab Athens GA 30602 USA Univ Georgia Bioimaging Res Ctr Athens GA 30602 USA QIMR Berghofer Med Res Inst Translat Neurosci Herston Qld Australia Univ Michigan Dept Computat Med & Bioinformat Ann Arbor MI 48109 USA Univ Michigan Dept Elect Engn & Comp Sci Ann Arbor MI 48109 USA Arizona State Univ Dept Comp Sci & Engn Tempe AZ 85287 USA Northwestern Polytech Univ Sch Automat Xian Peoples R China

Task functional magnetic resonance imaging (fMRI) has been widely employed for brain activation detection and brain network analysis. Modeling rich information from spatially-organized collection of fMRI time series is challenging because of the intrinsic complexity. Hypothesis-driven methods, such as the general linear model (GLM), which regress exterior stimulus from voxel-wise functional brain activity, are limited due to overlooking the complexity of brain activities and the diversity of concurrent brain networks. Recently, sparse representation and dictionary learning methods have attracted increasing interests in task fMRI data analysis. The major advantage of this methodology is its promise in reconstructing concurrent brain networks systematically. However, this data-driven strategy is, to some extent, arbitrary and does not sufficiently utilize the prior information of task design and neuroscience knowledge. To bridge this gap, we here propose a novel supervised sparse representation and dictionary learning framework based on stochastic coordinate coding (SCC) algorithm for task fMRI data analysis, in which certain brain networks are learned with known information such as pre-defined temporal patterns and spatial network patterns, and at the same time other networks are learned automatically from data. Our proposed method has been applied to two independent task fMRI datasets, and qualitative and quantitative evaluations have shown that our method provides a new and effective framework for task fMRI data analysis. (C) 2017 Elsevier B.V. All rights reserved.

关键词： Task fMRI supervised sparse coding Brain networks

来源：评论

学校读者我要写书评

暂无评论

A supervised dictionary learning and discriminative weighting model for action recognition

引用

NEUROCOMPUTING 2015年 158卷 246-256页

作者： Dong, Jian Sun, Changyin Yang, Wankou Southeast Univ Sch Automat Nanjing 210096 Jiangsu Peoples R China Southeast Univ Minist Educ Key Lab Measurement & Control Complex Syst Engn Nanjing 210096 Jiangsu Peoples R China Southeast Univ Jiangsu Key Lab Image & Video Understanding Socia Nanjing 210096 Jiangsu Peoples R China

In this paper, we propose a supervised dictionary learning algorithm for action recognition in still images followed by a discriminative weighting model. The dictionary is learned based on Local Fisher Discrimination which takes into account the local manifold structure and discrimination information of local descriptors. The label information of local descriptors is considered in both dictionary learning and sparse coding stage which generates a supervised sparse coding algorithm and makes the coding coefficients discriminative. Instead of using spatial pyramid features, sliding window-based features with max-pooling are computed from coding coefficients. And then a discriminative weighting model combining a max-margin classifier is proposed using the features. Both the weighting coefficients and model parameters can be jointly learned using the same way in Multiple Kernel Learning algorithm. We validate our model on the following action recognition datasets: Willow 7 human actions dataset, People Playing Music Instrument (PPMI) dataset, and Sports dataset. To show the generality of our model, we also validate it on Scene15 dataset. The experiment results show that only with single scale local descriptors, our algorithm is comparable to some state-of-the-art algorithms. (C) 2015 Elsevier B.V. All rights reserved.

关键词： Dictionary learning Local Fisher Discrimination supervised sparse coding Discriminative weighting model Multiple Kernel Learning Action recognition

来源：评论

学校读者我要写书评

暂无评论

Leveraging Human Fixations in sparse coding: Learning a Discriminative Dictionary for Saliency Prediction

Leveraging Human Fixations in Sparse Coding: Learning a Disc...

引用

IEEE International Conference on Systems, Man, and Cybernetics (SMC)

作者： Jiang, Ming Song, Mingli Zhao, Qi Natl Univ Singapore Dept Elect & Comp Engn Singapore 117548 Singapore Zhejiang Univ Coll Comp Sci Hangzhou Zhejiang Peoples R China

ISBN: (纸本)9781479906529

This paper proposes to learn a discriminative dictionary for saliency detection. In addition to the conventional sparse coding mechanism that learns a representational dictionary of natural images for saliency prediction, this work uses supervised information from eye tracking experiments in training to enhance the discriminative power of the learned dictionary. Furthermore, we explicitly model saliency at multi-scale by formulating it as a multi-class problem, and a label consistency term is incorporated into the framework to encourage class (salient vs. non-salient) and scale consistency in the learned sparse codes. K-SVD is employed as the central computational module to efficiently obtain the optimal solution. Experiments demonstrate the superior performance of the proposed algorithm compared with the state-of-the-art in saliency prediction.

关键词： Saliency Visual Attention supervised sparse coding Dictionary Learning K-SVD

来源：评论

学校读者我要写书评

暂无评论

Leveraging Human Fixations in sparse coding: Learning a Discriminative Dictionary for Saliency Prediction (Invited Paper)

Leveraging Human Fixations in Sparse Coding: Learning a Disc...

引用

IEEE International Conference on Systems, Man, and Cybernetics

作者： Ming Jiang Mingli Song Qi Zhao Department of Electrical and Computer Engineering National University of Singapore College of Computer Science Zhejiang University

ISBN: (纸本)9781479906505

关键词： Saliency Visual Attention supervised sparse coding Dictionary Learning K-SVD Visual Attention sparse coding Dictionaries as Topic non-salient fixation multi-class multi-scale

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：