Unsupervised categorization of objects is a fundamental problem in computervision. While appearance-based methods have become popular recently, other important cues like functionality are largely neglected. Motivated...
详细信息
Many state-of-the-art face recognition algorithms use image descriptors based on features known as Local Binary patterns (LBPs). While many variations of LBP exist, so far none of them can automatically adapt to the t...
详细信息
Human action recognition and annotation is an active research topic in computervision. How to model various actions, varying with time resolution, visual appearance, and others, is a challenging task. In this paper, ...
详细信息
Human action recognition and annotation is an active research topic in computervision. How to model various actions, varying with time resolution, visual appearance, and others, is a challenging task. In this paper, we propose a boosted exemplar learning (BEL) approach to model various actions in a weakly supervised manner, i.e., only action bag-level labels are provided but action instance level ones are not. The proposed BEL method can be summarized as three steps. First, for each action category, amount of class-specific candidate exemplars are learned through an optimization formulation considering their discrimination and co-occurrence. Second, each action bag is described as a set of similarities between its instances and candidate exemplars. Instead of simply using a heuristic distance measure, the similarities are decided by the exemplar-based classifiers through the multiple instance learning, in which a positive (or negative) video or image set is deemed as a positive (or negative) action bag and those frames similar to the given exemplar in Euclidean Space as action instances. Third, we formulate the selection of the most discriminative exemplars into a boosted feature selection framework and simultaneously obtain an action bag-based detector. Experimental results on two publicly available datasets: the KTH dataset and Weizmann dataset, demonstrate the validity and effectiveness of the proposed approach for action recognition. We also apply BEL to learn representations of actions by using images collected from the Web and use this knowledge to automatically annotate action in YouTube videos. Results are very impressive, which proves that the proposed algorithm is also practical in unconstraint environments.
In this paper, we propose a novel technique solution towards LED wafer defects automatic full inspection using neural network chip array to assure defect-free outgoing dies. Our research intends to develop an automati...
详细信息
Head pose estimation from images has recently attracted much attention in computervision due to its diverse applications in face recognition, driver monitoring and human computer interaction. Most successful approach...
详细信息
In this paper, we propose a method for detection and tracking of multiple planes in sequences of Time of Flight (ToF) depth images. Our approach extends the recent J-linkage algorithm for estimation of multiple model ...
详细信息
The biologically inspired model (BIM) proposed by Serre et al. presents a promising solution to object categorization. It emulates the process of object recognition in primates' visual cortex by constructing a set...
详细信息
The biologically inspired model (BIM) proposed by Serre et al. presents a promising solution to object categorization. It emulates the process of object recognition in primates' visual cortex by constructing a set of scale- and position-tolerant features whose properties are similar to those of the cells along the ventral stream of visual cortex. However, BIM has potential to be further improved in two aspects: mismatch by dense input and randomly feature selection due to the feedforward framework. To solve or alleviate these limitations, we develop an enhanced BIM (EBIM) in terms of the following two aspects: 1) removing uninformative inputs by imposing sparsity constraints, 2) apply a feedback loop to middle level feature selection. Each aspect is motivated by relevant psychophysical research findings. To show the effectiveness of the EBIM, we apply it to object categorization and conduct empirical studies on four computervision data sets. Experimental results demonstrate that the EBIM outperforms the BIM and is comparable to state-of-the-art approaches in terms of accuracy. Moreover, the new system is about 20 times faster than the BIM.
This book constitutes the refereed proceedings of the Second International Workshop on Machine Learning in Medical Imaging, MLMI 2011, held in conjunction with MICCAI 2011, in Toronto, Canada, in September 2011. The 4...
ISBN:
(数字)9783642243196
ISBN:
(纸本)9783642243189
This book constitutes the refereed proceedings of the Second International Workshop on Machine Learning in Medical Imaging, MLMI 2011, held in conjunction with MICCAI 2011, in Toronto, Canada, in September 2011. The 44 revised full papers presented were carefully reviewed and selected from 74 submissions. The papers focus on major trends in machine learning in medical imaging aiming to identify new cutting-edge techniques and their use in medical imaging.
This paper presents the possibility of using patternrecognition algorithms of infant gaze patterns at six months of age among children at high risk for an autism spectrum disorder (ASD). ASDs, which must be diagnosed...
详细信息
This three-volume proceedings contains revised selected papers from the Second International Conference on Artificial Intelligence and Computational Intelligence, AICI 2011, held in Taiyuan, China, in September 2011. ...
ISBN:
(数字)9783642238963
ISBN:
(纸本)9783642238956
This three-volume proceedings contains revised selected papers from the Second International Conference on Artificial Intelligence and Computational Intelligence, AICI 2011, held in Taiyuan, China, in September 2011. The total of 265 high-quality papers presented were carefully reviewed and selected from 1073 submissions. The topics of Part III covered are: machine vision; natural language processing; nature computation; neural computation; neural networks; particle swarm optimization; patternrecognition; rough set theory; and support vector machine.
暂无评论