A new robust structured light technique based on multi-valued pseudo-random color encoded pattern is discussed in this paper. After analyzing the advantages and disadvantages of the existing pseudo-random coding patte...
详细信息
ISBN:
(纸本)9783037850015
A new robust structured light technique based on multi-valued pseudo-random color encoded pattern is discussed in this paper. After analyzing the advantages and disadvantages of the existing pseudo-random coding patterns in computervision, a new multi-valued pseudo-random color encoded pattern is designed on the basis of the combination of the feature points and the feature lines. Using this pattern, the feature points are easy to extract, and the problems of leaking points and pseudo-feature points are greatly reduced. Furthermore, it also reduced the difficulty and the complexity of the feature matching because of the feature lines.
Recovering arbitrarily corrupted low-rank matrices arises in computervision applications, including bioinformatic data analysis and visual tracking. The methods used involve minimizing a combination of nuclear norm a...
详细信息
The proposed method for classifying clusters of patterns in complex non-convex, disconnected domains using Radial Basis Function Neural Networks (RBFNNs) enhanced with the Rvachev Function Method (RFM) is presented wi...
详细信息
ISBN:
(纸本)9783642250842
The proposed method for classifying clusters of patterns in complex non-convex, disconnected domains using Radial Basis Function Neural Networks (RBFNNs) enhanced with the Rvachev Function Method (RFM) is presented with numerical examples. R-functions are used to construct complex pattern cluster domain, parameters of which are applied to RBFNNs to establish boundaries for classification. The error functional is a convex quadratic one with respect to weight functions which take weight values on the discrete connectors between neurons. Activation function of neurons of RBFNNs is the sgn(.) function and, therefore, the error function is non-smooth. The delta learning rule during training phase is applied. The sub-gradient of the discretized error function is used rather than its gradient, because it is not smooth. The application of the RFM allows for the creation, implementation, and resolution of large heterogeneous NNs capable to solving diverse sets of classification problems with greater accuracy.
In this paper subpattern-based approaches are used to solve the age classification problem on facial images. Subpattern-based approaches named Local Binary patterns (LBP), subpattern-based Principal Component Analysis...
详细信息
ISBN:
(纸本)9781601321916
In this paper subpattern-based approaches are used to solve the age classification problem on facial images. Subpattern-based approaches named Local Binary patterns (LBP), subpattern-based Principal Component Analysis (spPCA) and modular Principal Component Analysis (mPCA) are examined to demonstrate the age classification performance on female and male facial images of human beings with several parameter settings. Classification of age intervals are conducted separately on female and male facial images since the aging process for female and male is different for human beings in real life. Subpattern-based LBP, spPCA and mPCA are used for feature extraction on different datasets selected from FG-NET and MORPH databases. Experimental results demonstrate the superiority of subpattern-based LBP over spPCA and mPCA techniques. Age classification performance using these three subpattern-based techniques with different parameter settings on the selected datasets is also presented.
Many computervision approaches take for granted positive answers to questions such as "Are semantic categories visually separable?" and "Is visual similarity correlated to semantic similarity?". I...
详细信息
Traditional computervision and machine learning algorithms have been largely studied in a centralized setting, where all the processing is performed at a single central location. However, a distributed approach might...
详细信息
A mobile computervision system is presented that helps visually impaired pedestrians cross roads. The system detects pedestrian lights in the environment and gives feedback about the current phase of the crucial ligh...
详细信息
A mobile computervision system is presented that helps visually impaired pedestrians cross roads. The system detects pedestrian lights in the environment and gives feedback about the current phase of the crucial light. For this purpose the live video stream of a mobile phone is analyzed in four steps: localization, classification, video analysis, and time-based verification. In particular, the temporal analysis allows us to alleviate the inherent problems such as occlusions (by vehicles), falsified colors, and others, and to further increase the decision certainty over a period of time. Due to the limited resources of mobile devices very efficient and precise algorithms have to be developed to ensure the reliability and the interactivity of the system. A prototype system was implemented on a Nokia N95 mobile phone and tested in real environment. It was trained to detect German traffic lights. For the prototype training and testing, we generated image and video databases including manually specified ground truth meta-data. These databases described in this paper are publicly available for the research community. Quantitative performance analysis is provided to demonstrate the reliability and interactivity of the prototype system.
This book constitutes the proceedings of the Second International Conference on Information Processing in computer-Assisted Interventions IPCAI 2011, held in Berlin, Germany, on June 22, 2011. The 17 papers presented ...
ISBN:
(数字)9783642215049
ISBN:
(纸本)9783642215032
This book constitutes the proceedings of the Second International Conference on Information Processing in computer-Assisted Interventions IPCAI 2011, held in Berlin, Germany, on June 22, 2011. The 17 papers presented were carefully reviewed and selected from 29 submissions. The focus of the conference is the use of information technology in interventional medicine, including real-time modeling and analysis, technology, human-machine interfaces, and systems associated with operating rooms and interventional suites. It also covers the overall information flow associated with intervention planning, execution, follow-up, and outcome analysis; as well as training and skill assessment for such procedures.
Unsupervised identical object segmentation remains a challenging problem in vision research due to the difficulties in obtaining high-level structural knowledge about the scene. In this paper, we present an algorithm ...
详细信息
ISBN:
(纸本)9781457701214;9781457701221
Unsupervised identical object segmentation remains a challenging problem in vision research due to the difficulties in obtaining high-level structural knowledge about the scene. In this paper, we present an algorithm based on level set with a novel similarity constraint term for identical objects segmentation. The key component of the proposed algorithm is to embed the similarity constraint into curve evolution, where the evolving speed is high in regions of similar appearance and becomes low in areas with distinct contents. The algorithm starts with a pair of seed matches (e. g. SIFT) and evolve the small initial circle to form large similar regions under the similarity constraint. The similarity constraint is related to local alignment with assumption that the warp between identical objects is affine transformation. The right warp aligns the identical objects and promotes the similar regions growth. The alignment and expansion alternate until the curve reaches the boundaries of similar objects. Real experiments validates the efficiency and effectiveness of the proposed algorithm.
Human activity recognition and speech recognition appear to be two loosely related research areas. However, on a careful thought, there are several analogies between activity and speech signals with regard to the way ...
详细信息
暂无评论