检索结果-内蒙古大学图书馆

2007 IEEE computer Society Conference on computer vision and pattern recognition, CVPR'07

作者： Niebles, Juan Carlos Li, Fei-Fei Universidad Del Norte Colombia University of Illinois Urbana-Champaign United States Princeton University United States

ISBN: (纸本)1424411807

We present a novel model for human action categorization. A video sequence is represented as a collection of spatial and spatial-temporal features by extracting static and dynamic interest points. We propose a hierarchical model that can be characterized as a constellation of bags-of-features and that is able to combine both spatial and spatial-temporal features. Given a novel video sequence, the model is able to categorize human actions in a frame-by-frame basis. We test the model on a publicly available human action dataset [2] and show that our new method performs well on the classification task. We also conducted control experiments to show that the use of the proposed mixture of hierarchical models improves the classification performance over bag of feature models. An additional experiment shows that using both dynamic and static features provides a richer representation of human actions when compared to the use of a single feature type, as demonstrated by our evaluation in the classification task. © 2007 IEEE.

关键词： pattern recognition

来源：评论

学校读者我要写书评

暂无评论

vision-based hand gesture recognition for understanding musical time pattern and tempo

Vision-based hand gesture recognition for understanding musi...

引用

33rd Annual Conference of the IEEE-Industrial-Electronics-Society

作者： Je, Hongmo Kim, Jiman Kim, Daijin POSTECH Dept CSE Pohang 790784 South Korea

ISBN: (纸本)9781424407835

We introduce a method of understanding of four musical time patterns and three tempos that are generated by a human conductor of robot orchestra or an operator of computer-based music play system using the hand gesture recognition. We use only a stereo vision camera with no extra special devices. We suggest a simple and reliable vision-based hand gesture recognition with two naive features. One is the motion-direction code which is a quantized code for motion directions. The other is the conducting feature point (CFP) where the point of sudden motion changes. The proposed hand gesture recognition system operates as follows: First, it extracts the human band region by segmenting the depth information generated by stereo matching of image sequences. Next, it follows the motion of the center of the gravity(COG) of the extracted hand region and generates the gesture features such as CFP and the direction-code. Finally, we obtain the current timing pattern of beat and tempo of the playing music by the proposed hand gesture recognition using either CFP tracking or motion histogram matching. The experimental results on the test data set show that the musical time pattern and tempo recognition rate is over 86.42% for the motion histogram matching, and 79.75% for the CFP tracking.

关键词： Stereo vision

来源：评论

学校读者我要写书评

暂无评论

Fast sparse Gaussian processes learning for man-made structure classification

Fast sparse Gaussian processes learning for man-made structu...

引用

2007 IEEE computer Society Conference on computer vision and pattern recognition, CVPR'07

作者： Hang, Zhou Suter, David Institute for Vision Systems Engineering Dept. Elec. and Comp. Syst. Eng. Monash University PO Box 35 Clayton Vic. 3800 Australia

ISBN: (纸本)1424411807

Informative Vector Machine (IVM) is an efficient fast sparse Gaussian processs (GP) method previously suggested for active learning. It greatly reduces the computational cost of GP classification and makes the GP learning close to real time. We apply IVM for man-made structure classification (a two class problem). Our work includes the investigation of the performance of IVM with varied active data points as well as the effects of different choices of GP kernels. Satisfactory results have been obtained, showing that the approach keeps full GP classification performance and yet is significantly faster (by virtue if using a subset of the whole training data points). © 2007 IEEE.

关键词： Support vector machines

来源：评论

学校读者我要写书评

暂无评论

Development of a computer vision system for the automatic quality grading of mandarin segments

Development of a computer vision system for the automatic qu...

引用

3rd Iberian Conference on pattern recognition and Image Analysis

作者： Blasco, Jose Cubero, Sergio Arias, Raul Gomez, Juan Juste, Florentino Molto, Enrique Inst Valenciano Invest Agr Ctr AgroIngn Ctra Moncada Naquera Km 5 Valencia 46113 Spain

ISBN: (纸本)9783540728481

This work focuses on the development of a computer vision system for the automatic on-line inspection and classification of Satsuma segments. During the image acquisition the segments are in movement, wet and frequently in contact with other pieces. The segments are transported over six semi-transparent conveyor belts that advance at speed of 1 nits. During on-line operation, the system acquires images of the segments using two cameras connected to a single computer and process the images in less than 50 ms. Extracting morphological features from the objects, the system identifies automatically pieces of skin and row material and separates entire segments from broken ones, discriminating between those with slight or large breaking degree. Combinations of morphological parameters were employed to decide the quality of each segment, classifying correctly 95% of sound segments.

关键词： automatic inspection machine vision

来源：评论

学校读者我要写书评

暂无评论

Backoff hierarchical class n-gram language models:: effectiveness to model unseen events in speech recognition

引用

computer SPEECH AND LANGUAGE 2007年第1期21卷 88-104页

作者： Zitouni, Imed IBM Corp Thomas J Watson Res Ctr Multilingual NLP POB 21820-136 Yorktown Hts NY 10598 USA

In this paper, we introduce the backoff hierarchical class n-gram language models to better estimate the likelihood of unseen n-gram events. This multi-level class hierarchy language modeling approach generalizes the well-known backoff n-gram language modeling technique. It uses a class hierarchy to define word contexts. Each node in the hierarchy,is a class that contains all the words of its descendant nodes. The closer a node to the root, the more general the class (and context) is. We investigate the effectiveness of the approach to model unseen events in speech recognition. Our results illustrate that the proposed technique outperforms backoff n-gram language models. We also study the effect of the vocabulary size and the depth of the class hierarchy on the performance of the approach. Results are presented on Wall Street Journal (WSJ) corpus using two vocabulary set: 5000 words and 20,000 words. Experiments with 5000 word vocabulary, which contain a small numbers of unseen events in the test set, show up to 10% improvement of the unseen event perplexity when using the hierarchical class n-gram language models. With a vocabulary of 20,000 words, characterized by a larger number of unseen events, the perplexity of unseen events decreases by 26%, while the word error rate (WER) decreases by 12% when using the hierarchical approach. Our results suggest that the largest gains in performance are obtained when the test set contains a large number of unseen events. (c) 2006 Elsevier Ltd. All rights reserved.

关键词： SPEECH perception pattern recognition systems pattern perception computer vision

来源：评论

学校读者我要写书评

暂无评论

Palmprint recognition under unconstrained scenes

Palmprint recognition under unconstrained scenes

引用

8th Asian Conference on computer vision

作者： Han, Yufei Sun, Zhenan Wang, Fei Tan, Tieniu Chinese Acad Sci Inst Automat Natl Lab Pattern Recognit Ctr Biometr & Secur Res Beijing 100080 Peoples R China

ISBN: (纸本)9783540763895

This paper presents a novel real-time palmprint recognition system for cooperative user applications. This system is the first one achieving non-contact capturing and recognizing palmprint images under unconstrained scenes. Its novelties can be described in two aspects. The first is a novel design of image capturing device. The hardware can reduce influences of background objects and segment out hand regions efficiently. The second is a process of automatic hand detection and fast palmprint alignment, which aims to obtain normalized palmprint images for subsequent feature extraction. The palmprint recognition algorithm used in the system is based on accurate ordinal palmprint representation. By integrating power of the novel imaging device, the palmprint preprocessing approach and the palmprint recognition engine, the proposed system provides a friendly user interface and achieves a good performance under unconstrained scenes simultaneously.

关键词： Image recognition

来源：评论

学校读者我要写书评

暂无评论

How to find interesting locations in video:: A spatiotemporal interest point detector learned from human eye movements

引用

29th Annual Symposium of the German-Association-for-pattern-recognition

作者： Kienzle, Wolf Schoelkopf, Bernhard Wichmann, Felix A. Franz, Matthias O. Max Planck Inst Biol Cybernet Abt Empir Inferenz Spemannstr 38 D-72076 Tubingen Germany Tech Univ Berlin Fak 4 FB Modellierung Kognitiver D-10587 Berlin Germany Bernstein Ctr Computat Neurosci D-10115 Berlin Germany

ISBN: (纸本)9783540749332

Interest point detection in still images is a well-studied topic in computer vision. In the spatiotemporal domain, however, it is still unclear which features indicate useful interest points. In this paper we approach the problem by learning a detector from examples: we record eye movements of human subjects watching video sequences and train a neural network to predict which locations are likely to become eye movement targets. We show that our detector outperforms current spatiotemporal interest point architectures on a standard classification dataset.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Large scale vision-based navigation without an accurate global reconstruction

Large scale vision-based navigation without an accurate glob...

引用

2007 IEEE computer Society Conference on computer vision and pattern recognition, CVPR'07

作者： Šegvić, Siniša Remazeilles, Anthony Diosi, Albert Chaumette, François IRISA/INRIA Campus de Beaulieu F-35042 Rennes Cedex France

ISBN: (纸本)1424411807

Autonomous cars will likely play an important role in the future. A vision system designed to support outdoor navigation for such vehicles has to deal with large dynamic environments, changing imaging conditions, and temporary occlusions by other moving objects. This paper presents a novel appearance-based navigation framework relying on a single perspective vision sensor, which is aimed towards resolving of the above issues. The solution is based on a hierarchical environment representation created during a teaching stage, when the robot is controlled by a human operator. At the top level, the representation contains a graph of key-images with extracted 2D features enabling a robust navigation by visual servoing. The information stored at the bottom level enables to efficiently predict the locations of the features which are currently not visible, and eventually (re-)start their tracking. The outstanding property of the proposed framework is that it enables robust and scalable navigation without requiring a globally consistent map, even in interconnected environments. This result has been confirmed by realistic off-line experiments and successful real-time navigation trials in public urban areas. © 2007 IEEE.

关键词： Image reconstruction

来源：评论

学校读者我要写书评

暂无评论

Arithmetic of five-part of leukocytes based on image process

Arithmetic of five-part of leukocytes based on image process

引用

mippr 2007: Medical Imaging, Parallel Processing of Images, and Optimization Techniques

作者： Yian, Li Guoyou, Wang Jianguo, Liu Institute for Pattern Recognition and Artificial Intelligence Huazhong University of Science and Technology State Key Lab. for Multispectral Information Processing Technologies Wuhan China

ISBN: (纸本)9780819469533

This paper apply computer image processing and pattern recognizition methods to solve the problem of auto classification and counting of leukocytes (white blood cell) in peripheral blood.. In this paper a new leukocyte arithmetic of five-part based on image process and pattern recognizition is presented, which relized auto classify of leukocyte. The first aim is detect the leukocytes. A major requirement of the whole system is to classify these leukocytes to 5 classes. This arithmetic bases on notability mechanism of eyes, process image by sequence, divides up leukocytes and pick up characters. Using the prior kwonledge of cells and image shape information, this arithmetic divides up the probable shape of Leukocyte first by a new method based on Chamfer and then gets the detail characters. It can reduce the mistake judge rate and the calculation greatly. It also has the learning fuction. This paper also presented a new measurement of karyon's shape which can provide more accurate information. This algorithm has great application value in clinical blood test.

关键词： Medical imaging

来源：评论

学校读者我要写书评

暂无评论

Influence of numerical conditioning on the accuracy of relative orientation

Influence of numerical conditioning on the accuracy of relat...

引用

2007 IEEE computer Society Conference on computer vision and pattern recognition, CVPR'07

作者： Šegvić, Siniša Schweighofer, Gerald Pinz, Axel Institute of Electrical Measurement and Measurement Signal Processing Graz University of Technology Kopernikusgasse 24/IV 8010 Graz Austria

ISBN: (纸本)1424411807

We study the influence of numerical conditioning on the accuracy of two closed-form solutions to the overconstrained relative orientation problem. We consider the well known eight-point algorithm and the recent five-point algorithm, and evaluate changes in their performance due to Hartley's normalization andMuehlich's equilibration. The need for numerical conditioning is introduced by explaining the known occurence of the bias of the eight-point algorithm towards the forward motion. Then it is shown how conditioning can be used to improve the results of the recent five-point algorithm. This is not straightforward since the conditioning disturbs the calibration of the input data. The conditioning therefore needs to be reverted before enforcing the internal cubic constraints of the essential matrix. The obtained improvements are less dramatic than in the case of the eight-point algorithm, for which we offer a plausible explanation. The theoretical claims are backed up with extensive experimentation on noisy artificial datasets, under a variety of geometric and imaging parameters. © 2007 IEEE.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：