With the rapid development of computer science, image processing, and patternrecognition, people are paying more and more attention on research and application of machine vision, which has resulted in excellent resul...
详细信息
ISBN:
(纸本)9783037856932
With the rapid development of computer science, image processing, and patternrecognition, people are paying more and more attention on research and application of machine vision, which has resulted in excellent results in many fields. Depending on the features of untouched, fast-speed, real-time, proper precision, anti-interference, the inspection technology based on machine vision has been researched deeply and comprehensive applied aboard. Recently, the size measurement technology based on machine vision has been applied in the size measurement of the machine processing parts. A case study of typical regular mechanical parts, algorithm of position and size measurement of line, circle center and radius are researched, and good results are got, which established an academic foundation for further measurement of complex parts.
In this work, we propose Bossallova, a novel representation for content-based concept detection in images and videos, which enriches the Bag-of-Words model. Relying on the quantization of highly discriminant local des...
详细信息
In this work, we propose Bossallova, a novel representation for content-based concept detection in images and videos, which enriches the Bag-of-Words model. Relying on the quantization of highly discriminant local descriptors by a codebook, and the aggregation of those quantized descriptors into a single pooled feature vector, the Bag-of-Words model has emerged as the most promising approach for concept detection on visual documents. Bossallova enhances that representation by keeping a histogram of distances between the descriptors found in the image and those in the codebook, preserving thus important information about the distribution of the local descriptors around each codeword. Contrarily to other approaches found in the literature, the non-parametric histogram representation is compact and simple to compute. Bossallova compares well with the state-of-the-art in several standard datasets: MIRFLICKR, ImageCLEF 2011, PASCAL VOC 2007 and 15-Scenes, even without using complex combinations of different local descriptors. It also complements well the cutting-edge Fisher Vector descriptors, showing even better results when employed in combination with them. Bossallova also shows good results in the challenging real-world application of pornography detection. (C) 2012 Elsevier Inc. All rights reserved.
Grammars are widely used to describe string languages such as programming and natural languages and, more recently, biosequences. Moreover, since the 1980s grammars have been used in computervision and related areas....
详细信息
Grammars are widely used to describe string languages such as programming and natural languages and, more recently, biosequences. Moreover, since the 1980s grammars have been used in computervision and related areas. Some factors accountable for this increasing use regard its relatively simple understanding and its ability to represent some semantic pattern models found in images, both spatially and temporally. The objective of this article is to present an overview regarding the use of syntactic patternrecognition methods in image representations in several applications. To achieve this purpose, we used a systematic review process to investigate the main digital libraries in the area and to document the phases of the study in order to allow the auditing and further investigation. The results indicated that in some of the studies retrieved, manually created grammars were used to comply with a particular purpose. Other studies performed a learning process of the grammatical rules. In addition, this article also points out still unexplored research opportunities in the literature.
This work investigates feature-based techniques for component-face recognition on one of the most difficult tasks: recognition of identical twins. The challenge with solving face recognition for identical twins is to ...
详细信息
This work investigates feature-based techniques for component-face recognition on one of the most difficult tasks: recognition of identical twins. The challenge with solving face recognition for identical twins is to find a feature extraction and template formulation approach that sufficiently separates the identical twins in match space. This work extends this premise to investigate which components of the face (eye areas, nose, or mouth) are the most discriminative features. This work uses a patch-based feature extractor and compares it to two well-known texture techniques: LBP and HOG, for the full face and the component face. We show that a single face component, the eye area, nearly matches the performance of the full face and that a simple fusion of the components outperforms the full face face on the recognition task. Further we demonstrate that the proposed feature extractor does not exhibit gender biases as does some face recognition systems, i.e. it performs almost equally on males and females. And, finally we investigate the claims that face recognition becomes easier as the twins grow older. This work adds a final contribution by experimenting on the largest public identical twins corpora available to date: ND/WVU (2009, 2010, 2011) and the CASIA Twins Face Dataset.
Recent work in computervision has addressed zero-shot learning or unseen class detection, which involves categorizing objects without observing any training examples. However, these problems assume that attributes or...
详细信息
ISBN:
(纸本)9781467364102
Recent work in computervision has addressed zero-shot learning or unseen class detection, which involves categorizing objects without observing any training examples. However, these problems assume that attributes or defining characteristics of these unobserved classes are known, leveraging this information at test time to detect an unseen class. We address the more realistic problem of detecting categories that do not appear in the dataset in any form. We denote such a category as an unfamiliar class;it is neither observed at train time, nor do we possess any knowledge regarding its relationships to attributes. This problem is one that has received limited attention within the computervision community. In this work, we propose a novel approach to the unfamiliar class detection task that builds on attribute-based classification methods, and we empirically demonstrate how classification accuracy is impacted by attribute noise and dataset "difficulty," as quantified by the separation of classes in the attribute space. We also present a method for incorporating human users to overcome deficiencies in attribute detection. We demonstrate results superior to existing methods on the challenging CUB-200-2011 dataset.
The proceedings contain 120 papers. The topics discussed include: research on the safety evaluation index of urban rail transit network operation;evaluation information integration model on book purchasing bids;a new ...
ISBN:
(纸本)9780819490261
The proceedings contain 120 papers. The topics discussed include: research on the safety evaluation index of urban rail transit network operation;evaluation information integration model on book purchasing bids;a new distributed systems scheduling algorithm: a swarm intelligence approach;hardware architecture of OFCE-HS for hardware implementation;optimization of knowledge sharing through multi-forum using cloud computing architecture;a parallel encryption algorithm for dual-core processor based on chaotic map;a novel architecture for information retrieval system based on semantic web;investigating the effect of software project type on accuracy of software development effort estimation in COCOMO model;a comprehensive framework for detecting and tracking space objects;design and implementation of multilayer cache strategy of web system based on seam;and a proposed algorithm for solving fuzzy multi-objectives linear programming.
This book summarizes his recent research work on Intelligent Vehicles. The target audience of the book is researchers, professionals and graduate students interested in a broad scientific field that includes not only ...
详细信息
This book summarizes his recent research work on Intelligent Vehicles. The target audience of the book is researchers, professionals and graduate students interested in a broad scientific field that includes not only basic principles, such as signal-image processing and patternrecognition, but also targeted fields of application, such as Intelligent Transportation Systems (ITS) and, more specifically, Intelligent Vehicles (IVs). The field of IVs includes a wide range of technologies spanning from vehicle dynamics to information, communications, hardware, computervision, artificial intelligence, ergonomics and human factors.
Combining multiple low-level visual features is a proven and effective strategy for a range of computervision tasks. However, limited attention has been paid to combining such features with information from other mod...
详细信息
ISBN:
(纸本)9781467312288
Combining multiple low-level visual features is a proven and effective strategy for a range of computervision tasks. However, limited attention has been paid to combining such features with information from other modalities, such as audio and videotext, for large scale analysis of web videos. In our work, we rigorously analyze and combine a large set of low-level features that capture appearance, color, motion, audio and audio-visual co-occurrence patterns in videos. We also evaluate the utility of high-level (i.e., semantic) visual information obtained from detecting scene, object, and action concepts. Further, we exploit multimodal information by analyzing available spoken and videotext content using state-of-the-art automatic speech recognition (ASR) and videotext recognition systems. We combine these diverse features using a two-step strategy employing multiple kernel learning (MKL) and late score level fusion methods. Based on the TRECVID MED 2011 evaluations for detecting 10 events in a large benchmark set of similar to 45000 videos, our system showed the best performance among the 19 international teams.
暂无评论