The recognition of screen-rendered text is a novel task. It is performed e.g. by translation tools which allow users to click on any text on the screen and give a translation. Also some commercial OCR programs start t...
详细信息
The recognition of screen-rendered text is a novel task. It is performed e.g. by translation tools which allow users to click on any text on the screen and give a translation. Also some commercial OCR programs start to address the problem of reading screenshots. Optical character recognition on screen-shot images can be very challenging due to very small and smoothed fonts. In order to build and compare recognition approaches for screen-rendered text, the availability of standard databases is a fundamental prerequisite. In this paper two freely available databases are presented, one that consists of annotated screenshot images of 28080 single characters and another holding 400 words extracted from documents plus 2 400 generated isolated words. Both databases include meta-information such as x-height, font type, style and rendering conditions. At the example of a developed recognition system, it is shown how these databases can serve for training, testing and optimization.
In this paper a new robotized inspection system for PCBs is presented. Electrical and optical tests are carried out by the system, using four mobile probes where two micro CCD cameras are mounted. This new approach be...
详细信息
In this paper a new robotized inspection system for PCBs is presented. Electrical and optical tests are carried out by the system, using four mobile probes where two micro CCD cameras are mounted. This new approach becomes a costless and quick setup alternative to the currently used spiked beds for those applications of prototyping developments where maximum flexibility is demanded for the inspection system. A computervision system provides the capacity for optical inspections where visual tests, like part presence/absence or polarity, can be performed.
Recognizing and annotating the occurrence of team actions in observations of embodied agents has applications in surveillance or in training of military or sport teams. We describe the team actions through a spatio-te...
详细信息
Recognizing and annotating the occurrence of team actions in observations of embodied agents has applications in surveillance or in training of military or sport teams. We describe the team actions through a spatio-temporal correlated pattern of movement, which can be modeled by a hidden Markov model. The hand-crafting of these models is a difficult task of knowledge engineering, even in application domains where explicit, natural language descriptions of the team actions are available. The main contribution of this paper is an approach through which the library of HMM representations can be acquired from a small number of hand annotated, representative samples of the specific movement patterns. A series of experiments, performed on a dataset describing a real-world terrestrial warfare exercise validates our method and shows good recognition accuracy even in the presence of noisy data. The speed of the recognition engine is sufficiently fast to allow real time annotation of incoming observations.
Perceptual surface roughness classification describes how a surface's texture feels haptically in terms of perceptual categories such as smooth, rough, bumpy, etc. computervision and patternrecognition algorithm...
详细信息
Perceptual surface roughness classification describes how a surface's texture feels haptically in terms of perceptual categories such as smooth, rough, bumpy, etc. computervision and patternrecognition algorithms which estimate a surface's perceptual roughness have a wide range of application areas including robotics, assistive devices, telesurgery and teleperception. In this paper, we propose a novel approach to perceptual surface roughness classification that, unlike previous approaches, is designed to handle multiple roughness categories within the same image. The steps of our approach include (1) texton extraction and classification using a multi-class, non-linear Support Vector Machine; (2) segmentation using the Iterated Conditional Modes algorithm; and (3) overall perceptual roughness classification using a Nearest Neighbor classifier. The proposed approach is evaluated using visio-haptic subjective measures of roughness on images of the 3D texture of real world objects.
We consider the problem of estimating parameters of a model described by a system of equations which underlies a wide class of computervision applications. One method to solve such a problem is the fundamental numeri...
详细信息
ISBN:
(纸本)9781424431618;9780769530673
We consider the problem of estimating parameters of a model described by a system of equations which underlies a wide class of computervision applications. One method to solve such a problem is the fundamental numerical scheme (FNS) previously proposed by some of the authors. In this paper, a more stable version of FNS is developed, with better convergence properties than the original version. The improvement in performance is achieved by reducing the original estimation problem to a couple of problems of lower dimension. By way of example, the new algorithm has been applied to the problem of estimating the trifocal tensor relating three views of a scene. Experiments carried out with both synthetic and real images reveal the new estimator to be more stable compared to the original FNS method, and commensurate in accuracy with the Gold Standard maximum likelihood estimator.
In this paper, a method automating the task of document editing is proposed. The methodology involves a new algorithm to recognize hand written symbols and their positions in printed document. Further, a software pack...
详细信息
ISBN:
(纸本)9780769530505;0769530508
In this paper, a method automating the task of document editing is proposed. The methodology involves a new algorithm to recognize hand written symbols and their positions in printed document. Further, a software package is developed to automate the task of incorporating the suggested corrections (on a hard copy) into the soft copy. The package is develop using patternrecognition and image processing techniques.
The following topics are dealt with: machine learning; signal processing; computer graphics; computervision; patternrecognition; security; information assurance; computer networks; P2P networks; embedded systems; sy...
The following topics are dealt with: machine learning; signal processing; computer graphics; computervision; patternrecognition; security; information assurance; computer networks; P2P networks; embedded systems; system architecture; wireless sensor networks; high performance network control; high performance network management; optical networks; database management; information retrieval; document and text processing; data and software engineering; information systems applications; data management and algorithms.
A universal software framework for hierarchical object recognition has been devised based on V. B. Mountcastle's observation (Mountcastle, 1978) that the human cortex consists of the same basic functionality which...
详细信息
A universal software framework for hierarchical object recognition has been devised based on V. B. Mountcastle's observation (Mountcastle, 1978) that the human cortex consists of the same basic functionality which is used to subdivide the complex computations into elementary matching tasks, independently whether auditory, visual, olfactory, haptic or any other sensory information is presented at the input. Combined with a unique vision sensor system that is capable to directly extract contrast magnitude and direction, a powerful combination of hardware and software for real-time pattern-recognition tasks at very low power-consumption levels is presented
Document clustering without any prior knowledge or background information is a challenging problem. In this paper, we propose SS-NMF: a semi-supervised non- negative matrix factorization framework for document cluster...
详细信息
Document clustering without any prior knowledge or background information is a challenging problem. In this paper, we propose SS-NMF: a semi-supervised non- negative matrix factorization framework for document clustering. In SS-NMF, users are able to provide supervision for document clustering in terms of pairwise constraints on a few documents specifying whether they "must" or "cannot" be clustered together. Through an iterative algorithm, we perform symmetric tri-factorization of the document- document similarity matrix to infer the document clusters. Theoretically, we show that SS-NMF provides a general framework for semi-supervised clustering and that existing approaches can be considered as special cases of SS-NMF. Through extensive experiments conducted on publicly available data sets, we demonstrate the superior performance of SS-NMF for clustering documents.
暂无评论