The focus of visual attention is guided by salient signals in the peripheral field of view (bottom-up) as well as by the relevance feedback of a semantic model (top-down). As a result, humans are able to evaluate new ...
详细信息
ISBN:
(纸本)9789898425980
The focus of visual attention is guided by salient signals in the peripheral field of view (bottom-up) as well as by the relevance feedback of a semantic model (top-down). As a result, humans are able to evaluate new situations very fast, with only a view numbers of fixations. In this paper, we present a learned model for the fast prediction of visual attention in video. We consider bottom-up and memory-less top-down mechanisms of visual attention guidance, and apply the model to video playback-speed adaption. The presented visual attention model is based on rectangle features that are fast to compute and capable of describing the known mechanisms of bottom-up processing, such as motion, contrast, color, symmetry, and others as well as topdown cues, such as face and person detectors. We show that the visual attention model outperforms other recent methods in adaption of video playback-speed.
Different scenarios can be found in land classification and segmentation of satellite images. First, when prior knowledge is available, the training data is generally selected by randomly picking samples within classe...
详细信息
ISBN:
(纸本)9789898425980
Different scenarios can be found in land classification and segmentation of satellite images. First, when prior knowledge is available, the training data is generally selected by randomly picking samples within classes. When no prior knowledge is available the system can pick samples at random among all unlabeled data, which is highly unreliable, and ask the expert to label them or it can rely on the expert collaboration to improve progressively the training data applying an active learning function. We suggest a scheme to tackle the lack of prior knowledge without actively involving the expert, whose collaboration may be expensive. The proposed scheme uses a clustering technique to analyze the feature space and find the most representative samples for being labeled. In this case the expert is just involved in labeling once a reliable training data set for being representative of the features space. Once the training set is labeled by the expert, different classifiers may be built to process the rest of samples. Three different approaches are presented in this paper: the result of the clustering process, a distance based classifier, and support vector machines (SVM).
image representation has always been an important and interesting topic in imageprocessing and patternrecognition. In 1999, Bribiesca introduced a new two dimensional chain code scheme called Vertex Chain Code (VCC)...
详细信息
Most of the agricultural loss and yield evaluation methods require separate plant parameters depending on phenological stages. In this study, an automated imageprocessing based phenological stages detection system fo...
详细信息
ISBN:
(纸本)9781467324946
Most of the agricultural loss and yield evaluation methods require separate plant parameters depending on phenological stages. In this study, an automated imageprocessing based phenological stages detection system for cereal plants is explained. TARIT Project (***) in Turkey aims to predict crop yield and loss rates in crop lands over a vast region in South-Eastern Turkey. The image data is acquired in RGB format at 30minutes period while the accompanied meteorological data is sampled at 10 minutes period. We construct a state machine for each field, and states represent the phonological stages. Each state has different transition function for the next stage related logical state. 7 major stages for wheat and barley are considered to determine transition dates although it could be increased in general. The sampled image is enhanced and normalized by using light rate and surface wetness related sensor data. Four different processes are then applied and their outcomes are used to determine the state transition conditions. The first is vegetation area green pixel ratio after RGB to HIS color space conversion. The second is change detection with respect to a virtual reference or recognized pattern for growth rate measurement. The fourth is probabilistic supporting stage transition factors based on agro-meteorological parameters. We set state transition functions depending on combination of these parameters with state dependent thresholds. It has shown that phenological stage transition dates of cereal plants can automatically be assigned less than a week tolerance with respect to expert diagnosis, by using the proposed image based state transition model.
This paper presents an efficient shape matching method based on XML data, we extract the contour of the shape and this one is represented by set of points. Using corner detection method for representing the contour by...
详细信息
This paper proposes a new framework of band selection for object classification in hyperspectral images. Different from traditional approaches where the selected bands are shared from all classes, in this work, differ...
详细信息
ISBN:
(纸本)9781467322164
This paper proposes a new framework of band selection for object classification in hyperspectral images. Different from traditional approaches where the selected bands are shared from all classes, in this work, different subsets of bands are selected for different class pairs. Without prior knowledge of spectral database, we estimate the spectral characteristic of objects with the labeled and unlabeled samples, benefiting from the concept of semi-supervised learning, Under the assumption of Gaussian mixture model (GMM), the vectors of mean values and covariance matrices for each class are estimated. The separabilities for all pairs of classes are thus calculated on each band. The bands with the highest separabilities are then selected. To validate our band selection result, support vector machine (SVM) is employed using a strategy of one against one (OAO). Experiments are conducted on a real data set of hyperspectral image, and the results can validate our algorithm.
Understanding human motions can be posed as a patternrecognition problem. Applications of patternrecognition in information processing problems are diverse ranging from Speech, Handwritten character recognition to m...
详细信息
Understanding human motions can be posed as a patternrecognition problem. Applications of patternrecognition in information processing problems are diverse ranging from Speech, Handwritten character recognition to medical research and astronomy. Humans express time-varying motion patterns (gestures), such as a wave, in order to convey a message to a recipient. If a computer can detect and distinguish these human motion patterns, the desired message can be reconstructed, and the computer can respond appropriately. This paper represents a framework for a human computer interface capable of recognizing gestures from the Indian sign language. The complexity of Indian sign language recognition system increases due to the involvement of both the hands and also the overlapping of the hands. Alphabets and numbers have been recognized successfully. This system can be extended for words and sentences recognition is done with PCA (Principal Component analysis). This paper also proposes recognition with neural networks. Further it is proposed that number of finger tips and the distance of fingertips from the centroid of the hand can be used along with PCA for robustness and efficient results.
Several effective machinelearning and patternrecognition schemes have been developed for medical imaging. Although many classifiers have been used with computer-aided detection (CAD) for computed tomographic colonog...
详细信息
This book constitutes the refereed proceedings of the 4th internationalconference on patternrecognition and machine Intelligence, PReMI 2011, held in Moscow, Russia in June/July 2011. The 65 revised papers presented...
ISBN:
(数字)9783642217869
ISBN:
(纸本)9783642217852
This book constitutes the refereed proceedings of the 4th internationalconference on patternrecognition and machine Intelligence, PReMI 2011, held in Moscow, Russia in June/July 2011. The 65 revised papers presented together with 5 invited talks were carefully reviewed and selected from 140 submissions. The papers are organized in topical sections on patternrecognition and machinelearning; image analysis; image and video information retrieval; natural language processing and text and data mining; watermarking, steganography and biometrics; soft computing and applications; clustering and network analysis; bio and chemo analysis; and document imageprocessing.
This paper aims that analysing neural network method in patternrecognition. A neural network is a processing device, whose design was inspired by the design and functioning of human brain and their components. The pr...
详细信息
ISBN:
(纸本)9783642240362;9783642240379
This paper aims that analysing neural network method in patternrecognition. A neural network is a processing device, whose design was inspired by the design and functioning of human brain and their components. The proposed solutions focus on applying Feature recognition Neural Network model for patternrecognition. The primary function of which is to retrieve in a patternstored in memory, when an incomplete or noisy version of that pattern is presented. An associative memory is a storehouse of associated patterns that are encoded in some form. In auto-association, an input pattern is associated with itself and the states of input and output units coincide. When the storehouse is incited with a given distorted or partial pattern, the associated pattern pair stored in its perfect form is recalled. patternrecognition techniques are associated a symbolic identity with the image of the pattern. This problem of replication of patterns by machines (computers) involves the machine printed patterns. There is no idle memory containing data and programmed, but each neuron is programmed and continuously active.
暂无评论