In this paper, a generic rule induction framework based on trajectory series analysis is proposed to learn the event rules. First the trajectories acquired by a tracking system are mapped into a set of primitive event...
详细信息
In this paper, a novel neural network based manifold learning method(NNBML)[1] recently appeared in the Journal of Science is introduced. It can effectively convert high-dimensional data into low-dimensional codes, wh...
详细信息
ISBN:
(纸本)9781601320438
In this paper, a novel neural network based manifold learning method(NNBML)[1] recently appeared in the Journal of Science is introduced. It can effectively convert high-dimensional data into low-dimensional codes, which are then used for classification. However, it performs not well while dealing with small size face database used for face recognition. We propose a solution generating more samples data based on the existing data. The proposed method is implemented on two well-known face databases, viz. ORL and Yale face databases. The experimental results show that NNBML is able to deal with the task of face recognition after more data samples generated using the proposed method, and also that NNBML outperforms LDA in terms of recognition rate.
The success of forensic identification largely depends on the availability of strong evidence or traces that substantiate the prosecution hypothesis that a certain person is guilty of crime. In light of this, extracti...
详细信息
A natural (human) eye can easily detect large visual patterns or objects emerging from spatially distributed discrete entities. This aspect of pattern analysis has been barely addressed in literature. We propose a bio...
详细信息
ISBN:
(纸本)9783540749332
A natural (human) eye can easily detect large visual patterns or objects emerging from spatially distributed discrete entities. This aspect of pattern analysis has been barely addressed in literature. We propose a biologically inspired approach derived from the concept of visual attention to associate together the distributed pieces of macro level patterns. In contrast to the usual approach practiced by the existing models of visual attention, this paper introduces a short-term excitation on the features and locations related to the current focus of attention in parallel to the spatial inhibition of return. This causes the attention system to fixate on analogous units in the scene that may formulate a meaningful global pattern. It is evident from the results of experiments that the outcome of this process can help in widening the scope of intelligent machine vision.
Video cameras are no ionger being used only in their traditional role of providing "Viewable pixels, but are rapidly becoming sources of intelligent information about the world. More recently 3D cameras are being...
详细信息
ISBN:
(纸本)1424411807
Video cameras are no ionger being used only in their traditional role of providing "Viewable pixels, but are rapidly becoming sources of intelligent information about the world. More recently 3D cameras are being developed to directly provide 3D measurements of objects and scenes. Appearance and geometry of objects and scenes, and the temporal dynamics of objects are the key information bearing sources for deriving visual intelligence. This talk will highlight sensor data analysis techniques for creating intelligent representations from 2D and 3D sensors. Intelligent sensor data analytics can be performed for mobile as well as widely distributed static sensor platforms. Applications ranging from 3D video manipulation, 3D situational awareness, wide area surveillance and tracking to video/3D object recognition and fingerprinting will be used to illustrate the work.
Parametric active contours have been used extensively in computervision for different tasks like segmentation and tracking. However, all parametric contours are known to suffer from the problem of frequent bunching a...
详细信息
As a fundamental problem in patternrecognition, graph matching has found a variety of applications in the field of computervision. In graph matching, patterns are modeled as graphs and patternrecognition amounts to...
详细信息
ISBN:
(纸本)9781424416301
As a fundamental problem in patternrecognition, graph matching has found a variety of applications in the field of computervision. In graph matching, patterns are modeled as graphs and patternrecognition amounts to finding a correspondence between the nodes of different graphs. There are many ways in which the problem has been formulated, but most can be cast in general as a quadratic assignment problem, where a linear term in the objective function encodes node compatibility functions and a quadratic term encodes edge compatibility functions. The main research focus in this theme is about designing efficient algorithms for solving approximately the quadratic assignment problem, since it is NP-hard. In this paper, we turn our attention to the complementary problem: how to estimate compatibility functions such that the solution of the resulting graph matching problem best matches the expected solution that a human would manually provide. We present a method for learning graph matching: the training examples are pairs of graphs and the "labels" are matchings between pairs of graphs. We present experimental results with real image data which give evidence that learning can improve the performance of standard graph matching algorithms. In particular, it turns out that linear assignment with such a learning scheme may improve over state-of-the-art quadratic assignment relaxations. This finding suggests that for a range of problems where quadratic assignment was thought to be essential for securing good results, linear assignment, which is far more efficient, could be just sufficient if learning is performed. This enables speed-ups of graph matching by up to 4 orders of magnitude while retaining state-of-the-art accuracy.
Many vision tasks can be posed as Bayesian inference, and the entropy of the posterior probability is a measure for uncertainty of perception, imperceptibility. In this paper, we studied the imperceptibility of multip...
详细信息
ISBN:
(纸本)9780819469502
Many vision tasks can be posed as Bayesian inference, and the entropy of the posterior probability is a measure for uncertainty of perception, imperceptibility. In this paper, we studied the imperceptibility of multiple object tracking, intrackability. Entropy theory and Bayesian framework are used to represent multiple objects intrackability. Intrackability is computed by different kinds of tracking features. Feature selection is crucial for intrackability computation. An example of umbrellas tracking is shown in this paper. The intrackability which is computed by appearance and shape feature is compared. At last, we use intrackability to guide one application-- Automatic grouping. Objects are dynamically merged and tracked as a group when they come close to each other. Automatic grouping reduces the representation when some details can't be perceived. After the intrackable part of the representation is discarded, the computation is reduced.
In this paper, we propose a new image codec, which called embedded zerotree wavelet coefficients envelop coding (EZWCEC). The coefficients envelop is characterized by describing the global tendency of the significant ...
详细信息
ISBN:
(纸本)9780819469502
In this paper, we propose a new image codec, which called embedded zerotree wavelet coefficients envelop coding (EZWCEC). The coefficients envelop is characterized by describing the global tendency of the significant wavelet coefficients. Based on the empirical analysis and experimental results, our EZWCEC algorithm restores the trend of the significant wavelet coefficients and estimates the magnitude of some insignificant coefficients on the decoder. Unlike other zerotree coding algorithms such as Said and Pearlman's SPIHT using three lists, EZWCEC only uses two lists during encoding and decoding. So the memory requirement for the hardware implementation is reduced significantly. Although PSNR values for EZWCEC are lower than SPIHT's and JPEG2000's, our experiment results have shown that EZWCEC can dramatically improve the visual quality of reconstructed at low bit rates (e.g., below 0.1bpp).
We present a novel model for human action categorization. A video sequence is represented as a collection of spatial and spatial-temporal features by extracting static and dynamic interest points. We propose a hierarc...
详细信息
暂无评论