We present an original approach for non parametric motion analysis in image sequences. It relies on the statistical modeling of distributions of local motion-related measurements computed over image sequences. Contrar...
详细信息
ISBN:
(纸本)0769512720
We present an original approach for non parametric motion analysis in image sequences. It relies on the statistical modeling of distributions of local motion-related measurements computed over image sequences. Contrary to previously proposed methods, the use of temporal multiscale Gibbs models allows us to handle in a unified statistical framework both spatial and temporal aspects of motion content. The important feature of our probabilistic scheme is to make the exact computation of conditional likelihood functions feasible and simple. It enables us to straightforwardly achieve model estimation according to the ML criterion and to benefit from a statistical point of view for classification issues. We have conducted motion recognition experiments over a large set of real image sequences comprising various motion types such as temporal texture samples, human motion examples and rigid motion situations.
A robust approach for super-resolution is, presented, which is especially valuable in the presence of outliers. Such outliers may be due to motion errors, inaccurate blur models, noise, moving objects, motion blur etc...
详细信息
ISBN:
(纸本)0769512720
A robust approach for super-resolution is, presented, which is especially valuable in the presence of outliers. Such outliers may be due to motion errors, inaccurate blur models, noise, moving objects, motion blur etc. This robustness is needed since super-resolution methods are very sensitive to such errors. A robust median estimator is combined in an iterative process to achieve a super resolution algorithm. This process can increase resolution even in regions with outliers, where other super resolution methods actually degrade the image.
We propose a fast and memory efficient encoding strategy for text image compression with the JBIG2 standard. The encoder splits up the input image into horizontal stripes and encodes one stripe at a time. Construction...
详细信息
We propose a fast and memory efficient encoding strategy for text image compression with the JBIG2 standard. The encoder splits up the input image into horizontal stripes and encodes one stripe at a time. Construction of the current dictionary is based on updating dictionaries from previous stripes. We describe separate updating processes for the singleton exclusion dictionary and for the modified-class dictionary. Experiments show that, for both dictionaries, splitting the page into two stripes can save 30% of encoding time and 40% of physical memory with a small loss of about 1.5% in compression. Further gains can be obtained by using more stripes but with diminishing returns. The same updating processes are also applied to compressing multi-page document images and shown to improve compression by 8-10% over coding a multi-page document as a collection of single-page documents.
The problem of binarization of gray level images acquired under nonuniform illumination is reconsidered. Yanowitz and Bruckstein (1989) proposed to use an adaptive threshold surface, determined by interpolation of the...
详细信息
ISBN:
(纸本)0769512720
The problem of binarization of gray level images acquired under nonuniform illumination is reconsidered. Yanowitz and Bruckstein (1989) proposed to use an adaptive threshold surface, determined by interpolation of the image gray levels at points where the image gradient is high. The rationale is that a high image gradient indicates probable object edges, and there the image values are between the object and background gray levels. The threshold surface was determined by successive overrelaxation as the solution of the Laplace equation. This work proposes a different method to determine an adaptive threshold surface. In this new method, inspired by multiresolution approximation, the threshold surface is constructed with considerably lower computational complexity and is smooth, yielding faster image binarizations and better visual performance.
We propose a novel algorithm for efficient and robust pose determination of vehicles in traffic scenes from single monocular intensity images using calibrated cameras. We consider the pose determination process as a s...
详细信息
ISBN:
(纸本)0780367251
We propose a novel algorithm for efficient and robust pose determination of vehicles in traffic scenes from single monocular intensity images using calibrated cameras. We consider the pose determination process as a series of evolutions from initial pose to correct pose in 3D space, which can be decomposed into two independent 3D motions: translation and rotation. The translation parameters are obtained based on point-to-line-segment distance (PLS distance), while the rotation parameters are determined by geometric relationships among a set of specially constructed but imaginary planes. Closed-form solutions to both sub-problems are obtained, thus avoiding the usual shortcoming of relatively high computational cost of traditional 3D-model based approaches. In addition, vertex neighborhood constraint (VNC) is introduced to improve the robustness of the method. Experimental results show that the algorithm works well even under severe occlusion and clutter.
This paper develops an evaluation of the position probability of a point C which is known to be in a direction /spl beta/ with respect to a point B, itself in the direction /spl alpha/ with respect to another point A....
详细信息
This paper develops an evaluation of the position probability of a point C which is known to be in a direction /spl beta/ with respect to a point B, itself in the direction /spl alpha/ with respect to another point A. The obtained results can be used in the problem of inference of directional relationships in the case of spatial reasoning.
In this paper, an algorithm of automated cartridge identification for firearm authentication is proposed. The ejector impression is used to calibrate the cartridge image. Features of the firing pin impression and the ...
详细信息
ISBN:
(纸本)0769512720
In this paper, an algorithm of automated cartridge identification for firearm authentication is proposed. The ejector impression is used to calibrate the cartridge image. Features of the firing pin impression and the breach face impression are extracted using an active snake model and local orientation analysis, respectively. Different features are then integrated to make a final decision using a support vector machine. Experimental results illustrate the effectiveness of our algorithm.
This paper deals with the problem of estimating structure and motion from long continuous image sequences, applying the expectation maximization algorithm based on an extended Kalman smoother to impose time-continuity...
详细信息
ISBN:
(纸本)0769512720
This paper deals with the problem of estimating structure and motion from long continuous image sequences, applying the expectation maximization algorithm based on an extended Kalman smoother to impose time-continuity of the motion parameters. By repeatedly estimating the state transition matrix of the dynamic equation and the parameters of noise processes in dynamic and measurement equations, this optimization gives maximum likelihood estimates of the motion and structure parameters. Practically, this research is essential for dealing with a long video-rate image sequence with partially unknown system equation and noise. The algorithm is implemented and tested for a real image sequence.
Locally parallel dense patterns - sometimes called texture flows define a perceptually coherent structure which is important to image segmentation, edge classification, shading analysis, and shape interpretation. This...
详细信息
ISBN:
(纸本)0769512720
Locally parallel dense patterns - sometimes called texture flows define a perceptually coherent structure which is important to image segmentation, edge classification, shading analysis, and shape interpretation. This paper develops the notion of texture flow from a geometrical point of view to argue that local measurements of such structures must incorporate two curvatures. We show how basic theoretical considerations lead to a unique model for the local behavior of the flow and allow for the specification of consistency constraints between nearby measurements. The computation of globally coherent structure via neighborhood relationships is demonstrated on synthetic and natural images, and is compared to orientation diffusion.
This paper addresses the problem of segmentation of moving objects in image sequences, which is of key importance in content-based applications. We transform the problem into a graph labeling problem over a region adj...
详细信息
ISBN:
(纸本)0769512720
This paper addresses the problem of segmentation of moving objects in image sequences, which is of key importance in content-based applications. We transform the problem into a graph labeling problem over a region adjacency graph (RAG), by introducing a Markov random field (MRF) model based on spatio-temporal information. The initial partition is obtained by fast, color-based watershed segmentation. The motion of each region is estimated and validated in a hierarchical framework. A dynamic memory, based on object tracking, is incorporated into the segmentation process to maintain temporal coherence. The performance of the algorithm is evaluated on several real-world image sequences.
暂无评论