Edge detection is analyzed as a problem in cost minimization. A cost function is formulated that evaluates the quality of edge configurations. A mathematical description of edges is given, and the cost function is ana...
详细信息
Edge detection is analyzed as a problem in cost minimization. A cost function is formulated that evaluates the quality of edge configurations. A mathematical description of edges is given, and the cost function is analyzed in terms of the characteristics of the edges in minimum-cost configurations. The cost function is minimized by the simulated annealing method. A novel set of strategies for generating candidate states and a suitable temperature schedule are presented. Sequential and parallel versions of the annealing algorithm are implemented and compared. Experimental results are presented.< >
Representing images with layers has many important applications, such as video compression, motion analysis, and 3D scene analysis. This paper presents an approach to reliably extracting layers from images by taking a...
详细信息
ISBN:
(纸本)0769512720
Representing images with layers has many important applications, such as video compression, motion analysis, and 3D scene analysis. This paper presents an approach to reliably extracting layers from images by taking advantages of the fact that homographies induced by planar patches in the scene form a low dimensional linear subspace. Layers in the input images will be mapped in the subspace, where it is proven that they form well-defined clusters and can be reliably identified by a simple mean-shift based clustering algorithm. Global optimality is achieved since all valid regions are simultaneously taken into account, and noise can be effectively reduced by enforcing the subspace constraint. Good layer descriptions are shown to be extracted in the experimental results.
A method for detecting and describing the features of faces using deformable templates is described. The feature of interest, an eye for example, is described by a parameterized template. An energy function is defined...
详细信息
A method for detecting and describing the features of faces using deformable templates is described. The feature of interest, an eye for example, is described by a parameterized template. An energy function is defined which links edges, peaks, and valleys in the image intensity to corresponding properties of the template. The template then interacts dynamically with the image, by altering its parameter values to minimize the energy function, thereby deforming itself to find the best fit. The final parameter values can be used as descriptors for the features. This method is demonstrated by showing deformable templates detecting eyes and mouths in real images.< >
The problem of dense optical flow computation is addressed from a variational viewpoint. A new geometric framework is introduced. It unifies previous art and yields new efficient methods. Along with the framework a ne...
详细信息
The problem of dense optical flow computation is addressed from a variational viewpoint. A new geometric framework is introduced. It unifies previous art and yields new efficient methods. Along with the framework a new alignment criterion suggests itself. It is shown that the alignment between the gradients of the optical flow components and between the latter and the intensity gradients is an important measure of the flow’s quality. Adding this criterion as a requirement in the optimization process improves the resulting flow. This is demonstrated in synthetic and real sequences.
We propose a method for deblurring of spatially variant object motion. A principal challenge of this problem is how to estimate the point spread function (PSF) of the spatially variant blur. Based on the projective mo...
详细信息
We propose a method for deblurring of spatially variant object motion. A principal challenge of this problem is how to estimate the point spread function (PSF) of the spatially variant blur. Based on the projective motion blur model of, we present a blur estimation technique that jointly utilizes a coded exposure camera and simple user interactions to recover the PSF. With this spatially variant PSF, objects that exhibit projective motion can be effectively de-blurred. We validate this method with several challenging image examples.
This paper presents a system aimed to serve as the enabling platform for a wearable assistant. The method observes manipulations from a wearable camera and classifies activities from roughly stabilized low resolution ...
详细信息
ISBN:
(纸本)9781424439942
This paper presents a system aimed to serve as the enabling platform for a wearable assistant. The method observes manipulations from a wearable camera and classifies activities from roughly stabilized low resolution images (160×120 pixels) with the help of a 3-level Dynamic Bayesian Network and adapted temporal templates. Our motivation is to explore robust but computationally inexpensive visual methods to perform as much activity inference as possible without resorting to more complex object or hand detectors. The description of the method and results obtained are presented, as well as the motivation for further work in the area of wearable visual sensing.
In particle filter trackers, the object a posteriori distribution is severely distorted under more challenging situations like occlusion. To overcome the problem, this paper proposes a principled manner of augmenting ...
详细信息
ISBN:
(纸本)0769523722
In particle filter trackers, the object a posteriori distribution is severely distorted under more challenging situations like occlusion. To overcome the problem, this paper proposes a principled manner of augmenting the particle filter algorithm with an MRF based representation of the tracked object within a dynamic Bayesian framework, where the object is transformed into a composite of multiple MRF regions. This results in more accurate modeling, thus improving the tracking performance. Additionally, Metropolis based sampling of the regions enhances the tracker with an adaptive ability. Finally, the resultant generative model provides a natural framework to integrate multiple cues. Experiments show good tracking results for challenging situations.
A framework is presented for segmenting shallow structures from their background over a sequence of images. Shallowness is first quantified as affine describability. This is embedded in a tracking system within which ...
详细信息
A framework is presented for segmenting shallow structures from their background over a sequence of images. Shallowness is first quantified as affine describability. This is embedded in a tracking system within which hypothesized model structures undergo a cycle of prediction and model-matching. Structures emerge either as shallow or non-shallow based on their affine trackability. This paper rejects continuity heuristics for purely image motion in favor of temporal continuity defined as the consistency of generic 3-D models, namely shallow structures.< >
In this paper, we describe a real-time vision-based tracking system to help students who are blind or visually impaired (SBVI) to follow instructional discourse that employs graphical illustrations. The vision system ...
详细信息
In this paper, we describe a real-time vision-based tracking system to help students who are blind or visually impaired (SBVI) to follow instructional discourse that employs graphical illustrations. The vision system employs a color model based tracking for both the instructor's pointing behavior and the SBVI's reading behavior, and maps the pointing positions into the same coordinates. Our Haptic Deictic System - HDS system also employs a haptic glove to provide SBVI access to the situated pointing behavior of instructor that is performed in conjunction with speech, and provides the instructor real-time visual feedback on the SBVI's reading actions. Thus, our system supports two-way situated multimodal communication. In this paper, we first introduce our system, and then we discuss studies that show the efficacy of our approach.
We present algorithms for coupling and training hidden Markov models (HMMs) to model interacting processes, and demonstrate their superiority to conventional HMMs in a vision task classifying two-handed actions. HMMs ...
详细信息
We present algorithms for coupling and training hidden Markov models (HMMs) to model interacting processes, and demonstrate their superiority to conventional HMMs in a vision task classifying two-handed actions. HMMs are perhaps the most successful framework in perceptual computing for modeling and classifying dynamic behaviors, popular because they offer dynamic time warping, a training algorithm and a clear Bayesian semantics. However the Markovian framework makes strong restrictive assumptions about the system generating the signal-that it is a single process having a small number of states and an extremely limited state memory. The single-process model is often inappropriate for vision (and speech) applications, resulting in low ceilings on model performance. Coupled HMMs provide an efficient way to resolve many of these problems, and offer superior training speeds, model likelihoods, and robustness to initial conditions.
暂无评论