Markov Random Fields (MRF's) can be used for a wide variety of vision problems, fn this paper we focus on MRF's with two-valued clique potentials, which form a generalized Potts model, We show that the maximum...
详细信息
ISBN:
(纸本)0818684976
Markov Random Fields (MRF's) can be used for a wide variety of vision problems, fn this paper we focus on MRF's with two-valued clique potentials, which form a generalized Potts model, We show that the maximum a posteriori estimate of such an MRF can be obtained by solving a multiway minimum cut problem on a graph. We develop efficient algorithms Sar computing good approximations to the minimum multiway cut. The visual correspondence problem can be formulated as an MRF in our framework this yields quite promising results on real data with ground truth. We also apply our techniques to MRF's with linear clique potentials.
We develop an integrated, probabilistic model for the appearance and three-dimensional geometry of cluttered scenes. Object categories are modeled via distributions over the 3D location and appearance of visual featur...
详细信息
In this paper, we address a novel problem of automatically creating a picture collage from a group of images. Picture collage is a kind of visual image summary - to arrange all input images on a given canvas, allowing...
详细信息
In this paper we study the role of dynamics in dimensionality reduction problems applied to sequences. We propose a new family of marginal auto-regressive (MAR) models that describe the space of all stable auto-regres...
详细信息
Different instances of a handwritten word consist of the same basic features (humps, cusps, crossings, etc.) arranged in a deformable spatial pattern. Thus, keywords in cursive text can be detected by looking for the ...
详细信息
ISBN:
(纸本)0818684976
Different instances of a handwritten word consist of the same basic features (humps, cusps, crossings, etc.) arranged in a deformable spatial pattern. Thus, keywords in cursive text can be detected by looking for the appropriate features in the "correct" spatial configuration. A keyword can be modeled hierarchically as a set of word fragments, each of which consists of lower-level features. To allow flexibility, the spatial configuration of keypoints within a fragment is modeled using a Dryden-Mardia (DM) probability density over the shape of the configuration. In a writer-dependent test on a transcription of the Declaration of Independence (similar to 1300 words, similar to 7500 characters), the method detected all eleven instances of the keyword "government" with only four false positives.
In this paper, we propose a novel local steerable phase (LSP) feature extracted from the face image using steerable filter for face representation and recognition. Steerable filter is a kind of oriented filters. It is...
详细信息
Background subtraction is a widely used paradigm to detect moving objects in video taken from a static camera and is used for various important applications such as video surveillance, human motion analysis, etc. Vari...
详细信息
We describe a method to align ASL video subtitles with a closed-caption transcript. Our alignments are partial, based on spotting words within the video sequence, which consists of joined (rather than isolated) signs ...
详细信息
Activity recognition is an important issue in building intelligent monitoring systems. We address the recognition of multilevel activities in this paper via a conditional Markov random field (MRF), known as the dynami...
详细信息
暂无评论