A given (overcomplete) discrete oriented pyramid may be converted into a steerable pyramid by interpolation. We present a technique for deriving the optimal interpolation functions (otherwise called 'steering coef...
详细信息
A given (overcomplete) discrete oriented pyramid may be converted into a steerable pyramid by interpolation. We present a technique for deriving the optimal interpolation functions (otherwise called 'steering coefficients'). The proposed scheme is demonstrated on a computationally efficient oriented pyramid, which is a variation on the Burt and Adelson (1983) pyramid. We apply the generated steerable pyramid to orientation-invariant texture analysis in order to demonstrate its excellent rotational isotropy. High classification rates and precise rotation identification are demonstrated.< >
Human-object interaction (HOI) detection is a core task in computervision. The goal is to localize all human-object pairs and recognize their interactions. An interaction de-fined by a tuple leads to a long-tailed vi...
详细信息
ISBN:
(数字)9781728193601
ISBN:
(纸本)9781728193618
Human-object interaction (HOI) detection is a core task in computervision. The goal is to localize all human-object pairs and recognize their interactions. An interaction de-fined by a tuple leads to a long-tailed visual recognition challenge since many combinations are rarely represented. The performance of the proposed models is limited especially for the tail categories, but little has been done to understand the reason. To that end, in this paper, we propose to diagnose rarity in HOI detection. We propose a three-step strategy, namely Detection, Identification and recognition where we carefully analyse the limiting factors by studying state-of-the-art models. Our findings indicate that detection and identification steps are altered by the interaction signals like occlusion and relative location, as a result limiting the recognition accuracy.
This paper presents three methods for hand gesture detection and recognition that can be applied to online video browsing. These methods aim at recognizing hand signs and positions using a single webcam, which can in ...
详细信息
ISBN:
(纸本)9781424455935
This paper presents three methods for hand gesture detection and recognition that can be applied to online video browsing. These methods aim at recognizing hand signs and positions using a single webcam, which can in turn, be used to control a broadband-enabled HDTV. The hand gesture can be trained to suit the user preference. We first provide an analysis of pattern matching, histogram back projection, and the use of Fourier's descriptors. These methods achieve good reliability and acceptable resource consumption. We compare these methods with a new method based on H.264 motion vectors that directly analyzes video in the compressed domain. It will be shown that this technique provides a faster and accurate way to recognize motion trajectories that may correspond to letters or alphabets. The extracted gesture or trajectory information can then be used for various multimedia applications, including improving human-TV interaction.
Attention allocation in visual search is known to be influenced by low-level image features, visual scene context and top down task constraints. Here, we investigate the role of Contextual priors in guiding visual sea...
详细信息
Attention allocation in visual search is known to be influenced by low-level image features, visual scene context and top down task constraints. Here, we investigate the role of Contextual priors in guiding visual search by monitoring eye movements as participants search very familiar scenes for a target object. The goal of the study is to identify which stage of the visual search benefits from contextual priors. Two groups of participants differed in the expectation of target presence associated with a scene. Stronger priors are established when a scene exemplar is always associated with the presence of the target than when the scene is periodically observed with and without the target. In both cases, overall search performance improves over repeated presentations of scenes. An analytic decomposition of the time course of the effect of contextual priors shows a time benefit to the exploration stage of search (scan time) and a decrease in gaze duration on the target. The strength of the contextual relationship modulates the magnitude of gaze duration gain, while the scan time gain constitutes one half of the overall search performance benefit regardless of the probability (50% or 100%) of target presence. These data are discussed in terms of the implications of contextdependent scene processing and its putative role in various stages of visual search.
rd It is a pleasure and an honour both to organize ICB 2009, the 3 IAPR/ieee Inter- tional conference on Biometrics. This will be held 2–5 June in Alghero, Italy, hosted by the computervision Laboratory, University ...
详细信息
ISBN:
(数字)9783642017933
ISBN:
(纸本)9783642017926
rd It is a pleasure and an honour both to organize ICB 2009, the 3 IAPR/ieee Inter- tional conference on Biometrics. This will be held 2–5 June in Alghero, Italy, hosted by the computervision Laboratory, University of Sassari. The conference series is the premier forum for presenting research in biometrics and its allied technologies: the generation of new ideas, new approaches, new techniques and new evaluations. The ICB series originated in 2006 from joining two highly reputed conferences: Audio and Video Based Personal Authentication (AVBPA) and the International conference on Biometric Authentication (ICBA). Previous conferences were held in Hong Kong and in Korea. This is the first time the ICB conference has been held in Europe, and by Programme Committee, arrangements and by the quality of the papers, ICB 2009 will continue to maintain the high standards set by its predecessors. In total we received around 250 papers for review. Of these, 36 were selected for oral presentationand 93 for poster presentation. These papers are accompanied by the invited speakers: Heinrich H. Bülthoff (Max Planck Institute for Biological Cybernetics, Tüb- gen, Germany) on “What Can Machine vision Learn from Human Perception?”, - daoki Furui (Department of computer Science, Tokyo Institute of Technology) on “40 Years of Progress in Automatic Speaker recognition Technology” and Jean-Christophe Fondeur (SAGEM Security and Morpho, USA) on “Large Scale Deployment of Biom- rics and Border Control”.
Recovery of motion parameters and point correspondences is a fundamental problem in computervision. Although a great deal of research has been done in solving rigid motion, nonrigid motion analysis has only recently ...
详细信息
Recovery of motion parameters and point correspondences is a fundamental problem in computervision. Although a great deal of research has been done in solving rigid motion, nonrigid motion analysis has only recently been addressed and is gaining interest due to its wide range of applications. This paper introduces a novel method for estimating motion parameters and point correspondences between surfaces under small nonrigid deformations. It uses the changes in differential geometric properties of surface under motion. Simulations are performed by generating nonrigid motion on an ellipsoidal data to illustrate performance and accuracy of derived algorithms, Then, the algorithm is tested on the sequence of facial range images. The motion parameters generated by the algorithm has also been used do detect the abnormality in cardiac images.< >
This paper unifies "line-process" approaches for regularization with discontinuities and robust estimation techniques. We generalize the notion of a "line process" to that of an analog "outlie...
详细信息
This paper unifies "line-process" approaches for regularization with discontinuities and robust estimation techniques. We generalize the notion of a "line process" to that of an analog "outlier process" and show that a problem formulated in terms of outlier processes can be viewed in terms of robust statistics. We also characterize a class of robust statistical problems for which an equivalent outlier-process formulation exists and give a straightforward method for converting a robust estimation problem into an outlier-process formulation. This outlier-processes approach provides a general framework which subsumes the traditional line-process approaches as well as a wide class of robust estimation problems. Examples in image reconstruction and optical flow are used to illustrate the approach.< >
We propose an affine framework for perspective views, captured by a single extremely simple equation based on a viewer-centered invariant we call relative affine structure. Via a number of corollaries of our main resu...
详细信息
We propose an affine framework for perspective views, captured by a single extremely simple equation based on a viewer-centered invariant we call relative affine structure. Via a number of corollaries of our main results we show that our framework unifies previous work-including Euclidean, projective and affine-in a natural and simple way. Finally, the main results were applied to a real image sequence for purpose of 3D reconstruction from 2D views.< >
We describe a system for detection and description of buildings in aerial scenes. This is a difficult task as the aerial images contain a variety of objects. Low-level segmentation processes give highly fragmented seg...
详细信息
We describe a system for detection and description of buildings in aerial scenes. This is a difficult task as the aerial images contain a variety of objects. Low-level segmentation processes give highly fragmented segments due to a number of reasons. We use a perceptual grouping approach to collect these fragments and discard those that come from other sources. We use shape properties of the buildings for this. We use shadows to help form and verify the hypotheses generated by the grouping process. This latter step also provides 3-D descriptions of the buildings. Our system has been tested on a number of examples and is able to work with overhead or oblique views.< >
The 9th International conference on Medical Image Computing and computer Assisted Intervention, MICCAI 2006, was held in Copenhagen, Denmark at the Tivoli Concert Hall with satellite workshops and tutorials at the IT ...
详细信息
ISBN:
(数字)9783540447283
ISBN:
(纸本)9783540447276
The 9th International conference on Medical Image Computing and computer Assisted Intervention, MICCAI 2006, was held in Copenhagen, Denmark at the Tivoli Concert Hall with satellite workshops and tutorials at the IT University of Copenhagen, October 1-6, 2006. The conference has become the premier international conference with - depth full length papers in the multidisciplinary ?elds of medical image c- puting, computer-assisted intervention, and medical robotics. The conference brings together clinicians, computer scientists, engineers, physicists, and other researchers and o?ers a forum for the exchange of ideas in a multidisciplinary setting. MICCAI papers are of high standard and have a long lifetime. In this v- ume as well as in the latest journal issues of Medical Image Analysis and ieee Transactions on Medical Imaging papers cite previous MICCAIs including the ?rst MICCAI conference in Cambridge, Massachusetts, 1998. It is obvious that the community requires the MICCAI papers as archive material. Therefore the proceedingsofMICCAIarefrom2005andhenceforthbeing indexedbyMedline. Acarefulreviewandselectionprocesswasexecutedinordertosecurethebest possible program for the MICCAI 2006 conference. We received 578 scienti?c papers from which 39 papers were selected for the oral program and 193 papers for the poster program.
暂无评论