The concept of a "conical panorama" was introduced in an earlier paper, involving the view of an oblique down-look camera rotating about an axis, either directly through or at some distance from the center o...
详细信息
The concept of a "conical panorama" was introduced in an earlier paper, involving the view of an oblique down-look camera rotating about an axis, either directly through or at some distance from the center of projection. This provides the framework 1) to analyze multi-camera realizations developed for various underwater visual tasks, and 2) to develop methods for the computation of visual motion that are more robust relative to processing of images covering a finite field of view. This paper presented a stereo conical imaging system, comprising 12 cameras with relatively large overlap between pairs of neighboring cameras. This panoramic stereovision system is targeted for 3-D visualization and improved scene reconstruction by utilizing correspondences in more than 2 views of the scene at modest distances. The calibration of the system is addressed, and experimental results demonstrate the potential in the use of this system for 3-D mapping.
Acquiring 3-D computer models of objects from images has been an active computervision research topic. A common approach uses overlapping images from different views around the object, and attempts to fuse all the in...
详细信息
Acquiring 3-D computer models of objects from images has been an active computervision research topic. A common approach uses overlapping images from different views around the object, and attempts to fuse all the information to construct the model by volume intersection. Most current methods have in common that the camera poses for these views are known. To achieve this, a monocular camera circles the target object on a robot arm, or equivalently, views a target on a rotating platform (e.g., a turntable), while the images are acquired at known intervals. We are interested in the deployment of a submersible platform equipped with stereo vision to construct 3-D volumetric models of benthic habitat objects, e.g., coral reefs and other seafloor structures. In this application, we would automatically acquire continuous 360-degrees stereo views of object(s) of interest, by processing the images online to estimate and control camera trajectory and views. In particular, we show that the redundancy from binocular cues improves the robustness and accuracy. Upon completion of the image data acquisition, a bundle adjustment formulation is proposed to recompute the trajectory and the stereo rig poses, as well as to generate object model(s).
Accurate and efficient optical flow estimation is a major step in many computational vision problems, including 3-D mapping applications. While processing of gray scale image sequences has been the main approach, some...
详细信息
Accurate and efficient optical flow estimation is a major step in many computational vision problems, including 3-D mapping applications. While processing of gray scale image sequences has been the main approach, some authors have recently investigated and reported the use of color images. The main advantage in the use of color images is to exploit redundant information at each pixel, namely 3 channels, as potential cues for visual motion. Moreover, several spectral-dependent factors including medium attenuation may justify the use of color cues in underwater, calling for a physics-based approach to optical flow analysis. This paper explores a simplified underwater image formation model 1) to provide some preliminary insight on the information encoded in various color channels, 2) draw some general conclusions on the use of color and its various representations, and 3) verify if the results are consistent with the experiments with 7 sets of coral reef video recoded in relatively clear shallow waters of the Bahamas. These results suggest that optical flow algorithms based on the HSV representation can provide improved localization and accuracy.
This paper addresses the problem of calibrating camera lens distortion, which can be significant in medium to wide angle lenses. While almost all existing nonmetric distortion calibration methods need user involvement...
详细信息
This paper addresses the problem of calibrating camera lens distortion, which can be significant in medium to wide angle lenses. While almost all existing nonmetric distortion calibration methods need user involvement in one form or another,we present an approach to distortion calibration based on the robust the-least-median-of-squares (LMedS) estimator. Our approach is thus able to proceed in a ful ly-automatic manner while being less sensitive to erroneous input data such as image curves that are mistakenly considered as projections of 3D linear segments. Our approach uniquely uses fast, closed-form solutions to the distortion coefficients, which serve as an initial point for a non-linear optimization algorithm to straighten imaged lines. Moreover we propose a method for distortion model selection based on geometrical *** experiments to evaluate the performance of this approach on synthetic and real data are reported.
The correspondence problem is of high relevance in the constructionand use of statistical models. Statistical models are used for avariety of medical application, e.g. segmentation, registration and shapeanalysis. In ...
详细信息
The ability to detect shadows Is a critical feature of any Intelligent transportation system (ITS). The Improper handling of shadows can be the cause of erroneous conclusions in traffic analysis. Because vision-based ...
详细信息
A new method based on MLE-OED is proposed for unsupervised image segmentation of multiple objects which have fuzzy edges. It adjusts the parameters of a mixture of Gaussian distributions via minimizing a new loss func...
详细信息
In automatic face recognition, the management of image database has its own importance. At present there are two large databases that are being used for most of the face recognition algorithms, they are FERET and XM2V...
详细信息
ISBN:
(纸本)0970789017
In automatic face recognition, the management of image database has its own importance. At present there are two large databases that are being used for most of the face recognition algorithms, they are FERET and XM2VTS database images. The preparation of image database contains several steps that are described in this paper.
We present a background subtraction method that uses multiple cues to detect objects robustly in adverse conditions. The algorithm consists of three distinct levels, i.e., pixel level, region level and frame level. At...
详细信息
We present a background subtraction method that uses multiple cues to detect objects robustly in adverse conditions. The algorithm consists of three distinct levels, i.e., pixel level, region level and frame level. At the pixel level, statistical models of gradients and color are separately used to classify each pixel as belonging to background or foreground. In the region level, foreground pixels obtained from the color based subtraction are grouped into regions and gradient based subtraction is then used to make inferences about the validity of these regions. Pixel based models are updated based on decisions made at the region level. Finally, frame level analysis is performed to detect global illumination changes. Our method provides the solution to some of the common problems that are not addressed by most background subtraction algorithms, such as fast illumination changes, repositioning of static background objects, and initialization of background model with moving objects present in the scene.
In this article we present a method to remove commercials from talk and game show videos and to segment these videos into host and guest shots. In our approach, we rely mainly on information contained in shot transiti...
详细信息
In this article we present a method to remove commercials from talk and game show videos and to segment these videos into host and guest shots. In our approach, we rely mainly on information contained in shot transitions, rather than analyzing the scene content of individual frames. We utilize the inherent difference in scene structure of commercials and talk shows to differentiate between them. Similarly, we make use of the well-defined structure of talk shows, which can be exploited to classify shots as host or guest shots. The entire show is first segmented into camera shots based on colour histogram. Then we construct a data-structure (shot connectivity graph) that links similar shots over time. Analysis of the shot connectivity graph helps us to automatically separate commercials from program segments. This is done by first detecting stories and then assigning a weight to each story based on its likelihood of being a commercial. Further analysis on stories is done to distinguish shots of the hosts from shots of the guests. We have tested our approach on several full-length shows (including commercials) and have achieved video segmentation with high accuracy. The whole scheme is fast and works even on low-quality video (160×120 pixel images at 5 Hz).
暂无评论