In this paper we present a novel hierarchical framework and effective algorithms for cricket event detection and classification. the proposed scheme performs a top-down video event detection and classification using h...
详细信息
ISBN:
(纸本)9781424442195
In this paper we present a novel hierarchical framework and effective algorithms for cricket event detection and classification. the proposed scheme performs a top-down video event detection and classification using hierarchical tree which avoids shot detection and clustering. In the hierarchy, at level-1, we use audio features, to extract excitement clips from the cricket video. At level-2, we classify excitement clips into real-time and replay segments. At level-3, we classify these segments into field view and non-field view based on dominant grass color ratio. At level-4a, we classify field view into pitch-view, long-view, and boundary view using motion-mask. At level-4b, we classify non-field view into close-up and crowd using edge density feature. At level-5a, we classify close-ups into the three frequently occurring classes batsman, bowler/fielder umpire using jersey color feature. At level-5b, we classify crowd segment into the two frequently occurring classes spectator and players' gathering using color feature. We show promising results, with correctly classified cricket events, enabling structural and temporal analysis, such as highlight extraction, and video skimming.
One of the main goals of anywhere augmentation is the development of automatic algorithms for scene acquisition in augmented reality systems. In this paper, we present Envisor, a system for online construction of envi...
详细信息
ISBN:
(纸本)9781424419715
One of the main goals of anywhere augmentation is the development of automatic algorithms for scene acquisition in augmented reality systems. In this paper, we present Envisor, a system for online construction of environment maps in new locations. To accomplish this, Envisor uses vision-based frame to frame and landmark orientation tracking for long-term, drift-free registration. For additional robustness, a gyroscope / compass orientation unit can optionally be used for hybrid tracking. the tracked video is then projected into a cubemap frame by frame. Feedback is presented to the user to help avoid gaps in the cubemap, while any remaining gaps are filled by texture diffusion. the resulting environment map can be used for a variety of applications, including shading of virtual geometry and remote presence.
the modern remote sensing imaging sensors, like those in the IKONOS and QuickBird satellites, are capable of generating panchromatic images with one meter spatial resolution and multiespectral images with good spectra...
详细信息
ISBN:
(纸本)9781424442195
the modern remote sensing imaging sensors, like those in the IKONOS and QuickBird satellites, are capable of generating panchromatic images with one meter spatial resolution and multiespectral images with good spectral information. the principal objective of fusion in remote sensing is to obtain high-resolution multispectral images that can combine the spectral characteristic of the low-resolution multispectral images withthe spatial information of the high-resolution panchromatic images. Traditional fusion methods, such as IHS, PCA and Brovey, can reach good spatial resolution results, but often cause spectral distortion problems. In the literature, it is possible to find some image fusion methods using frequency domain processing, like wavelet-based fusion methods. Although they preserve good spectral information, their spatial visual effects are not satisfactory. IHS fusion method enhanced by Fourier transform presents good spectral and spatial resolution results, but limits the number of spectral bands used in the fusion process to three. In this paper, a method based on Fourier transform is proposed in order to obtain good spatial and spectral resolutions, without limiting the number of bands. In order to compare the spatial and spectral effects of this new method withthose of IHS, IHS enhanced by Fourier transform and wavelet-based methods, IKONOS panchromatic and multispectral images were used as the test data. Quantitative measurements such as correlation coefficient, discrepancy and Mean Structural Similarity index were applied to evaluate the quality of the fused images. the results have shown that the new method can keep almost the same spatial resolution as the panchromatic images, and its spectral effect is well preserved.
Automatic Target Recognition (ATR) from satellite imagery involves detection of Foreground (FG) objects from the Background (BG). ATR demands higher fidelity, which in turn requires more bitrate, hence a conventional ...
详细信息
ISBN:
(纸本)9781424442195
Automatic Target Recognition (ATR) from satellite imagery involves detection of Foreground (FG) objects from the Background (BG). ATR demands higher fidelity, which in turn requires more bitrate, hence a conventional compression, which does not discriminate targets withthe background results in poor detection rate. Here we propose a mechanism to achieve lower bitrate without compromising the detection efficiency. By allowing the background to be coded with lower fidelity than the regions-of-interest (ROIs), significant gains can be achieved in terms of compression and hence in storage space and transmission times. One interesting feature of the new JPEG-2000 image coding standard is support of ROI coding using maximum shift (MaxShift) method, which allows for arbitrarily shaped ROI image compression without shape coding or explicitly transmitting any shape information to the decoder. We propose a Fuzzy C-Means clustering approach to generating arbitrary shape mask so as to cluster the images into regions of varying homogeneity. Homogenous ROIs can be coded at a lower bitrate than the high detail regions. this ensures that the target recognition process is not affected by the compression process. A validation benchmark using 'Fuzzy Feature Vectors' is proposed which checks the foreground objects for features like Rectangularity, Circularity. Elongatedness, Symmetry, Area etc., as compared to its uncompressed equivalent. the validation is done on Standard JPEG, JPEG-2000 with ROI coding and JPEG-2000 without ROI coding at preset bitrates and compared.
A mammography is a specific type of imaging that uses low-dose x-ray system to examine breasts. this is an efficient means of early detection of breast cancer. High resolution is a common characteristic of such images...
详细信息
Touching characters are major problem of achieving higher recognition rate in Optical Character Recognition (OCR). Present OCR systems do not perform well when adjacent characters touch. If characters are touched in g...
详细信息
ISBN:
(纸本)9781424442195
Touching characters are major problem of achieving higher recognition rate in Optical Character Recognition (OCR). Present OCR systems do not perform well when adjacent characters touch. If characters are touched in graphical documents (e.g. map) then such touching string recognition is more difficult because in such documents touching characters appear in multi-oriented direction. In this paper, we present a scheme towards the recognition of English two-character multi-oriented touching strings. When two or more characters touch, the), generate a big cavity region at the background portion and we used this background information in our scheme. To handle the background information, convex hull is used. In this scheme, at first, a set of initial segmentation points is predicted based on the concave residues of the convex hull of the touching characters. Next, based on the initial points, we select some candidate segmentation lines. Finally the recognition confidence of two sub-images of a touching string, obtained from each candidate segmentation line is computed. the candidate segmentation line from which we get optimum confidence is the actual segmentation line and the corresponding characters in favour of which the two segmentation parts show optimum confidence is the recognition result of the touching string. To compute the recognition confidence, SVM classifier is used. the features used in the SVM are invariant to character orientation. Circular ring and convex hull ring based approach has been used along with angular information of the contour pixels of the character to make the feature rotation invariant. From the experiment we obtained encouraging result.
the traditional bundle adjustment algorithm for structure from motion problem has a computational complexity of O((m + n)3) per iteration and memory requirement of O(mn(m+n)), wheremis the number of cameras and n is t...
详细信息
Depth from stereo is an important research field in computervision due to the wide range of its applications. In this work, we present a stereo matching algorithm based on belief propagation (BP). the algorithm is de...
详细信息
Depth from stereo is an important research field in computervision due to the wide range of its applications. In this work, we present a stereo matching algorithm based on belief propagation (BP). the algorithm is designed to work on sparse images originating from image content adaptive mesh representation techniques. there, an image is approximated with a mesh. the nodes of the mesh are the non-uniform samples which are the ones that form the sparse image. the key issue in the proposed method is to formulate BP such that it matches a sparse left stereo image with a dense right image to obtain a sparse depth map. Moreover, we propose a simple method that recovers the dense disparity map of the scene from the sparse one using the approximating mesh of the image. the results obtained show that the proposed method leads to an average of 40% improvement in the quality of depth maps when compared to existing sparse stereo matching techniques.
We present an approach to decomposing branching volume data into sub-branches. First, a metric is proposed for evaluating local convexities in volumetric data, and it is a criterion for global selection of tip points....
详细信息
We present an approach to decomposing branching volume data into sub-branches. First, a metric is proposed for evaluating local convexities in volumetric data, and it is a criterion for global selection of tip points. Second, a multi-path growing strategy is adopted to segment the volumes based on a DFS transformation starting from the tips. Experiments show that this approach is capable of generating desirable components and reasonable segmentation boundaries of a volume.
In this paper, we propose a new approach of synthesizing novel views in multiview camera configurations. We introduce the semi iV-view & iV-depth framework in order to estimate disparity maps efficiently and corre...
详细信息
In this paper, we propose a new approach of synthesizing novel views in multiview camera configurations. We introduce the semi iV-view & iV-depth framework in order to estimate disparity maps efficiently and correctly. this framework reduces redundancy on disparity estimation by using information from neighboring views. the occlusion problem is handled by using cost functions computed with multiview images. the proposed method provides a 2D/3D freeview video. User can select 2D/3D modes of freeview video and control 3D depth perception by adjusting several parameters in 3D freeview video. Experimental results show that the proposed method yields the accurate disparity maps and provides seamless freeview videos.
暂无评论