Recently, writer identification has been studied and it has a wide variety of applications, more specifically in biometric and forensic science. This paper presents the use of texture features in identifying the write...
详细信息
We present a Bayesian approach for segmenting a sequence of gray-scale images to obtain a binary sketch. We extend a 2-D algorithm to video sequences. The 2-D algorithm is an adaptive thresholding scheme that uses spa...
详细信息
We present a Bayesian approach for segmenting a sequence of gray-scale images to obtain a binary sketch. We extend a 2-D algorithm to video sequences. The 2-D algorithm is an adaptive thresholding scheme that uses spatial constraints and takes into consideration the local intensity characteristics of the image. We model the segmentation distribution as a 3-D Gibbs Random Field. We add temporal constraints and temporal local intensity adaptation to ensure a smooth transition of the segmentation from frame to frame. For computational efficiency as well as performance we use a multi-resolution approach. We also consider several suboptimal implementations to reduce the delay as well as the amount of computation. We tested the performance of the algorithm on head and shoulders video sequences. The algorithm achieves accurate rendering of the lip and eye movements and preserves the main characteristics of the face, so that it is easily recognizable.
image and video data contains abundant, rich information for data miners to explore. On one hand, the rich literature on image and video data analysis will naturally provide many advanced methods that may help mining ...
详细信息
ISBN:
(纸本)9781605580708
image and video data contains abundant, rich information for data miners to explore. On one hand, the rich literature on image and video data analysis will naturally provide many advanced methods that may help mining other kinds of data. On the other hand, recent research on data mining will also provide some new, interesting methods that may benefit image and video data retrieval and analysis. In this talk we explore the latter, and discuss whether the new results obtained in data mining research could be useful in image and video data retrieval and analysis. Our discussion will be focused on the following aspects: (1) how frequent pattern, sequential pattern, and structural pattern analysis methods may help image and video data analysis;(2) how data mining may help construction of effective and efficient indexing and similarity search mechanisms for image and video retrieval;(3) how discriminative pattern-based classification methods may shed new light on image and video classification;and (4) how pattern-based analysis methods may help high-dimensional clustering in image and video analysis. Our goal is to promote collaborative research between these two research communities.
The use of traffic monitoring techniques based on imageprocessing algorithms for supervising urban vehicle flows could be very useful. Classical inductive loops can only compute traffic density on a single lane but a...
详细信息
The use of traffic monitoring techniques based on imageprocessing algorithms for supervising urban vehicle flows could be very useful. Classical inductive loops can only compute traffic density on a single lane but are unable to estimate, for example, the behaviour of the vehicle flow at a crossroad. In this paper a system that estimates the turning rates at an urban crossroad by processing the sequence of images taken by a videocamera is presented. Block matching motion estimation, segmentation, and moving object tracking techniques are used. The good results obtained are presented and their relation to the camera position is discussed.
Feature engineering under the stereoscopic images is widely used for visual discomfort prediction. Due to the complexity of the human visual system, extensive feature representation face major challenges, including a ...
详细信息
In this paper we present a new framework for an accelerated 3D reconstruction of deformable objects within a multi-view setup. It is based on a new memory management and an enhanced algorithm pipeline of the well know...
详细信息
ISBN:
(纸本)9783642217982;9783642217999
In this paper we present a new framework for an accelerated 3D reconstruction of deformable objects within a multi-view setup. It is based on a new memory management and an enhanced algorithm pipeline of the well known image-Based Visual Hull (IBVH) algorithm that enables efficient and fast reconstruction results and opens up new perspectives for the scalability of time consuming computations within larger camera environments. As a result, a significant increase of frame rates for the volumetric reconstruction of deformable objects can be achieved using an optimized CUDA-based implementation on NVIDIA's Fermi-GPUs.
In this paper, we present a new method to speed up the mode decision process using mode prediction. In general, video coding exploits spatial and temporal redundancies between video blocks, in particular temporal redu...
详细信息
ISBN:
(纸本)3540446303
In this paper, we present a new method to speed up the mode decision process using mode prediction. In general, video coding exploits spatial and temporal redundancies between video blocks, in particular temporal redundancy is a crucial key to compress video sequence with little loss of image quality. The proposed method determines the best coding mode of a given macroblock by predicting the mode and its rate-distortion (rd) cost from neighboring MBs in time and space. Compared to the H.264/AVC reference software, the simulation results show that the proposed method can save up to 53% total encoding time with up to 2.4% bit rate increase at the same PSNR.
Two imageprocessing methods are presented and compared for particle detection based on digital holography. The imageprocessing methods are developed based on edge detection and threshold based segmentation. The prop...
详细信息
ISBN:
(纸本)9783038351818
Two imageprocessing methods are presented and compared for particle detection based on digital holography. The imageprocessing methods are developed based on edge detection and threshold based segmentation. The proposed methods have been applied on reconstructed images of holograms of ceramic beads and certified microspheres. Particle size distribution obtained for both methods are found to be in good agreement with the actual size;but the processing speed for threshold based detection is found to be faster in comparison to the edge detection method.
Multimedia and virtual reality systems give the user the illusion of three-dimensional contact. Unsurprisingly, this requires the transmission of more data than the monocular case. To bring participants together in a ...
详细信息
ISBN:
(纸本)0818688211
Multimedia and virtual reality systems give the user the illusion of three-dimensional contact. Unsurprisingly, this requires the transmission of more data than the monocular case. To bring participants together in a virtual space, or to allow a user to "see around" a 3-D object it is necessary to synthesise intermediate views from a stereoscopic image pair or sequence. In a stereo/video coding scheme, a block-based disparity map is usually coded and transmitted. This format is economic in computation and transmission, but likely to cause artifacts an any reconstructed intermediate view. To minimise the distortion of the reconstructed intensities, a nonlinear interpolator, which uses the single disparity available to each block of the image, generates a full-sized map that partially preserves original disparity edge information producing a sharper intermediate view. The identification of occluded and non-occluded areas is also used to aid the intermediate view reconstruction.
Interventional video Tomography (IVT) is a new imaging modality for image Directed Surgery to visualize in real-time intraoperatively the spatial position of surgical instruments relative to the patient39;s anatomy....
详细信息
ISBN:
(纸本)0819417424
Interventional video Tomography (IVT) is a new imaging modality for image Directed Surgery to visualize in real-time intraoperatively the spatial position of surgical instruments relative to the patient's anatomy. The video imaging detector is based on a special camera equipped with an optical viewing and lighting system and electronic 3D sensors. When combined with an endoscope it is used for examining the inside of cavities or hollow organs of the body from many different angles. The surface topography of objects is reconstructed from a sequence of monocular video or endoscopic images. To increase accuracy and speed of the reconstruction the relative movement between objects and endoscope is continuously tracked by electronic sensors. The IVT image sequence represents a 4D data set in stereotactic space and contains image, surface topography and motion data. In ENT surgery an IVT image sequence of the planned and so far accessible surgical path is acquired prior to surgery. To simulate the surgical procedure the cross sectional imaging data is superimposed with the digitally stored IVT image sequence. During surgery the video sequence component of the IVT simulation is substituted by the live video source. The IVT technology makes obsolete the use of 3D digitizing probes for the patient image coordinate transformation. The image fusion of medical imaging data with live video sources is the first practical use of augmented reality in medicine. During surgery a head-up display is used to overlay real-time reformatted cross sectional imaging data with the live videoimage.
暂无评论