This book constitutes the thoroughly refereed joint post-proceedings of the first two International Workshops on Dynamical vision, WDV 2005 and WDV 2006 held in Beijing, China in October 2005 within the scope of ICCV ...
详细信息
ISBN:
(数字)9783540709329
ISBN:
(纸本)9783540709312
This book constitutes the thoroughly refereed joint post-proceedings of the first two International Workshops on Dynamical vision, WDV 2005 and WDV 2006 held in Beijing, China in October 2005 within the scope of ICCV 2005 and in Graz, Austria in May 2006 in the course of ECCV 2006. The 24 revised full papers presented were carefully reviewed and selected from numerous submissions. The papers address a wide range of theoretical and application issues in dynamical vision and are organized in topical sections on motion segmentation and estimation, human motion analysis, dynamic textures, motion tracking, rigid and non-rigid motion analysis, as well as motion filtering and vision-based control.
The endovascular repair of a traumatic rupture of the thoracic aorta - that would otherwise lead to the death of the patient - is performed by delivering a stent-graft into the vessel at the rupture location. The age ...
详细信息
The endovascular repair of a traumatic rupture of the thoracic aorta - that would otherwise lead to the death of the patient - is performed by delivering a stent-graft into the vessel at the rupture location. The age range of the affected patients is large and the stent-graft will stay in the body for the remaining life. The technique is relatively new, and no experience with regard to long-term effects, and durability exists. To predict long-term complications, such as ruptures or destructive interactions with surrounding tissue during the life of the patient, it is important to understand the - rather intense and constant - movement of the stent- graft during the cardiac cycle. A computed tomography with heart gating (gated CT) acquires sequences that show the region of the stent-graft at different time points. We analyze the motion of stent-grafts with a model based approach. Stent-grafts are represented as sparse sets of axis points extracted from the gated CT, and motion patterns are captured by a minimum description length based group-wise registration of the stent-graft at different time points. No parameterization or a priori definition of the topology is necessary, and highly variable elasticity properties in the data volume can by accounted for by the sparse statistical model, that captures correlations and motion components of the stent-graft. We report results for deformation models and registration accuracy for 5 patients.
The ability to detect pedestrians is a first important step in many computervision applications such as video surveillance. This paper presents an experimental study on pedestrian detection using state-of-the-art loc...
详细信息
ISBN:
(纸本)9781424431618
The ability to detect pedestrians is a first important step in many computervision applications such as video surveillance. This paper presents an experimental study on pedestrian detection using state-of-the-art local feature extraction and support vector machine (SVM) classifiers. The performance of pedestrian detection using region covariance, histogram of oriented gradients (HOG) and local receptive fields (LRF) feature descriptors is experimentally evaluated. The experiments are performed on both the benchmarking dataset used in [1] and the MIT CBCL dataset. Both can be publicly accessed. The experimental results show that region covariance features with radial basis function (RBF) kernel SVM and HOG features with quadratic kernel SVM outperform the combination of LRF features with quadratic kernel SVM reported in [1].
Hexagonal structure is another image structure alternative to traditional square image structure for image processing and computervision. The geometrical arrangement of pixels on a hexagonal structure can be describe...
详细信息
ISBN:
(纸本)9781424431618;9780769530673
Hexagonal structure is another image structure alternative to traditional square image structure for image processing and computervision. The geometrical arrangement of pixels on a hexagonal structure can be described as a collection of hexagonal pixels. Because all the existing hardware for capturing image and for displaying image are produced based on square structure, it becomes important to find a proper software approach to mimic hexagonal structure so that images represented on the traditional square structure can be smoothly converted from or to the images on hexagonal structure. For accurate image processing, it is critical to best maintain the image resolution during the image conversion. In this paper, a bilinear interpolation algorithm that is used to convert an image from square structure to hexagonal structure is presented. Based on this, an edge detection method is proposed. Our experimental results show that the bilinear interpolation improves the edge detection accuracy.
Based on the properties of integer wavelet transform and multiresolution analysis and combined with human visual specialties, an improved region of interest (ROI) progressive image transmission algorithm is proposed. ...
详细信息
Based on the properties of integer wavelet transform and multiresolution analysis and combined with human visual specialties, an improved region of interest (ROI) progressive image transmission algorithm is proposed. With no impact on the subjective quality of ROI image, in the first instance, the coefficients in the ROI and the important wavelet coefficients for human vision are transmitted. Moreover, to indicate the relative importance of the BG with respect to the ROI, an expansion factor S for background (BG) image transmission and its formula is presented. Depending on the factor, the transmission of BG image can be controlled at different bit rates, so that the users can achieve the image with better subjective quality. Meanwhile, the algorithm could improve the visual quality of image and enhance the efficiency of the image transmission. The experimental results prove that the proposed algorithm is effective and the expansion factor could control the image transmission.
In the traditional canonical correlation analysis (CCA) based face recognition methods, the size of sample is always smaller than the dimension of sample. This problem is so called the small sample size (SSS) problem....
详细信息
In the traditional canonical correlation analysis (CCA) based face recognition methods, the size of sample is always smaller than the dimension of sample. This problem is so called the small sample size (SSS) problem. In order to solve this problem, a new supervised learning method called two-dimensional CCA (2DCCA) is developed in this paper. Different from traditional CCA method, 2DCCA directly extracts the features from image matrix rather than matrix-to-vector transformation. In practice, the covariance matrix extracted by 2DCCA is always full rank. Hence the small sample size (SSS) problem can be effectively dealt with by this new developed method. The theory foundation of 2DCCA method is firstly developed, and the construction method for the class-membership matrix Y which is used to precisely represent the relationship between samples and classes in the 2DCCA framework is then clarified. Simultaneously, the analytic form of the generalized inverse of such class-membership matrix is derived. From our experiment results on face recognition, we clearly find that not only the SSS problem can be effectively solved, but also better recognition performance than several other CCA based methods has been achieved
Commercial products that support L2-learners with computer assisted pronunciation training usually focus per exercise only on one possible pronunciation mistake that is typical for speakers of the respective L1 group....
详细信息
Commercial products that support L2-learners with computer assisted pronunciation training usually focus per exercise only on one possible pronunciation mistake that is typical for speakers of the respective L1 group. Acoustic models for words with wrong pronunciation are added to the system. In the present paper a more general approach with features that have proved to be widely independent of the learners' mother tongue is proposed. It is able to take various possible mistakes into consideration all at once. High dimensional feature vectors that encode prosodic varieties and differences of reference and recognized sentences are analyzed. With the ADABOOST algorithm those features are found, which contain the most important information to assess German children learning English. With 35 features 89 % of the agreement of experts is achieved.
This paper presents the design of a machine vision based system for real time detection of the counterfeit Bangladeshi bank notes. The proposed system works with the denominations of five hundred and one hundred taka....
详细信息
ISBN:
(纸本)1424415500;1424415519
This paper presents the design of a machine vision based system for real time detection of the counterfeit Bangladeshi bank notes. The proposed system works with the denominations of five hundred and one hundred taka. This system relies on a specific feature of the both five hundred and one hundred taka. The relied feature is not possible to replicate for the counterfeit makers or producers. And there is no foreseeable likelihood that they would be capable to imitate this feature even within a pretty long time. The relied feature is the repeatedly printed ldquoBANGLADESHBANKrdquo on some portions of the notes using microprint technique. The proposed stand alone system captures the portions of the notes with a proprietary scanner called the Grid Scanner [1]. The captured image is then processed by a microcontroller PIC-16F648A or ATMega88 (AVR). The microcontroller then determines the validity of the note based on an OCR technique by looking for the characters dasiaBpsila, dasiaApsila and dasiaNpsila in the scanned image. The success-rate of the counterfeit detection with properly captured image is 100% and the average processing time is 250 milliseconds with above mentioned microcontroller.
This paper proposes an image binarization method based on the human vision system. It firstly gets the image threshold value according to the Euclidean distance, then uses the standard deviation to determine the visua...
详细信息
This paper proposes an image binarization method based on the human vision system. It firstly gets the image threshold value according to the Euclidean distance, then uses the standard deviation to determine the visual sensitivity coefficient of each pixel according to the human visual characteristic, finally performs image binarization according to the gray value, the image threshold and the visual sensitivity coefficient. This method can both effectively eliminate the ghosts, and keep the character stroke well. The experimental results show that this method is more effective than Ostu method and Bernsen method. It is superior to Ostu method and Bernsen method in the entropy. It also provides a new effective way for image binarization.
暂无评论