In this paper, we propose spatial localization of multiple sound sources using a spherical robot head equipped with four microphones. We obtain arrival time differences using phase difference candidates. Based on the ...
详细信息
ISBN:
(纸本)9781424493197
In this paper, we propose spatial localization of multiple sound sources using a spherical robot head equipped with four microphones. We obtain arrival time differences using phase difference candidates. Based on the model of precedence effect, arrival temporal disparities obtained from the zero-crossing point are used to calculate time differences and suppress the influence of echoes in a reverberant environment. To integrate spatial cues of different microphone pairs, we use a mapping method from the correlation between different microphone pairs to a 3D map corresponding to azimuth and elevation of sound sources directions. Experiments indicate that the system provides the distribution of sound sources in azimuth-elevation localization, with the EA model even concurrently in reverberant environments.
Unlike most previous manifold-based data classification algorithms assume that all the data points are on a single manifold, we expect that data from different classes may reside on different manifolds of possible dif...
详细信息
ISBN:
(纸本)9781424475421
Unlike most previous manifold-based data classification algorithms assume that all the data points are on a single manifold, we expect that data from different classes may reside on different manifolds of possible different dimensions. Therefore, better classification accuracy would be achieved by modeling the data by multiple manifolds each corresponding to a class. To this end, a general framework for data classification on multiple manifolds is presented. The manifolds are firstly learned for each class separately, and a stochastic optimization algorithm is then employed to get the near optimal dimensionality of each manifold from the classification viewpoint. Then, classification is performed under a newly defined minimum reconstruction error based classifier. Our method could be easily extended by involving various manifold learning methods and searching strategies. Experiments on both synthetic data and databases of facial expression images show the effectiveness of the proposed multiple manifold based approach.
In order to solve the problem of image degradation caused by dust environments, an image degradation model considering multiple scattering factors caused by dust was first established using the first-order multiple sc...
详细信息
In order to improve the classifier performance in semantic image annotation, we propose a novel method which adopts learning vector quantization (LVQ) technique to optimize low level feature data extracted from given ...
详细信息
In order to improve the classifier performance in semantic image annotation, we propose a novel method which adopts learning vector quantization (LVQ) technique to optimize low level feature data extracted from given image. Some representative vectors are selected with LVQ to train support vector machine (SVM) classifier instead of using all feature data. Performance is compared between the methods with and without feature data optimization when SVM is applied to semantic image annotation. Experiment results show that the proposed method has a better performance than that without using LVQ technique.
In automatic image annotation, it is often extracting low-level visual features from original image for the purpose of mapping to high level image semantic information. In this paper, we propose a novel method which i...
详细信息
In automatic image annotation, it is often extracting low-level visual features from original image for the purpose of mapping to high level image semantic information. In this paper, we propose a novel method which integrates kernel independent component analysis (KICA) and support vector machine (SVM) for analyzing the semantic information of natural images. KICA, which contains a nonlinear kernel mapping component, is adopted to extract low-level features from the original image data. Then these feature vectors are mapped to high-level semantic words using SVM to annotate images with labels in a given semantic label set. Comparative studies have done for the performance of KICA with traditional color histogram and discrete cosine transform features. The experimental results show that the proposed method is capable of extracting the components of images as key features, and with these features to map into semantic categories, higher accuracy is achieved.
In classification of multi-source remote sensing image, it is usually difficult to obtain higher classification accuracy. In the previous work, the modeling technique for the remote sensing image classification based ...
In classification of multi-source remote sensing image, it is usually difficult to obtain higher classification accuracy. In the previous work, the modeling technique for the remote sensing image classification based on the minimum description length (MDL) principle with mixture model is analyzed theoretically. In this work, experimental studies are performed for investigating the modeling technique. With intensive experiments and sophisticated analysis, it is found that the developed modeling technique can build a robust classification system, which can avoid classifier over-fitting training data and make the learning process trade-off between bias and variance. Meanwhile, designed mixture model is more efficient to represent real multi-source remote sensing images compared to single model.
Based on the human auditory system for spatial localization theory, we proposed a spatial localization of multiple sound sources using a spherical robot head. Space sound vectors recorded by a microphone array with sp...
详细信息
Based on the human auditory system for spatial localization theory, we proposed a spatial localization of multiple sound sources using a spherical robot head. Space sound vectors recorded by a microphone array with spatial configuration, are used to estimate the histograms of spatial arrival time difference vectors by solving the simultaneous equations in different frequency bands. The echo avoidance model based on precedence effect is used to reduce the interference of environment reverberations which provide the strong interference for phase vectors especially in small indoor environments. To integrate spatial cues of different microphone pairs, we propose a mapping method from the correlation between different microphone pairs to a 3D map corresponding to azimuth and elevation of sound sources directions. Experiments indicate that the system provides the distribution of sound source in azimuth-elevation localization, even concurrently in reverberant environments.
In this paper, a novel method of fingerprint minutiae extraction on grey-scale images is proposed based on the Gabor phase field. The novelty of our approach is that the proposed algorithm is performed on the transfor...
详细信息
In this paper, a novel method of fingerprint minutiae extraction on grey-scale images is proposed based on the Gabor phase field. The novelty of our approach is that the proposed algorithm is performed on the transform domain, i.e. the Gabor phase field of the fingerprint image. This is different from most existing minutiae extraction methods, in which the minutiae are usually extracted from the binarized and thinned fingerprint image. Experimental results on benchmark data sets demonstrate that the proposed algorithm has promising performances.
As a fundamental biological problem, revealing the protein folding mechanism remains to be one of the most challenging problems in structural bioinformatics. Prediction of protein folding rate is an important step tow...
详细信息
This paper presents a fast connected component labeling algorithm based on line description method and optimized tree Union-Find strategy. The algorithm transforms the pixel-connected issue, which most of proposed alg...
详细信息
This paper presents a fast connected component labeling algorithm based on line description method and optimized tree Union-Find strategy. The algorithm transforms the pixel-connected issue, which most of proposed algorithms focus on, into line-connected issue. This algorithm is comprised of three phrases, line extraction, connected component identification and label assignment. The line description method transforms the connected pixels into line form for reducing the scan time. While the new tree Union-Find strategy diminishes the redundant root compare operations. A comparison analysis is performed with other optimized famous component labeling algorithms. Our algorithm has shown an outstanding performance with respect to the processing time, which achieves 1.1~8 times as fast as the other algorithms in various test cases.
暂无评论