Objective image quality measure, which is a fundamental and challenging job in image processing, evaluates the image quality consistently with human perception automatically. On the assumption that any image distortio...
详细信息
Objective image quality measure, which is a fundamental and challenging job in image processing, evaluates the image quality consistently with human perception automatically. On the assumption that any image distortion could be modeled as the difference between the directional projection-based maps of reference and distortion images, we propose a new objective quality assessment method based on directional projection for full reference model. Experimental results show that the proposed metrics are well consistent with the subjective quality score.
Trip planning is generally a very time-consuming task due to the complex trip requirement and the lack of convenient tools/systems to assist the planning. In this paper, we propose an on-site travel path recommendatio...
详细信息
Trip planning is generally a very time-consuming task due to the complex trip requirement and the lack of convenient tools/systems to assist the planning. In this paper, we propose an on-site travel path recommendation service based on geo-tagged photos to facilitate tourists' trip planning. The large scale geo-tagged photos that are publically available on the web make this service possible, as geotagged photos encode rich travel-related metadata and can be used to mine travel paths from previous tourists. In this work, about 20 million geo-tagged photos were crawled from ***. Then a substantial number of travel paths are minded from the crawled geo-tagged photos. After that, a search system is built to index and search paths, and Directed Sparse Chamfer Distance is proposed to measure the similarity of two paths in linear time. Based on 20 million geo-tagged photos, the proposed service is evaluated using both objective and subjective evaluation methods and shows promising results.
In this paper, we present a new framework to model full covariance matrices of Gaussian components. In this framework, directly modeling the full correlation matrix instead of the full covariance matrix is our purpose...
详细信息
ISBN:
(纸本)0769525210
In this paper, we present a new framework to model full covariance matrices of Gaussian components. In this framework, directly modeling the full correlation matrix instead of the full covariance matrix is our purpose, as the correlation matrix is the direct description of the correlation of inter feature elements. In order to model full correlation matrices, we share linear transformations among components' full correlation matrices. Thus, the full correlation matrix of each component is represented by a shared linear transformation and a component-specific diagonal correlation matrix. The transformation is used to help the diagonal correlation matrix to model the correlation of inter feature-vector elements more precisely. We evaluate our new framework on a mandarin speaker identification task. Experiments show that above 35% reduction in speaker identification error rate is achieved compared with the best diagonal covariance models. Furthermore, our algorithm achieved better performance than STC does.
In order to solve the problem of robustness in textindependent speaker identification, two-stage Wiener filter presented in the ETSI standard is used in this paper. However, the performance of the voice activity detec...
详细信息
A novel modeling method for glottal source is proposed for Improving the naturalness and quality of synthetic speech. This paper utilizes the high correlation between vocal tract parameters and glottal source to model...
详细信息
The acoustic mismatch between the training and test environments will lead to the difference of the statistical characteristics of speech parameters. Since the statistical characteristics of the kurtosis can measure t...
详细信息
Speech conveys more information than text, as the same word can be uttered in various voices to convey diverse information. Compared to traditional text-to-speech (TTS) methods relying on speech prompts (reference spe...
详细信息
The accuracy of face alignment affects greatly the performance of a face recognition system. Since the face alignment is usually conducted using eye positions, the algorithm for accurate eye lo- calization is essentia...
详细信息
The accuracy of face alignment affects greatly the performance of a face recognition system. Since the face alignment is usually conducted using eye positions, the algorithm for accurate eye lo- calization is essential for the accurate face recognition. In this paper, an algorithm is proposed for eye localization. First, the proper AdaBoost detection is adaptively trained to segment the region based on the special gray distribution in the region. After that, a fast radial symmetry operator is used to pre- cisely locate the center of eyes. Experimental results show that the method can accurately locate the eyes, and it is robust to the variations of face poses, illuminations, expressions, and accessories.
In this paper we present a novel algorithm for object tracking in video sequence based on SURF key-point and superpixel. SURF key-point is very effective for object matching between two images and we can use it to loc...
详细信息
Online travel destination recommendation is to keep track of a user's current traveling history to recommend next destination in real time while the user is on the travel. This paper presents an efficient variable...
详细信息
暂无评论