In this paper we propose a new adaptation technique for improved text-independent speaker verification with limited amounts of training data using Gaussian mixture models (GMMs). The technique, referred to as probabil...
详细信息
In this paper we propose a new adaptation technique for improved text-independent speaker verification with limited amounts of training data using Gaussian mixture models (GMMs). The technique, referred to as probabilistic subspace adaptation (PSA), employs a probabilistic subspace description of how a client?s parametric representation (i.e. GMM) is allowed to vary. Our technique is compared to traditional maximum a posteriori (MAP) adaptation, or relevance adaptation (RA), and maximum likelihood eigen-decomposition (MLED), or subspace adaptation (SA) techniques. Results are given on a subset of the XM2VTS databases for the task of textindependent speaker verification.
This work deals with the development of a computational signal algebra framework for the modeling and simulation of digital image interferometry processing applications, exploiting the rapid prototyping capabilities o...
详细信息
This paper describes a platform-independent system which adds spatial auditory feedback to the icons of a computer interface in order to assist visually impaired individuals during icon location and selection. In this...
详细信息
ISBN:
(纸本)9806560531
This paper describes a platform-independent system which adds spatial auditory feedback to the icons of a computer interface in order to assist visually impaired individuals during icon location and selection. In this enhanced system, icons have 3D sound properties, in addition to their graphical properties. Each icon has a specific sound associated with it. As the cursor is moved throughout the interface, the user is able to hear, spatially, in which direction and at what distance each icon is located, with respect to the cursor.
In direct methods of contrast enhancement, a contrast measure is first defined, which is then modified by a mapping function to generate the pixel value of the enhanced image. Various mapping functions such as the squ...
详细信息
This paper presents a ground truth data collection effort along with its use in evaluating unmixing algorithms. Unmixing algorithms are typically evaluated using synthetic data generated by selecting endmember spectru...
详细信息
ISBN:
(纸本)9780819471574
This paper presents a ground truth data collection effort along with its use in evaluating unmixing algorithms. Unmixing algorithms are typically evaluated using synthetic data generated by selecting endmember spectrums and adding them in different amounts and with added noise. Going from synthetic to real data poses many problems. One of the greatest is the amount of data to be collected. Also, there will be many unmodeled variations in real data. These include greater variation of the endmembers, additional endmembers that are a very small percentage of the image, and nonlinear effects in the data that are not modeled. The data collation effort produced a high resolution class map along with spectral measurements of 153 different sampling sites to validate the map. The methodology for using this high resolution class map for generating the ground truth data for use in the unmixing algorithms is presented. Specifically, a 1m class map is used to generate the endmember abundances for every pixel in a 30m Hyperion image of the Enrique Reef in Southwest Puerto Rico. The results using two unmixing algorithms, one with a sum to one constraint and the other with a non-negative constraint are presented. The unmixing results for each endmember are presented along with a newly developed unmixing parameter called the Correct Unmixing Index (CUI).
Techniques that treat the face holistically as a vector of pixel values, which we refer to as a monolithic representation, are still widely considered state of the art for the task of face verification in literature. ...
详细信息
Optical flow computation may be divided into four processing steps where the first is extraction of image features suitable for flow estimation. Using a generalization of the basic flow constraint it is possible to es...
详细信息
In images and videos of a 3D scene, blur due to camera shake can be a source of depth information. Our objective is to find the shape of the scene from its motion-blurred observations without having to restore the ori...
详细信息
Representing the face as a distribution of freely moving patches, which we refer to as a "free-parts" representation, has recently demonstrated some benefit in the task of face verification. This benefit can...
详细信息
ISBN:
(纸本)1901725294
Representing the face as a distribution of freely moving patches, which we refer to as a "free-parts" representation, has recently demonstrated some benefit in the task of face verification. This benefit can be largely attributed to the representation's natural ability to deal with local appearance variation within the face. Hitherto, a major limitation that has hindered the wider adoption of this type of facial representation, for the task of face verification, has been its poor ability to take advantage of prior knowledge concerning mismatches in context;such as pose. This paper goes some way to alleviating these limitations by making two novel contributions: (i) Demonstrating that free-parts distributions of a client's face for different poses overlap to such a degree that a considerable amount of discrimination is preserved in the intersection. (ii) Through the off-line estimation of subject-independent pose dependent priors, an alternative to the canonical log-likelihood measure can be employed that takes advantage of this intersection and is less sensitive to mismatch in the presence of pose variation.
Digital processing of black and white images has received most attention during the last 25 years, and has led to various algorithms for the enhancement, smoothing, and zooming of images. Due to the decreasing cost an...
详细信息
暂无评论