In InISAR system, the pixels between two ISAR images derived from corresponding antennas usually do not register properly without prior compensation. A three-dimension motion compensation method, or 3D focusing, is pu...
详细信息
In InISAR system, the pixels between two ISAR images derived from corresponding antennas usually do not register properly without prior compensation. A three-dimension motion compensation method, or 3D focusing, is put forward in this paper. While the multi-antenna-pair configuration in radar system, the angular motion parameters both in the azimuth and pitching are estimated accurately and the phase unwrapping processing can be avoided in the procedure of obtaining phase unambiguous. Simulation data is used to illustrate the accuracy of the proposed method. The method has also been extended to stripmap SAR for the interferometric 3D imaging of moving and/or man-made objects.
This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum C...
详细信息
ISBN:
(纸本)9781479928941
This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum Coefficients (TPCC). These features capture speaker-discriminative characteristics from different aspects of glottal source excitation patterns. The proposed physiologically-driven features give better results with lower model complexities, and also provide complementary information that can improve overall system performance even for larger amounts of data. Results on speaker identification using the YOHO corpus demonstrate that these physiologically-driven features are both more accurate than and complementary to traditional mel-frequency cepstral coefficients (MFCC). In particular, the incorporation of the proposed glottal source features offers significant overall improvement to the robustness and accuracy of speaker identification tasks.
Classifying an unknown object in image retrieval systems using the nearest neighbour classifier would be very time consuming when the number of the objects within the associated database is high. Generating a dendrogr...
详细信息
Classifying an unknown object in image retrieval systems using the nearest neighbour classifier would be very time consuming when the number of the objects within the associated database is high. Generating a dendrogram using a Hierarchical Agglomerative Clustering (HAC) algorithm and searching the database images from coarse to fine resolutions using image pyramids are two important groups of techniques widely used for dealing with this problem. In this paper, a novel algorithm is proposed by combining these methods within the framework of a face recognition system. The search process is performed in a coarse-to-fine manner using image pyramids. On the bottom level of the pyramid (the finest resolution), a set of dendrograms is formed using the HAC algorithm. Our experimental studies show that the recognition process can be speeded up by a factor of around 65 compare to the basic nearest neighbour classifier. In such a condition, however, the recognition rate is slightly reduced.
This paper introduces the use of two new features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC) and Glottal Flow Cepstrum Coefficients (GLFCC), to capture speaker-specific characteristics fro...
详细信息
This paper introduces the use of two new features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC) and Glottal Flow Cepstrum Coefficients (GLFCC), to capture speaker-specific characteristics from their vocal excitation patterns. Results on a cross-lingual speaker identification task taken from the NIST 2004 SRE demonstrate that these RPCC and GLFCC features are significantly more accurate than traditional mel-frequency cepstral coefficients (MFCC). In particular, these two new features give better results with smaller amounts of training data, due to lower model complexity.
This paper presents the unmixing analysis of hyperspectral images collected at close-range of vegetation plots associated with AON-ITEX in Utqiaġvik, AK. The resulting analysis illustrates the potential value of unmix...
This paper presents the unmixing analysis of hyperspectral images collected at close-range of vegetation plots associated with AON-ITEX in Utqiaġvik, AK. The resulting analysis illustrates the potential value of unmixing for the analysis of very high spatial resolution hyperspectral images. Unmixing can extract the signatures of the materials present in the image and capture their spectral variability and spatial distribution to support further analysis up the image chain. A variation of PPI is presented that is suitable for this application and address the problem of estimating the number of endmembers from the data.
A new method of image coding by autoregressive (AR) synthesis is presented. The physics of image formation suggests that an image may be considered as a power spectrum. Using this formulation a Cosine transform of the...
详细信息
A new method of image coding by autoregressive (AR) synthesis is presented. The physics of image formation suggests that an image may be considered as a power spectrum. Using this formulation a Cosine transform of the sampled image is shown to yield a set of autocorrelations. These are used to find an equivalent AR model whose parameters are encoded for transmission. Compared to conventional Cosine transform coding, this method is shown to give superior resolution and is shown to suppress the "block-effects" present in block-by-block transform coding methods. Distinction between this method and linear predictive coding (LPC) used for speech data compression is made. Extensions and examples for two dimensional images are given.
This paper describes a technique for the enhancement of images by FIR filters which compensates for the decreased response of human vision at high spatial frequencies. Because many images contain mainly horizontal and...
详细信息
This paper describes a technique for the enhancement of images by FIR filters which compensates for the decreased response of human vision at high spatial frequencies. Because many images contain mainly horizontal and vertical features, and because vision is less acute along diagonals, it is possible to design anisotropic enhancement filters which do not increase greatly the background noise. The anisotropic design methodology also incorporates other results on filter design based on human vision which have recently been reported by the author and coworkers.
In cyclic-prefixed communication systems, if the delay spread of the channel is longer than the cyclic prefix (CP) a channel-shortening equalizer (CSE) can be used to restore the desired operation of such systems. Sin...
详细信息
In cyclic-prefixed communication systems, if the delay spread of the channel is longer than the cyclic prefix (CP) a channel-shortening equalizer (CSE) can be used to restore the desired operation of such systems. Since in time-varying environment we are interested in fast adaptive equalizer with tracking capability, the aim of this paper is to propose RLS-type algorithm for channel shortening. In this paper, we first propose an RLS-type algorithm to estimate the eigenvector corresponding to the smallest eigenvalue of a matrix and based on this algorithm we develop an RLS-type blind channel shortener. We also, based on PAST algorithm, propose an RLS-type update rule to shorten the channel under MMSE criterion. Simulations show the speed advantage of the proposed algorithms.
In this paper, the inherent drawbacks of the naive lattice decoding for MIMO fading systems is investigated. We show that using the naive lattice decoding for MIMO systems has considerable deficiencies in terms of the...
详细信息
ISBN:
(纸本)9781424413973;1424414296;1424413974
In this paper, the inherent drawbacks of the naive lattice decoding for MIMO fading systems is investigated. We show that using the naive lattice decoding for MIMO systems has considerable deficiencies in terms of the rate-diversity tradeoff. Unlike the case of maximum-likelihood decoding, in this case, even the perfect lattice space-time codes which have the non-vanishing determinant property can not achieve the optimal rate-diversity trade-off. Indeed, we show that in the case of naive lattice decoding, all the codes based on full-rate lattices have the same rate-diversity trade-off as V-BLAST. Also, we drive a lower bound on the symbol error probability of the naive lattice decoding for the fixed-rate MIMO systems (with equal numbers of receive and transmit antennas). This bound shows that asymptotically, the naive lattice decoding has an unbounded loss in terms of the required SNR, compared to the maximum likelihood decoding.
In this work, a novel method of fusing colour information in feature level is proposed considering a face verification system. For this purpose, composite kernels which have been already used in support vector machine...
详细信息
In this work, a novel method of fusing colour information in feature level is proposed considering a face verification system. For this purpose, composite kernels which have been already used in support vector machine classifier is applied within the framework of the generalised discriminant analysis (GDA) algorithm. The performance of the resulting system is evaluated using the XM2VTS face database and its associated experimental protocols. Our experiments show that by combining colour information using the proposed approach the good classification performance demonstrated by the kernel based methods can be further improved.
暂无评论