A simple space-time coded orthogonal frequency division multiplexing (OFDM) transmitter diversity technique for wireless communications over frequency selective fading channels is presented. The proposed technique uti...
详细信息
A simple space-time coded orthogonal frequency division multiplexing (OFDM) transmitter diversity technique for wireless communications over frequency selective fading channels is presented. The proposed technique utilizes OFDM to transform frequency selective fading channels into multiple flat fading subchannels on which space-time coding is applied. A two-branch transmitter diversity system is implemented without bandwidth expansion and with a small increase in complexity beyond that of a conventional OFDM system. Simulations verify that the two-branch transmitter diversity system achieves a diversity gain equivalent to that of the optimal maximal ratio combining (MRC) receiver diversity system.
This paper describes a method for the unsupervised and gender-independent estimation of the average human vocal tract length from the speech waveform, and reports results obtained on Fant's (1960) X-ray vowel data...
详细信息
This paper describes a method for the unsupervised and gender-independent estimation of the average human vocal tract length from the speech waveform, and reports results obtained on Fant's (1960) X-ray vowel data as well as results from experiments performed on multiple sentence utterances of 86 male and 78 female TIMIT speakers, including correlation analyses between the vocal tract length estimates and given body heights. The investigated error criteria that make non-iterative, closed-form estimator solutions possible are all found to achieve good speaker clustering potential for both male and female subgroups.
For several years, we have been teaching DSP as a first course in electrical and computerengineering at Georgia Tech. Such a dramatic rearrangement of the introductory material requires a new organization of topics a...
详细信息
For several years, we have been teaching DSP as a first course in electrical and computerengineering at Georgia Tech. Such a dramatic rearrangement of the introductory material requires a new organization of topics and courses when teaching circuits and systems. In addition, the use of computer-enhanced course materials has a profound impact on the systems courses, which are quite mathematical and abstract in nature. This paper addresses some of the issues encountered when adopting a signalprocessing first approach.
It is shown that the particular form of the frequency support of raw data and focused imagery obtained from an ultra-wideband, wide beamwidth synthetic aperture radar system can be exploited in nonseparable sampling s...
详细信息
It is shown that the particular form of the frequency support of raw data and focused imagery obtained from an ultra-wideband, wide beamwidth synthetic aperture radar system can be exploited in nonseparable sampling schemes to reduce the overall amount of raw data samples and image pixels that need to be stored and computed. Furthermore, it is demonstrated that the constant integration angle backprojection (CIAB) image former implicitly applies a fan filter that interpolates raw data sampled on a quincunx grid back onto the underlying rectangular grid. This subtle property of the CIAB has not been exploited so far. It leads to higher quality images with less computational complexity.
The sinusoidal transform (ST) provides a sparse representation for speech signals by utilizing several psychoacoustic phenomena. It is well suited to applications in signal enhancement because the signal is represente...
详细信息
The sinusoidal transform (ST) provides a sparse representation for speech signals by utilizing several psychoacoustic phenomena. It is well suited to applications in signal enhancement because the signal is represented in a parametric manner that is easy to manipulate. The multi-resolution sinusoidal transform (MRST) has the additional advantage that it is both particularly well suited to typical speech signals and well matched to the human auditory system. The currently reported work discusses the removal of noise from a noisy signal by applying an adaptive Wiener filter to the MRST parameters and then conditioning the parameters to eliminate "musical noise". In informal tests MRST based noise reduction was found to reduce background noise significantly better than traditional Wiener filtering and to virtually eliminate the "musical noise" often associated with Wiener filtering.
We describe an embedded hidden Markov model (HMM)-based approach for face detection and recognition that uses an efficient set of observation vectors obtained from the 2D-DCT coefficients. The embedded HMM can model t...
详细信息
We describe an embedded hidden Markov model (HMM)-based approach for face detection and recognition that uses an efficient set of observation vectors obtained from the 2D-DCT coefficients. The embedded HMM can model the two dimensional data better than the one-dimensional HMM and is computationally less complex than the two-dimensional HMM. This model is appropriate for face images since it exploits an important facial characteristic: frontal faces preserve the same structure of "super states" from top to bottom, and also the same left-to-right structure of "states" inside each of these "super states".
A compression scheme for diverse speech and audio signals is proposed. In this scheme, signals are analyzed with a 2-band QMF filter bank followed by the application of a modulated lapped biorthogonal transform (MLBT)...
详细信息
A compression scheme for diverse speech and audio signals is proposed. In this scheme, signals are analyzed with a 2-band QMF filter bank followed by the application of a modulated lapped biorthogonal transform (MLBT) to each of the filter bank channels. Subsequent encoding of transform coefficients is performed using Laplacian optimized scalar and vector quantizers, whose rates are determined by an estimated noise threshold, i.e., masking threshold. Listening tests show that the coder achieves a quality at 32 kbits/s that is preferred over the ITU G.722 coder at 64 kbits/s, for speech, music, and more diverse signals consisting of speech in the presence of eventful background sounds. Both the delay of the coder, at 40 ms, and the level of complexity are moderate.
A new directional filter bank for image analysis and classification is proposed. This paper introduces an improved structure in order to visualize subband outputs of the directional filter banks, while retaining the a...
详细信息
A new directional filter bank for image analysis and classification is proposed. This paper introduces an improved structure in order to visualize subband outputs of the directional filter banks, while retaining the attractive properties of the original directional filter banks such as 1-D separable filtering, perfect reconstruction, and maximal decimation. Using this structure, any arbitrary 2/sup n/ band directional filter bank can be implemented by cascading simple directional filter bank blocks, unlike the original structure that needs a parallel structure for visualizing subband outputs. Also, in order to have nondistorted phase information in the subbands for visualization, both FIR and IIR filter prototypes that can be implemented efficiently are provided for linear phase filtering. This paper shows the approach proposed here can be applied to image analysis and classification.
Stereo image pair coding is an important issue in stereo data compression. A wavelet based stereo image pair coding algorithm is proposed in this paper. The wavelet transform is used to decompose the image into an app...
详细信息
Stereo image pair coding is an important issue in stereo data compression. A wavelet based stereo image pair coding algorithm is proposed in this paper. The wavelet transform is used to decompose the image into an approximation image and three edge images. In the wavelet domain, a disparity estimation technique is developed to estimate the disparity field using both approximation image and edge images. To improve the accuracy of estimation of wavelet images produced by the disparity compensation technique, a novel wavelet based subspace projection technique (SPT) is developed. In the SPT, the block dependent subspaces are constructed using block varying basis vectors that are derived from the disparity compensated wavelet images. Experimental results show that the proposed algorithm is efficient to achieve stereo image compression.
An interactive color image segmentation technique is presented for use in applications where the segmented regions correspond to meaningful objects, such as for image retrieval. The proposed technique utilizes the per...
详细信息
暂无评论