We present a low bit rate multimode speech coding algorithm which applies a suitable spectral model and a custom quantization scheme to each frame according to the selected mode. For unvoiced speech, the spectrum is c...
详细信息
In this correspondence, we propose a comprehensive theory for the morphological bounds on order-statistics filters (and their repeated iterations). Conditions are derived for morphological openings and closings to ser...
详细信息
In this correspondence, we propose a comprehensive theory for the morphological bounds on order-statistics filters (and their repeated iterations). Conditions are derived for morphological openings and closings to serve as bounds (lower and upper, respectively) on order-statistics filters (and their repeated iterations). Under various assumptions, morphological open-closings and close-openings are also shown to serve as (tighter) bounds (lower and upper, respectively) on iterations of order-statistics filters. Simulations of the application of the results presented to image restoration are finally provided.
In this correspondence, we propose a comprehensive theory of the convergence and characterization of roots of order-statistics filters. Conditions for the convergence of iterations of order-statistics filters are prop...
详细信息
In this correspondence, we propose a comprehensive theory of the convergence and characterization of roots of order-statistics filters. Conditions for the convergence of iterations of order-statistics filters are proposed. Criteria for the morphological characterization of roots of order-statistics filters are also proposed.
This paper describes modelling of the coefficient domain in wavelet subbands of wideband audio signals for low-bit rate and high-quality compression. The purpose is to develop models of the perception of wideband audi...
详细信息
This paper describes modelling of the coefficient domain in wavelet subbands of wideband audio signals for low-bit rate and high-quality compression. The purpose is to develop models of the perception of wideband audio signals in the wavelet domain. The coefficients in the wavelet subbands are quantized using a scheme that adapts to the subband signal by setting the quantization step size for a particular subband to a size that is inversely proportional to the subband energy, and then, within a subband, by modifying the energy determined step size as inversely proportional to the amplitude probability density of the coefficient. The amplitude probability density of the coefficients in each subband is modelled using learned vector/scalar quantization employing frequency sensitive competitive learning. The source data consists of 1-channel, 16-bit linear data sampled at 44.1 kHz from a CD containing major classical and pop music. Preliminary results show a bit-rate of 150 kbps, rather than 705.6 kbps, with no perceptual loss in quality. The wavelet transform provides better results for representing multifractal signals, such as wide band audio, than do other standard transforms, such as the Fourier transform.
This paper presents a method of generating unique fingerprints of radio transmitter turn-on transients. The fingerprinting system consists of the application of multiresolution wavelet analysis used to characterize th...
详细信息
This paper presents a method of generating unique fingerprints of radio transmitter turn-on transients. The fingerprinting system consists of the application of multiresolution wavelet analysis used to characterize the features contained in the transient followed by the use of a genetic algorithm to extract the wavelet coefficients that represent critical features of the transient. To measure the ability of the system to generate efficient and unique fingerprints, a neural network is used to classify the transients by their fingerprints. To test the noise sensitivity of the system, noisy transients were applied to a trained neural network, the network was able to positively classify noisy transients with 20 dB signal to noise ratios (SNR) and up. Experiments with real radio transients show that the system is able generate uniqiue fingerprints for absolute classification by a neural network for radios of differing model type as well as radios of the same model type.
Summary form only given, as follows. Fractal based data compression has attracted a great deal of interest since Barnsley's introduction of iterated functions systems (IFS), a scheme for compactly representing int...
详细信息
Summary form only given, as follows. Fractal based data compression has attracted a great deal of interest since Barnsley's introduction of iterated functions systems (IFS), a scheme for compactly representing intricate image structures. This paper discusses the incremental development of a block-oriented fractal coding technique for still images based on the work of Jacquin (1990). A brief overview of Jacquin's method is provided, and several of its features are discussed. In particular, the high order of computational complexity associated with the technique is addressed. This paper proposes that a neural network paradigm known as frequency sensitive competitive learning (FSCL) be employed to assist the encoder in locating fractal self-similarity within a source image. A judicious development of the proper neural network size for optimal time performance is provided. Such an optimally-chosen network has the effect of reducing the time complexity of Jacquin's original encoding algorithm from O(n/sup 4/) to O(n/sup 3/). In addition, an efficient distance measure for comparing two image segments independent of mean pixel brightness and variance is developed. This measure, not provided by Jacquin, is essential for determining the fractal block transformations. An implementation of fractal block coding employing FSCL is presented and coding results are compared with other popular image compression techniques. The paper also present the structure of the associated software simulator.
This paper presents a new and promising approach in characterizing speech consonants using a fractal model for speech recognition systems. Characterization of consonants has been a difficult problem because consonant ...
详细信息
This paper presents a new and promising approach in characterizing speech consonants using a fractal model for speech recognition systems. Characterization of consonants has been a difficult problem because consonant waveforms may be indistinguishable in time or frequency domain. The approach views consonant waveforms as coming from a turbulent constriction (excitation) in a human speech production system, and thus exhibiting turbulent and noise like time domain appearance. However, it departs from the usual approach by modeling consonant excitation using chaotic dynamical systems capable of generating turbulent and noise-like excitations. The scheme employs correlation fractal dimension and Takens embedding theorem to measure fractal dimension from time-series observation of the dynamical systems. It uses linear predictive coding (LPC) excitation of twenty-two consonant waveforms as the time series. Furthermore, the correlation fractal dimension is calculated using a fast Grassberger algorithm. A preliminary observation shows encouraging results because every consonant results in a unique trend of fractal dimensions for different embedding dimensions and scales.
This paper is concerned with reducing the rank of the adaptive weight vector in radar array signal processing. The motivation for reducing the rank is that modern space-time processing requires many more weights than ...
详细信息
Very-high-order FIR filters required for the new modulation schemes associated with wireless computer networks and cellular telephones can be implemented in VLSI circuitry using low-power CMOS technology and a novel a...
详细信息
Very-high-order FIR filters required for the new modulation schemes associated with wireless computer networks and cellular telephones can be implemented in VLSI circuitry using low-power CMOS technology and a novel application of Residue Number System (RNS) arithmetic. Through this approach 20-bit equivalent integer arithmetic can be obtained for filters with 8 to 256 taps with only a modest increase in hardware for filters above 8 taps. Simulations indicate that this new technique can increase dramatically the number of taps implemented on a single VLSI chip when compared with an FIR filter generated using FIRGEN.
暂无评论