This paper presents a wavelet transformation (WT) based technique for reducing the size of Cellular Neural Network (CNN) [1] used for the acoustic alarm signals classification system proposed by Osuna ***. [2]. The sy...
详细信息
This paper presents a wavelet transformation (WT) based technique for reducing the size of Cellular Neural Network (CNN) [1] used for the acoustic alarm signals classification system proposed by Osuna ***. [2]. The system of [2] consists of three processing units: i) Transformation of a 1-dimensional (1-D) signal into a sequence of 2-dimensional (2-D) signals, so called images obtained by a low pass filter cascade incorporating with a grid like correlation process, ii) Concentrating an image sequence into a single image by linear threshold template CNN, iii) Classification of the resulting image by discrete-valued perceptrons. In this paper, discrete WT (DWT) incorporating with grid like correlation process has been used for transforming 1-D acoustic signal into an image sequence. All other operations needed for the classification has been performed as done in [2] for the sake of comparison. The WT based technique proposed in this paper gives the possibility of acoustic alarm signal classification by using CNNs of small size, e.g., 13×13. By using WT based technique, CNN of size 13×13 becomes sufficient.
Performance of multiresolution motion estimation schemes in low bit rate video coding are presented in this paper. Video frames are individually wavelet decomposed and motion activity is detected using variable block ...
详细信息
ISBN:
(纸本)0780336798;0780336801
Performance of multiresolution motion estimation schemes in low bit rate video coding are presented in this paper. Video frames are individually wavelet decomposed and motion activity is detected using variable block size multi-resolution motion estimation (MRME) schemes. The residual frames are coded using zero tree quantization,followed by arithmetic entropy coding. MRME schemes are shown to exploit the motion correlation among subbands of different scales. In this paper we show that the MRME schemes behave differently, than as shown in [4], in low bit rate applications employing zero tree quantization. Simulation results are provided for coding of QCIF resolution (176x144 pixels) video frames at 10 frames/sec, coded at various low bit rates. Four different MRME schemes are evaluated and performance comparisons are provided for several low output bit rates.
Our work focuses on the development of finite element models (FEMs) that describe the biomechanics of human joints. Finite element modeling is becoming a standard tool in industrial applications. In highly complex pro...
详细信息
ISBN:
(纸本)0780332598
Our work focuses on the development of finite element models (FEMs) that describe the biomechanics of human joints. Finite element modeling is becoming a standard tool in industrial applications. In highly complex problems such as those found in biomechanics research, however, the full potential of FEMs is just beginning to be explored, due to the absence of precise, high resolution medical data and the difficulties encountered in converting these enormous datasets into a form that is usable in FEMs. With increasing computing speed and memory available, it is now feasible to address these challenges. We address the first by acquiring data with a high resolution X-ray CT scanner and the latter by developing a semi-automated method for generating the volumetric meshes used in the FEM. Issues related to tomographic reconstruction, volume segmentation, the use of extracted surfaces to generate volumetric hexahedral meshes, and applications of the FEM are described.
This thesis is concerned with new approaches to processing and compression of nonstationary signals using wavelets and multifractality, with applications to speech and imales. wavelets allow for simultaneous observati...
This thesis is concerned with new approaches to processing and compression of nonstationary signals using wavelets and multifractality, with applications to speech and imales. wavelets allow for simultaneous observations of time and frequency transients, while multifractality allows fractal dimension characterizations of signal singularities. A scheme is developed to separate stationary and nonstationary parts of speech signals using linear predictive coding (LPC) filters and LPC excitations, respectively, and to design (i) wavelet compact representation of the excitation to achieve 15.33 dB signal-to-noise ratio (SNR) at a 5.5 kbit per second (kbps) rate, and (ii) fractal characterization of 22 consonant excitations resulting in separable features. wavelet representation is also suitable for image compression, where more than 907o truncation of wavelet coeficients still results in good perceptual quality. An optimum scalar quantization of such coefficients results in a peak SNR (PSNR) of 28.06 dB at arafe of 0.15 bit per pixel (bpp), outperforming the classical joint photographic expert group (JPEG) objectively and subjectively. 'wavelet coefficients are also suitable for fast Rice entropy coders. In addition, wavelets and fractality lead to separation of signal from noise, and are used to obtain a 2:1 compression ratio of otherwise incompressible noise contaminated images. wavelets and multifractality not only reveal that nonstationary parts are suffrcient to represent natural images, with wavelet maxima detecting both the regular and singular transients, but also can be used to reconstruct such images. This has led to an edge preserving coding for aerial ortho images, achieving a quality of 21.89 dB PSNR, thus outperforming the edge quality of JPEG at 30 dB. Finally, a hierarchical neural network has been designed to exffact nonstationary features and can classify 22 severe storm events accurately.
A Multiresolution method to match deformed contours is presented [1-2]. Our approach has been inspired by the method due to Duncan and Cohen [3-4]. The innovation of our work is in the incorporation of a new smoothnes...
详细信息
ISBN:
(纸本)7505338900
A Multiresolution method to match deformed contours is presented [1-2]. Our approach has been inspired by the method due to Duncan and Cohen [3-4]. The innovation of our work is in the incorporation of a new smoothness term in the elastic fitting criterion. As a result, the computational complexity is reduced and the equation corresponding to the minimization of the fitting criterion has a simple interpretation. In this paper, we investigate the necessary and sufficient conditions to the minimization of the fitting criterion using variational calculus.
In this paper, we present a new algorithm to recognize a two- dimensional object of general shape which has a closed contour, The algorithm includes two stages. The first stage consists of building a one-dimensional s...
详细信息
ISBN:
(纸本)7505338900
In this paper, we present a new algorithm to recognize a two- dimensional object of general shape which has a closed contour, The algorithm includes two stages. The first stage consists of building a one-dimensional signal representation of the two-dimensional boundary of the object and obtaining the wavelet transform of the representation by using a particular mother wavelet which is the first derivative of a smoothing function. The second stage consists of calculating the rise -fall periods of the wavelet transform and its transfer probability matrix as the feature template, and then completing the matching procedure for objects recognition.
In this paper, a novel dynamic frequency based SIMD linear array processor (DFLAP) for imageprocessingapplications is proposed. The operating clock frequency of the processor is varied dynamically between 400 MHz an...
详细信息
ISBN:
(纸本)0780332598
In this paper, a novel dynamic frequency based SIMD linear array processor (DFLAP) for imageprocessingapplications is proposed. The operating clock frequency of the processor is varied dynamically between 400 MHz and 50 MHz based on the operation performed in order to enhance the processor throughput. An efficient implementation for the dynamic clocking unit (DCU) which enables dynamic switching of clock frequencies is presented. Each processing element in the linear array contains an 8-bit arithmetic/logic unit, an 8×8 single-cycle multiplier, a shifter, a bidirectional neighbor communication unit, a 32×8 dual port SRAM, and a DCU. The architecture was designed and implemented using CADENCE design tools. Several low-level imageprocessing tasks have been mapped onto the architecture to demonstrate the effectiveness of the dynamic frequency based architecture.
We introduce a novel image-adaptive encoding scheme for the baseline JPEG standard that maximizes the decoded image quality without compromising compatibility with current JPEG decoders. Our algorithm jointly optimize...
详细信息
ISBN:
(纸本)0819422355
We introduce a novel image-adaptive encoding scheme for the baseline JPEG standard that maximizes the decoded image quality without compromising compatibility with current JPEG decoders. Our algorithm jointly optimizes quantizer (Q-matrix) selection, coefficient ''thresholding'' and entropy (Huffman) coding within a rate-distortion (RD) framework. It unifies two previous approaches to image-adaptive JPEG encoding: R-D optimized quantizer selection by Wu and Gersho [4], and R-D optimal coefficient thresholding by Ramchandran and Vetterli [3]. By formulating an algorithm which optimizes these two operations jointly, we have obtained performance that is the best in the reported literature (and very close to the optimal performance theoretically attainable) for JPEG-compatible coding. In fact the performance of this JPEG coder is comparable to that of more complex ''state of the art'' image coding schemes: e.g., for the benchmark 512x512 ''Lenna'' image at a coding rate of 1 bit per pixel (8:1 compression), our algorithm achieves a Peak signal to Noise Ratio (PSNR) of 39.6 dB, which represents a gain of 1.7 dB over SPEG using the example Q-matrix with a customized Huffman entropy coder, and even slightly exceeds the published performance of Shapiro's celebrated embedded zerotree wavelet coding scheme. Furthermore, with the choice of appropriate visually-based error metrics, noticeable subjective improvement has been achieved as well. The reason for our algorithm's superior performance can be attributed to its conceptual equivalence to the application of Entropy-Constrained Vector Quantization (ECVQ) design principles to a JPEG-compatible framework. Furthermore, our algorithm may be applied to other systems that use run-length encoding, including intra-frame MPEG and subband or wavelet coding.
We present a new stabilized zero-crossing representation with a salient feature that the signal reconstruction problem reduces to a typical minimum-norm optimization problem, the solution of which is formulated as a l...
详细信息
We present a new stabilized zero-crossing representation with a salient feature that the signal reconstruction problem reduces to a typical minimum-norm optimization problem, the solution of which is formulated as a linear simultaneous equation, and developed an iterative algorithm for signal reconstruction. Moreover, we extend them to the two-dimensional case. With the extended two-dimensional reconstruction algorithm we can almost perfectly reconstruct an original image from the stabilized two-dimensional zero-crossing representation, and after some dozens of iterations the algorithm provides a reconstruction image with subjectively high picture quality. Furthermore, we introduce a threshold operation based on edge intensity to reduce the amount of information in the stabilized zero-crossing representation, and experimentally demonstrate that the threshold operation works well.
Two modifications to a wavelet packet based compression scheme for single lead electrocardiogram (ECG) data have been tested for their effect upon compression performance. First, differential pulse code modulation and...
详细信息
暂无评论