Satellite 8-bit images are processed using a new nonlinear wavelet estimator for image denoising. An equal subband analysis of the images is realized over 3 resolution levels, leading to 64 subbands. Considering the n...
详细信息
Satellite 8-bit images are processed using a new nonlinear wavelet estimator for image denoising. An equal subband analysis of the images is realized over 3 resolution levels, leading to 64 subbands. Considering the noise as white Gaussian with a standard deviation around unity, the best, in a mean square sense, soft threshold estimator is obtained through simulations using spline orthogonal filters of order 3. In both Weiner and sigmoidal filtering, a statistical test is carried out for each subband to prevent the optimization of the weights when the signal-to-noise ratio is too low.
this paper provides an overview of a new software framework, TABS, which has been designed to support the rapid development of imageprocessing and image analysis systems and components. Compared to other image manipu...
详细信息
ISBN:
(纸本)085296692X
this paper provides an overview of a new software framework, TABS, which has been designed to support the rapid development of imageprocessing and image analysis systems and components. Compared to other image manipulation software frameworks, TABS has a number of novel features which make it particularly suitable for use in applications where hypotheses rather than single "hard" results are generated by system components, and symbolic data are manipulated.
In this paper, we present a high speed VLSI architecture of Discrete wavelet Transform (DWT) for MPEG-4. We found similarity between computation results of each octave. By using similarity, in the proposed architectur...
详细信息
In this paper, we present a high speed VLSI architecture of Discrete wavelet Transform (DWT) for MPEG-4. We found similarity between computation results of each octave. By using similarity, in the proposed architecture, the input data are separated even and odd, and 2 data are inputted parallel. this cause faster Discrete wavelet Transform operation than other architectures. In conventional architecture, N-point DWT is computed in N cycles or 2N cycles. Where as in proposed architecture, N-point DWT is computed in N/2 cycles with 100% hardware utilization. therefore, proposed architecture can be applied in MPEG-4, image transmission in wireless network and digital signalprocessing which require high speed processing.
We present an optimisation technique to select adaptively a set of control parameters for edge and boundary detection in intensity image data. the method is based on a genetic algorithm, and requires the definition of...
详细信息
ISBN:
(纸本)085296692X
We present an optimisation technique to select adaptively a set of control parameters for edge and boundary detection in intensity image data. the method is based on a genetic algorithm, and requires the definition of a performance measure to estimate the quality of the output in the absence of ground truth. Synthetic and real images are used to illustrate the approach.
In this paper, we will present a highly scalable waveletimage codec that also achieves good coding efficiency, both subjectively and in terms of PSNR (Peak signal-to-Noise Ratio). A proposal based on this codec was r...
详细信息
ISBN:
(纸本)0201322323
In this paper, we will present a highly scalable waveletimage codec that also achieves good coding efficiency, both subjectively and in terms of PSNR (Peak signal-to-Noise Ratio). A proposal based on this codec was recently adopted as the texture coding mode in the MPEG4 Working Draft (WD). this codec supports embedded scalability at many SNR levels and spatial resolutions without sacrificing image quality, and the scalability is supported all the way to lossless compression (compressed once, and decoded at multiple resolutions and qualities according to end users' needs without transcoding). We will introduce in detail the techniques used in the codec. the syntax of the MPEG4 texture coding mode that incorporates this technology will also be introduced. this highly scalable image codec is an enabling technology for many multimedia applications. In this paper, we will describe some of the applications made possible withthis technology, such as efficient progressive transmission for web browsing, dynamic rate shaping by network routers, flexible image storage and management in digital cameras, and efficient texture mapping using compressed images in computer graphics.
the proceedings contains 29 papers from the Fourth International conference on massively parallel processing using optical interconnections. Topics discussed include: parallel optical interconnections;optical clock si...
详细信息
the proceedings contains 29 papers from the Fourth International conference on massively parallel processing using optical interconnections. Topics discussed include: parallel optical interconnections;optical clock signal distribution systems;shepherd pulses;optical networks;stack ring networks;electronic systems;Benes network;fiber optic arrays;switch interconnects;Fourier plane patterned mirror array;image sequences;binary trees;equivalent shuffle network;and free space optical interconnections.
Recently developed object-based video coding schemes require the estimation of motion in arbitrary-shaped objects. A quad-tree structured variable size block matching technique is described. the inherent motion is rep...
详细信息
ISBN:
(纸本)085296692X
Recently developed object-based video coding schemes require the estimation of motion in arbitrary-shaped objects. A quad-tree structured variable size block matching technique is described. the inherent motion is represented by significantly fewer blocks than if conventional fixed size block matching had been used to provide a similar quality prediction.
the use of discrete cosine transforms (DCT) with a genetic algorithm (GA) for stereo matching is demonstrated. the matching process is implemented by generating for each image block sets of DCT coefficients which when...
详细信息
ISBN:
(纸本)085296692X
the use of discrete cosine transforms (DCT) with a genetic algorithm (GA) for stereo matching is demonstrated. the matching process is implemented by generating for each image block sets of DCT coefficients which when reverse transformed become the estimated disparity map for that block. the GA performs an optimization process and finds, among a large number of candidates, the best matching DCT coefficients.
A complementary metal oxide semiconductor (CMOS) very large scale integration (VLSI)-based design methodology enables application specific binary sensors of differing array sizes and processing to be designed and laid...
详细信息
ISBN:
(纸本)085296692X
A complementary metal oxide semiconductor (CMOS) very large scale integration (VLSI)-based design methodology enables application specific binary sensors of differing array sizes and processing to be designed and laid out quickly and simply using a library of cells. the generic bit-sliced library cells and full-custom logic are slower to produce initially, but their potential for repeated use in many designs makes this penalty worthwhile. It was discovered that this is compensated by an increase in packing density of bit-sliced cells of typically 16-21%, compared to auto-routed logic. Silicon area is also fundamentally minimized by virtue of using the lowest resolution sensing array and optimal processing architecture.
Consistently high person recognition accuracy is difficult to attain using a single recognition modality. this paper assesses the fusion of voice and outer lip-margin features for person identification, Feature fusion...
详细信息
ISBN:
(纸本)085296692X
Consistently high person recognition accuracy is difficult to attain using a single recognition modality. this paper assesses the fusion of voice and outer lip-margin features for person identification, Feature fusion is investigated in the form of audio-visual feature vector concatenation principal component analysis, and linear discriminant analysis. the paper shows that, under mismatched test and training conditions, audio-visual feature fusion is equivalent to an effective increase in the signal-to-noise ratio of the audio signal. Audio-visual feature vector concatenation is shown to be an effective method for feature combination, and linear discriminant analysis is shown to possess the capability of packing discriminating audio-visual information into fewer coefficients than principal component analysis. the paper reveals a high sensitivity of bimodal person identification to a mismatch between LDA or PCA feature-fusion module and speaker model training noise-conditions. Such a mismatch leads to worse identification accuracy than unimodal identification.
暂无评论