A VLSI architecture designed to perform real-time image compression using wavelets is described. The two basic modules of the architecture are a 2-D wavelet transform generator and a coder based on the SPIHT algorithm...
详细信息
ISBN:
(纸本)0780349571
A VLSI architecture designed to perform real-time image compression using wavelets is described. The two basic modules of the architecture are a 2-D wavelet transform generator and a coder based on the SPIHT algorithm for lossy image compression. A folded architecture is proposed for computing the 2-D wavelet transform, The architecture uses 3 parallel computational units and 2 storage units, The hardware for the SPIHT coder uses 2 content addressable memories and 3 random access memories. The designs are modular and can easily be extended for different levels of wavelet decomposition and filter lengths. The derived architecture has been functionally verified for an 8 x 8 image size by simulating its VHDL code using Mentor Graphics.
We introduce and describe MAT2DSP, a MATLAB toolbox, whose function is to estimate the computational load of algorithms specified in the form of a MATLAB program (or programs). This toolbox is aimed at providing resea...
详细信息
ISBN:
(纸本)0780336941
We introduce and describe MAT2DSP, a MATLAB toolbox, whose function is to estimate the computational load of algorithms specified in the form of a MATLAB program (or programs). This toolbox is aimed at providing researches developing advanced signal and imageprocessing algorithms, a quick and convenient way of estimating the implementation requirements of their algorithm on a variety of processors. MAT2DSP analyzes the user program and generates reports on its computational requirements.
Humanistic computing is proposed as a new signalprocessing framework in which the processing apparatus is inextricably intertwined with the natural capabilities of our human body and mind. Rather than trying to emula...
详细信息
Humanistic computing is proposed as a new signalprocessing framework in which the processing apparatus is inextricably intertwined with the natural capabilities of our human body and mind. Rather than trying to emulate human intelligence, humanistic computing recognizes that the human brain is perhaps the best neural network of ifs kind, and that there are many new signalprocessing applications (within the domain of personal technologies) that can make use of this excellent but often overlooked processor. The emphasis of this paper is on personal imaging applications of humanistic computing, to take a first step toward an intelligent wearable camera system that can allow us to effortlessly capture our day-to-day experiences, help us remember and see better, provide us with personal safety through crime reduction, and facilitate new forms of communication through collective connected humanistic computing. The author's wearable signalprocessing hardware, which began as a cumbersome backpack-based photographic apparatus of the 1970's and evolved into a clothing-based apparatus in the early 1980's, currently provides the computational power of a UNIX workstation concealed within ordinary-looking eyeglasses and clothing. Thus it may be worn continuously during all facets of ordinary day-to-day living, so that, through long-term adaptation, it begins to function as a true extension of the mind and body.
In recent years, growing computer networks and multimedia systems require more efficient signalprocessing methods for independent signal separation or noise reducion. Especially, image noise reduction methods are imp...
详细信息
ISBN:
(纸本)0780348605
In recent years, growing computer networks and multimedia systems require more efficient signalprocessing methods for independent signal separation or noise reducion. Especially, image noise reduction methods are important to preserve transparency of network communications as a preprocessing filter of the other methods. Some of these noise reduction methods are based on minimizing the dependence among input signals to separate a noise component, because a noise component is usually independent on the other signals. Under such circumstances, we have developed a new method to separate a noise component which directly minimizes the Kullback-Leibler divergence by a genetic algorithm(GA). The Kullback-Leibler divergence is lower when input signals have lower dependence from each other. Therefore, finding the transformation of input signals which minimizes this measure is equivalent to separate independent noise components from the noise mixed input signals. We have adopted a genetic algorithm to minimize the Kullback-Leibler divergence. GA is one of parallel processing optimization methods, which imitates biological genes and is suitable for random optimization problems. Finally, we have performed computer simulations to evaluate the developed method. Results of simulations show that the method is promised but preliminary.
We have used advanced signalprocessing and innovative imageprocessing methods that are used outside the operating room. The software is written in C++ in a windows environment and can be used on any PC. We will demo...
详细信息
ISBN:
(纸本)0780351649
We have used advanced signalprocessing and innovative imageprocessing methods that are used outside the operating room. The software is written in C++ in a windows environment and can be used on any PC. We will demonstrate some applications on a laptop. Applications to be long with computationalintelligence for diagnostic purposes and as visualization aids inside and demonstrated, include blood cell classification, mammography, evoked potentials, ophthalmology, EEG and field potentials in Parkinson's disease. The latter methods along with 3D reconstruction of MRI images of Parkinsonian patients, are currently used in the operating room for Pallidotomies.
Amateur digital video footage, shot with a hand-held camera, is characterized by visually annoying, jerky movement between successive frames. In this paper a fast and robust image stabilization technique is presented ...
详细信息
ISBN:
(纸本)0780344553
Amateur digital video footage, shot with a hand-held camera, is characterized by visually annoying, jerky movement between successive frames. In this paper a fast and robust image stabilization technique is presented to remove the effects of unwanted camera motion while not effecting true pans or zooms. Unlike other image stabilization techniques, which are typically used as a pre-processing step to more complicated imageprocessing operations, the proposed technique is specifically aimed at providing "visually jerk-free video" for consumer viewing. Empirical evidence suggested that a simple global translational model for each pair of successive images followed by local integration of cumulative motion is sufficient for this purpose. We use a novel technique to estimate the translational motion model which reduces the total computational burden to less than 1 computation/pixel, while providing robust jerk elimination, thus making real-time operation possible on most PC/UNIX platforms. Tests with a number of video sequences have shown the efficacy of the proposed method.
作者:
Venkatesh, YVIndian Inst Sci
Dept Elect Engn Comp Vis & Artificial Intelligence Lab Bangalore 560012 Karnataka India
Generalized Hermite polynomials in two variables are employed for the reconstruction of images from a knowledge of their zero crossing contours. The problem of reconstruction of signals as functions of two variables i...
详细信息
ISBN:
(纸本)0818688211
Generalized Hermite polynomials in two variables are employed for the reconstruction of images from a knowledge of their zero crossing contours. The problem of reconstruction of signals as functions of two variables is not a mere extension of that of a single variable. This is a consequence of the fact that the spatial and spectral characteristics of two-variable functions are quite distinct from what one can expect from their separate projections on to the coordinate axes. One of the results of the paper is that we cannot guarantee uniqueness in reconstruction unless we impose certain constraints on, for instance, space-bandwidth products/ratios in the x - w(x), v - w(y) directions, of the unknown image. Further, a distinguishing feature of the proposed approach is that the standard assumption of bandlimitedness is not invoked. The proposed framework is believed to provide a more unified procedure for signal reconstruction (of uni- and multi-dimensional signals) from partial information than most of the results of the literature. For lack of space, only the main analytical and computational results are presented.
The capabilities of switched networks for parallel and distributed computers are evolving rapidly towards networks with various forms of intelligence in support of parallel execution of programs. This paper presents a...
详细信息
ISBN:
(纸本)081868433X
The capabilities of switched networks for parallel and distributed computers are evolving rapidly towards networks with various forms of intelligence in support of parallel execution of programs. This paper presents a perspective on intelligent networks, including reconfiguration of the network to adapt to the needs of successive computational algorithms being performed as part of an overall problem, for clusters containing a modest number of digital signal processors (DSPs), Scalability of the overall parallel DSP-based computer is achieved by adding these cluster nodes. It is suggested that there are many operating system functions which might be directly integrated into such an intelligent network, including adapting those operating system functions according to the needs of the specific tasks being performed. The viewpoint presented here is based on a reconfigurable system using FPGAs and being constructed for imageprocessing applications to study opportunities for integration of such intelligent networks into future silicon VLSI components, including advanced packaging such as multi-chip modules (MCMs), The packaging limitations present the greatest barrier to aggressive development of such networks, with active substrate MCMs implementing the network function providing the greatest flexibility and performance.
In this work, we use the continuous image sequences generated by an electronic scanning sonar to achieve the aim of obstacle avoidance and visual navigation for an AUV (Autonomous Underwater Vehicle). Using sonar syst...
详细信息
ISBN:
(纸本)0780342739
In this work, we use the continuous image sequences generated by an electronic scanning sonar to achieve the aim of obstacle avoidance and visual navigation for an AUV (Autonomous Underwater Vehicle). Using sonar systems for sensing of unknown underwater environments is the best selection in practice. However, the critical demand for real-time signalprocessing and the uncertainties of AUV's dynamics make on-line detection of obstacles a challenging task. We use the Track-Before-Detect algorithm to extract information contained in image sequences to estimate the dynamics of the AUV, then we apply Dynamic Programming algorithm to solve for the problem of detection. This method reduces the computational cost to meet the real-time demand on obstacle avoidance and navigation of the AUV system.
The categorization of speech sounds by the auditory system has been a subject of intense attention over several decades, reflecting its importance to the scientific study of speech perception and the technological dev...
详细信息
ISBN:
(纸本)0780348605
The categorization of speech sounds by the auditory system has been a subject of intense attention over several decades, reflecting its importance to the scientific study of speech perception and the technological develop ment of more human-like capabilities in automatic speech recognition In previous work, we have firmly established that a two-stage computational model can mimic important aspects of the speech categorization behavior of human and animal listeners. The first stage employs a biologically-motivated 'front-end', modeling the peripheral auditory system, and the second stage is a trainable artificial neural network 'back-end', modeling more central processes. When suitably trained on synthetic stimuli, the two-stage system is able to reproduce the important effects of category formation for the class of initial plosive stop speech sounds, and movement of the category boundary with place of articulation. Appropriate behavior is maintained across a variety of 'back-end' architectures and associated learning algorithms. The behavior is emergent in that it was not explicitly programmed into the model These facts imply that there is something very basic about categorization behavior. Unlike real (human and animal) listeners, a software model can be interrogated to find out the contribution of its component parts to the overall behavior. Replacing the auditory front-end by a more prosaic fast Fourier transform analyzer allows us to focus on the contribution of the acoustic-to-auditory transformation to categorization. We find that the front-end processor is not essential to category formation but plays an important part in the boundary-movement phenomenon, by emphasizing important time-frequency regions of the speech signal.
暂无评论