This paper presents a model of the retina with its properties with respect to sampling, spatiotemporal filtering, color-coding and non-linearity, and their consequences on the processing of visual information. It'...
详细信息
ISBN:
(纸本)9783540730064
This paper presents a model of the retina with its properties with respect to sampling, spatiotemporal filtering, color-coding and non-linearity, and their consequences on the processing of visual information. It's formalism points out the architectural and algorithmic principles of neuromorphic circuits which are known to improve compactness, consumption, robustness and efficiency, leading to direct applications in engineering science. It's biological aspect, strongly based neural and cellular descriptions makes it suitable as an investigation tool for neurobiologists, allowing the simulation of experiences difficult to set up and answering fundamental theoretical questions.
Real-life applications of intelligent systems that use neuralnetworks require a high degree of success, usability and reliability. Power systems applications can benefit from such intelligent systems;particularly for...
详细信息
ISBN:
(纸本)9783540730064
Real-life applications of intelligent systems that use neuralnetworks require a high degree of success, usability and reliability. Power systems applications can benefit from such intelligent systems;particularly for voltage stabilization. Voltage instability in power distribution systems could lead to voltage collapse and thus power blackouts. This paper presents an intelligent system which detects voltage instability and classifies voltage output of an assumed power distribution system (PDS) as: stable, unstable or overload. The novelty of our work is the use of voltage output images as the input patterns to the neural network for training and generalizing purposes, thus providing a faster instability detection system that simulates a trained operator controlling and monitoring the 3-phase voltage output of the assumed PDS. Experimental results suggest that our method performs well and provides a fast and efficient system for voltage instability detection.
Accurate reconstruction of deformable structures in image sequences is a fundamental task in many applications ranging from forecasting by remote sensing to sophisticated medical imaging applications. In this paper we...
详细信息
Accurate reconstruction of deformable structures in image sequences is a fundamental task in many applications ranging from forecasting by remote sensing to sophisticated medical imaging applications. In this paper we report a novel automatic two-stage method for deformable structure reconstruction from 3D image sequences. The first stage of the proposed method is focused on the automatic identification and localization of the deformable structures of interest, by means of fuzzy clustering and temporal regions tracking. The final segmentation is accomplished by a second processing stage, devoted to identify finer details using a Multilevel artificialneural Network. Application to the segmentation of heart left ventricle from MRI sequences are discussed.
Fast development of multimedia technology during the last two decades has brought different approach to the evaluation of image quality. In most of the cases, multimedia technology applications do not rely on the imag...
详细信息
ISBN:
(纸本)9780819468482
Fast development of multimedia technology during the last two decades has brought different approach to the evaluation of image quality. In most of the cases, multimedia technology applications do not rely on the image fidelity criterion but the human impression plays the main role, A model for perceptual assessment of image quality in multimedia technology is presented in this paper. The model exploits properties of the human visual system (HVS) while utilizing steerable pyramidal decomposition. image distortion features are based on Jeffrey divergence (JD) as a metric between probability distributions of original and distorted image signal values in each subband of steerable pyramid. Mean square error (MSE) is also computed. Data preprocessing using mutual information (MI) approach has been used to get a smaller set of objective distortion features describing the perceived image quality with reasonable precision. The impairment feature vector is processed by the radial basis function (RBF) artificialneural network (ANN) to allow simple adaptation of the model in respect to the required mode of operation, fidelity or impressiveness based. Parameters of the ANN are adjusted using mean opinion scores (MOS) obtained from the group of assessors. The presented system mimics an assessment process with human subjects. Model performance is verified comparing predicted quality and scores from human observers.
Inspired by the behaviour of biological receptive fields and the human visual system, a network model based on spiking neurons is proposed to detect edges in a visual image. The structure and the properties of the net...
详细信息
ISBN:
(纸本)9783540742012
Inspired by the behaviour of biological receptive fields and the human visual system, a network model based on spiking neurons is proposed to detect edges in a visual image. The structure and the properties of the network are detailed in this paper. Simulation results show that the network based on spiking neurons is able to perform edge detection within a time interval of 100 ms. This processing time is consistent with the human visual system. A firing rate map recorded in the simulation is comparable to Sobel and Canny edge graphics. In addition, the network can separate different edges using synapse plasticity, and the network provides an attention mechanism in which edges in an attention area can be enhanced.
Detection of moving objects in video streams is the first relevant step of information extraction in many computer vision applications. Aside from the intrinsic usefulness of being able to segment video streams into m...
详细信息
ISBN:
(纸本)9783540755548
Detection of moving objects in video streams is the first relevant step of information extraction in many computer vision applications. Aside from the intrinsic usefulness of being able to segment video streams into moving and background components, detecting moving objects provides a focus of attention for recognition, classification, and activity analysis, making these later steps more efficient. We propose an approach based on self organization through artificialneuralnetworks, widely applied in human imageprocessing systems and more generally in cognitive science. The proposed model allows to capture structural background variation due to periodic-like motion over a long period of time under limited memory. Our method can handle scenes containing moving backgrounds or illumination variations, and it achieves robust detection for different types of videos taken with stationary cameras. We compared our method with other modeling techniques. Experimental results, both in terms of detection accuracy and in terms of processing speed, are presented for color video sequences which represent typical situations critical for vicleo surveillance systems.
Biometric systems are defined as systems exploiting automated methods of personal recognition based on physiological or behavioural characteristics. Among these, fingerprints are very reliable biometric identifiers. T...
详细信息
ISBN:
(纸本)9783540748267
Biometric systems are defined as systems exploiting automated methods of personal recognition based on physiological or behavioural characteristics. Among these, fingerprints are very reliable biometric identifiers. Trying to fasten the imageprocessing step makes the recognition process more efficient, especially concerning embedded systems for real-time authentication. In this work we propose an FPGA-based architecture that efficiently implements the high computationally demanding core of a matching algorithm.
This paper introduces the use of 15 different readability indices as a fingerprint that enables the classification of documents into different categories. While a classification based on such fingerprints alone is not...
详细信息
ISBN:
(纸本)9783540746935
This paper introduces the use of 15 different readability indices as a fingerprint that enables the classification of documents into different categories. While a classification based on such fingerprints alone is not necessarily superior to document categorization based on dedicated dictionaries per se, the document fingerprints can enhance the overall classification rate by applying proper data fusion techniques. For other applications text mining related applications such as language classification, the detection of plagiarism, or author identification, the accuracy of text categorization methods based on readability fingerprints can even exceed a dictionary-based approach. A novel addition to the readability indices is the addition of histo-rams based on the word length of all the dictionary words used in the text and a dictionary of the most common easy words in the English language.
Vehicle license plate recognition has been intensively studied in many countries. Due to the different types of license plates being used, the requirement of an automatic license plate recognition system is different ...
详细信息
ISBN:
(纸本)9780889866294
Vehicle license plate recognition has been intensively studied in many countries. Due to the different types of license plates being used, the requirement of an automatic license plate recognition system is different for each country. In this paper, an automatic license plate recognition system is proposed for Malaysian vehicles with standard license plates based on imageprocessing, clustering, feature extraction and neuralnetworks. The imageprocessing library is developed in-house which referred to as Vision System Development Platform (VSDP). Fixed filter, Minimum filter, Median Filter and Homomorphic Filtering are used in image enhancement process. After applying image enhancement, the image is segmented using blob analysis, horizontal scan line profiles, clustering and run length smoothing algorithm approach to identify the location of the license plate. Thoroughly each image is transformed into blob objects and its important information such as total of blobs, location, height and width, are being analyzed for the purpose of cluster exercising and choosing the best cluster with winner blobs. Here, new algorithm called Cluster Run Length Smoothing Algorithm (CRLSA) approach was applied to locate the license plate at the fight position. CRLSA consisted of two separate new proposed algorithm which applied new edge detector algorithm using 30 kernel masks and 128 grayscale offset plus a new way (3D method) to calculate run length smoothing algorithm (RLSA), which can improve clustering techniques in segmentation phase. Three separate experiments were performed;Cluster and Threshold value 130 (CT130) and CRLSA with Threshold value I (CCTI). From those experiments, analysis of error tables based on segmentation errors were constructed. The prototyped system has an accuracy more than 96% and suggestions to further improve the system are discussed in this paper pertaining to analysis of the error.
Face recognition is a biometric identification method which among the other methods such as, finger print identification, speech recognition, signature and hand written recognition has assigned a special place to itse...
详细信息
ISBN:
(纸本)9789889867140
Face recognition is a biometric identification method which among the other methods such as, finger print identification, speech recognition, signature and hand written recognition has assigned a special place to itself. In principle, the biometric identification methods includes a wide range of sciences such as machine vision, imageprocessing, pattern recognition neuralnetworks and-has various applications in film processing, control access networks and etc. There are several methods for recognition among which appearance based methods is one of them. One of the most important algorithms in appearance based methods is linear discriminant analysis (LDA) which is based on statistical pattern recognition. The great drawback for linear discriminent analysis is, although there are great differences in the images for a single person, because of luminous condition and head rotation, this algorithm only considers the average of the images. In this paper we introduce fractional multiple exemplar discriminant analysis (FMEDA) which is a variation of linear discriminant analysis algorithm and then we will compare it with the other well known algorithms in this field.
暂无评论