Markov/Gibbs random fields have been used for posing a variety of computervision and image processing problems. Many of these problems are then solved using a simulated annealing type of method which involves the var...
详细信息
ISBN:
(纸本)0819407445
Markov/Gibbs random fields have been used for posing a variety of computervision and image processing problems. Many of these problems are then solved using a simulated annealing type of method which involves the varying of the temperature, a scale parameter for the model. In this paper we analyze the effect of temperature on random field texture patterns. We obtain new results relating structure in the texture co-occurrence matrix to temperature. We also show the existence of multiple transition temperatures which delimit regions of different bandwidth in the co-occurrence matrix, and hence can be used to control pattern formation.
Character and handwriting recognition is one of the most difficult problems of pattern recognition and artificial intelligence. Unlike the machine generated character, which is uniform throughout a document and often ...
详细信息
ISBN:
(纸本)0819407453
Character and handwriting recognition is one of the most difficult problems of pattern recognition and artificial intelligence. Unlike the machine generated character, which is uniform throughout a document and often uniform between machines, each human being has a unique style of writing characters. With the infinite number of ways to record a character, it is a wonder that a person can understand his own script, let alone the script of another. Training a computer to recognize human-produced characters is a tremendous task in which researchers are just beginning to achieve some success. Primarily these methods rely on the use of algorithms to determine the similarities of two characters. Neural networks are an alternative technique now being explored. Four separate methods will be discussed in this paper. The first involves normalization, skeletonization, and feature extraction of a handwritten digit before application to a neural network for classification. The second simply applies a normalized digit to the neural net's input, and the network performs a 2-dimensional convolution on it in order to classify the digit. The third method involves a hierarchical network. The final technique incorporates time information into the system while using simple preprocessing and a small number of parameters. Their advantages and disadvantages are compared and discussed.
It is suggested that there exists a minimal set of rules for the perceptual composition of the unending variety of spatio-temporal patterns in our perceptual world. Driven by perceptual discernment of "sudden cha...
详细信息
ISBN:
(纸本)0819407453
It is suggested that there exists a minimal set of rules for the perceptual composition of the unending variety of spatio-temporal patterns in our perceptual world. Driven by perceptual discernment of "sudden change" and "unexpectedness", these rules specify conditions (such as co-linearity and virtual continuation) for perceptual grouping and for recursive compositions of perceptual "modalities" and "signatures". Beginning with a smallset of primitive perceptual elements, selected contextually at some relevant level of abstraction, perceptual compositions can graduate to an unlimited variety of spatiotemporal signatures, scenes and activities. Local discernible elements, often perceptually ambiguous by themselves, may be integrated into spatiotemporal compositions, which generate unambiguous perceptual separations between "figure" and "ground". The definition of computational algorithms for the effective instantiation of the rules of perceptual grouping remains a principal problem. In this paper we present our approach for solving the problem of perceptual recognition within the confines of one-D variational profiles. More specifically, concerning "early" (pre-attentive) recognition, we define the "structural identity of a k-norm, k ∈ K,"--SkID--as a tool for discerning and locating the instantiation of spatiotemporal objects or events. The SkID profile also serves a s a reference coordinate framework for the "perceptual focusing of attention" and the eventual assessment of resemblance. Neural network implementations of pre-attentive and attentive recognition are also discussed briefly. Our principles are exemplified by application to one-D perceptual profiles, which allows simplicity of definitions and of the rules of perceptual composition.
A decision support system is developed for personnel scheduling in a multiple warehouse environment. The system incorporates current manpower level, historical data of workers used, empirical load distributions, and p...
详细信息
ISBN:
(纸本)0819407445
A decision support system is developed for personnel scheduling in a multiple warehouse environment. The system incorporates current manpower level, historical data of workers used, empirical load distributions, and performance standards to generate manpower requirements for a specified planning horizon. The software has been developed to be easily adaptable to varying situational details, therefore is widely applicable in different warehouse settings. The system offers personnel managers a valuable tool for evaluating alternative schedules and making intelligent decisions regarding personnel scheduling in warehouses.
Research is underway to apply computerized tomography (CT) imaging to hardwood log inspection in the forest products industry. For this purpose, an intelligentvision system is being created that is aimed at locating,...
详细信息
ISBN:
(纸本)0819407445
Research is underway to apply computerized tomography (CT) imaging to hardwood log inspection in the forest products industry. For this purpose, an intelligentvision system is being created that is aimed at locating, identifying, and quantifying the internal defects inside logs by analyzing their CT image data. This inspection system is designed to be wood species independent. It is composed of three components: a CT scanner-based data acquisition system;a low-level module for image segmentation;and a high-level module for defect recognition. Defect quantification is attained by computing the volume and orientation of each defect. This paper discusses the problems of segmenting CT image sequence and 3-D object detection by a rule-based expert system approach. Experimental results with real-world images of different hardwood log species are provided to show the usefulness, efficacy, and robustness of the proposed inspection system. This allows solutions to hardwood log inspection, as well as to problems in other nondestructive testing applications where image analysis plays an important role.
This paper presents neural network based lateral and longitudinal motion stereo methods. Lateral motion stereo infers depth information from a lateral motion. Existing lateral motion stereo algorithms use either a Kal...
详细信息
ISBN:
(纸本)0819407453
This paper presents neural network based lateral and longitudinal motion stereo methods. Lateral motion stereo infers depth information from a lateral motion. Existing lateral motion stereo algorithms use either a Kalman filter or recursive least square algorithm to update the disparity values. Due to the unmeasurable estimation error, the estimated disparity values at each recursion are unreliable, yielding a noisy disparity field. Instead of updating the disparity values, we recursively update the bias inputs of the network. The disparity field is then computed by using a neural network. Since the recursive algorithm implements the matching algorithm only once, and the bias input updating scheme can be accomplished in real time, a vision system employing such an algorithm is feasible. For the purpose of handling batch data, we have also designed a batch algorithm. The batch algorithm integrates information from all images by embedding them into the bias inputs of the network. Then a static matching procedure is used to compute the disparity values. Longitudinal motion stereo infers depth information from a forward or backward motion. Existing longitudinal stereo algorithms have some problems associated with the location of the focus of expansion (FOE), and with the camera and surface orientations. Instead, our approach allows the camera to move along its optical axis forward or backward, requires no information on the FOE, and makes no assumption about the object surface. The algorithm uses a Gabor correlation operator to extract image features and employs the neural network to compute the disparity field based on the Gabor features. It produces multiple dense disparity fields and recovers the depth map very efficiently.
A modification of the Hough transform has been devised and tested. It incorporates a post processing stage in which the voting edge points are tested according to perceptual criteria. The perceptual criteria are deriv...
详细信息
ISBN:
(纸本)0819407445
A modification of the Hough transform has been devised and tested. It incorporates a post processing stage in which the voting edge points are tested according to perceptual criteria. The perceptual criteria are derived from the Gestalt psychologists work in characterizing the human vision system and include similarity in intensity, similarity in color, good boundary continuity, etc. Edge points which fail on these criteria are eliminated before the final vote in the Hough transform is taken. The method allows weak, but perceptually significant, information to be retained, even in the presence of noise. The method has been applied to the detection of curved boundaries in images of the human colon.
A neural network pattern classifier is presented. Its decision boundaries are formed from segments of conic sections which allows it to achieve improved performance over piecewise linear neural network classifiers, su...
详细信息
Texture is generally recognized as being fundamental to perception. A taxonomy of problems encountered within the context of texture analysis could be that of classification/discrimination, description, and segmentati...
详细信息
ISBN:
(纸本)0819407453
Texture is generally recognized as being fundamental to perception. A taxonomy of problems encountered within the context of texture analysis could be that of classification/discrimination, description, and segmentation. In this paper we suggest a novel artificial neural network (ANN) architecture for features extraction and texture recognition. There is evidence which suggests that the analysis of stimulus by visual system might involve a set of quasi-independent mechanisms called channels which could be conveniently characterized in the spatial frequency domain. In our model we use an FT feature space with angular and radial bins that characterize spatial domain filters to extract features. The extracted features are then used as input for the recognition stage. In order to evaluate the 2-D FT coefficients we use the Radon transform. The usage of the Radon transform simplifies the ANN model significantly. We suggest an electronic implementation of the ANN model for feature extraction, using a Connected Network Adaptive ProcessorS (CNAPS) chip designed by Adaptive Solutions Inc. We also develop software to simulate the ANN model with the Radon transform. We use a three stage back-propagation network as a classifier. We have used ten different texture patterns to test our ANN model.
A new set of collision checking and obstacle avoidance algorithms has been developed and implemented in both hardware and software. The method allows for unlimited vector checks against an unlimited set of objects. De...
详细信息
ISBN:
(纸本)0819407445
A new set of collision checking and obstacle avoidance algorithms has been developed and implemented in both hardware and software. The method allows for unlimited vector checks against an unlimited set of objects. Dependent upon the application, the single card hardware performance ranges from 1 million line sorts per second to hundreds of millions. Therefore, due to the high algorithm speed, the overall system performance only becomes limited by the choice of processor and the speed of the interface. The hardware is presently configured to process large blocks of objects and data (8K) at a sorting rate of one point against eight objects at a rate of 200 million points per second. The trade-off between choice of algorithm and performance is discussed.
暂无评论