In this paper, we combine image segmentation techniques and face detection methods to extract the human from scenes. Firstly, skin regions are detected and an ellipse fitting method is employed to detect the face regi...
详细信息
In this paper, we combine image segmentation techniques and face detection methods to extract the human from scenes. Firstly, skin regions are detected and an ellipse fitting method is employed to detect the face region and consequently locate the human position. Then we propose an improved automatic seeded region growing algorithm to segment the image. The initial seeds are generated automatically, and the remaining pixels are classified to the nearest region. After the region growing procedure, two neighboring regions with high similarity are merged. The human body is determined by confining semantic human body region in segmented regions, and those belonging to the human face and human body are merged afterward. Lastly, we will detect the human vertical y-coordinate values in the image, and the depths can then be estimated according to the depth look-up tables of the camera.
This paper presents the use of the Izhikevich and Hodgkin Huxley neuron models for image recognition. The former is more biologically accurate than the commonly used integrate and fire neuron model but has similar low...
详细信息
This paper presents the use of the Izhikevich and Hodgkin Huxley neuron models for image recognition. The former is more biologically accurate than the commonly used integrate and fire neuron model but has similar low computational requirements. Brain scale cortex models tend to use the more biological neuron models. The results of this work show that the Izhikevich model can be used for image recognition and would be a good candidate for a large scale visual cortex model. Neural networks based on these models are developed and applied to character recognition. They were able to identify 48 24times24 images and their noisy versions. The networks were accelerated using modern multicore processors and showed significant speedups. Such processors are likely to be used for developing high performance, large scale implementations of these image recognition networks.
This paper presents a local tone mapping method to render high dynamic range images on conventional displays. We adaptively stretch contrast in local regions to reproduce local contrast. In order to avoid halos, we us...
详细信息
This paper presents a local tone mapping method to render high dynamic range images on conventional displays. We adaptively stretch contrast in local regions to reproduce local contrast. In order to avoid halos, we use bilateral filtering to smooth the image prior to the contrast stretching operation. Our method is fast and easy to use, and the experiment results show that the technique can produce good results on a variety of high dynamic range images.
The following topics were dealt with: imageprocessing; biomedical applications; signalprocessing; control systems; sensors; fault detection; neural networks; computationalintelligence techniques; and real-time syst...
The following topics were dealt with: imageprocessing; biomedical applications; signalprocessing; control systems; sensors; fault detection; neural networks; computationalintelligence techniques; and real-time systems.
Similarity of images in content-based image retrieval (CBIR) is a subjective measure varying by the user, and requires tuning according to the user's preference. Another issue in CBIR is the need of partial image ...
详细信息
Similarity of images in content-based image retrieval (CBIR) is a subjective measure varying by the user, and requires tuning according to the user's preference. Another issue in CBIR is the need of partial image matching. Structural modeling of the images can be promising in finding a small query image within a large database image. In this work, a graph-based image modeling which assigns image regions to labeled nodes and their adjacency to weighted edges is used. Also, the image similarity measure is tuned according to the user's evaluation, by way of parameter selection using Particle Swarm Optimization (PSO)[1][2]. In the experiments, a small-scale CBIR system based on graph modeling of images was developed. Using the system, it was confirmed that images including the query image of different size and rotation angle could be successfully retrieved. Also, the user's preference in weighting the different aspects of similarity in the feedback information was found to be successfully incorporated in the retrieval after parameter optimization using PSO.
computational visual attention (CVA) model is one of the methods which focus on finding region of interesting (ROI) in an image or in a scene. Similarity attention is one important task in CVA. If there are many objec...
详细信息
computational visual attention (CVA) model is one of the methods which focus on finding region of interesting (ROI) in an image or in a scene. Similarity attention is one important task in CVA. If there are many objects in a scene, people will pick up the most abnormal one, which perhaps the similar one or dissimilar one, according to the composition objects of the scene. Capability of similarity attention enables human vision to promptly focus on similar or dissimilar regions in a scene. This paper implements this capability in the CVA model by attaching a high-level similarity comparison function to find ROI in the scene. The output of the model simulates the serial search mode and more approach to human visual behavior. Experimental results show that the function of similarity attention can be achieved successfully.
Cellular simultaneous recurrent networks (CSRN)s have been successfully exploited to solve the conventional maze traversing problem. In this work, for the first time, we investigate the use of CSRNs for image registra...
详细信息
Cellular simultaneous recurrent networks (CSRN)s have been successfully exploited to solve the conventional maze traversing problem. In this work, for the first time, we investigate the use of CSRNs for image registration under affine transformations. In our simulations, we consider binary images with in-plane rotations between plusmn20deg. First, we experiment with a readily available CSRN with generalized multilayer perceptrons (GMLP)s as the basic core. We identify performance criteria for such CSRNs in affine correction. We then propose a modified MLP architecture with multi-layered feedback as the core for a CSRN to improve binary image registration performance. Simulation results show that while both the GMLP network and our modified network are able to achieve localized image registration, our modified architecture is more effective in moving pixels for registration. Finally, we use sub-imageprocessing with our modified MLP architecture, to reduce training time and increase global registration accuracy. Overall, both CSRN architectures show promise for correctly registering a binary image.
This paper presents a target-surround feature attention (TSFA) model for constructing attention-based visual tracking algorithm. This model extracts attentive region by distinguishing the color contrast between the in...
详细信息
This paper presents a target-surround feature attention (TSFA) model for constructing attention-based visual tracking algorithm. This model extracts attentive region by distinguishing the color contrast between the interested target and its surround. A preference generator provides online feature transformation to update the target/surround biasing masks that describes the color composition associated with the target and its surround. Output of the TSFA model is a saliency map representing occurrence possibility of the target. Tracker based on the mean shift algorithm is used to lock and locate the target on the saliency map. Experimental results show that visual tacking algorithm with the TSFA model may adapt to noisy images under changing illumination.
The following topics are dealt with: coding theory; speech and audio signalprocessing; wireless personal communication; system on chip; SoC; direct of arrrival estimation; image enhancement and filtering; CDMA; TDMA;...
The following topics are dealt with: coding theory; speech and audio signalprocessing; wireless personal communication; system on chip; SoC; direct of arrrival estimation; image enhancement and filtering; CDMA; TDMA; OFDM; radar tracking; radar detection; image denoising; feature extraction and detection; information theory; spectral estimation; image segmentation; wireless ad hoc and sensor network; FPGA; embedded system; parameter estimation; channel estimation; image coding; analog circuits; digital filters; filter bank; video coding and transmission; MIMO communication; motion estimation; multimedia management; computationalintelligence; image fusion; communication protocols; face detection; PCA; ICA; SVD; optical-signalprocessing; radar signalprocessing.
Graph-based signalprocessing (GSP) is an emerging field that is based on representing a dataset using a discrete signal indexed by a graph. Inspired by the recent success of GSP in imageprocessing and signal filteri...
详细信息
Graph-based signalprocessing (GSP) is an emerging field that is based on representing a dataset using a discrete signal indexed by a graph. Inspired by the recent success of GSP in imageprocessing and signal filtering, in this paper, we demonstrate how GSP can be applied to non-intrusive appliance load monitoring (NALM) due to smoothness of appliance load signatures. NALM refers to disaggregating total energy consumption in the house down to individual appliances used. At low sampling rates, in the order of minutes, NALM is a difficult problem, due to significant random noise, unknown base load, many household appliances that have similar power signatures, and the fact that most domestic appliances (for example, microwave, toaster), have usual operation of just over a minute. In this paper, we proposed a different NALM approach to more traditional approaches, by representing the dataset of active power signatures using a graph signal. We develop a regularization on graph approach where by maximizing smoothness of the underlying graph signal, we are able to perform disaggregation. Simulation results using publicly available REDD dataset demonstrate potential of the GSP for energy disaggregation and competitive performance with respect to more complex Hidden Markov Model-based approaches.
暂无评论