In order to improve the quality of fusion image integrated by infrared and visible images, so that it is more suitable for human and computervision or processing. A two-step fusion method, that is region structure si...
详细信息
ISBN:
(纸本)9781509023776
In order to improve the quality of fusion image integrated by infrared and visible images, so that it is more suitable for human and computervision or processing. A two-step fusion method, that is region structure similarity and Contourlet transform, is presented. With the blocking idea, source images are fused by using region structure similarity, region energy ratio and region clarity ratio to obtain the first fusion image. Then the first fusion image and two source images are decomposed by taking Contourlet transform, and low frequency components use region variance weighting rule, high frequency components adopt multi-scale analysis. Experimental results prove that our proposed method can improve subjective visual effect of the image. Meanwhile objective evaluation indexes entropy, standard deviation, average gradient and mutual information are increased.
Despite the fact that different objects possess distinct class specific features, they also usually share common patterns. Inspired by this observation, we propose a novel method to explicitly and simultaneously learn...
详细信息
ISBN:
(纸本)9781467399616
Despite the fact that different objects possess distinct class specific features, they also usually share common patterns. Inspired by this observation, we propose a novel method to explicitly and simultaneously learn a set of common patterns as well as class-specific features for classification. Our dictionary learning framework is hence characterized by both a shared dictionary and particular (class-specific) dictionaries. For the shared dictionary, we enforce a low rank constraint, i.e. claim that its spanning subspace should have low dimension and the coefficients corresponding to this dictionary should be similar. For the particular dictionaries, we impose on them the well-known constraints stated in the Fisher discrimination dictionary learning (FDDL). Further, we propose a new fast and accurate algorithm to solve the sparse coding problems in the learning step, accelerating its convergence. The said algorithm could also be applied to FDDL and its extensions. Experimental results on widely used image databases establish the advantages of our method over state-of-the-art dictionary learning methods.
Greening has been promoted to improve the living conditions in urban environments. Quantification of greenery is an important issue to identify the criteria for stakeholders in the process of greening. This research f...
详细信息
ISBN:
(纸本)9789881902672
Greening has been promoted to improve the living conditions in urban environments. Quantification of greenery is an important issue to identify the criteria for stakeholders in the process of greening. This research focuses on the quantification of visible greenery ratio which is defined as the amount of greenery in the field of vision. Some measurement methods of visible greenery ratio have been already proposed. However, the quantification process is usually time consuming and prone to human errors due to manual operations by using an imageprocessing software. Therefore, in this research, the authors developed an automated measurement system based on imageprocessing technology for the efficient visible greenery ratio measurement. In the verification experiment, the proposed method achieved similar results for extracted pixels of green areas as the traditional manual method, with decreased calculation time. Furthermore, in addition to measuring the current ratio of greenery, this system can visualize possible future changes in visible greenery by adding planting (landscape) design models in an Augmented Reality (AR) environment. Using the proposed method, an ideal greening environment can be designed and evaluated by end-users, more intuitively. The developed design system is expected to eventually result in increasing the amount of greenery in the urban environment.
Entertainment games provide engaging activities and it would appear that far from waning, interest in games for leisure is still growing. The availability of new consoles, platforms and technologies for the delivery o...
详细信息
ISBN:
(纸本)9789898533524
Entertainment games provide engaging activities and it would appear that far from waning, interest in games for leisure is still growing. The availability of new consoles, platforms and technologies for the delivery of games is an important factor in this continued growth. The aim of this study is to determine characteristic and habits of computer game players in Turkey. In line with this purpose following processes were followed and reported. Study group of the research is chosen by random sampling method through online questionary techniques from 69 women and 252 men, 321 people in total. Two different data collection tools were used in gathering research data. As a result of the study, most of the computer game players choses target shooting games and sportive, action, and fantasy games follow to target shooting games as popularity and more than half of players (%56) prefer to play multiplayer games than single player games. It is reported that players spend approximately 10,1 hours playing computer game per week.
Probabilistic inference is a versatile tool to solve a large variety of pixel-labeling problems in computervision such as stereo matching and image denoising. Belief Propagation (BP) is an effective method for such i...
详细信息
ISBN:
(纸本)9781479999880
Probabilistic inference is a versatile tool to solve a large variety of pixel-labeling problems in computervision such as stereo matching and image denoising. Belief Propagation (BP) is an effective method for such inference tasks, and has also shown attractive error-resilience properties-the ability to converge to usable solutions in the presence of low-level hardware errors. This is of increasing interest, as the looming end of Moore's Law scaling brings with it a vast increase in the statistical variability of nanoscale circuit fabrics. In this work we seek to understand why certain combinations of BP and error-resilience mechanisms work so well in practice. We focus on Algorithmic Noise Tolerance (ANT) techniques for the resilience mechanisms, and Max-Product BP for inference. We analyze the error characteristics of BP in this hardware context, derive novel asymptotic error bounds, and provide theoretical reasoning to explain why ANT works well in this BP context. Experimental results from detailed resilient-BP simulations for various stereo matching tasks offer empirical support for this analysis.
Often the filters learned by Convolutional Neural Networks (CNNs) from different image datasets appear similar. This similarity of filters is often exploited for the purposes of transfer learning. This is also being u...
详细信息
ISBN:
(纸本)9781467399616
Often the filters learned by Convolutional Neural Networks (CNNs) from different image datasets appear similar. This similarity of filters is often exploited for the purposes of transfer learning. This is also being used as an initialization technique for different tasks in the same dataset or for the same task in similar datasets. Off-the-shelf CNN features have capitalized on this idea to promote their networks as best transferable and most general and are used in a cavalier manner in day-to-day computervision tasks. While the filters learned by these CNNs are related to the atomic structures of the images from which they are learnt, all datasets learn similar looking low-level filters. With the understanding that a dataset that contains many such atomic structures learn general filters and are therefore useful to initialize other networks with, we propose a way to analyse and quantify generality. We applied this metric on several popular character recognition, natural image and a medical image dataset, and arrive at some interesting conclusions. On further experimentation we also discovered that particular classes in a dataset themselves are more general than others.
— Watermarking is widely used to protect information authenticity and integrity. In this work we propose to add invisible watermark into one channel of color RGB images in the frequency domain. Using Discrete Cosine ...
详细信息
Current research in computervision and machine learning has demonstrated some great abilities at detecting and recognizing objects in natural images. The promising results in these areas have inspired research toward...
详细信息
ISBN:
(纸本)9781467399616
Current research in computervision and machine learning has demonstrated some great abilities at detecting and recognizing objects in natural images. The promising results in these areas have inspired research towards solving more complex multi-modal learning problems in the image/video domains such as automatic annotation, segmentation, labelling, and generic understanding. Although solutions have been provided for one or more of these problems, their approaches have been application specific. This paper introduces an end-to-end trainable Fusion-based Recurrent Multi-Modal (FRMM) model to address multi-modal applications. FRMM allows each input modality to be independent in terms of architecture, parameters and length of input sequences. FRMM image description models seamlessly blend convolutional neural net work feature descriptors with sequential language data in a recurrent framework. For training and testing we used the Flickr30K and MSCOCO datasets, demonstrating state-of-the-art description results.
Surface printing defects inspection based on machine vision can effectively improve producing speed and quality. However, captured image rotate variance led to high false detection rate and low speed. Considering limi...
详细信息
In computervision, optical camera is often used as the eyes of computer. If we replace camera with synthetic aperture radar (SAR), we will then enter a microwave vision of the world. This paper gives a comparison of ...
详细信息
ISBN:
(纸本)9781510601543
In computervision, optical camera is often used as the eyes of computer. If we replace camera with synthetic aperture radar (SAR), we will then enter a microwave vision of the world. This paper gives a comparison of SAR imaging and camera imaging from the viewpoint of epipolar geometry. The imaging model and epipolar geometry of the two sensors are analyzed in detail. Their difference is illustrated, and their unification is particularly demonstrated. We hope these may benefit researchers in field of computervision or SAR imageprocessing to construct a computer SAR vision, which is dedicated to compensate and improve human vision by electromagnetically perceiving and understanding the images.
暂无评论