The hyperspectral remote sensing is one of the frontier techniques in the remote sensing research fields. Applying the sparse coding model to the hyperspectral remote sensing imageprocessing is a hot topic in hypersp...
详细信息
ISBN:
(纸本)9781467372206
The hyperspectral remote sensing is one of the frontier techniques in the remote sensing research fields. Applying the sparse coding model to the hyperspectral remote sensing imageprocessing is a hot topic in hyperspectral information processing. To improve the accuracy of hyperspectral image classification, we propose a classification method based on the spatial-spectral joint contextual sparse coding. Firstly, a dictionary is obtained by training using samples selected from the ground-truth reference data. Then, the sparse coefficients of each pixel are calculated based on the learned dictionary. Afterward, the sparse coefficients are input to the classifier and the final classification result is obtained. The visible and near-infrared hyperspectral remote sensing image collected by Tiangong-1 in Chaoyang District of Beijing is used to evaluate the performance of the proposed approach. Experimental results show that the proposed method yields the best classification performance with the overall accuracy of 95.74% and the Kappa coefficient of 0.9476 in comparison with other classification methods.
Benefit from tremendous growth of user-generated content, social annotated tags get higher importance in organization and retrieval of large scale image database on Online Sharing Websites (OSW). To obtain high-qualit...
详细信息
ISBN:
(纸本)9781450332743
Benefit from tremendous growth of user-generated content, social annotated tags get higher importance in organization and retrieval of large scale image database on Online Sharing Websites (OSW). To obtain high-quality tags from existing community contributed tags with missing information and noise, tag-based annotation or recommendation methods have been proposed for performance promotion of tag prediction. While images from OSW contain rich social attributes, existing studies only utilize the relations between visual content and tags to construct global information completion models. In this paper, beyond the image-tag relation, we take full advantage of the ubiquitous GPS locations and image-user relationship, to enhance the accuracy of tag prediction and improve the computational efficiency. For GPS locations, we define the popular geo-locations where people tend to take more images as Points of Interests (POI), which are discovered by mean shift approach. For image user relationship, we integrate a localized prior constraint, expecting the completed tag sub-matrix in each POI to maintain consistency with users' tagging behaviors. Based on these two key issues, we propose a unified tag matrix completion framework which learns the image-tag relation within each POI. To solve the proposed model, an efficient proximal sub-gradient descent algorithm is designed. The model optimization can be easily parallelized and distributed to learn the tag sub-matrix for each POI. Extensive experimental results reveal that the learned tag sub-matrix of each POI reflects the major trend of users' tagging results with respect to different POIs and users, and the parallel learning process provides strong support for processing large scale online image database.
The proceedings contain 34 papers. The special focus in this conference is on imageprocessing and Communications. The topics include: Classifier selection uses decision profiles in binary classification task;2DHMM-ba...
ISBN:
(纸本)9783319238135
The proceedings contain 34 papers. The special focus in this conference is on imageprocessing and Communications. The topics include: Classifier selection uses decision profiles in binary classification task;2DHMM-based face recognition method;corner detection based on directional gradients;sensor fusion enhancement for mobile positioning systems;thermal face recognition;feature reduction using similarity measure in object detector learning with haar-like features;assessment of the brain coverage ratio in the postoperative craniosynostosis based on 3D CT scans;combined imaging system for taking facial portraits in visible and thermal spectra;the PUT surveillance database;automatic analysis of vehicle trajectory applied to visual surveillance;algorithmically optimized AVC video encoder with parallelprocessing of data;neural video compression based on SURF scene change detection algorithm;cell detection in corneal endothelial images using directional filters;the method of probabilistic nodes combination in 2D information retrieval, pattern recognition and biometric modeling;3-D reconstruction of real objects using an android device;methods of natural image preprocessing supporting the automatic text recognition using the OCR algorithms;fast machine vision line detection for mobile robot navigation in dark environments;adjustment of viterbi algorithm for line following robots;studentized range for spatio-temporal track-before-detect algorithm;real-time US image enhancement by forward-backward diffusion using GPU;face-based distributed visitor identification system;color space optimization for lacunarity method in analysis of papanicolaou smears and toward texture-based 3D level set image segmentation.
3D modeling of cultural monuments is very crucial issue for preparing restoration projects. However, it has challenges such as data acquisition, preparation and processing. 3D modeling of objects can be time consuming...
详细信息
3D modeling of cultural monuments is very crucial issue for preparing restoration projects. However, it has challenges such as data acquisition, preparation and processing. 3D modeling of objects can be time consuming and may include some difficulties due to the complexity of the structures. 3D terrestrial laser (TLS) scanning technique is one of the reliable and advantageous methods for 3D reconstruction of monuments. This technique is commonly acknowledged due to its accuracy, speed and flexibility. But the suitability and capability of this technique depends on proper usage, and good survey planning. Magnificent developments in high-resolution digital sensor technologies leaded to manufacturing of new camera systems. parallel to these innovations, development of computer systems and imageprocessing techniques made enable to obtain multiple image-based 3D object models. In the presented study, TLS method has been compared to conventional photogrammetric and image-based dense matching methods. Automatic dense point creation has been realized by our developed algorithm and PIXEL-PHOTO software which generates 3D point clouds from stereo images. The reliability and encountered problems during point cloud measurement process have been discussed. The study area has been chosen as historical Byzantine Land Walls of Istanbul, which constitute a remarkable area defining the ancient city's historical peninsula.
An efficient implementation are necessary, as most medical imaging methods are computational expensive, and the amount of medical imaging data is growing. Graphic processing units (GPUs) can solve large data parallel ...
详细信息
An efficient implementation are necessary, as most medical imaging methods are computational expensive, and the amount of medical imaging data is growing. Graphic processing units (GPUs) can solve large data parallel problems at a higher speed than the traditional CPU, while being more affordable and energy efficient than distributed systems. This review investigates the use of GPUs to accelerate medical imaging methods. A set of criteria for efficient use of GPUs are defined. The review concludes that most medical imageprocessingmethods may benefit from GPU processing due to the methods' data parallel structure and high thread count. However, factors such as synchronization, branch divergence and memory usage can limit the speedup.
In recognition that inmodern applications billions of images are stored into distributed databases in different logical or physical locations, we propose a similarity search strategy over the cloud based on the dimens...
详细信息
In recognition that inmodern applications billions of images are stored into distributed databases in different logical or physical locations, we propose a similarity search strategy over the cloud based on the dimensions value cardinalities of image descriptors. Our strategy has low preprocessing requirements by dividing the computational cost of the preprocessing steps into several nodes over the cloud and locating the descriptors with similar dimensions value cardinalities logically close. New images are inserted into the distributed databases over the cloud efficiently, by supporting dynamical update in real-time. The proposed insertion algorithm has low computational complexity, depending exclusively on the dimensionality of descriptors and a small subset of descriptors with similar dimensions value cardinalities. Finally, an efficient query processing algorithm is proposed, where the dimensions of image descriptors are prioritized in the searching strategy, assuming that dimensions of high value cardinalities have more discriminative power than the dimensions of low ones. The computation effort of the query processing algorithm is divided into several nodes over the cloud infrastructure. In our experiments with seven publicly available datasets of image descriptors, we show that the proposed similarity search strategy outperforms competitive methods of single node, parallel and cloud-based architectures, in terms of preprocessing cost, search time and accuracy.
Testing of imageprocessing applications is a challenging job especially, when evaluating the correctness of output image. Generally, output images are evaluated manually by visual inspection carried out by an expert ...
详细信息
Testing of imageprocessing applications is a challenging job especially, when evaluating the correctness of output image. Generally, output images are evaluated manually by visual inspection carried out by an expert tester, which is the main hindrance in automation of testing process. Recently, statistical and metamorphic testing approaches are presented to automate output evaluation of imageprocessing applications. The statistical method is dependent on availability of statistical distribution of output images, whereas metamorphic testing require more research efforts to make it widely used in practice. Metamorphic testing is a well-known technique to alleviate the test oracle problem and eliminates the required manual efforts by using relations of input and output images. Follow-up test cases are generated based on these relations and their expected output is evaluated. This paper addresses test oracle problem for imageprocessing applications and demonstrates how properties of implementation under test can be adopted as metamorphic relations. We have studied general and specific metamorphic relations of morphological image operations such as dilation and erosion. Selection of metamorphic relations and their effectiveness by mutation analysis is demonstrated. The results show that metamorphic testing is useful for evaluation of output images in the absence of a perfect test oracle.
The two-volume set LNAI 9119 and LNAI 9120 constitutes the refereed proceedings of the 14th International Conference on Artificial Intelligence and Soft Computing, ICAISC 2015, held in Zakopane, Poland in June 2015. T...
ISBN:
(数字)9783319193687;9783319193694
ISBN:
(纸本)9783319193687;9783319193694
The two-volume set LNAI 9119 and LNAI 9120 constitutes the refereed proceedings of the 14th International Conference on Artificial Intelligence and Soft Computing, ICAISC 2015, held in Zakopane, Poland in June 2015. The 142 revised full papers presented in the volumes, were carefully reviewed and selected from 322 submissions. These proceedings present both traditional artificial intelligence methods and soft computing techniques. The goal is to bring together scientists representing both areas of research. The first volume covers topics as follows neural networks and their applications, fuzzy systems and their applications, evolutionary algorithms and their applications, classification and estimation, computer vision, image and speech analysis and the workshop: large-scale visual recognition and machine learning. The second volume has the focus on the following subjects: data mining, bioinformatics, biometrics and medical applications, concurrent and parallelprocessing, agent systems, robotics and control, artificial intelligence in modeling and simulation and various problems of artificial intelligence.
暂无评论