In this paper, a hierarchical disaster image classification (HDIC) framework based on multi-source data fusion (MSDF) and multiple correspondence analysis (MCA) is proposed to aid emergency managers in disaster respon...
详细信息
ISBN:
(纸本)9781457709661
In this paper, a hierarchical disaster image classification (HDIC) framework based on multi-source data fusion (MSDF) and multiple correspondence analysis (MCA) is proposed to aid emergency managers in disaster response situations. The HDIC framework classifies images into different disaster categories and sub-categories using a pre-defined semantic hierarchy. In order to effectively fuse different sources (visual and text) of information, a weighting scheme is presented to assign different weights to each data resource depending on the hierarchical structure. The experimental analysis demonstrates that the proposed approach can effectively classify disaster images at each logical layer. In addition, the paper also presents an iPad application developed for situation report management using the proposed HDIC framework.
This paper extends the neighborhood components analysis method (NCA) to learning a mixture of sparse distance metrics for classification and dimensionality reduction. We emphasize two important properties in the recen...
详细信息
ISBN:
(纸本)9781457711022
This paper extends the neighborhood components analysis method (NCA) to learning a mixture of sparse distance metrics for classification and dimensionality reduction. We emphasize two important properties in the recent learning literature, locality and sparsity, and (1) pursue a set of local distance metrics by maximizing a conditional likelihood of observed data;and (2) add l(1)-norm of eigen values of the distance metric to favor low rank matrices of fewer parameters. Experimental results on standard UCI machine learning datasets, face recognition datasets, and image categorization datasets demonstrate the feasibility of our approach for both distance metric learning and dimensionality reduction.
With the widely used of data mining and cluster analysis, cluster validation is attracting increasing attention. In this paper, the concept and development of cluster validation are introduced, then, based on the memb...
详细信息
ISBN:
(纸本)9783642239700
With the widely used of data mining and cluster analysis, cluster validation is attracting increasing attention. In this paper, the concept and development of cluster validation are introduced, then, based on the membership degree, a classification of cluster validity indexes is proposed: cluster validity indexes fit for crisp cluster, cluster validity indexes fit for fuzzy cluster. Based on this, combining with Cluster Validity analysis Platform (CVAP), describing the two most important usages of cluster validation: to find the optimal number of clusters and to find appropriate clustering algorithms to a particular data set. Experiments give visualization representation of clustering validation process.
Solar radiation is an important factor in forecasting outputs of photovoltaic power systems. A method for solar radiation prediction is proposed based on wavelet transform. The data, including solar radiation and pote...
详细信息
This paper describes the practical application of four visualization techniques that have been developed to deal with high-volume, large time-scaled, and high-dimensional data sets that are characteristic of Internet-...
详细信息
An increasing amount of publicly available geo-referenced data enables the identification of patterns of behavior, habits and movements of people. This paper presents results of a case study analysis based on data ser...
详细信息
We present Spine, an efficient algorithm for finding the "backbone" of an influence network. Given a social graph and a log of past propagations, we build an instance of the independent-cascade model that de...
详细信息
ISBN:
(纸本)9781450308137
We present Spine, an efficient algorithm for finding the "backbone" of an influence network. Given a social graph and a log of past propagations, we build an instance of the independent-cascade model that describes the propagations. We aim at reducing the complexity of that model, while preserving most of its accuracy in describing the data. We show that the problem is inapproximable and we present an optimal, dynamic-programming algorithm, whose search space, albeit exponential, is typically much smaller than that of the brute force, exhaustive-search approach. Seeking a practical, scalable approach to sparsification, we devise Spine, a greedy, efficient algorithm with practically little compromise in quality. We claim that sparsification is a fundamental datareduction operation with many applications, ranging from visualization to exploratory and descriptive dataanalysis. As a proof of concept, we use Spine on real-world datasets, revealing the backbone of their influence-propagation networks. Moreover, we apply Spine as a pre-processing step for the influence-maximization problem, showing that computations on sparsified models give up little accuracy, but yield significant improvements in terms of scalability. Copyright 2011 ACM.
We apply visualization techniques to user profiles and repository metadata from the GitHub source code hosting service. Our motivation is to identify patterns within this development community that might otherwise rem...
详细信息
The low-cost and minimum health risks associated with ultrasound (US) have made ultrasonic imaging a widely accepted method to perform diagnostic and image-guided procedures. Despite the existence of 3D ultrasound pro...
详细信息
ISBN:
(纸本)9780819485069
The low-cost and minimum health risks associated with ultrasound (US) have made ultrasonic imaging a widely accepted method to perform diagnostic and image-guided procedures. Despite the existence of 3D ultrasound probes, most analysis and diagnostic procedures are done by studying the B-mode images. Currently, multiple ultrasound probes include 6-DOF sensors that can provide positioning information. Such tracking information can be used to reconstruct a 3D volume from a set of 2D US images. Recent advances in ultrasound imaging have also shown that, directly from the streaming radio frequency (RF) data, it is possible to obtain additional information of the anatomical region under consideration including the elasticity properties. This paper presents a generic framework that takes advantage of current graphics hardware to create a low-latency system to visualize streaming US data while combining multiple tissue attributes into a single illustration. In particular, we introduce a framework that enables real-time reconstruction and interactive visualization of streaming data while enhancing the illustration with elasticity information. The visualization module uses two-dimensional transfer functions (2D TFs) to more effectively fuse and map B-mode and strain values into specific opacity and color values. On commodity hardware, our framework can simultaneously reconstruct, render, and provide user interaction at over 15 fps. Results with phantom and real-world medical datasets show the advantages and effectiveness of our technique with ultrasound data. In particular, our results show how two-dimensional transfer functions can be used to more effectively identify, analyze and visualize lesions in ultrasound images.
The detection of previously unknown, frequently occurring patterns in time series, often called motifs, has been recognized as an important task. However, it is difficult to discover and visualize these motifs as thei...
详细信息
ISBN:
(纸本)9780819484055
The detection of previously unknown, frequently occurring patterns in time series, often called motifs, has been recognized as an important task. However, it is difficult to discover and visualize these motifs as their numbers increase, especially in large multivariate time series. To find frequent motifs, we use several temporal data mining and event encoding techniques to cluster and convert a multivariate time series to a sequence of events. Then we quantify the efficiency of the discovered motifs by linking them with a performance metric. To visualize frequent patterns in a large time series with potentially hundreds of nested motifs on a single display, we introduce three novel visual analytics methods: (1) motif layout, using colored rectangles for visualizing the occurrences and hierarchical relationships of motifs in a multivariate time series, (2) motif distortion, for enlarging or shrinking motifs as appropriate for easy analysis and (3) motif merging, to combine a number of identical adjacent motif instances without cluttering the display. Analysts can interactively optimize the degree of distortion and merging to get the best possible view. A specific motif (e.g., the most efficient or least efficient motif) can be quickly detected from a large time series for further investigation. We have applied these methods to two real-world data sets: data center cooling and oil well production. The results provide important new insights into the recurring patterns.
暂无评论