Microarray time-course data relate to the recorded activity of thousands of genes, in parallel, over multiple discrete points in time during a biological process. Existing techniques that attempt to support the explor...
详细信息
Bioinformatics systems benefit from the use of data mining strategies to locate interesting and pertinent relationships within massive information. For example, data mining methods can ascertain and summarize the set ...
详细信息
ISBN:
(纸本)0780390482
Bioinformatics systems benefit from the use of data mining strategies to locate interesting and pertinent relationships within massive information. For example, data mining methods can ascertain and summarize the set of genes responding to a certain level of stress in an organism. Even a cursory glance through the literature in journals, reveals the persistent role or data mining in experimental biology. Integrating data mining within the context of experimental investigations is central to bioinformatics software. In this paper we describe the framework of Probabilistic Principal Surfaces, a latent variable model which offers a large variety of appealing visualization capabilities and which can be successfully integrated in the context of microarray analysis. A preprocessing phase consisting or a nonlinear PCA neural network which seems to be very useful to deal with noisy and time dependent nature of microarray data has been added to this framework.
data repositories around the world hold many thousands of datasets. A problem for remote dataset users is to browse the repositories and efficiently locate relevant datasts. In this note, we introduce the Iconic Remot...
详细信息
ISBN:
(纸本)081945642X
data repositories around the world hold many thousands of datasets. A problem for remote dataset users is to browse the repositories and efficiently locate relevant datasts. In this note, we introduce the Iconic Remote Visual data Exploration tool (IRVDX), which provides visual browsing for exploring the features of remote and distributed data without the necessity of downloading the entire dataset. IRVDX employs three kinds of visualizations: one provides a reduced representation of the datasets, which we call dataset Icons. These icons show the important statistical characteristics of datasets and help to identify relevant datasets from distributed repositories. Another one is called the Remote dataset Visual Browser that provides visualizations to browse remote data without downloading the complete dataset to identify its content. The final one provides visualizations to show the degree of similarity between two datasets and to visually determine whether a join of two remote datasets will be meaningful. In this paper, we describe the design and implementation of IRVDX in detail. We assess the benefits of our dataset Icons against the traditional text-based interfaces and show the usefulness of IRVDX by conducting experiments with datasets from the UCI KDD Archive.
Scientific visualization serves as a useful technique for rapid analysis of large data sets. The application of visualization techniques to physical simulations executed on high-performance computing hardware offers t...
详细信息
Our goal is to bring a data rich environment to the intelligence analyst's desktop setwithin a detailed 3D view of the real-worldcontext. This paper outlines our effortsleading up to this goal, our current work an...
详细信息
ISBN:
(纸本)0819457639
Our goal is to bring a data rich environment to the intelligence analyst's desktop set
within a detailed 3D view of the real-world
context. This paper outlines our efforts
leading up to this goal, our current work and our path forward.
In this paper, we study how a multidimensional local binary pattern (LBP) texture feature data can be visually explored and analyzed. The goal is to determine how true paper properties can be characterized with local ...
详细信息
ISBN:
(纸本)4901122045
In this paper, we study how a multidimensional local binary pattern (LBP) texture feature data can be visually explored and analyzed. The goal is to determine how true paper properties can be characterized with local texture features from visible light images. We utilize isometric feature mapping (Isomap) for the LBP texture feature data and perform non-linear dimensionality reduction for the data. These 2D projections are then visualized with original images to study data properties. visualization is utilized in the manner of selecting texture models for unlabeled data and analyzing feature performance when building a training set for a classifier. The approach is experimented on with simulated image data illustrating different paper properties and on-line transilluminated paper images taken from a running paper web in the paper mill. The simulated image set is used to acquire quantitative figures on the performance while the analysis of real-world data is an example of semi-supervised learning.
High-throughput experiments, such as gene expression microarrays in the life sciences, result in very large data sets. In response, a wide variety of visualization tools have been created to facilitate dataanalysis. ...
详细信息
High-throughput experiments, such as gene expression microarrays in the life sciences, result in very large data sets. In response, a wide variety of visualization tools have been created to facilitate dataanalysis. A primary purpose of these tools is to provide biologically relevant insight into the data. Typically, visualizations are evaluated in controlled studies that measure user performance on predetermined tasks or using heuristics and expert reviews. To evaluate and rank bioinformatics visualizations based on real-world dataanalysis scenarios, we developed a more relevant evaluation method that focuses on data insight. This paper presents several characteristics of insight that enabled us to recognize and quantify it in open-ended user tests. Using these characteristics, we evaluated five microarray visualization tools on the amount and types of insight they provide and the time it takes to acquire it. The results of the study guide biologists in selecting a visualization tool based on the type of their microarray data, visualization designers on the key role of user interaction techniques, and evaluators on a new approach for evaluating the effectiveness of visualizations for providing insight. Though we used the method to analyze bioinformatics visualizations, it can be applied to other domains.
We present a haptically enabled immersive environment for the segmentation and visualization of volumetric data. The environment uses a client/server design and allows multiple, remotely separated users to work in a s...
详细信息
ISBN:
(纸本)0769523102
We present a haptically enabled immersive environment for the segmentation and visualization of volumetric data. The environment uses a client/server design and allows multiple, remotely separated users to work in a shared visual space. The environment incorporates several distinct segmentation methods that are based on the local application of computationally expensive algorithms and provides the user with continuous fine-grained control. The environment uses haptic feedback to both provide a sense of touch for segmented structures and to provide additional user control over the segmentation algorithms.
With our increasing ability to capture or produce and to store large multivariate data, these data sets are increasing both in size and complexity. Many conventional techniques for visualizing multivariate data suffer...
详细信息
ISBN:
(纸本)076952396X
With our increasing ability to capture or produce and to store large multivariate data, these data sets are increasing both in size and complexity. Many conventional techniques for visualizing multivariate data suffer from problems like cluttered displays since they are not designed to handle these amounts of entries. We present a novel method to overcome this problem by interactively selecting and displaying statistics derived from the data in a separate view. Changes in the display are visually tracked by animation and vector plotting for easy comparison of various measures applied to different subsets of the data.
The study reports an experiment intended to identify parameters that affect the detection of cause and effect relations in graphically displayed data in a visual data mining environment. Accuracy of performance was me...
详细信息
ISBN:
(纸本)081945642X
The study reports an experiment intended to identify parameters that affect the detection of cause and effect relations in graphically displayed data in a visual data mining environment. Accuracy of performance was measured as a function of visual properties of the cause function and information processing styles. People with different styles employ different task-solving-strategies, expressed by tool usage, and by effects of different visual properties of the displayed data. Participants with high analytic cognitive styles were better able to detect cause and effect relations through investigations of visual and more global properties of the displayed data. Visual properties of the data affected users with high analytic and low experiential cognitive styles similarly and had no direct effect on accuracy. The study points to the need for further research to gain a deeper understanding of the effect of user characteristics, display properties and data structure in a visual data mining environment that is based on intensive interaction of the user with complex graphical displays.
暂无评论