It is necessary to integrate and manage data in the Cloud for providing in-depth data services. This paper introduces an intermediate view that provides a highly interactive environment to integrate and manage data di...
详细信息
Stencils are finite-difference algorithms for solving large-scale and high-dimension partial differential equations. Due to the data dependences among the iterative statements in Stencils, traditional Stencil computat...
详细信息
Stencils are finite-difference algorithms for solving large-scale and high-dimension partial differential equations. Due to the data dependences among the iterative statements in Stencils, traditional Stencil computations are be executed serially, rather than in parallel. It's challenging to design an effective and scalable Stencil parallelized method. To address the issue of 3D data space computing, we present a serial execution model based on multi-layers symmetric Stencil method and time skewing techniques. Within this model, the iteration space is divided to multiple tiles based on time skewing, where the executive process is ordered by the sequence of tiles, and the nodes in each individual tile can be swept repeatedly to improve the data locality. In addition, we propose a novel 3D iterative space alternate tiling Stencil parallel method, which subdivides the iteration space along high dimension, and changes the execution sequence of tiles to reduce the data dependency and communication cost, where the partial order of tiles is still guaranteed. Experimental results demonstrate our proposed alternative tiling parallel method achieves better parallel efficiency and scalability compared with the domain-decomposition methods.
How to determine the truthfulness of a piece of information becomes an increasingly urgent need for users. In this paper, we propose a method called MFSV, to determine the truthfulness of fact statements. We first cal...
详细信息
Untruthful information spreads on the web, which may mislead users or have a negative impact on user experience. In this paper, we propose a method called Multi-verifier to determine the truthfulness of a fact stateme...
详细信息
Sensor fusion is the combining of sensory data from disparate sources such that the resulting information is in some sense better than would be possible when these sources were used individually. The natural uncertain...
详细信息
Accurate classification of gene expression data offers great value in understanding the mechanism of tumor and effective clinical treatment. However, in real-world application, people often face a large number of unla...
详细信息
Accurate classification of gene expression data offers great value in understanding the mechanism of tumor and effective clinical treatment. However, in real-world application, people often face a large number of unlabeled samples and meager labeled ones, so semi-supervised learning is applied in cancer classification. In this paper, a Local Reconstruction and Global Preserving Based Semi-Supervised Dimensionality(LRGPSSDR) Method was proposed for cancer classification. LRGPSSDR makes full use of side information, which can set the edge weights of neighborhood graph through minimizing the local reconstruction error and can preserve the global geometric structure of the sampled data set as well as preserving its local geometric structure. Experimental results on five public gene expression datasets show the superior performance of the method.
Although there have been many efforts for management of uncertain data, evaluating probabilistic inference queries, a known NP-hard problem, is still a big challenge, especially for querying data with highly correlati...
详细信息
Wireless Sensor Networks (WSNs) can be viewed as a new type of distributed databases. data management technology is one of the core technologies of WSNs. In this demo we show a Query Processing system based on TinyOS ...
详细信息
A text mining algorithm named HMM-TFM (Hidden Markov Model based transcription factor name mining) is presented. The proposed algorithm does not need a dictionary of transcription factor names. A small verb set is def...
详细信息
Gene set-based microarray analysis allows researchers to better analyze the gene expression data for studying complex diseases like cancer. By transforming gene expression data into another form using gene set informa...
详细信息
Gene set-based microarray analysis allows researchers to better analyze the gene expression data for studying complex diseases like cancer. By transforming gene expression data into another form using gene set information, the biomarkers will have higher discriminative power and should result in more accurate disease classification. This work compares two techniques for applying our previously developed NCFS-i-based method to deal with unlabeled data, i.e. to make predictive diagnosis. Seven cancer datasets that include 4 breast cancer and 3 lung cancer datasets were used in this study. The results show that inferring gene set activity using curated phenotype-correlated genes (PCOGs) sets of training data is a more robust method for applying NCFS-i- based method to work with unlabeled data, providing biologically relevant gene sets.
暂无评论