检索结果-内蒙古大学图书馆

作者： Wu, Changjun Washington State University

学位级别：Ph.D.

Developing high performance computing solutions for modern day biological problems present a unique set of challenges. The field is experiencing a data revolution due to a rapid introduction of several disruptive experimental technologies. Consequently, computational methods that analyze biological data are currently being put to the test in their capability to scale to massive data sizes. Added to this data-intensiveness, is the brand of computation that is quite different in flavor to that in other, perhaps more traditional scientific computing fields. The problems are dominated by integer arithmetic, string matching, combinatorial space exploration, and graph-theoretic formulations that introduce irregularity in computation and communication patterns. In this thesis, we report on our efforts to bridge the gap between biological data processing and high performance computing solutions. Specifically, we focus on the problem of clustering very large collections of protein sequences on distributed memory supercomputers. Given a set of amino acid sequences we reduce the problem to one of constructing sequence homology graph and subsequently detecting arbitrarily-sized dense subgraphs. Our approach efficiently parallelizes this task on a distributed memory machine through a combination of divide-and-conquer and combinatorial pattern matching heuristic techniques. Preliminary tests on an arbitrary collection of 2 million protein sequences from the Global Ocean Sampling project database reveal that our new approach is able to improve sensitivity, recruit more sequences, while considerably reducing the time to solution and memory requirement. The algorithmic techniques developed as part of this research have a wider applicability to other applications in computational biology wherever the need for conducting large-scale sequence analysis is the primary bottleneck.

关键词： Computer Science Bioinformatics Bioinformatics Computational Biology graph algorithms graph Construction High Performance Computing Sequence Clustering

来源：评论

学校读者我要写书评

暂无评论

Polaritytrust: Measuring trust and reputation in social networks

Polaritytrust: Measuring trust and reputation in social netw...

引用

4th International Conference on Internet Technologies and Applications, ITA 11

作者： Ortega, F. Javier Troyano, José A. Cruz, Fermín L. De Salamanca, Fernando Enríquez Department of Computer Languages and Systems University of Seville Spain

ISBN: (纸本)9780946881680

In this work we tackle the problem of determining the trustworthiness of the users in a social network. Our approach introduces the novelty of taking into account the negative opinions in a social network to obtain the ranking of trust according to the opinions of all the users in the network. We briefly discuss some common attacks that malicious users can perform against a system in order to gain good reputation in the network. The experiments are performed with synthetic graphs, randomly generated to model real social networks according to some common features, and to simulate the attacks previously mentioned. The results show that our approach can deal with these threats, demoting malicious users and minimizing their effects in the final ranking of trust.

关键词： graph algorithms Social networks Trust and reputation

来源：评论

学校读者我要写书评

暂无评论

Bicolored independent sets and bicliques

Bicolored independent sets and bicliques

引用

10th Cologne-Twente Workshop on graphs and Combinatorial Optimization, CTW 2011

作者： Couturier, Jean-François Kratsch, Dieter Laboratoire D'Informatique Th´eorique et Appliqúee Université Paul Verlaine 57045 Metz Cedex 01 France

来源：评论

学校读者我要写书评

暂无评论

A topological sorting algorithm for large graphs

引用

ACM Journal of Experimental Algorithmics 2012年第PP3.1–3.21期17卷 3.1–3.21页

作者： Deepak Ajwani Adan Cosgaya-Lozano Norbert Zeh University College Cork Ireland Dalhousie University Canada

We present an I/O-efficient algorithm for topologically sorting directed acyclic graphs, called IterTS. In the worst case, our algorithm is extremely inefficient and performs O(n ċ sort(m)) I/Os. However, our experiments show that IterTS achieves good performance in practice. To evaluate IterTS, we compared its running time to those of three competitors: PeelTS, an I/O-efficient implementation of the standard strategy of iteratively removing sources and sinks; ReachTS, an I/O-efficient implementation of a recent parallel divide-and-conquer algorithm based on reachability queries; and SeTS, a standard DFS-based topological sorting built on top of a semiexternal DFS algorithm. In our evaluation on various types of input graphs, IterTS consistently outperformed PeelTS and ReachTS by at least an order of magnitude in most cases. SeTS outperformed IterTS on most graphs whose vertex sets fit in memory. However, IterTS often came close to the running time of SeTS on these inputs and, more importantly, SeTS was not able to process graphs whose vertex sets were beyond the size of main memory, while IterTS was able to process such inputs efficiently.

关键词： External-memory algorithms graph algorithms

来源：评论

学校读者我要写书评

暂无评论

Parallel chip-firing on the complete graph: Devil's staircase and Poincare rotation number

引用

ERGODIC THEORY AND DYNAMICAL SYSTEMS 2011年第3期31卷 891-910页

作者： Levine, Lionel MIT Dept Math Cambridge MA 02139 USA

We study how parallel chip-firing on the complete graph K-n changes behavior as we vary the total number of chips. Surprisingly, the activity of the system, defined as the average number of firings per time step, does not increase smoothly in the number of chips;instead it remains constant over long intervals, punctuated by sudden jumps. In the large n limit we find a 'devil's staircase' dependence of activity on the number of chips. The proof proceeds by reducing the chip-firing dynamics to iteration of a self-map of the circle S-1, in such a way that the activity of the chip-firing state equals the Poincare rotation number of the circle map. The stairs of the devil's staircase correspond to periodic chip-firing states of small period.

关键词： graph algorithms Mathematics

来源：评论

学校读者我要写书评

暂无评论

A Consensus Tree Approach for Reconstructing Human Evolutionary History and Detecting Population Substructure

引用

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011年第4期8卷 918-928页

作者： Tsai, Ming-Chi Blelloch, Guy Ravi, R. Schwartz, Russell Joint Carnegie Mellon Univ Univ Pittsburgh PhD Pr Pittsburgh PA 15213 USA Lane Ctr Computat Biol Pittsburgh PA 15213 USA Carnegie Mellon Univ Dept Comp Sci Pittsburgh PA 15213 USA Carnegie Mellon Univ Tepper Sch Business Pittsburgh PA 15213 USA Carnegie Mellon Univ Dept Biol Sci Pittsburgh PA 15213 USA

The random accumulation of variations in the human genome over time implicitly encodes a history of how human populations have arisen, dispersed, and intermixed since we emerged as a species. Reconstructing that history is a challenging computational and statistical problem but has important applications both to basic research and to the discovery of genotype-phenotype correlations. We present a novel approach to inferring human evolutionary history from genetic variation data. We use the idea of consensus trees, a technique generally used to reconcile species trees from divergent gene trees, adapting it to the problem of finding robust relationships within a set of intraspecies phylogenies derived from local regions of the genome. Validation on both simulated and real data shows the method to be effective in recapitulating known true structure of the data closely matching our best current understanding of human evolutionary history. Additional comparison with results of leading methods for the problem of population substructure assignment verifies that our method provides comparable accuracy in identifying meaningful population subgroups in addition to inferring relationships among them. The consensus tree approach thus provides a promising new model for the robust inference of substructure and ancestry from large-scale genetic variation data.

关键词： Biology and genetics trees information theory graph algorithms

来源：评论

学校读者我要写书评

暂无评论

Computing All Pairs Shortest Paths on Sparse graphs with Articulation Points

引用

Computer Technology and Application 2011年第11期2卷 866-883页

作者： Carlos Roberto Arias Von-Wun Soo Institute of lnformation Systems and Applications National TsingHua University Hsinchu Taiwan Facultad de Ingenierias Universidad Tecnol6gica Centroamericana Tegucigalpa Honduras Computer Science Department National TsingHua University Hsinchu Taiwan

In most network analysis tools the computation of the shortest paths between all pairs of nodes is a fundamental step to the discovery of other properties. Among other properties is the computation of closeness centrality, a measure of the nodes that shows how central a vertex is on a given network. In this paper, the authors present a method to compute the All Pairs Shortest Paths on graphs that present two characteristics： abundance of nodes with degree value one, and existence of articulation points along the graph. These characteristics are present in many real life networks especially in networks that show a power law degree distribution as is the case of biological networks. The authors＇ method compacts the single nodes to their source, and then by using the network articulation points it disconnects the network and computes the shortest paths in the biconnected components. At the final step the authors proposed methods merges the results to provide the whole network shortest paths. The authors＇ method achieves remarkable speedup compared to state of the art methods to compute the shortest paths, as much as 7 fold speed up in artificial graphs and 3.25 fold speed up in real application graphs. The authors＇ performance improvement is unlike previous research as it does not involve elaborated setups since the authors algorithm can process significant instances on a popular workstation.

关键词： graph algorithms all pairs shortest paths articulation points

来源：评论

学校读者我要写书评

暂无评论

All-Pairs Shortest Paths with a Sublinear Additive Error

引用

ACM TRANSACTIONS ON algorithms 2011年第4期7卷 45-45页

作者： Roditty, Liam Shapira, Asaf Bar Ilan Univ Dept Comp Sci IL-52900 Ramat Gan Israel Georgia Inst Technol Sch Math Atlanta GA 30332 USA Georgia Inst Technol Coll Comp Atlanta GA 30332 USA

We show that, for every 0 <= p <= 1, there is an O(n(2.575-p/(7.4-2.3p)))-time algorithm that given a directed graph with small positive integer weights, estimates the length of the shortest path between every pair of vertices u, v in the graph to within an additive error delta(p)(u, v), where delta(u, v) is the exact length of the shortest path between u and v. This algorithm runs faster than the fastest algorithm for computing exact shortest paths for any 0 < p <= 1. Previously the only way to "beat" the running time of the exact shortest path algorithms was by applying an algorithm of Zwick [2002] that approximates the shortest path distances within a multiplicative error of (1 + epsilon). Our algorithm thus gives a smooth qualitative and quantitative transition between the fastest exact shortest paths algorithm, and the fastest approximation algorithm with a linear additive error. In fact, the main ingredient we need in order to obtain the above result, which is also interesting in its own right, is an algorithm for computing (1 + epsilon) multiplicative approximations for the shortest paths, whose running time is faster than the running time of Zwick's approximation algorithm when epsilon << 1 and the graph has small integer weights.

关键词： graph algorithms shortest paths matrix multiplication

来源：评论

学校读者我要写书评

暂无评论

Breaking the 2(n)-barrier for Irredundance: Two lines of attack

引用

JOURNAL OF DISCRETE algorithms 2011年第3期9卷 214-230页

作者： Binkele-Raible, Daniel Brankovic, Ljiljana Cygan, Marek Fernau, Henning Kneis, Joachim Kratsch, Dieter Langer, Alexander Liedloff, Mathieu Pilipczuk, Marcin Rossmanith, Peter Wojtaszczyk, Jakub Onufry Univ Trier FB Abt Informat 4 Trier Germany Univ Newcastle Sch Elect Engn & Comp Sci Callaghan NSW Australia Univ Warsaw Fac Math Informat & Mech Warsaw Poland Rhein Westfal TH Aachen Dept Comp Sci Aachen Germany Univ Paul Verlaine Metz Lab Informat Theor & Appl Metz France Univ Orleans Lab Informat Fondamentale Orleans Orleans France Polish Acad Sci Inst Math Warsaw Poland

The lower and the upper irredundance numbers of a graph G, denoted ir(G) and IR(G), respectively, are conceptually linked to the domination and independence numbers and have numerous relations to other graph parameters. It has been an open question whether determining these numbers for a graph G on n vertices admits exact algorithms running in time faster than the trivial Theta(2(n) center dot poly(n)) enumeration, also called the 2(n)-barrier. The main contributions of this article are exact exponential-time algorithms breaking the 2(n)-barrier for irredundance. We establish algorithms with running times of O*(1.99914(n)) for computing ir(G) and O*(1.9369(n)) for computing IR(G). Both algorithms use polynomial space. The first algorithm uses a parameterized approach to obtain (faster) exact algorithms. The second one is based, in addition, on a reduction to the Maximum Induced Matching problem providing a branch-and-reduce algorithm to solve it. (C) 2011 Elsevier B.V. All rights reserved.

关键词： graph algorithms Irredundance number

来源：评论

学校读者我要写书评

暂无评论

Sensitive detection of pathway perturbations in cancers: extended abstract 11

Sensitive detection of pathway perturbations in cancers: ext...

引用

Proceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine

作者： Corban G. Rivera Brett M. Tyler T. M. Murali Virginia Tech Blacksburg VA

ISBN: (纸本)9781450307963

The normal functioning of a living cell is characterized by complex interaction networks involving many different types of molecules. Associations detected between diseases and perturbations in well-defined pathways within such interaction networks have the potential to illuminate the molecular mechanisms underlying disease progression and response to treatment. In this paper, we present a computational method that compares expression profiles of genes in cancer samples to samples from normal tissues in order to detect perturbations of pre-defined pathways in the cancer. In contrast to many previous methods, our scoring function approach explicitly takes into account the interactions between the gene products in a pathway. Moreover, we compute the sub-pathway that has the highest score, as opposed to merely computing the score for the entire pathway. We use a permutation test to assess the statistical significance of the most perturbed sub-pathway. We apply our method to 20 pathways in the Netpath database and to the Global Cancer Map of gene expression in 18 cancers. We demonstrate that our method yields more sensitive results than alternatives that do not consider interactions or measure the perturbation of a pathway as a whole. We perform a sensitivity analysis to show that our approach is robust to modest changes in the input data. Our method confirms numerous well-known connections between pathways and cancers.

关键词： gene expression pathways cancers pathway perturbation simulated annealing graph algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：