检索结果-内蒙古大学图书馆

parallel Burrow-Wheeler transform for genomic data on multi-core computers

Journal of Computational Information Systems 2012年第16期8卷 6935-6942页

作者： Zhao, Zhiheng Yin, Jianping Xiong, Wei Long, Jun School of Computer National University of Defense Technology Changsha 410073 China

As the core of FM-index and compressed suffix array, the Burrows-Wheeler Transform (BWT) plays a key role in indexing genomic sequence data for pattern search. It can run in O (n) bits, typically in total space less than or equal to that of the sequences themselves, and is high efficient in exact pattern matching. However, computing a BWT from a sequence in efficient space and time is challenging. In this article, we present an efficient parallel algorithm for computing BWT on multi-core computers. Our parallel algorithm is based on an incremental idea which is space-efficient. The analysis and experiments show that our algorithm is very efficient on multi-core computers and has good scalability, especially for long sequences. It only takes about a few minutes to compute the BWT of human whole genome on a prevalent 4-core desktop computer. © 2012 Binary Information Press.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallelisation of electromagnetic simulation codes

引用

IEEE TRANSACTIONS ON MAGNETICS 1998年第5期34卷 3423-3426页

作者： Janssen, R Dracopoulos, M Parrott, K Slessor, E Alotto, P Molfino, P Nervi, M Simkin, J Philips Res Labs NL-5656 AA Eindhoven Netherlands Univ Oxford Oxford Parallel OUCL Oxford OX1 3QD England Univ Genoa Dipartimento Ingn Elettr I-16145 Genoa Italy Vector Fields Ltd Oxford OX5 1JE England

In this paper results obtained from the parallelisation of existing 3D electromagnetic Finite Element codes within the ESPRIT HPCN project PARTEL are presented. The parallelisation procedure, based on the Bulk Synchronous parallel approach, is outlined and the encouraging results obtained in terms of speed-up on some industrially significant test cases are described and discussed.

关键词： finite element methods electromagnetic simulation software parallel algorithms high performance computing

来源：评论

学校读者我要写书评

暂无评论

parallel bisection mesh refinement algorithm for distributed memory parallel computers

引用

Jisuan Wuli/Chinese Journal of Computational Physics 2005年第5期22卷 399-406页

作者： Liu, Qing-Kai Zhang, Lin-Bo Institute of Computational Mathematics and Scientific/Engineering Computing Academy of Mathematics and Systems Science Chinese Academy of Sciences Beijing 100080 China Graduate School Chinese Academy of Sciences Beijing 100080 China

We present a parallel bisection mesh refinement algorithm based on ALBERT (Adaptive multi-Level finite element toolbox using Bisection refinement and Error control by Residual Techniques). The goal is to develop a parallel adaptive finite element code suitable for distributed memory parallel computers or PC clusters. An overview on the basic strategy for the parallelization of ALBERT is given. Issues on the parallel mesh refinement are addressed. A modified mesh refinement algorithm, which can be implemented efficiently on distributed memory parallel computers, is proposed and its properties are discussed. Numerical experiments with parallel bisection mesh refinement algorithm are shown.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel Symmetry-Breaking in Sparse Graphs: 收藏
分享
引用; SIAM Journal on Discrete Mathematics 1988年第4期1卷 434-446页; 作者： Andrew V. Goldberg Serge A. Plotkin Gregory E. Shannon; This paper describes efficient deterministic techniques for breaking symmetry in parallel. These techniques work well on rooted trees and graphs of constant degree or genus. The primary technique allows us to 3-color ... 详细信息; This paper describes efficient deterministic techniques for breaking symmetry in parallel. These techniques work well on rooted trees and graphs of constant degree or genus. The primary technique allows us to 3-color a rooted tree in $O (\lg^{*} n)$; 关键词： parallel algorithms graph coloring maximal independent set planar graphs; 来源：评论; 学校读者我要写书评

暂无评论

A parallel algorithm for dynamic slicing of distributed Java programs in non-DSM systems

引用

International Journal of Information and Communication Technology 2007年第1期1卷 38-49页

作者： Mohapatra, Durga Prasad Mall, Rajib Kumar, Rajeev Department of CSE National Institute of Technology Rourkela 769008 Orissa India Department of CSE Indian Institute of Technology Kharagpur WB 721302 India

We propose a parallel algorithm for dynamic slicing of distributed Java programs in non-Distributed Shared Memory (DSM) systems. Given a distributed Java program, we first construct an intermediate representation in the form of a Distributed Program Dependence Graph (DPDG). We mark and unmark the edges of the DPDG appropriately as and when dependencies arise and cease during run-time. Our algorithm can run parallely on a network of computers, so that each node in the network contributes to the dynamic slice by computing its local portion of the global slice in a fully distributed fashion. Copyright © 2007 Inderscience Enterprises Ltd.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM 21

Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM

引用

50th International Conference on parallel Processing (ICPP)

作者： Ghanim, Fady Elwasif, Wael R. Bernholdt, David E. Oak Ridge Natl Lab Oak Ridge TN 37830 USA

ISBN: (纸本)9781450384414

The parallel Random Access Machines (PRAM) abstraction is the simplest and most elegant algorithmic model for the design and analysis of parallel algorithms. It consists of different models categorized based on the underlying memory access mode used, the most powerful of which is the Concurrent Read Concurrent Write (CRCW) model. A PRAM algorithm describes a series of rounds, each of which consists of a collection of operations that can be executed concurrently within the same time step. However, the lack of support for concurrent memory accesses and the prevalence of asynchronous programming models led to the belief that implementing CRCW PRAM algorithms is unattainable and prompted many to avoid this model except for theoretical studies of optimal performance. In this work, we study the arbitrary and common concurrent writes in the CRCW PRAM model and explore implementation challenges on general-purpose systems. Moreover, we examine current practices for implementing common/arbitrary concurrent writes and propose a new efficient lightweight and thread-safe method to implement concurrent writes through leveraging atomic instructions. To demonstrate the efficacy of our method, we developed OpenMP kernels for classical CRCW PRAM algorithms and provide experimental results and comparisons based on run time performance measured over the x86 multicore architecture. Our results show a performance speedup compared to current practices up to 4.5x across all our benchmarks.

关键词： CRCW PRAM parallel algorithms Arbitrary Concurrent Writes Write-conflict resolution parallel Architectures

来源：评论

学校读者我要写书评

暂无评论

A parallel Algorithm for Production Scheduling

引用

IFAC Proceedings Volumes 1997年第19期30卷 321-325页

作者： José Francisco Ferreira Ribeiro Cassilda Maria Ribeiro Marcio Mattos Borges de Oliveira University of São Paulo Department of Mechanical Engineering fax: 55-16-2749150 phone: 55-16-2743444 13560-970 São Carlos-SP Brazil University of São Paulo Department of Computer Science and Statistics fax: 55-16-2749150 phone: 55-16-2743444 13560-970 São Carlos-SP Brazil University of São Paulo Department of Economy Management and Accounting fax: 55-16-6336133 phone: 55-16-6335617 14040-900 Ribeirão Preto-SP Brazil

A method to schedule and program operations based on manufacture cells and use of parallel processing is presented in this paper. The factory is organized in cells with the aim of decomposing the global problem of scheduling in subproblems of reduced dimension. This decomposition allows a simplification of tasks related to the control and supervision of the factory. Besides, parallel processing enables faster computations and the use of the program in real time.

关键词： Optimization problems Operations research Scheduling algorithms Discrete systems parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Efficient parallel algorithm for shortest paths in planar layered digraphs

引用

Journal of Zhejiang University: Science 2004年第5期5卷 518-527页

作者： Mishra, P.K. Dept. of Appl. Math. Birla Inst. of Technol. Mesra Ranchi 835215 India

This paper presents an efficient parallel algorithm for the shortest path problem in planar layered digraphs that runs in O(log3n) time with n processors. The algorithms uses a divide and conquer approach and is based... 详细信息

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel parsing from recurrence equations

引用

INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS 1996年第3-4期59卷 151-164页

作者： Barsan, C Evans, DJ LOUGHBOROUGH UNIV TECHNOL PARALLEL ALGORITHMS RES CTRLOUGHBOROUGH LE11 3TULEICSENGLAND

In this paper a new framework for parallel parsing is proposed. The parsing problem of context free languages are converted into a system of linear recurrence equations. This presents a new approach for parallel parsing because one can apply VLSI automatic synthesis procedures developed for numerical computation. Two well-known context-free parsing algorithms, the Coke-Younger-Kasamy (CYK) algorithm and Early algorithm are rewritten as systems of linear recurrence equations. The proposed framework can be used as an automatic generation procedure of a parallel parser similar to the sequential parser generators tools like YACC.

关键词： parallel algorithms context-free languages parsing dependency analysis systolic arrays

来源：评论

学校读者我要写书评

暂无评论

parallel adaptive version of the block-based Gauss-Jordan algorithm

Proceedings of the International Parallel Processing Symposi...

引用

Proceedings of the International parallel Processing Symposium, IPPS 1999年 350-354页

作者： Melab, N. Talbi, E.-G. Petiton, S. Universite des Sciences et Technologies de Lille Villeneuve d'Ascq France

This paper presents a parallel adaptive version of the block-based Gauss-Jordan algorithm used in numerical analysis to invert matrices. This version includes a characterization of the workload of processors and a mechanism of its adaptive folding/unfolding. The application is implemented and experimented with MARS in dedicated and non-dedicated environments. The results show that an absolute efficiency of 92% is possible on a cluster of DEC/ALPHA processors interconnected by a Gigaswitch network and an absolute efficiency of 67% can be obtained on an Ethernet network of SUN-Sparc4 workstations. Moreover, the adaptability of the algorithm is experimented on a non-dedicated meta-system including both the two parks of machines.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法