检索结果-内蒙古大学图书馆

GTfold: Enabling parallel RNA secondary structure prediction on multi-core desktops

BMC Research Notes 2012年第1期5卷 1-6页

作者： Swenson, M Shel Anderson, Joshua Ash, Andrew Gaurav, Prashant Sükösd, Zsuzsanna Bader, David A Harvey, Stephen C Heitsch, Christine E School of Mathematics Georgia Institute of Technology Atlanta GA United States College of Computing Georgia Institute of Technology Atlanta GA United States Interdisciplinary Nanoscience Center Aarhus University Aarhus Denmark Department of Molecular Biology Aarhus University Aarhus Denmark School of Biology Georgia Institute of Technology Atlanta GA United States

Background: Accurate and efficient RNA secondary structure prediction remains an important open problem in computational molecular biology. Historically, advances in computing technology have enabled faster and more accurate RNA secondary structure predictions. Previous parallelized prediction programs achieved significant improvements in runtime, but their implementations were not portable from niche high-performance computers or easily accessible to most RNA researchers. With the increasing prevalence of multi-core desktop machines, a new parallel prediction program is needed to take full advantage of today's computing technology. Findings. We present here the first implementation of RNA secondary structure prediction by thermodynamic optimization for modern multi-core computers. We show that GTfold predicts secondary structure in less time than UNAfold and RNAfold, without sacrificing accuracy, on machines with four or more cores. Conclusions: GTfold supports advances in RNA structural biology by reducing the timescales for secondary structure prediction. The difference will be particularly valuable to researchers working with lengthy RNA sequences, such as RNA viral genomes. © 2012 Swenson et al.;licensee BioMed Central Ltd.

关键词： Minimum Free Energy Ribosomal Sequence Minimum Free Energy Structure shared memory parallelism Terminal Mismatch

来源：评论

学校读者我要写书评

暂无评论

Column-Segmented Sparse Matrix-Matrix Multiplication on Multicore CPUs 28

Column-Segmented Sparse Matrix-Matrix Multiplication on Mult...

引用

28th Annual IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC)

作者： An, Xiaojing Catalyurek, Umit, V Georgia Inst Technol Sch Computat Sci & Engn Atlanta GA 30332 USA Amazon Web Serv Seattle WA USA

ISBN: (纸本)9781665410168

Sparse general matrix-matrix multiplication, SpGEMM, is one of the most fundamental yet challenging sparse computation kernels. Due to its irregular computation pattern, SpGEMM frequently becomes the performance bottleneck in many scientific applications. Many prior state-of-the-art approaches use either dense or sparse accumulators to merge matrix rows as a critical component. Dense accumulators are efficient for small matrices but are infeasible for large or highly sparse matrices, due to high memory use and low cache efficiency. In this work, by segmenting the columns for the second input matrix, we propose a new SpGEMM algorithm that utilizes both a new sparse high-level overview of the matrix and fast and small dense accumulators that would fit in cache. With that, our approach brings the dense accumulator benefits to both large and highly sparse matrices. Our extensive experimental evaluation, carried out on three hardware platforms and on hundreds of sparse matrices from a variety of domains, shows that our algorithm out-performs state-of-the-art SpGEMM implementations.

关键词： sparse matrix matrix multiplication shared memory parallelism

来源：评论

学校读者我要写书评

暂无评论

Parallel network simplex algorithm for the minimum cost flow problem

引用

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2022年第4期34卷

作者： Kara, Gokcehan Ozturan, Can Bogazici Univ Dept Comp Engn TR-34342 Istanbul Turkey

In this work, we contribute a parallel implementation of the network simplex algorithm that is used for the solution of minimum cost flow problem. In the network simplex algorithm, finding an entering arc requires searching through many arcs to decide which one should be included in the spanning tree solution on the next iteration. We propose finding the entering arc in parallel as it often takes the majority of the execution time. A usual strategy is to pick the arc violating the optimality the most out of all possible candidates. Scanning all arcs can take quite some time, so it is common to consider only a fixed number of arcs which is referred as the block search pivoting rule. Arc scans can easily be done in parallel to find the best candidate as the calculations are independent of each other. We used shared memory parallelism using OpenMP along with vectorization using AVX instructions. We also tried adjusting block sizes to increase the parallel portion of the algorithm. Our dataset consists of various natural and synthetic graphs with sizes up to a billion arc. Our experiments show speedups up to four are possible, though they are typically lower.

关键词： block search pivoting rule minimum cost flow problem network simplex algorithm shared memory parallelism vectorization

来源：评论

学校读者我要写书评

暂无评论

Parallelization of a two-dimensional time-area watershed routing

引用

ENVIRONMENTAL MODELLING & SOFTWARE 2021年 146卷 1页

作者： Her, Younggu Yang, Kwangsoo Song, Jung-Hun Univ Florida Inst Food & Agr Sci Dept Agr & Biol Engn Homestead FL 33031 USA Univ Florida Inst Food & Agr Sci Trop Res & Educ Ctr Homestead FL 33031 USA Florida Atlantic Univ Dept Elect Engn & Comp Sci Boca Raton FL 33431 USA

Grid-based spatially distributed hydrological modeling has become feasible with advances in watershed routing schemes, remote sensing technology, and computing resources. However, the need for long-running times on a substantial set of computational resources prevents a spatially detailed modeling program from being widely used, particularly in fine-resolution large-scale studies. Parallelizing computational tasks successfully mitigate this difficulty. We propose a novel way to improve the simulation efficiency of direct runoff transport processes by grouping watershed areas based on a time-area routing scheme. The proposed parallelization method was applied to simulating the runoff routing processes of two watersheds in different sizes and landscapes. The method substantially improved the computational efficiency of the time-area routing simulation with common computing resources. The efficiency of the parallelization was not limited by the hierarchical relationship between upstream and downstream catchments along the flow paths, which could be possible with the Lagrangian tracking of the time-area routing method.

关键词： Time-area routing Distributed watershed model Parallel computing shared memory parallelism Stream network Grid-based simulation

来源：评论

学校读者我要写书评

暂无评论

Flexible control structures for parallelism in OpenMP

引用

Concurrency and Computation: Practice and Experience 2000年第12期12卷

作者： Sanjiv Shah Grant Haab Paul Petersen Joe Throop Kuck & Associates Inc. 1906 Fox Drive Champaign IL 61820 U.S.A.

OpenMP cannot handle some very common programming idioms like recursive control and list or tree data structures. We present the workqueuing model and show it as a natural, flexible, and easy to use extension of OpenMP that is available as a commercial product. A detailed description of workqueuing is presented, together with performance results and pointers to the source code used. Copyright © 2000 John Wiley & Sons, Ltd.

关键词： shared memory parallelism symmetric multiprocessing OpenMP irregular parallelism multithreading POSIX threads hierarchical decomposition task queue workqueuing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：