检索结果-内蒙古大学图书馆

14th IEEE International Symposium on Embedded Multicore/Many-Core Systems-on-Chip (MCSoC)

作者： Bloch, Aurelien Brunet, Simone Casale Mattavelli, Marco Ecole Polytech Fed Lausanne SCI MM STI Lausanne Switzerland

ISBN: (纸本)9781665438605

Writing and optimizing application software for heterogeneous platforms including GPU units is a very difficult task that requires designer efforts and resources to consider several key elements to obtain good performance. Dataflow programming has shown to be a good approach for accomplishing such a difficult task for its properties of portability and the possibility of arbitrary partitioning a dataflow network on each unit of heterogeneous platforms. However, such a design methodology is not sufficient by itself to obtain good performance. The paper describes some methodological steps for improving the performance of dataflow programs written in RVC-CAL and synthesized to execute on heterogeneous CPU/GPU co-processing platforms. The steps do include the optimization of the performance of the communication tasks between processing elements, a strategy for the efficient scheduling of independent GPU partitions, and the introduction of dynamic programming for leveraging the simd nature of GPU platforms. The approach is validated qualitatively and quantitatively using dataflow application program examples executed by applying several partitioning configurations.

关键词： dynamic dataflow programs RVC-CAL simd parallel computing source-to-source compiler GPU programming heterogeneous systems

来源：评论

学校读者我要写书评

暂无评论

Evolving AVX512 parallel C Code Using GP 22nd

Evolving AVX512 Parallel C Code Using GP

引用

22nd European Conference on Genetic Programming (EuroGP) Held as Part of EvoStar Conference

作者： Langdon, William B. Lorenz, Ronny UCL CREST Comp Sci London WC1E 6BT England Univ Vienna Inst Theoret Chem A-1090 Vienna Austria

Using 512 bit Advanced Vector Extensions, previous development history and Intel documentation, BNF grammar based genetic improvement automatically ports RNAfold to AVX, giving up to a 1.77 fold speed up. The evolved ... 详细信息

ISBN: (纸本)9783030166694;9783030166700

关键词： RNA secondary structure prediction Genetic programming GGGP simd parallel computing Software engineering RCS SBSE

来源：评论

学校读者我要写书评

暂无评论

Fine-grained parallel implementations for SWAMP plus Smith-Waterman alignment

引用

parallel computing 2013年第12期39卷 819-833页

作者： Steinfadt, Shannon Los Alamos Natl Lab Los Alamos NM 87545 USA

More sensitive than heuristic methods for searching biological databases, the Smith-Waterman algorithm is widely used but has the drawback of a high quadratic running time. The faster approach extends Smith-Waterman using Associative Massive parallelism (SWAMP+) for three different parallel architectures: ASsociative computing (ASC), the ClearSpeed coprocessor, and the Convey Computer FPGA coprocessor. We show that parallel versions of Smith-Waterman can be successfully modified to produce multiple BLAST-like sub-alignments while maintaining the original precision. SWAMP+ combines parallelism and the novel extension producing multiple sub-alignments for pairwise comparisons. Two parallel SWAMP+ implementations for the ASC model and the ClearSpeed CSX-620 use a wavefront approach. Both perform a full traceback in parallel memory, returning multiple sub-alignments. Results show a linear speedup for the 96 processing elements (PEs) on a single ClearSpeed chip. The third SWAMP+ adaptation uses the non-associative Convey Computer FPGA coprocessor. The hybrid system has a Smith-Waterman algorithm suite designed to produce high-speed, high-throughput alignments, optimized for large databases. The Convey Computer Smith-Waterman algorithm suite was extended to produce the additional SWAMP+ sub-alignments efficiently. The parallel sequence alignment algorithms were designed for three different computer systems, all of which contain extensions to produce multiple, additional sub-alignments. This work creates a speedup while providing a deeper exploration of the matched query sequences previously unavailable. (C) 2013 Elsevier B.V. All rights reserved.

关键词： simd parallel computing Bioinformatics parallel co-processor FPGAs Sequence alignment Smith-Waterman

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：