检索结果-内蒙古大学图书馆

Parallel intersection counting on shared-memory multiprocessors and GPUs

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE 2024年 159卷 423-431页

作者： Marzolla, Moreno Birolo, Giovanni D'Angelo, Gabriele Fariselli, Piero Univ Bologna Dipartimento Informat Sci & Ingn DISI Mura Anteo Zamboni 7 I-40126 Bologna Italy Univ Torino Dipartimento Sci Med Corso Dogliotti 14 IT-10126 Turin Italy Univ Bologna Ctr Interdept Ind Res ICT I-40126 Bologna Italy Univ Bologna Dept Comp Sci & Engn DISI Cesena CampusVia Univ 50 I-47521 Cesena Italy

Computing intersections among sets of one-dimensional intervals is an ubiquitous problem in computational geometry with important applications in bioinformatics, where the size of typical inputs is large and it is therefore important to use efficient algorithms. In this paper we propose a parallel algorithm for the 1D intersection -counting problem, that is, the problem of counting the number of intersections between each interval in a given set A and every interval in a set B . Our algorithm is suitable for shared -memory architectures (e.g., multicore CPUs) and GPUs. The algorithm is work -efficient because it performs the same amount of work as the best serial algorithm for this kind of problem. Our algorithm has been implemented in C++ using the Thrust parallel algorithms library, enabling the generation of optimized programs for multicore CPUs and GPUs from the same source code. The performance of our algorithm is evaluated on synthetic and real datasets, showing good scalability on different generations of hardware.

关键词： Intersection counting Parallel algorithms GPU programming shared-memory algorithm Bioinformatics

来源：评论

学校读者我要写书评

暂无评论

An Efficient Method for Accelerating Training of Short-Term Traffic Prediction Models in Large-Scale Traffic Networks 2020

An Efficient Method for Accelerating Training of Short-Term ...

引用

24th Pan-Hellenic Conference on Informatics

作者： Athanasios I. Salamanis George A. Gravvanis Christos K. Filelis-Papadopoulos Dimitrios Tzovaras Democritus University of Thrace Greece University College Cork Greece Centre for Research and Technology Hellas Greece

ISBN: (纸本)9781450388979

The increasing availability of large volumes of traffic data has led to the development of several short-term traffic prediction models. Training these models is a computationally intensive process due to the volume of available traffic data. Therefore, having effective methods for accelerating this process is considered necessary. In this paper, we propose an efficient method for accelerating the training process of multiple short-term traffic prediction models in large-scale traffic networks. In particular, the traffic data is organized into separate files so that the training process for one model is independent of the others. These files are distributed in the cores of a shared-memory multicore processor so as to train multiple models simultaneously. Appropriate measures have been taken to limit the memory footprint of the proposed method, as well as to enhance its load balancing capabilities. The proposed method was applied to five short-term traffic prediction models, and evaluated using large-scale real-world traffic data. Preliminary experimental results indicate that the proposed method exhibits nearly linear speedup for the training process of all models, while maintaining their prediction performance.

关键词： Big Data Short-Term Traffic Prediction shared-memory algorithm Parallel Training

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：