检索结果-内蒙古大学图书馆

作者： Bahig, Hazem M. Bahig, Hatem M. Fathy, Khaled A. Computer Science Division Department of Mathematics Faculty of Science Ain Shams University Cairo11566 Egypt Department of Mathematics Faculty of Science Al-Azhar University Cairo Egypt

The problem of designing efficient parallel algorithms to calculate the product of n numbers when the multipliers are large is a fundamental problem in many applications of computer science such as cryptography. In this work, we present a new parallel algorithm on exclusive read shared memory model. The performance of the introduced algorithm is measured based on three factors, namely, (1) the number of cores, (2) the size of the array, and (3) the size of the multiplier. The experimental study on a multi core system reveals that the introduced algorithm is faster than the best-known optimal parallel algorithm. The improvement of the proposed algorithm in processing time compared to the best known parallel algorithm is 80% when the size of the array was 220 and the sizes of the multiplier were 1024, 2048, and 4096 bits. Moreover, our algorithm is a highly scalable parallel algorithm compared with the best-known optimal parallel algorithm. © 2019 John Wiley & Sons, Ltd.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallelizing a SAT-based product configurator 27

Parallelizing a SAT-based product configurator

引用

27th International Conference on Principles and Practice of Constraint Programming, CP 2021

作者： Ullmann, Nils Merlin Balyo, Tomáš Klein, Michael CAS Software AG Karlsruhe Germany

ISBN: (纸本)9783959772112

This paper presents how state-of-the-art parallel algorithms designed to solve the Satisfiability (SAT) problem can be applied in the domain of product configuration. During an interactive configuration process, a user selects features step-by-step to find a suitable configuration that fulfills his desires and the set of product constraints. A configuration system can be used to guide the user through the process by validating the selections and providing feedback. Each validation of a user selection is formulated as a SAT problem. Furthermore, an optimization problem is identified to find solutions with the minimum amount of changes compared to the previous configuration. Another additional constraint is deterministic computation, which is not trivial to achieve in well performing parallel SAT solvers. In the paper we propose five new deterministic parallel algorithms and experimentally compare them. Experiments show that reasonable speedups are achieved by using multiple threads over the sequential counterpart. © Nils Merlin Ullmann, Tomáš Balyo, and Michael Klein.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Design and Implementation of the Intelligent Parking System for Large Communities

Design and Implementation of the Intelligent Parking System ...

引用

2021 International Conference on Computer, Internet of Things and Control Engineering, CITCE 2021

作者： Yang, Cun-Wei Wang, Wei-Qing Li, Feng-Ying Southwest University Department of Computer Science and Technology Chongqing China Southwest University Department of Management Science and Engineering Chongqing China Nanjing University of Information Science Technology Department of Environmental Science Nanjing China

ISBN: (纸本)9781665421843

Nowadays, the scale of a community is expanding and the regional transportation network is ever more intricate against the backdrop of the population explosion. It posed an obstacle to people especially guests in finding spaces to park. It also disturbs residents and increases traffic risks. This paper proposes a multi-technology intelligent parking system that manages and guides vehicles in a large community to avoid chaos. A number of vehicles in a community are served by the system concurrently, which puts forward higher requirements for the performance of the system, so a parking space allocation algorithm and parallel path-finding algorithm are introduced. To monitor parking spaces and provide navigation service, Internet of Things (IoT) devices are deployed and they are generally controlled by the server computer. Furthermore, a client/server (C/S) architecture software project is developed and users are able to access the system via the Android or iOS application. In reality test, the performance and robustness of the system are exactly suitable for given conditions. © 2021 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Improved GPU near neighbours performance for multi-agent simulations

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2020年第0期137卷 53-64页

作者： Chisholm, Robert Maddock, Steve Richmond, Paul Univ Sheffield Dept Comp Sci 211 Portobello Sheffield S1 4DP S Yorkshire England

Complex systems simulations are well suited to the SIMT paradigm of GPUs, enabling millions of actors to be processed in fractions of a second. At the core of many such simulations, fixed radius near neighbours (FRRN) search provides the actors with spatial awareness of their neighbours. The FRNN search process is frequently the limiting factor of performance, due to the disproportionate level of scattered memory reads demanded by the query stage, leading to FRNN search runtimes exceeding that of simulation logic. In this paper, we propose and evaluate two novel optimisations (Strips and Proportional Bin Width) for improving the performance of uniform spatially partitioned FRNN searches and apply them in combination to demonstrate the impact on the performance of multi-agent simulations. The two approaches aim to reduce latency in search and reduce the amount of data considered (i.e. more efficient searching), respectively. When the two optimisations are combined, the peak obtained speedups observed in a benchmark model are 1.27x and 1.34x in two and three dimensional implementations, respectively. Due to additional non FRNN search computation, the peak speedup obtained when applied to complex system simulations within FLAMEGPU is 1.21x. (C) 2019 The Authors. Published by Elsevier Inc.

关键词： GPU CUDA parallel algorithms Complex systems

来源：评论

学校读者我要写书评

暂无评论

Design and Implementation of Multi-Threaded algorithms in Polynomial Algebra 21

Design and Implementation of Multi-Threaded Algorithms in Po...

引用

46th International Symposium on Symbolic and Algebraic Computation, ISSAC 2021

作者： Moreno Maza, Marc University of Western Ontario LondonON Canada

来源：评论

学校读者我要写书评

暂无评论

Hybrid CPU-GPU Community Detection in Weighted Networks

引用

IEEE ACCESS 2020年 8卷 57527-57551页

作者： Souravlas, Stavros Sifaleras, Angelo Katsavounis, Stefanos Univ Macedonia Dept Appl Informat Thessaloniki 54636 Greece Democritus Univ Thrace Dept Prod & Management Engn Xanthi 67100 Greece

Recently, a new trend has emerged in the field of parallel and high performance computing, the hybrid implementation using CPU-GPU modules. In such implementations, the computational load is shared between the CPU and GPU, in order to improve the computational efficiency. However, the task of sharing the computational load between the two modules is a rather difficult one, with a number of limitations being imposed. This paper extends our recent work on community detection, which is based on transforming a network of nodes into a set of threaded binary trees. In this work, we share the computational load between the two units: the CPU takes specific samples of the network communities and organizes them in the form of threaded binary trees. The GPU takes over the heavy load of reading this data and transforming it into a path-matrix. Finally, this matrix is sent back to the CPU for analysis, community detection and overlaps, as well as network information upgrades. Our simulation results show significant improvement over our previous strategy and other known community detection strategies found in the literature.

关键词： Community detection parallel algorithms binary trees social circles GPU-CPU scheduling

来源：评论

学校读者我要写书评

暂无评论

ADMM-Based Energy-Reserve Joint Dispatch Considering Cross-Zone Reserve Dispatchability

ADMM-Based Energy-Reserve Joint Dispatch Considering Cross-Z...

引用

2021 International Conference on Power System Technology, POWERCON 2021

作者： Pu, Shutong Yan, Xinfei Shen, Mengjun Zhong, Haiwang

ISBN: (纸本)9781665407373

As distributed energy resources (DERs) becomes widespread in power system, distributed algorithms are required for economic dispatch. Renewable generators accrue along with the randomness and volatility of the generation side, therefore higher proportion of reserves are applied in the system for stability concerns. Zonal-partitioned reserve dispatch is commonly considered in large-scale reserve dispatch cases. This paper gives solution to energy-reserve joint dispatch problems concerning reserve cross-zone dispatchability by modified ADMM algorithm. The precision and economic-effectiveness of the algorithm is validated through case studies based on IEEE 30-bus system. © 2021 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

High-Quality Shared-Memory Graph Partitioning

引用

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2020年第11期31卷 2710-2722页

作者： Akhremtsev, Yaroslav Sanders, Peter Schulz, Christian Google Zurich Switzerland Karlsruhe Inst Technol KIT D-76131 Karlsruhe Germany Univ Vienna Fac Comp Sci A-1010 Vienna Austria

Partitioning graphs into blocks of roughly equal size such that few edges run between blocks is a frequently needed operation in processing graphs. Recently, size, variety, and structural complexity of these networks has grown dramatically. Unfortunately, previous approaches to parallel graph partitioning have problems in this context since they often show a negative trade-off between speed and quality. We present an approach to multi-level shared-memory parallel graph partitioning that produces balanced solutions, shows high speedups for a variety of large graphs and yields very good quality independently of the number of cores used. For example, in an extensive experimental study, at 79 cores, one of our closest competitors is faster but fails to meet the balance criterion in the majority of cases and another is mostly slower and incurs about 13 percent larger cut size. Important ingredients include parallel label propagation for both coarsening and refinement, parallel initial partitioning, a simple yet effective approach to parallel localized local search, and fast locality preserving hash tables.

关键词： Partitioning algorithms Clustering algorithms Program processors Complex networks Contracts parallel algorithms parallel graph partitioning shared-memory parallelism local search label propagation

来源：评论

学校读者我要写书评

暂无评论

A DISTRIBUTED-MEMORY ALGORITHM FOR COMPUTING A HEAVY-WEIGHT PERFECT MATCHING ON BIPARTITE GRAPHS

引用

SIAM JOURNAL ON SCIENTIFIC COMPUTING 2020年第4期42卷 C143-C168页

作者： Azad, Ariful Buluc, Aydin Li, Xiaoye S. Wang, Xinliang Langguth, Johannes Indiana Univ Intelligent Syst Engn Bloomington IN 47408 USA Lawrence Berkeley Natl Lab Computat Res Div Berkeley CA 94720 USA Tsinghua Univ Dept Comp Sci & Technol Beijing 100084 Peoples R China Simula Res Lab N-1364 Fornebu Norway

We design and implement an efficient parallel algorithm for finding a perfect matching in a weighted bipartite graph such that weights on the edges of the matching are large. This problem differs from the maximum weight matching problem, for which scalable approximation algorithms are known. It is primarily motivated by finding good pivots in scalable sparse direct solvers before factorization. Due to the lack of scalable alternatives, distributed solvers use sequential implementations of maximum weight perfect matching algorithms, such as those available in MC64. To overcome this limitation, we propose a fully parallel distributed memory algorithm that first generates a perfect matching and then iteratively improves the weight of the perfect matching by searching for weight-increasing cycles of length 4 in parallel. For most practical problems the weights of the perfect matchings generated by our algorithm are very close to the optimum. An efficient implementation of the algorithm scales up to 256 nodes (17,408 cores) on a Cray XC40 supercomputer and can solve instances that are too large to be handled by a single node using the sequential algorithm.

关键词： bipartite graphs matching parallel algorithms graph theory transversals

来源：评论

学校读者我要写书评

暂无评论

Capturing Associations in Graphs

引用

PROCEEDINGS OF THE VLDB ENDOWMENT 2020年第11期13卷 1863-1876页

作者： Fan, Wenfei Jin, Ruochun Liu, Muyang Lu, Ping Tian, Chao Zhou, Jingren Univ Edinburgh Edinburgh Midlothian Scotland Shenzhen Univ Shenzhen Inst Comp Sci Shenzhen Peoples R China Beijing Adv Innovat Ctr Big Data & Brain Comp Beijing Peoples R China Alibaba Grp Seattle WA USA

This paper proposes a class of graph association rules, denoted by GARs, to specify regularities between entities in graphs. A GAR is a combination of a graph pattern and a dependency;it may take as predicates ML (machine learning) classifiers for link prediction. We show that GARs help us catch incomplete information in schemaless graphs, predict links in social graphs, identify potential customers in digital marketing, and extend graph functional dependencies (GFDs) to capture both missing links and inconsistencies. We formalize association deduction with GARs in terms of the chase, and prove its Church-Rosser property. We show that the satisfiability, implication and association deduction problems for GARs are coNP-complete, NP-complete and NP-complete, respectively, retaining the same complexity bounds as their GFD counterparts, despite the increased expressive power of GARs. The incremental deduction problem is DP-complete for GARs versus coNP-complete for GFDs. In addition, we provide parallel algorithms for association deduction and incremental deduction. Using real-life and synthetic graphs, we experimentally verify the effectiveness, scalability and efficiency of the parallel algorithms.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：