检索结果-内蒙古大学图书馆

6th International Conference on Computer, Software and Modeling, ICCSM 2022

作者： Moussa, Ziggaf Imad, Kissami ENSAO LMCS Complexe Universitaire B.P. 669 Oujda60000 Morocco MSDA Mohammed VI Polytechnic University Lot 660 Ben Guerir43150 Morocco Universite Sorbonne Paris Nord LAGA CNRS UMR 7539 VilletaneuseF-93430 France

ISBN: (数字)9781665454865

ISBN: (纸本)9781665454865

A parallel algorithm for solving the 2D shallow water equations coupled with the convection-diffusion equation has been developed, in order to demonstrate the capability and performance of our parallel approach while maintaining very good accuracy of the results obtained. The numerical scheme used is written in a non-uniform triangular grid formalism, which allows for the complexity of the geometry of the computational domain used. This approach is based on both predictor and corrector stages. The predictor one uses the method of characteristics to reconstruct the numerical fluxes, whereas the corrector stage recovers the conservation equations. Numerical results are presented for a pollutant transport in a squared cavity. © 2022 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel computation of combinatorial symmetries 29

Parallel computation of combinatorial symmetries

引用

29th Annual European Symposium on algorithms, ESA 2021

作者： Anders, Markus Schweitzer, Pascal TU Darmstadt Germany

ISBN: (纸本)9783959772044

In practice symmetries of combinatorial structures are computed by transforming the structure into an annotated graph whose automorphisms correspond exactly to the desired symmetries. An automorphism solver is then employed to compute the automorphism group of the constructed graph. Such solvers have been developed for over 50 years, and highly efficient sequential, single core tools are available. However no competitive parallel tools are available for the task. We introduce a new parallel randomized algorithm that is based on a modification of the individualization-refinement paradigm used by sequential solvers. The use of randomization crucially enables parallelization. We report extensive benchmark results that show that our solver is competitive to state-of-the-art solvers on a single thread, while scaling remarkably well with the use of more threads. This results in order-of-magnitude improvements on many graph classes over state-of-the-art solvers. In fact, our tool is the first parallel graph automorphism tool that outperforms current sequential tools. © Markus Anders and Pascal Schweitzer;licensed under Creative Commons License CC-BY 4.0

关键词： Algorithm engineering Automorphism groups Graph isomorphism parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Data-parallel Hashing Techniques for GPU Architectures

引用

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2020年第1期31卷 237-250页

作者： Lessley, Brenton Childs, Hank Univ Oregon Dept Comp & Informat Sci Eugene OR 97403 USA

Hash tables are a fundamental data structure for effectively storing and accessing sparse data, with widespread usage in domains ranging from computer graphics to machine learning. This study surveys the state-of-the-art research on data-parallel hashing techniques for emerging massively-parallel, many-core GPU architectures. This survey identifies key factors affecting the performance of different techniques and suggests directions for further research.

关键词： Graphics processors hash tables parallel algorithms search problems

来源：评论

学校读者我要写书评

暂无评论

Anchored coreness: efficient reinforcement of social networks

Anchored coreness: efficient reinforcement of social network...

引用

作者： Linghu, Qingyuan Zhang, Fan Lin, Xuemin Zhang, Wenjie Zhang, Ying University of New South Wales Sydney Australia Guangzhou University Guangzhou China Centre for AI University of Technology Sydney Sydney Australia

The stability of a social network has been widely studied as an important indicator for both the network holders and the participants. Existing works on reinforcing networks focus on a local view, e.g., the anchored k-core problem aims to enlarge the size of the k-core with a fixed input k. Nevertheless, it is more promising to reinforce a social network in a global manner: considering the engagement of every user (vertex) in the network. Since the coreness of a user has been validated as the "best practice" for capturing user engagement, we propose and study the anchored coreness problem in this paper: anchoring a small number of vertices to maximize the coreness gain (the total increment of coreness) of all the vertices in the network. We prove the problem is NP-hard and show it is more challenging than the existing local-view problems. An efficient greedy algorithm is proposed with novel techniques on pruning search space and reusing the intermediate results. The algorithm is also extended to distributed environment with a novel graph partition strategy to ensure the computing independency of each machine. Extensive experiments on real-life data demonstrate that our model is effective for reinforcing social networks and our algorithms are efficient. © 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

NEARLY WORK-EFFICIENT parallel ALGORITHM FOR DIGRAPH REACHABILITY

引用

SIAM JOURNAL ON COMPUTING 2020年第5期49卷 STOC18-500-STOC18-539页

作者： Fineman, Jeremy T. Georgetown Univ Dept Comp Sci Washington DC 20057 USA

One of the simplest problems on directed graphs is that of identifying the set of vertices reachable from a designated source vertex. This problem can be solved easily sequentially by performing a graph search, but efficient parallel algorithms have eluded researchers for decades. For sparse high-diameter graphs in particular, there is no known work-efficient parallel algorithm with nontrivial parallelism. This amounts to one of the most fundamental open questions in parallel graph algorithms: Is there a parallel algorithm for digraph reachability with nearly linear work? This article shows that the answer is yes, presenting a randomized parallel algorithm for digraph reachability and related problems with expected work o(m) and span (O) over tilde (n(2/3)), and hence parallelism (O) over tilde (m/n(2/3)) = (Omega) over tilde (n(1/3)), on any graph with n vertices and m arcs. This is the first parallel algorithm having both nearly linear work and strongly sublinear span, i.e., span (O) over tilde (n(1-is an element of)) for any constant is an element of > 0. The algorithm can be extended to produce a directed spanning tree, determine whether the graph is acyclic, topologically sort the strongly connected components of the graph, or produce a directed ear decomposition, all with work (O) over tilde (m) and span (O) over tilde (n(2/3)). The main technical contribution is an efficient Monte Carlo algorithm that, through the addition of a(n) shortcuts, reduces the diameter of the graph to (O) over tilde (n(2/3)) with high probability. While both sequential and parallel algorithms are known with those combinatorial properties, even the sequential algorithms are not efficient, having sequential runtime Omega(mn(Omega(1))). This article presents a surprisingly simple sequential algorithm that achieves the stated diameter reduction and runs in (O) over tilde (m) time. parallelizing that algorithm yields the main result, but doing so involves overcoming several other challen

关键词： parallel algorithms randomized algorithms graph search reachability shortcuts

来源：评论

学校读者我要写书评

暂无评论

A Branch-and-Bound Algorithm for Computing the Reliable Isolated Zeros of Multivariate Polynomial Functions Systems 23

A Branch-and-Bound Algorithm for Computing the Reliable Isol...

引用

23rd IEEE International Conference on High Performance Computing and Communications, 7th IEEE International Conference on Data Science and Systems, 19th IEEE International Conference on Smart City and 7th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Applications, HPCC-DSS-SmartCity-DependSys 2021

作者： Chen, Cheng Chen, Liangyu Zeng, Zhenbing Lin, Dang East China Normal University Shanghai Key Lab of Trustworthy Computing Shanghai200062 China Shanghai University Department of Mathematics Shanghai200444 China

ISBN: (纸本)9781665494571

In this paper, we present an algorithm using the GPGPU machine to compute the interval solutions of isolated real zeros of multivariate polynomial functions in given ranges. To overcome the state space explosion in the process of searching zero points, we combine the branch-and-bound method and the Hansen-Sengupta method, and the interval arithmetic has been used throughout the computation to guarantee the reliability of results. The computation is implemented on GPGPU system, and experiments for 55 benchmark problems have been done. The result shows our method can produce reliable isolation for real zeros in accepted time. © 2021 IEEE.

关键词： Smart cities Benchmark testing Explosions Reliability parallel algorithms Optimization Arithmetic

来源：评论

学校读者我要写书评

暂无评论

parallel Data Distribution Management on Shared-memory Multiprocessors

引用

ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION 2020年第1期30卷 1–25页

作者： Marzolla, Moreno D'angelo, Gabriele Univ Bologna Dept Comp Sci & Engn DISI Mura Anteo Zamboni 7 I-90126 Bologna Italy

The problem of identifying intersections between two sets of d-dimensional axis-parallel rectangles appears frequently in the context of agent-based simulation studies. For this reason, the High Level Architecture (HLA) specification a standard framework for interoperability among simulators includes a Data Distribution Management (DDM) service whose responsibility is to report all intersections between a set of subscription and update regions. The algorithms at the core of the DDM service are CPU-intensive, and could greatly benefit from the large computing power of modern multi-core processors. In this article, we propose two parallel solutions to the DDM problem that can operate effectively on shared-memory multiprocessors. The first solution is based on a data structure (the interval tree) that allows concurrent computation of intersections between subscription and update regions. The second solution is based on a novel parallel extension of the Sort Based Matching algorithm, whose sequential version is considered among the most efficient solutions to the DDM problem. Extensive experimental evaluation of the proposed algorithms confirm their effectiveness on taking advantage of multiple execution units in a shared-memory architecture.

关键词： Data distribution management (DDM) parallel and distributed simulation (PADS) high level architecture (HLA) parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

NvPD: novel parallel edit distance algorithm, correctness, and performance evaluation

引用

CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS 2020年第2期23卷 879-894页

作者： Sadiq, Muhammad Umair Yousaf, Muhammad Murtaza Aslam, Laeeq Aleem, Muhammad Sarwar, Shahzad Jaffry, Syed Waqar Univ Punjab Punjab Univ Coll Informat Technol Lahore Pakistan Univ Punjab Punjab Univ Coll Informat Technol Comp Sci Lahore Turkey Capital Univ Sci & Technol Dept Comp Sci Islamabad Pakistan

Edit distance has applications in many domains such as bioinformatics, spell checking, plagiarism checking, query optimization, speech recognition, and data mining. Traditionally, edit distance is computed by dynamic programming based sequential solution which becomes infeasible for large problems. In this paper, we introduce NvPD, a novel algorithm for parallel edit distance computation by resolving dependencies in the conventional dynamic programming based solution. We also establish the correctness of modified dependencies. NvPD exhibits certain characteristics such as balanced workload among processors, less synchronization overhead, maximum utilization of resources and it can exploit spatial locality. It requiresmin(m,n)steps to complete as compared to diagonal based approach that completes inmax(m,n) Experimental evaluation using variety of random and real life data sets over shared memory multi-core systems and graphic processing units (GPUs) show that NvPD outperforms state-of-the-art parallel edit distance algorithms.

关键词： Edit distance Dynamic programming parallel algorithms Performance evaluation GPUs OpenMP

来源：评论

学校读者我要写书评

暂无评论

Advances in Asynchronous parallel and Distributed Optimization

引用

PROCEEDINGS OF THE IEEE 2020年第11期108卷 2013-2031页

作者： Assran, By Mahmoud Aytekin, Arda Feyzmahdavian, Hamid Reza Johansson, Mikael Rabbat, Michael G. McGill Univ Dept Elect & Comp Engn Montreal PQ H3A 0G4 Canada Ericsson AB S-16440 Stockholm Sweden ABB S-72226 Stockholm Sweden KTH Royal Inst Technol S-10044 Stockholm Sweden Facebook Inc Dept AI Res Montreal PQ H2S 3G9 Canada

Motivated by large-scale optimization problems arising in the context of machine learning, there have been several advances in the study of asynchronous parallel and distributed optimization methods during the past decade. Asynchronous methods do not require all processors to maintain a consistent view of the optimization variables. Consequently, they generally can make more efficient use of computational resources than synchronous methods, and they are not sensitive to issues like stragglers (i.e., slow nodes) and unreliable communication links. Mathematical modeling of asynchronous methods involves proper accounting of information delays, which makes their analysis challenging. This article reviews recent developments in the design and analysis of asynchronous optimization methods, covering both centralized methods, where all processors update a master copy of the optimization variables, and decentralized methods, where each processor maintains a local copy of the variables. The analysis provides insights into how the degree of asynchrony impacts convergence rates, especially in stochastic optimization methods.

关键词： Program processors Optimization methods Machine learning Computational modeling Convergence Computational efficiency Distributed algorithms machine learning machine learning algorithms optimization methods parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel implementation of the Image Block Representation using OpenMP

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2020年 137卷 134-147页

作者： Spiliotis, Iraklis M. Bekakos, Michael P. Boutalis, Yiannis S. Democritus Univ Thrace Dept Elect & Comp Engn GR-67100 Xanthi Greece

Herein, a parallel implementation in OpenMP of the Image Block Representation (IBR) for binary images is investigated. The IBR is a region-based image representation scheme that represents the binary image as a set of non-overlapping rectangular areas with object level, called blocks. The IBR permits the execution of operations on image areas instead of image points and therefore leads to a substantial reduction of the required computational complexity. The experimental and the analytically derived results from parallel implementation in OpenMP, on a multicore computer, proved that a very good overall performance can be achieved. (C) 2019 Elsevier Inc. All rights reserved.

关键词： Image Block Representation Karp-Flatt metric parallel computing parallel algorithms OpenMP

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：