检索结果-内蒙古大学图书馆

AAAI Symposium on the 2nd Workshop on Deep Models and Artificial Intelligence for Defense Applications: Potentials, Theories, Practices, Tools, and Risks, DMAI4DA 2020

作者： Ganzfried, Sam Laughlin, Conner Morefield, Charles Ganzfried Research Arctan Inc.

Many real-world domains contain multiple agents behaving strategically with probabilistic transitions and uncertain (potentially infinite) duration. Such settings can be modeled as stochastic games. While algorithms have been developed for solving (i.e., computing a game-theoretic solution concept such as Nash equilibrium) two-player zero-sum stochastic games, research on algorithms for non-zero-sum and multiplayer stochastic games is limited. We present a new algorithm for these settings, which constitutes the first parallel algorithm for multiplayer stochastic games. We present experimental results on a 4-player stochastic game motivated by a naval strategic planning scenario, showing that our algorithm is able to quickly compute strategies constituting Nash equilibrium up to a very small degree of approximation error. © 2020 CEUR-WS. All rights reserved.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel Monte Carlo Simulation of VaR Calculation Based on Intel MIC Architecture

Parallel Monte Carlo Simulation of VaR Calculation Based on ...

引用

2020 International Conference on Big Data and Informatization Education, ICBDIE 2020

作者： Wen, Tiansheng Mao, Rui Tan, Cheng Northwest Af University College of Information Engineering Yangling China

ISBN: (纸本)9781728159003

With regard to the research on financial risk management, Value-at-Risk(VaR) has been widely accepted as a standard approach to financial risk management. There are various ways applicable to calculate VaR, of which Monte Carlo Simulation is regarded as the most effective one to deal with VaR calculation. Nevertheless, the calculation accuracy of the Monte Carlo Simulation method tends to be affected by the scale of simulation. As the scale of simulation increases, the calculation accuracy improves. In the meantime, however, the time cost rises on a continued basis. Therefore, it is difficult for the Monte Carlo Simulation method to be applied in practice. In order to solve this problem, a parallel Monte Carlo Simulation calculation method based on Intel MIC architecture is proposed in this paper to calculate VaR. The simulation process consists of a group of simulation tasks at a time. Due to the absence of data dependency between each simulation task, it is capable of excellent parallelism. In this paper, the OpenMP programming model was applied to assign each simulation task to a different thread for execution. Then, a comparison was performed between the CPU sequential execution scheme, the scheme based on MIC offload mode and the scheme based on MIC native mode. According to the comparative results, the solution based on MIC native mode is superior to that based on MIC offload mode. When the number of times for simulation to be conducted reaches 50,000, the parallel algorithm in MIC native mode achieves a maximum speed-up of 93.2 times compared to CPU sequential calculations, which could make the Monte Carlo Simulation of VaR calculations suitable for practical application. © 2020 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

A computational journey in the true north

引用

INTERNATIONAL JOURNAL OF parallel EMERGENT AND DISTRIBUTED SYSTEMS 2020年第2期35卷 132-142页

作者： Akl, Selim G. Queens Univ Sch Comp Kingston ON K7L 3N6 Canada

This paper tells my recollections of half a century of computing in Canada and, briefly earlier, in two other corners of the world.

关键词： parallel algorithms inherently parallel computations unconventional computational problems quantum chess quantum cryptography nonuniversality superlinear performance cloud security homomorphic cryptography

来源：评论

学校读者我要写书评

暂无评论

New parallel algorithms for finding determinants of NxN matrices

New parallel algorithms for finding determinants of NxN matr...

引用

World Congress on Computer and Information Technologies (WCCIT)

作者： Almalki, Sami Alzahrani, Saeed Alabdullatif, Abdullatif King Saud Univ Coll Comp & Informat Sci Riyadh Saudi Arabia

ISBN: (纸本)9781479904624

Determinants has been used intensively in a variety of applications through history. It also influenced many fields of mathematics like linear algebra. Finding the determinants of a squared matrix can be done using a variety of methods, including well-known methods of Leibniz formula and Laplace expansion which calculate the determinant of any NxN matrix in O(n!). However, decomposition methods, such as: LU decomposition, Cholesky decomposition and QR decomposition, have replaced the native methods with a significantly reduced complexity of O(n boolean AND 3). In this paper, we introduce two parallel algorithms for Laplace expansion and LU decomposition. Then, we analyze them and compare them with their perspective sequential algorithms in terms of run time, speed-up and efficiency, where new algorithms provided better results. At maximum, in Laplace expansion, it became 129% faster, whereas in LU Decomposition, it became 44% faster.

关键词： parallel Processing parallel algorithms Laplace Equations Linear algebra Multithreading

来源：评论

学校读者我要写书评

暂无评论

A computational technique for parallel solution of diagonally dominant banded linear systems

A computational technique for parallel solution of diagonall...

引用

International Conference on High Performance Computing

作者： S. Chandra Sekhara Rao Rabia Kamra Indian Institute of Technology Delhi New Delhi India

ISBN: (纸本)9781665410175

In this work, we present a WZ factorization for a nonsingular diagonally dominant banded matrix. With little modifications in the structures of W and Z, we construct a stable parallel algorithm suitable for solving narrow banded nonsingular diagonally dominant linear systems using divide and conquer technique. Partition the coefficient matrix along the main diagonal; also partition the unknown vector and the right hand side vector accordingly. The coefficient matrix of the ‘reduced system’ which is obtained by collecting the first ß and last ß equations from each partition, where ß is semibadwidth of the given banded linear system, is proved to be nonsingular diagonally dominant. The backward error analysis of the algorithm is presented and the algorithm is proved to be numerically stable. Numerical experiments are conducted to check the performance of the parallel algorithm and to compare the present parallel algorithm with the corresponding subroutines of ScaLAPACK. The performance of the parallel algorithm is evaluated in terms of speedup and scalability.

关键词： Linear systems Error analysis Scalability High performance computing Conferences Partitioning algorithms parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Unified Structure and parallel algorithms for FBMC Transmitter and Receiver

Unified Structure and Parallel Algorithms for FBMC Transmitt...

引用

IEEE 24th International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)

作者： Zeng, Yonghong Liang, Ying-Chang Chia, Meng Wah Peh, Edward Chu Yeow ASTAR Inst Infocomm Res Singapore 138632 Singapore

ISBN: (纸本)9781467362351

In recent years, filter bank multicarrier (FBMC) has recaptured widespread interests for its possible applications in cognitive radio and dynamic spectrum access. A distinctive feature for cognitive radio is its adaptivity to environment. When environment changes, a cognitive radio will change its parameters to optimize the transmission and receiving. Thus it is desirable to design a unified structure and algorithm for FBMC that needs little change for different parameters. In this paper, we propose a unified structure and parallel algorithms to implement the FBMC. The FBMC system and parallel algorithms are constructed based on the normalized prototype filter. The coefficients of the normalized prototype filter can be pre-computed and stored. The proposed parallel algorithms have the same structure for various choices of time duration, subcarrier spacing and bandwidth. Combined with known parallel algorithms for the fast Fourier transform (FFT), the proposed algorithms fully parallelize the computations for the transmitter and receiver, which can run much faster than conventional serial algorithms as modern processors usually have massive parallel capability.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Exact parallel Algorithm for the Knapsack Sharing Problem

Exact Parallel Algorithm for the Knapsack Sharing Problem

引用

Electrical, Computer and Energy Technologies (ICECET), International Conference on

作者： Hedi Mhalla Yacine Laalaoui College of Engineering and Technology American University of the Middle East Kuwait College of Computers & Information Technology Taif University Taif KSA

ISBN: (纸本)9781665442329

In the last two decades we observed a considerably improvement in algorithms' performances and their ability to solve hard combinatorial optimization problems. One of these problems is the knapsack sharing problem (KSP). The latter problem is a challenging variant of the well-known NP-hard single knapsack problem. In fact, we can find in the literature several exact and heuristic resolution approaches to solve the (KSP). We mainly propose here an improvement and an adaptation to parallel computing of one of the existing and most recent algorithm in the literature. The approach is a constructive tree search that runs in two phases: the initial solution construction phase and the second phase where we build the optimal solution through a customized branch and bound. We applied a parallel computing on this second phase in order to improve the overall computational time. Finally we present a comparative study on instances from literature to show the positive effect of parallel processing on the computing time.

关键词： Instruction sets Heuristic algorithms Energy resolution parallel algorithms Optimization

来源：评论

学校读者我要写书评

暂无评论

parallel Wideband MLFMA for Analysis of Electrically Large, Nonuniform, Multiscale Structures

引用

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION 2019年第2期67卷 1094-1107页

作者： Hughey, Stephen Aktulga, H. M. Vikram, Melapudi Lu, Mingyu Shanker, Balasubramaniam Michielssen, Eric Michigan State Univ Dept Elect & Comp Engn E Lansing MI 48824 USA Michigan State Univ Dept Comp Sci E Lansing MI 48824 USA GE Global Res Ctr Bengaluru 560066 India West Virginia Univ Inst Technol Dept Elect & Comp Engn Beckley WV 25801 USA Univ Michigan Dept Elect Engn & Comp Sci Ann Arbor MI 48109 USA

Electromagnetic scattering from electrically large objects with multiscale features is an increasingly important problem in computational electromagnetics. A conventional approach is to use an integral equation-based solver that is then augmented with an accelerator, a popular choice being a parallel multilevel fast multipole algorithm (MLFMA). One consequence of multiscale features is locally dense discretization, which leads to low-frequency breakdown and requires nonuniform trees. To the authors' knowledge, the literature on parallel MLFMA for such multiscale distributions capable of arbitrary accuracy is sparse;this paper aims to fill this niche. We prescribe an algorithm that overcomes this bottleneck. We demonstrate the accuracy (with respect to analytical data) and performance of the algorithm for both PEC scatterers and point clouds as large as 755 lambda with several hundred million unknowns and nonuniform trees as deep as 16 levels.

关键词： Adaptive algorithms computational electromagnetics method of moments (MoM) multilevel fast multipole algorithm (MLFMA) parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Theoretically and Practically Efficient parallel Nucleus Decomposition

arXiv

引用

arXiv 2021年

作者： Shi, Jessica Dhulipala, Laxman Shun, Julian MIT CSAIL United States

This paper studies the nucleus decomposition problem, which has been shown to be useful in finding dense substructures in graphs. We present a novel parallel algorithm that is efficient both in theory and in practice. Our algorithm achieves a work complexity matching the best sequential algorithm while also having low depth (parallel running time), which significantly improves upon the only existing parallel nucleus decomposition algorithm (Sariyüce et al., PVLDB 2018). The key to the theoretical efficiency of our algorithm is the use of a theoretically-efficient parallel algorithms for clique listing and bucketing. We introduce several new practical optimizations, including a new multi-level hash table structure to store information on cliques space-efficiently and a technique for traversing this structure cache-efficiently. On a 30-core machine with two-way hyper-threading on real-world graphs, we achieve up to a 55x speedup over the state-of-the-art parallel nucleus decomposition algorithm by Sariyüce et al., and up to a 40x self-relative parallel speedup. We are able to efficiently compute larger nucleus decompositions than prior work on several million-scale graphs for the first time. Copyright © 2021, The Authors. All rights reserved.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Work Efficient parallel algorithms for Large Graph Exploration

Work Efficient Parallel Algorithms for Large Graph Explorati...

引用

20th International Conference on High Performance Computing (HiPC)

作者： Banerjee, Dip Sankar Sharma, Shashank Kothapalli, Kishore Int Inst Informat Technol Hyderabad 500032 Andhra Pradesh India

ISBN: (纸本)9781479907298

Graph algorithms play a prominent role in several fields of sciences and engineering. Notable among them are graph traversal, finding the connected components of a graph, and computing shortest paths. There are several efficient implementations of the above problems on a variety of modern multiprocessor architectures. It can be noticed in recent times that the size of the graphs that correspond to real world data sets has been increasing. parallelism offers only a limited succor to this situation as current parallel architectures have severe short-comings when deployed for most graph algorithms. At the same time, these graphs are also getting very sparse in nature. This calls for particular work efficient solutions aimed at processing large, sparse graphs on modern parallel architectures. In this paper, we introduce graph pruning as a technique that aims to reduce the size of the graph. Certain elements of the graph can be pruned depending on the nature of the computation. Once a solution is obtained for the pruned graph, the solution is extended to the entire graph. We apply the above technique on three fundamental graph algorithms: breadth first search (BFS), Connected Components (CC), and All Pairs Shortest Paths (APSP). To validate our technique, we implement our algorithms on a heterogeneous platform consisting of a multicore CPU and a GPU. On this platform, we achieve an average of 35% improvement compared to state-of-the-art solutions. Such an improvement has the potential to speed up other applications that rely on these algorithms.

关键词： MULTIPROCESSOR ARCHITECTURE Line graph EFFICIENT parallel ALGORITHM parallel algorithms Graph algorithms Exploration connected component shortest path

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：