检索结果-内蒙古大学图书馆

MPCS 1994 - 1st international conference on massively parallel computing systems: The Challenges of General-Purpose and Special-Purpose computing

MPCS 1994 - 1st International Conference on Massively Parall...

引用

1st international conference on massively parallel computing systems: The Challenges of General-Purpose and Special-Purpose computing, MPCS 1994

ISBN: (纸本)0818663227

The proceedings contain 76 papers. The topics discussed include: list scheduling: alone, with foresight, and with lookahead;massively parallel computing systems with real time constraints the 'algorithm architecture adequation' methodology;systolic-type implementation of matrix computations based on the Faddeev algorithm;architecture and realization of the modular expandable multiprocessor system MEMSY;communications is more than I/O;avoiding memory contention on tightly coupled multiprocessors;cache coherence in a multiport memory environment;stochastic modeling of multiprocessor reliability;comparing architectures using throughput-versus-cost modeling;optimal triple modular redundancy embeddings in the hypercube;and general purpose massively parallel systems: the role of programming environments.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Proceedings of the First international conference on massively parallel computing systems (MPCS) The Challenges of General-Purpose and Special-Purpose computing

Proceedings of the First International Conference on Massive...

引用

international conference on massively parallel computing systems

Presents the front cover of the proceedings record.

关键词：

来源：评论

学校读者我要写书评

暂无评论

PAC: computing Join Queries with Semi-Covers 28

PAC: Computing Join Queries with Semi-Covers

引用

28th international conference on Database Theory, ICDT 2025

作者： Aamer, Heba Ketsman, Bas Vrije Universiteit Brussel Belgium

ISBN: (纸本)9783959773645

An increased and growing interest in large-scale data processing has triggered a demand for specialized algorithms that thrive in massively parallel shared-nothing systems. To answer the question of how to efficiently compute join queries in this setting, a rich line of research has emerged specifically for the massively parallel Communication (MPC) model. In the MPC model, algorithms are executed in rounds, with each round consisting of a synchronized communication phase and a separate local computation phase. The main cost measure is the load of the algorithm, defined as the maximum number of messages received by any server in any round. We study worst-case optimal algorithms for the join query evaluation problem in the constant-round MPC model. In the single-round variant of MPC, the worst-case optimal load for this problem is well understood and algorithms exist that guarantee this load for any join query. In the constant-round variant of MPC, queries can often be computed with a lower load compared to the single-round variant, but the worst-case optimal load is only known for specific classes of join queries, including graph-like and acyclic join queries, and the associated algorithms use very different techniques. In this paper, we propose a new constant-round MPC algorithm for computing join queries. Our algorithm is correct for every join query and its load matches (up to a polylog factor) the worst-case optimal load for at least all join queries that are acyclic or graph-like. © Heba Aamer and Bas Ketsman.

关键词： Structured Query Language

来源：评论

学校读者我要写书评

暂无评论

Enhancing trusted computing operation efficiency based on parallel file verification techniques 4

Enhancing trusted computing operation efficiency based on pa...

引用

4th international conference on Network Communication and Information Security, ICNCIS 2024

作者： Zhang, Zhilong Pan, Fengyan Hou, Haibo Li, Yuan Shanghai Jiaotong University Shanghai China Beijing Chenguang Rongxin Technology Co. Beijing China

ISBN: (数字)9781510688261

ISBN: (纸本)9781510688254

Trusted computing technology represents a significant element of cyber security systems, serving to guarantee the integrity and accessibility of data and systems. The incorporation of Trusted computing introduces a series of security detection and verification mechanisms to the host system, consequently impacting the performance of host system startup and application operation. In this paper, following an analysis of the verification mechanisms of Trusted computing, a parallel verification mechanism is proposed. The objective is to facilitate the parallel operation of the host system program and the TPM-based file verification process, thereby enhancing the operational efficiency of the trusted computing system. Through the configurable and micro-intervention design scheme, flexible expansion and configuration within the existing Trusted computing technology framework can be achieved. © 2025 SPIE.

关键词： computing systems Design Computer security Process control systems modeling Mathematical optimization Operating systems

来源：评论

学校读者我要写书评

暂无评论

Fast Deterministic massively parallel Ruling Sets Algorithms 25

Fast Deterministic Massively Parallel Ruling Sets Algorithms

引用

26th international conference on Distributed computing and Networking, ICDCN 2025

作者： Ji, Hongyan Kothapalli, Kishore Pemmaraju, Sriram V Singh, Ajitanshu Department of Computer Science The University of Iowa Iowa CityIA United States Center for Security Theory and Algorithmic Research IIIT Hyderabad Telangana Hyderabad India

ISBN: (纸本)9798400710629

In this paper, we present a deterministic Õ(log1/3)-round algorithm for the 2-ruling set problem in the sublinear massively parallel Computation (MPC) model. This improves upon the fastest known deterministic 2-ruling set algorithm for this model, which is the Õ(√log;n)-round algorithm by Giliberti and Parsaeian (PODC 2024). Our result is obtained by derandomizing the "sample-and-gather"approach of Kothapalli, Pai, and Pemmaraju (FSTTCS 2020). The "sample-and-gather"approach involves making random sampling decisions, not just for the current iteration, but a batch of future iterations. Thus, derandomizing this approach requires the "fixing"of randomness for a batch of future iterations. We further extend our results to show that a β-ruling set for β ≥ 2 can be obtained in Õ(log1/2β - 1 Δ) deterministic rounds in the sublinear MPC model. Additionally, we present a deterministic β-ruling set algorithms for sparse graphs (i.e., bounded arboricity graphs) where β ≥ 2, which runs in Õ(log1/2β - 1 λ) rounds for arboricity-λ graphs in the sublinear MPC model. © 2025 Copyright held by the owner/author(s).

关键词： Consensus algorithm

来源：评论

学校读者我要写书评

暂无评论

Innovations in mathematical modeling, AI, and optimization techniques

引用

JOURNAL OF SUPERcomputing 2025年第1期81卷 1-4页

作者： Ohue, Masahito Yasuo, Nobuaki Takata, Masami Inst Sci Tokyo Sch Comp Dept Comp Sci Yokohama Kanagawa 2268501 Japan Inst Sci Tokyo Acad Convergence Mat & Informat TAC MI Tokyo 1528550 Japan Nara Womens Univ Res Grp Informat & Commun Technol Life Nara 6308506 Japan

This special issue is dedicated to examining the rapidly evolving fields of artificial intelligence, mathematical modeling, and optimization, with particular emphasis on their growing importance in computational science. It features the most notable papers from the "Mathematical Modeling and Problem Solving" workshop at PDPTA'24, the 30th international conference on parallel and Distributed Processing Techniques and Applications. The issue showcases pioneering research in areas such as natural language processing, system optimization, and high-performance computing. The nine selected studies include novel AI-driven methods for chemical compound generation, historical text recognition, and music recommendation, along with advancements in hardware optimization through reconfigurable accelerators and vector register sharing. Additionally, evolutionary and hyper-heuristic algorithms are explored for sophisticated problem-solving in engineering design, and innovative techniques are introduced for high-speed numerical methods in large-scale systems. Collectively, these contributions demonstrate the significance of AI, supercomputing, and advanced algorithms in driving the next generation of scientific discovery.

关键词： Mathematical modeling Artificial intelligence parallel and distributed computing Reconfigurable computing Drug discovery

来源：评论

学校读者我要写书评

暂无评论

Introducing Novel parallel computing Using Orbital Data 37th

Introducing Novel Parallel Computing Using Orbital Data

引用

37th international conference on Computer Applications in Industry and Engineering

作者： Mekhiel, Nagi Toronto Metropolitan Univ Dept Elect Comp & Biomed Engn Toronto ON Canada

ISBN: (纸本)9783031762727;9783031762734

We propose a novel parallel computing that allows processors to access data in predictable time without the need to access it from different locations in memory using addresses. It uses orbital data that is mapped to time and is made available to multiple processors at the same time in multiple different orbits and at a specific predictable time in each orbit. This allows processors in different orbits to share the same data, eliminating the problem of sharing data at the same time among multiple processors. It provides processors with the ability to hide the waiting time when accessing shared data by overlapping it with useful work on another data while allowing other processors to work on the shared data in another orbit. The performance of this novel method shows significant improvements in scalability compared to that of conventional parallel computing.

关键词： Data accesses Computer models parallel Computers Memory systems Scalability of parallel computing

来源：评论

学校读者我要写书评

暂无评论

massively parallel Maximum Coverage Revisited 50th

Massively Parallel Maximum Coverage Revisited

引用

50th international conference on Current Trends in Theory and Practice of Computer Science, SOFSEM 2025

作者： Bui, Thai Vu, Hoa T. San Diego State University San DiegoCA92182 United States

ISBN: (纸本)9783031826696

We study the maximum set coverage problem in the massively parallel model. In this setting, m sets that are subsets of a universe of n elements are distributed among m machines. In each round, these machines can communicate with each other, subject to the memory constraint that no machine may use more than O~n memory. The objective is to find the k sets whose coverage is maximized. We consider the regime where k=Ω(m) (i.e., k=m/100), m=O(n), and each machine has O~n memory1. Maximum coverage is a special case of the submodular maximization problem subject to a cardinality constraint. This problem can be approximated to within a 1-1/e factor using the greedy algorithm, but this approach is not directly applicable to parallel and distributed models. When k=Ω(m), to obtain a 1-1/e-ϵ approximation, previous work either requires O~mn memory per machine which is not interesting compared to the trivial algorithm that sends the entire input to a single machine, or requires 2O(1/ϵ)n memory per machine which is prohibitively expensive even for a moderately small value ϵ. Our result is a randomized (1-1/e-ϵ)-approximation algorithm that uses O(1/ϵ3·logm·(log(1/ϵ)+logm))rounds. Our algorithm involves solving a slightly transformed linear program of the maximum coverage problem using the multiplicative weights update method, classic techniques in parallel computing such as parallel prefix, and various combinatorial arguments. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Approximation algorithms

来源：评论

学校读者我要写书评

暂无评论

Enhancing QR Decomposition: A GPU-Based Approach to parallelizing the Householder Algorithm with CUDA Streams 21st

Enhancing QR Decomposition: A GPU-Based Approach to Parallel...

引用

21st international conference on Distributed computing and Intelligent Technology

作者： Eshwar, Uppu Chatterjee, Soumyajit Peri, Sathya Indian Inst Technol Hyderabad Hyderabad 502285 Telangana India

ISBN: (纸本)9783031814037;9783031814044

Linear algebra algorithms, such as the Householder QR decomposition, are pivotal in various applications including signal processing, optimization, and numerical solutions to systems of linear equations. Traditional sequential implementations of the Householder algorithm face significant limitations in terms of performance and scalability when applied to large matrices. To overcome these constraints, this paper explores the parallelization of the Householder QR algorithm on Graphics Processing Units (GPUs) using CUDA, a parallel computing platform and programming model developed by NVIDIA. Our method ensures the availability of critical intermediate data, distinguishing it from standard libraries like cuSOLVER, which modify the processing order and often discard important intermediate computations. By leveraging CUDA streams, we achieve enhanced parallelism without compromising the integrity of the algorithm's sequence or the accessibility of intermediate data. Our performance analysis reveals that our implementation achieves efficiency comparable to cuSOLVER, making it a viable option. This study not only presents a novel implementation but also extends the potential for GPU-accelerated linear algebra procedures to benefit a wider range of scientific and engineering applications.

关键词： QR Decomposition parallel computing General Purpose GPU programming CUDA streams

来源：评论

学校读者我要写书评

暂无评论

A Framework for Reproducible parallel DNA String Matching

A Framework for Reproducible Parallel DNA String Matching

引用

18th international Joint conference on Biomedical Engineering systems and Technologies, BIOSTEC 2025

作者： Chaves, Ricardo Regis Cavalcante Alves de Melo, Alba Cristina Magalhaes Dep. of Computer Science Campus Universitario Darcy Ribeiro University of Brasilia Brasilia Brazil

In this paper, we propose an output reproducible framework that executes parallel sequence comparison algo rithms, computing the edit distance. The framework generates tables/graphics and linear regressions that can be used to predict the execution times. We also propose parallel OpenMP versions of serial algorithms (DP and UK)used to compute the edit distance. Our parallel DP is antidiagonal block-based, where the blocks that belong to the same set of antidiagonals are assigned to different threads, which compute them simultaneously. Due to data dependencies presented by UK, we opted to compute each antidiagonal in parallel. Our results with synthetic and real sequences show that the parallel UK version presents the best execution times in most cases. We also show that the linear regressions generated by our tool have errors below 10%, on average. © 2025 by SCITEPRESS– Science and Technology Publications, Lda.

关键词： Linear regression

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：