检索结果-内蒙古大学图书馆

30th international conference on parallel and distributed Processing Techniques, PDPTA 2024, held as part of the World Congress in Computer Science, Computer Engineering and Applied computing, CSCE 2024

作者： Li, Ming Hale, John The University of Tulsa TulsaOK74104 United States

ISBN: (纸本)9783031856372

Attack graphs (AGs) are graphical tools to analyze the security of computer networks. By connecting the exploitation of individual vulnerabilities, AGs expose possible multi-step attacks against target networks, allowing system administrators to take preventive measures to enhance their network’s security. As powerful analytical tools, however, AGs are both time- and memory-consuming to be generated. As the numbers of network assets, interconnections between devices, as well as vulnerabilities increase, the size and volume of the resulting AGs grow at a much higher rate, leading to the well-known state-space explosion. In this paper, we propose the use of high performance computing (HPC) clusters to implement AG generators. We evaluate the performance through experiments and provide insights into how cluster environments can help resolve the issues of slow speed and high memory demands in AG generation in a balanced way. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Attack Graph Cybersecurity High Performance computing

来源：评论

学校读者我要写书评

暂无评论

Large-Scale Modeling of the Extracellular Matrix on Hybrid Supercomputers Using Highly parallel Stencil Computations

Large-Scale Modeling of the Extracellular Matrix on Hybrid S...

引用

ISC High Performance 2025 Research Paper Proceedings (40th international conference)

作者： Julian Herold Paul Brinkmeier Markus Götz Alexander Schug Achim Streit Scientific Computing Center (SCC) Karlsruhe Institute of Technology (KIT) Germany Jülich Supercomputing Center (JSC) Forschungsszentrum Germany Department of Biology University of Duisburg-Essen Germany

ISBN: (数字)9783982633619

Modeling the crucial dynamic properties of tissue simulations is computationally expensive due to the complex interactions between cells and its surrounding extracellular matrix (ECM). This work extends the Cellular Potts Model (CPM) with a dynamic ECM model to simulate the viscoelastic mechanics. Our implementation leverages the NAStJA framework, a highly scalable system for distributed simulations. We assess the performance of our implementations on different HPC setups and analyze the scalability challenges of large-scale tissue simulations. GPU-accelerated simulations significantly reduce computation time but are limited by host-device communication.

关键词： Codes Computational modeling Scalability Memory management Graphics processing units Data models Supercomputers Computational efficiency Time factors Extracellular

来源：评论

学校读者我要写书评

暂无评论

advances on P2P, parallel, Grid, Cloud and Internet computing: Proceedings of the 12th international conference on P2P, parallel, Grid, Cloud

引用

2017年

作者： Fatos Xhafa Santi Caball Leonard Barolli

ISBN: (纸本)9783319698342

This book presents the latest, innovative research findings on P2P, parallel, Grid, Cloud, and Internet computing. It gathers the Proceedings of the 12th international conference on P2P, parallel, Grid, Cloud and Internet computing, held on November 810, 2017 in Barcelona, Spain. These computing technologies have rapidly established themselves as breakthrough paradigms for solving complex problems by enabling the aggregation and sharing of an increasing variety of distributed computational resources at large scale. Grid computing originated as a paradigm for high-performance computing, offering an alternative to expensive supercomputers through different forms of large-scale distributed computing, while P2P computing emerged as a new paradigm after client-server and web-based computing and has shown to be useful in the development of social networking, B2B (Business to Business), B2C (Business to Consumer), B2G (Business to Government), B2E (Business to Employee), and so on. Cloud computing has been defined as a computing paradigm where the boundaries of computing are determined by economic rationale rather than technical limits. Cloud computing has quickly been adopted in a broad range of application domains and provides utility computing at large scale. Lastly, Internet computing is the basis of any large-scale distributed computing paradigm; it has very rapidly developed into a flourishing field with an enormous impact on todays information societies, serving as a universal platform comprising a large variety of computing forms such as Grid, P2P, Cloud and Mobile computing. The aim of the book advances on P2P, parallel, Grid, Cloud and Internet computing is to provide the latest findings, methods and development techniques from both theoretical and practical perspectives, and to reveal synergies between these large-scale computing paradigms.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Contributions to Accelerating a Numerical Simulation of Free Flow parallel to a Porous Plane

Contributions to Accelerating a Numerical Simulation of Free...

引用

Euromicro conference on parallel, distributed and Network-Based Processing

作者： Claudio Schepke Roberta A. Spigolon José Rufino Cesar F. Da C. Cristaldo Glener L. Pizzolato Laboratory of Advances Studies in Computation (LEA) Federal University of Pampa (UNIPAMPA) Alegrete Brazil Research Centre in Digitalization and Intelligent Robotics (CeDRI) Laboratório Associado para a Sustentabilidade e Tecnologia em Regiões de Montanha (SusTEC) Instituto Politécnico de Bragancá Bragança Portugal Laboratory of Computational Fluid Dynamics and Atmospheric Turbulence (LFCTA) Federal University of Pampa (UNIPAMPA) Alegrete Brazil

ISBN: (数字)9798331524937

ISBN: (纸本)9798331524944

Flow models over flat porous surfaces have applications in natural processes, such as material, food, chemical processing, or mountain mudflow simulations. The development of simplified analytical or numerical models can predict characteristics such as velocity, pressure, deviation length, and even temperature of such flows for geophysical and engineering purposes. In this context, there is considerable interest in theoretical and experimental models. Mathematical models to represent such phenomena for fluid mechanics have continuously been developed and implemented. Given this, we propose a mathematical and simulation model to describe a free-flowing flow parallel to a porous material and its transition zone. The objective of the application is to analyze the influence of the porous matrix on the flow under different matrix properties. We implement a Computational Fluid Dynamics scheme using the Finite Volume Method to simulate and calculate the numerical solutions for case studies. However, computational applications of this type demand high performance, requiring parallel execution techniques. Due to this, it is necessary to modify the sequential version of the code. So, we propose a methodology describing the steps required to adapt and improve the code. This approach decreases 5.3% the execution time of the sequential version of the code. Next, we adopt OpenMP for parallel versions and instantiate parallel code flows and executions on multi-core. We get a speedup of 10.4 by using 12 threads. The paper provides simulations that offer the correct understanding, modeling, and construction of abrupt transitions between free flow and porous media. The process presented here could expand to the simulations of other porous media problems. Furthermore, customized simulations require little processing time, thanks to parallel processing.

关键词： Codes Temperature Computational modeling Computational fluid dynamics Media parallel processing Numerical simulation Mathematical models Numerical models Surface treatment

来源：评论

学校读者我要写书评

暂无评论

parallel GPU-Enabled Algorithms for SpGEMM on Arbitrary Semirings with Hybrid Communication 25

Parallel GPU-Enabled Algorithms for SpGEMM on Arbitrary Semi...

引用

Proceedings of the 16th ACM/SPEC international conference on Performance Engineering

作者： Thomas McFarland Julian Bellavita Giulia Guidi Cornell University Ithaca New York USA

ISBN: (纸本)9798400710735

Sparse General Matrix Multiply (SpGEMM) is key for various High-Performance computing (HPC) applications such as genomics and graph analytics. Using the semiring abstraction, many algorithms can be formulated as SpGEMM, allowing redefinition of addition, multiplication, and numeric types. Today large input matrices require distributed memory parallelism to avoid disk I/O, and modern HPC machines with GPUs can greatly accelerate linear algebra *** this paper, we implement a GPU-based distributed-memory SpGEMM routine on top of the CombBLAS library. Our implementation achieves a speedup of over 2× compared to the CPU-only CombBLAS implementation and up to 3× compared to PETSc for large input ***, we note that communication between processes can be optimized by either direct host-to-host or device-to-device communication, depending on the message size. To exploit this, we introduce a hybrid communication scheme that dynamically switches data paths depending on the message size, thus improving runtimes in communication-bound scenarios.

关键词： gpu

来源：评论

学校读者我要写书评

暂无评论

Euro-Par 2015: parallel processing: 21st international conference on parallel and distributed computing Vienna, Austria, August 24–28, 2015 proceedings 21st

Euro-Par 2015: Parallel processing: 21st International Confe...

引用

21st international conference on parallel and distributed computing, Euro-Par 2015

作者： Träff, Jesper Larsson Hunold, Sascha Versaci, Francesco Vienna University of Technology Vienna Austria

来源：评论

学校读者我要写书评

暂无评论

Scheduling Big Machine Learning Tasks on Clusters of Heterogeneous Edge Devices

Scheduling Big Machine Learning Tasks on Clusters of Heterog...

引用

international Communication Systems and Networks and Workshops, COMSNETS

作者： Saumya Mathkar Shreya Aiyer Yashovardhan Bapat Pinki Pinki Arnab K. Paul Vinayak Naik Department of CSIS BITS Pilani Goa India

ISBN: (数字)9798331531195

ISBN: (纸本)9798331531201

Edge computing has transformed machine learning by using computing closer to the data sources, thereby reducing latency. The ever-increasing volume of data has necessitated forming clusters of edge devices, possibly with heterogeneous capabilities. Managing heterogeneous resources such as computation and memory remains challenging. Given the capabilities of the edge devices, we need a simple technique suitable for an Edge computing *** introduce a scheduling mechanism that leverages Integer Linear Programming (ILP) to optimize the overall computation time of ML-based tasks. We implement our scheduling mechanism that efficiently allocates resources, ensuring tasks are executed parallel across the cores of edge devices in a cluster by minimizing computation time. For tasks large enough to fit on any core, we leverage distributed learning to train the model in pieces and later combine them. We employ our ILP-based scheduler for efficient task allocation and compare its performance with a Greedy, simple approach based on best-fit technique. We evaluate our approach on three cases of sets of tasks on the spectrum of uniformity and size. Our results demonstrate two times the speed gain for our ILP-based approach over Greedy for the category with the least uniformity and large size.

关键词： Performance evaluation Computer aided instruction Machine learning algorithms Processor scheduling Distance learning Soft sensors Machine learning Integer linear programming Resource management Edge computing

来源：评论

学校读者我要写书评

暂无评论

Proceedings of the 16th IASTED international conference on parallel and distributed computing and Systems

Proceedings of the 16th IASTED International Conference on P...

引用

Proceedings of the 16th IASTED international conference on parallel and distributed computing and Systems

The proceedings contain 147 papers from the Proceedings of the 16th IASTED international conference on parallel and distributed computing and Systems. The topics discussed include: a grid simulation infrastructure supporting advance reservation;auction-based resource allocation protocols in grids;effectiveness of grid configurations on application performance;a constant time shortest-path routing algorithm for pyramid networks;wormhole routers for network-on-chop;communication optimization on broadcast-based clusters;a localization algorithm extension for the evolvable sensor network;and migration algorithms for automated load balancing.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

Multi-Node Inference Architectures for Low-Latency LLM Serving

Multi-Node Inference Architectures for Low-Latency LLM Servi...

引用

Advanced computing Technologies (ICoACT), international conference on

作者： Naresh Kumar Gundla Sri Harsha Atthuluri

ISBN: (数字)9798331509859

ISBN: (纸本)9798331509866

Over the past few years, large language models have evolved to enable a wide range of applications-from natural language understanding to real-time conversational agents. However, the deployment of LLMs into production presents many significant challenges, especially with regard to low-latency responses that enable real-time interactions. This work investigates multi-node inference architectures for optimized deployment using open-source frameworks with scalability, flexibility, and cost-effectiveness. We investigate various methods, such as microbatching, tensor and pipeline parallelism, and sophisticated load balancing, that effectively distribute inference workloads across multiple nodes. We conduct extensive evaluations using popular open-source tools such as Kubernetes, Ray, and Envoy to benchmark the performance of these architectures in terms of latency, throughput, and resource utilization under diverse workloads. We also analyze model replication versus model partitioning trade-offs, giving insights into the most appropriate configuration for various deployment scenarios. As our results show, a well-orchestrated multi-node setup can be used to greatly reduce inference latency while preserving high throughputs, enabling the deployment of sophisticated LLMs in latencysensitive applications. This paper gives insights with a detailed analysis of multi-node inference strategies and integration into open-source ecosystems, therefore it will be a great guide for practitioners seeking to develop deployments of LLMs at scale. In summary, this work underlines how distributed architectures can overcome some of the inherent limitations imposed by singlenode deployments and are crucial for achieving more efficient and responsive AI-driven services.

关键词： Tensors Large language models Scalability Pipelines Computer architecture parallel processing Throughput Real-time systems Low latency communication Load modeling

来源：评论

学校读者我要写书评

暂无评论

SPECIAL ISSUE international conference ON parallel AND distributed computing, APPLICATIONS AND TECHNOLOGIES (PDCAT 2009) PREFACE

引用

international JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE 2011年第5期22卷 999-1000页

作者： Fujiwara, Akihiro Kakugawa, Hirotsugu Nakano, Koji Kyushu Inst Technol Dept Comp Sci & Syst Engn Fukuoka 8208502 Japan Osaka Univ Dept Comp Sci Suita Osaka 5650871 Japan Hiroshima Univ Dept Informat Engn Kagamiyama Higashi Hiroshi 7398527 Japan

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：