检索结果-内蒙古大学图书馆

30th international conference on High Performance computing, Data, and Analytics (HiPC)

作者： Sayed, Zainul Abideen Zola, Jaroslaw Univ Buffalo Buffalo NY 14260 USA

ISBN: (纸本)9798350383225

We propose SCoOL, a programming model and its corresponding parallel runtime systems for implementing optimization problem solvers. In SCoOL, users specify what task is performed for a point in a given search space, and what global information should be maintained during the search. The resulting optimization program is then efficiently executed in a BSP-style on a shared or distributed memory computers by a parallel runtime provided with the model. In the paper, we show details of our scalable runtime for distributed memory clusters, including algorithms for work stealing and tasks rebalancing. To benchmark the platform, we implement solutions to several optimization problems and provide performance analysis for Quadratic Assignment Problem, Parent Set Assignment, and Bayesian Networks Structure Learning. Our solvers show strong scaling on a cluster with 1,280 cores, significantly outperforming the current state-of-the-art solvers in Bayesian networks learning.

关键词： parallel programming model runtime system optimization problems

来源：评论

学校读者我要写书评

暂无评论

Tangram: High-resolution Video Analytics on Serverless Platform with SLO-aware Batching 44

Tangram: High-resolution Video Analytics on Serverless Platf...

引用

44th IEEE international conference on distributed computing Systems (ICDCS)

作者： Peng, Haosong Zhan, Yufeng Li, Peng Xia, Yuanqing Beijing Inst Technol Sch Automat Beijing Peoples R China Univ Aizu Sch Comp Sci & Engn Aizu Wakamatsu Fukushima Japan

ISBN: (纸本)9798350386066;9798350386059

Cloud-edge collaborative computing paradigm is a promising solution to high-resolution video analytics systems. The key lies in reducing redundant data and managing fluctuating inference workloads effectively. Previous work has focused on extracting regions of interest (RoIs) from videos and transmitting them to the cloud for processing. However, a naive Infrastructure as a Service (IaaS) resource configuration falls short in handling highly fluctuating workloads, leading to violations of Service Level Objectives (SLOs) and inefficient resource utilization. Besides, these methods neglect the potential benefits of RoIs batching to leverage parallel processing. In this work, we introduce Tangram, an efficient serverless cloud-edge video analytics system fully optimized for both communication and computation. Tangram adaptively aligns the RoIs into patches and transmits them to the scheduler in the cloud. The system employs a unique "stitching" method to batch the patches with various sizes from the edge cameras. Additionally, we develop an online SLO-aware batching algorithm that judiciously determines the optimal invoking time of the serverless function. Experiments on our prototype reveal that Tangram can reduce bandwidth consumption and computation cost up to 74.30% and 66.35%, respectively, while maintaining SLO violations within 5% and the accuracy loss negligible.

关键词： video analytics batching inference serverless computing

来源：评论

学校读者我要写书评

暂无评论

FlexRoute: A Fast, Flexible and Priority-Aware Packet-Processing Design 32

FlexRoute: A Fast, Flexible and Priority-Aware Packet-Proces...

引用

32nd Euromicro international conference on parallel, distributed and Network-Based Processing (PDP)

作者： Zyla, Klajd Liess, Marco Wild, Thomas Herkersdorf, Andreas Tech Univ Munich Chair Integrated Syst Munich Germany

ISBN: (纸本)9798350363074;9798350363081

As the world becomes more connected and new digital services emerge at a fast pace, the amount of network traffic increases rapidly. Consequently, processing requirements become more varied and drive the need for flexible packet-processing designs, especially as in-network computing gains traction. Traditional approaches deploy hardware accelerators in a pipeline in the sequence that the associated tasks are supposed to be executed. Hence, they do not accommodate flows with different processing requirements and provide no possibility to remap flows to task sequences in runtime. In order to address these limitations, we propose FlexRoute, a fast, flexible and priority-aware packet-processing design that can process network traffic at a rate of over 100 Gbit/s on FPGAs. Our design consists of a reconfigurable parser and several processing engines that are arranged in a pipeline. The processing engines are equipped with processing units that execute specific tasks, flexible forwarding logic and priority-aware queuing/scheduling logic. We implement a prototype of FlexRoute in Verilog and evaluate it via cycle-accurate register-transfer level simulations. We also synthesize and implement our design on the Alveo U55C High Performance Compute Card and show its resource usage. The evaluation results demonstrate that FlexRoute can process packets of arbitrary size with different processing requirements at a traffic rate of about 70 Gbit/s significantly faster than two state-of-the-art flexible packet-processing designs.

关键词： In-network computing Packet processing Flexibility Scheduling SDN

来源：评论

学校读者我要写书评

暂无评论

Research on Power Big Data Fusion Method Based on Graph Convolutional Neural Network 1

Research on Power Big Data Fusion Method Based on Graph Conv...

引用

1st international Symposium on parallel computing and distributed Systems, PCDS 2024

作者： Ji, Runyang State Grid Nantong Power Supply Company Jiangsu Nantong China

ISBN: (纸本)9798350349658

In recent years, in order to accelerate the construction of new power systems, power companies have carried out a series of information system construction, generating a massive amount of power related business data, which brings problems such as low query efficiency and inaccurate results to the real-time query performance of the system. Therefore, this article proposes a power big data fusion method based on graph convolutional neural networks, using association rule algorithms to horizontally connect various fragmented basic wide tables, indicators, and labels related to the perspective of power business, and aggregate them to form a logical business view. The association integrates more data information of power business objects, providing data support for power big data analysis, mining, and inference. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Research on distributed streaming parallel computing of large scale wind DFIGs from the perspective of Ecological Marxism

引用

ENERGY REPORTS 2022年 8卷 304-312页

作者： Xue, He Beijing Foreign Studies Univ Sch Marxism Beijing 100089 Peoples R China Beijing Foreign Studies Univ Res Ctr Cooperat Innovat Socialist Theory Chinese Beijing 100089 Peoples R China

Extreme climate change is the major ecological crisis which mankind encounters at present. The core of coping with climate change is to reduce greenhouse gas emissions, among which is mainly carbon dioxide emissions from fossil energy combustion. Energy substitution of large-scale new energy power generation is an effective solution to the problems. How to simulate and calculate such a large number of wind power generations for planning or stability analysis is a technical challenge, especially the detailed research on the interactive characteristics for hundreds of models with IGBT converter. In this technical field, inspired by the decentralized and distributed ideas of Ecological Marxism theory, decoupling and grid division calculation is carried out for the electromagnetic transient model of a large number of DFIGs. Through the design and development of the data-driven large-scale parallel computing framework, multi-stage high-speed task pipelined parallel calculation is realized. It effectively solves the calculation difficulties in matrix dimension disaster caused by the number increase of wind turbines, theoretically breaks the ceiling effect of fine-grain simulation for large-scale wind power generations, and also waveforms comparison verifies the feasibility and correctness. (C) 2022 The Author(s). Published by Elsevier Ltd.

关键词： Streaming parallel computing distributed wind generation Ecological Marxism

来源：评论

学校读者我要写书评

暂无评论

Multilevel Load Balancing Algorithm for Domestic Heterogeneous Manycore Architecture 22

Multilevel Load Balancing Algorithm for Domestic Heterogeneo...

引用

22nd IEEE international Symposium on parallel and distributed Processing with Applications, ISPA 2024

作者： Ma, Yi Chen, Xin Guo, Heng Li, Fang Liu, Xin Tsinghua University Beijing China National Research Center of Parallel Computer Engineering and Technology Beijing China

ISBN: (纸本)9798331509712

Load imbalance often occurs in particle-in-cell simulations on parallel computing, which seriously affects the efficiency of applications. Due to the characteristics of multilevel parallelism and communication asymmetry of compute nodes in domestic heterogeneous manycore architecture, the impact of load imbalance is more prominent. The paper proposes a multilevel load-balancing algorithm for domestic heterogeneous manycore architecture. Inside the supernode, computing tasks are redivided based on manycore acceleration. Between the supernodes, a greedy-based communication mode is designed to minimize communication across supernodes. The experimental results show that the proposed algorithm achieves almost ideal dynamic load balance, and improves the performance of the evaporation module in two-phase flow simulation by 10.9-19.7 times for the 50 million-sized grid. © 2024 IEEE.

关键词： parallel architectures

来源：评论

学校读者我要写书评

暂无评论

Research on Micro grid Optimization based on Multi-Agent Reinforcement Learning Algorithm

Research on Micro Grid Optimization based on Multi-Agent Rei...

引用

2024 international conference on Optimization computing and Wireless Communication, ICOCWC 2024

作者： Zhang, Bin Zhou, Zongchuan Han, Yiming Hu, Zhibin Ma, Junxian Economic and Technological Research Institute of State Grid Ningxia Electric Power Co. Ltd Yinchuan750004 China

ISBN: (纸本)9798350383348

In modern society, the energy problem has become increasingly prominent. In order to achieve sustainable and efficient energy utilization, microgrid technology came into being. Microgrid is a small power system with autonomous control, protection and management functions, which can realize flexible scheduling among distributed energy sources, energy storage devices and loads © 2024 IEEE.

关键词： micro grid multi-agent reinforcement learning algorithm Optimized Reinforcement learning theory Smart

来源：评论

学校读者我要写书评

暂无评论

Distributing Compilation to Enable High Throughput Scalable Quantum Workloads 5

Distributing Compilation to Enable High Throughput Scalable ...

引用

2024 international conference on Quantum computing and Engineering

作者： Waring, Harry J. Oxford Quantum Circuits Reading Berks England

ISBN: (纸本)9798331541378

The increasing quality and availability of Quantum Processing Units (QPUs) is fueling a growing interest in quantum computing across many technological areas. The resulting increase in demand for QPU resources necessitates Quantum computing as a Service (QCaaS) providers to support a high throughput of quantum workloads. A major runtime bottleneck in current QCaaS software stacks is the computationally-intensive compilation step which requires significant compute. To address this, Oxford Quantum Circuits has introduced distributed compilation whereby quantum programs are compiled in parallel and stored until the QPU is available. This has replaced our previous serial compilation approach where each program was compiled immediately prior to execution. From experiments using our production compilers and a simulated backend representing the QPU, we show that distributed compilation has resulted in a 78% reduction in processing time as compared to serial compilation. This demonstrates that there are sizeable performance gains to program throughput attainable through the introduction of distributed compilation into a QCaaS architecture. We posit that the usefulness of this feature will only grow given the increasing complexity of quantum programs and the growing popularity of quantum -classical hybrid algorithms.

关键词： distributed Compilation Quantum computing Quantum computing as a Service Quantum Systems Software

来源：评论

学校读者我要写书评

暂无评论

A Motion Trace Decomposition-based overset grid method for parallel CFD simulations with moving boundaries 24

A Motion Trace Decomposition-based overset grid method for p...

引用

53rd international conference on parallel Processing (ICPP)

作者： Zhao, Ran Li, Chao Guo, Xiaowei Zhang, Sen Yang, Xi Tang, Tao Yang, Canqun Natl Univ Def Technol Coll Comp Sci & Technol Changsha Hunan Peoples R China

ISBN: (纸本)9798400717932

The overset grid method is widely employed to solve moving boundary problems in numerical simulations. However, the heavy and inevitable communication resulting from boundary movements severely impedes the improvement of parallel efficiency. This paper proposes a Motion Trace Decomposition (MTD) method to alleviate this issue. The MTD method minimizes communication overhead between processors by decomposing sub-grids and distributing them according to the object motion trajectory, negating the need to reproduce communication areas when boundaries move. Various tests were conducted to evaluate the MTD method, incorporating diverse motion types, such as displacement and rotation. Results from experimental simulations with 1.9 x 10(6) grid cells indicate that the proposed method enhances the parallel efficiency of the assembly process by up to 20.35% using 72 processors. These findings showcase the significant potential of the MTD method in alleviating communication challenges associated with simulating moving boundary problems using overset grids.

关键词： Overset grid method Motion Trace Decomposition Communication reduction parallel computing

来源：评论

学校读者我要写书评

暂无评论

Fuzzy Based Decision-Making of Energy Management for Regional Energy Systems with Renewable Generation and Installed Storage Systems 3

Fuzzy Based Decision-Making of Energy Management for Regiona...

引用

3rd IEEE international conference on distributed computing and Electrical Circuits and Electronics, ICDCECE 2024

作者： Cao, Weijie Jiang, Hao Lv, Da Qing, Hua Jiang, Zhenghong Luo, Yaogang State Grid Ninghai Power Supply Company Ningbo China State Grid Ningbo Electric Power Supply Company Ningbo China

ISBN: (纸本)9798350318609

The electricity system with penetration of a massive number of renewable generation sources needs to consider various demands, such as power generation efficiency, the service life of energy storage devices, and its impact on the main grid. Therefore, the control of various distributed units in microgrids is a comprehensive multi-objective problem. Taking into account the natural indirectness of new energy, the randomness of predicted loads, and the time-varying nature of electricity prices, this paper proposes an optimized fuzzy control (OFC) system to achieve power flow optimization of microgrids and maximize comprehensive benefits. Simulation experiments show that the proposed optimized fuzzy control strategy can better achieve microgrid power flow control and meet the requirements of multi-objective optimization. © 2024 IEEE.

关键词： Energy management

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：