检索结果-内蒙古大学图书馆

8th IEEE international conference on Big Data (Big Data)

作者： Goetz, Markus Debus, Charlotte Coquelin, Daniel Krajsek, Kai Comito, Claudia Knechtges, Philipp Hagemeier, Bjorn Tarnawa, Michael Hanselmann, Simon Siggel, Martin Basermann, Achim Streit, Achim German Aerosp Ctr Inst Software Technol SC Cologne Germany Forschungszentrum Julich Inst Rio & Geosci Agrosphere IBG 3 Julich Germany Forschungszentrum Julich FZJ Julich Supercomp Ctr JSC Julich Germany Karlsruhe Inst Technol KIT Steinbuch Ctr Comp SCC Karlsruhe Germany

ISBN: (纸本)9781728162515

To cope with the rapid growth in available data, the efficiency of data analysis and machine learning libraries has recently received increased attention. Although great advancements have been made in traditional array-based computations, most are limited by the resources available on a single computation node. Consequently, novel approaches must be made to exploit distributed resources, e.g. distributed memory architectures. To this end, we introduce IleAT, an array-based numerical programming framework for large-scale parallel processing with an easy-to-use NumPy-like API. HeAT utilizes PyTorch as a node-local eager execution engine and distributes the workload on arbitrarily large high-performance computing systems via MPI. It provides both low-level array computations, as well as assorted higher-level algorithms. With HeAT, it is possible for a NumPy user to take full advantage of their available resources, significantly I owering the bartier to distributed data analysis. When compared to similar frameworks, HeAT achieves speedups of up to two orders of magnitude.

关键词： IleAT Tensor Framework High-performance computing PyTorch NumPy Message Passing Interface CPU Rig Data Analytics Machine Learning Dask Model parallelism parallel Application Frameworks

来源：评论

学校读者我要写书评

暂无评论

An efficient bi-objective optimization workflow using the distributed quasi-Newton method and its application to field development optimization

An efficient bi-objective optimization workflow using the di...

引用

SPE Reservoir Simulation conference 2021, RSC 2021

作者： Wang, Yixuan Alpak, Faruk Gao, Guohua Chen, Chaohui Vink, Jeroen Wells, Terence Saaf, Fredrik Shell Exploration and Production Company Shell International Exploration and Production Co. Shell Global Solution US Inc. Shell Global Solutions International BV

ISBN: (纸本)9781613997475

Although it is possible to apply traditional optimization algorithms to determine the Pareto front of a multiobjective optimization problem, the computational cost is extremely high, when the objective function evaluation requires solving a complex reservoir simulation problem and optimization cannot benefit from adjoint-based gradients. This paper proposes a novel workflow to solve bi-objective optimization problems using the distributed quasi-Newton (DQN) method, which is a well-parallelized and derivative-free optimization (DFO) method. Numerical tests confirm that the DQN method performs efficiently and robustly. The efficiency of the DQN optimizer stems from a distributed computing mechanism which effectively shares the available information discovered in prior iterations. Rather than performing multiple quasi-Newton optimization tasks in isolation, simulation results are shared among distinct DQN optimization tasks or threads. In this paper, the DQN method is applied to the optimization of a weighted average of two objectives, using different weighting factors for different optimization threads. In each iteration, the DQN optimizer generates an ensemble of search points (or simulation cases) in parallel and a set of non-dominated points is updated accordingly. Different DQN optimization threads, which use the same set of simulation results but different weighting factors in their objective functions, converge to different optima of the weighted average objective function. The non-dominated points found in the last iteration form a set of Pareto optimal solutions. Robustness as well as efficiency of the DQN optimizer originates from reliance on a large, shared set of intermediate search points. On the one hand, this set of searching points is (much) smaller than the combined sets needed if all optimizations with different weighting factors would be executed separately;on the other hand, the size of this set produces a high fault tolerance. Even if some simulati

关键词： Efficiency

来源：评论

学校读者我要写书评

暂无评论

Data-Adapted parallel Merge Sort 25th

Data-Adapted Parallel Merge Sort

引用

25th international conference on parallel and distributed computing (Euro-Par)

作者： Holke, Johannes Ruettgers, Alexander Klitz, Margrit Basermann, Achim German Aerosp Ctr DLR Dept High Performance Comp Inst Software Technol D-51147 Cologne Germany

ISBN: (纸本)9783030483401;9783030483395

In the aerospace sciences we produce huge amounts of data. This data must be arranged in a meaningful order, so that we can analyze or visualize it. In this paper we focus on data that is distributed among computer processes and then needs to be sorted by a single root process for further analysis. We assume that the memory on the root process is too small to hold all sorted data at once, so that we have to perform the sorting and processing of data chunk-wise. We prove the efficiency of our approach in weak scaling tests, where we achieve a near constant bandwidth. Additionally, we obtain a considerable speed up compared to the standard parallel external sort. We also demonstrate the usefulness of our algorithm in a real-life aviation application.

关键词： parallel sorting High-performance computing Merge sort Data analysis Aerospace sciences

来源：评论

学校读者我要写书评

暂无评论

Load Balance-Centric distributed parallel Routing for Large-Scale FPGAs

Load Balance-Centric Distributed Parallel Routing for Large-...

引用

international conference on Field Programmable Logic and Applications

作者： Minghua Shen Nong Xiao School of Computer Science and Engineering Sun Yat-sen University Guangzhou China

Routing is one of the most time-consuming stages in the FPGA design flow. parallelization can accelerate the routing process but suffering from load imbalance, further resulting in a low scalability. In this paper, we propose a load balance-centric parallel router in a distributed computing environment. First, we explore regular and irregular region partitioning so that routing tasks are assigned to different cores for static load balance before parallel routing. Second, we explore message propagation and task migration between underloaded and overloaded cores so that load balance can be dynamically maintained at parallel routing runtime. Finally, we demonstrate the effectiveness of the parallel router using large-scale Titan designs. Experimental results show that our parallel router achieves about 17 × speedup on average using 32 cores, compared with VTR 8 router.

关键词： Degradation Runtime Scalability Routing Task analysis distributed computing Field programmable gate arrays

来源：评论

学校读者我要写书评

暂无评论

Research and Design of Virtual Terminal for New Generation Electricity Information Collection System

Research and Design of Virtual Terminal for New Generation E...

引用

Electronics and Devices, Computational Science (ICEDCS), international conference on

作者： Yukun Xu Feng Huang Chao Jiang Shuang Xiao State Grid Shanghai Electric Power Shanghai China

ISBN: (数字)9781665455411

ISBN: (纸本)9781665455428

Design a power consumption information acquisition system simulation electric energy meter, use communication technology, computer technology and automatic control technology to monitor and manage the power load comprehensive system, collect, process and real-time monitor the power consumption information of power users, realize the use of Automatic collection of electrical information, monitoring of abnormal metering, power quality monitoring, power consumption analysis and management, related information release, distributed energy monitoring, information exchange of intelligent electrical equipment and other functions. The system user interface integrates application logic through services, improves the data coordination ability of the system, effectively reduces the data access load, and improves the system expansion ability through load balancing.

关键词： Meters Power demand Scientific computing User interfaces Planning Systems simulation Resource management

来源：评论

学校读者我要写书评

暂无评论

New challenges for distributed computing at the CMS experiment

引用

JOURNAL OF INSTRUMENTATION 2020年第7期15卷

作者： Krammer, N. Austrian Acad Sci Inst High Energy Phys Vienna Austria

The Large Hadron Collider (LHC) experiments soon step into the next period of run-3 data taking with an increased data rate and high pileup requiring an excellent working computing infrastructure. In the future High-Luminosity LHC (HL-LHC) data-taking period, the compute, storage and network facilities have to be further extended by large factors and flexible and sophisticated computing models are essential. New techniques of modern state-of-the-art methods in physics analysis and data science, Deep Learning and Big Data tools, are crucial to handle high-dimensional and more complex problems. Beside flexible cloud computing technologies the usage of High Performance computing (HPC) at the LHC experiments are explored. In this presentation, I will discuss the LHC run-3 and future HL-LHC runs computing technologies and the utilization of modern physics analysis and data science methods for the increasing and complex demands of large-scale scientific computing.

关键词： computing (architecture, farms, grid for recording, storage, archiving, and distribution of data) Analysis and statistical methods Data processing methods

来源：评论

学校读者我要写书评

暂无评论

Aggregation Control for distributed Energy Storage in Distribution Network

Aggregation Control for Distributed Energy Storage in Distri...

引用

2020 international conference on Intelligent computing, Automation and Systems, ICICAS 2020

作者： Li, Ping Wang, Wende Sun, Feng Zhang, Xiaotong Wang, Zihe Wang, Shiyuan Ye, Peng State Grid Liaoning Electric Power Co. Ltd. Electric Power Research Institue Shenyang110006 China State Grid Liaoning Electric Power Co. Ltd. Shenyang110006 China Shenyang Institute of Engineering Shenyang110006 China

ISBN: (纸本)9781728191461

At present, large amount distributed energy storages (DESs) connected to the distribution network lack of effective scheduling methods. An centralized control strategy of DESs with random access and output can be utilized to realize the aggregation control of large amount DESs, which can improve the stability, efficiency and economy of the distribution network. Due to the application of DESs have not yet been configured in large amounts, the research on the aggregation of DES is few. Based on the aggregation mechanism of DES, this paper proposes an aggregation control algorithm of DESs. The simulation explains the specific method of the proposed aggregation control strategy. © 2020 IEEE.

关键词： Energy storage

来源：评论

学校读者我要写书评

暂无评论

GRAPHTM: An Efficient Framework for Supporting Transactional Memory in a distributed Environment 20

GRAPHTM: An Efficient Framework for Supporting Transactional...

引用

21st international conference on distributed computing and Networking (ICDCN)

作者： Poudel, Pavan Sharma, Gokarna Kent State Univ Kent OH 44242 USA

ISBN: (纸本)9781450377515

In this paper, we present GRAPHTM, an efficient and scalable framework for processing transactions in a distributed environment. The distributed environment is modeled as a graph where each node of the graph is a processing node that issues transactions. The objects that transactions use to execute are also on the graph nodes (the initial placement may be arbitrary). The transactions execute on the nodes which issue them after collecting all the objects that they need following the data-flow model of computation. This collection is done by issuing the requests for the objects as soon as transaction starts and wait until all required objects for the transaction come to the requesting node. The challenge is on how to schedule the transactions so that two crucial performance metrics, namely (i) total execution time to commit all the transactions, and (ii) total communication cost involved in moving the objects to the requesting nodes, are minimized. We implemented GRAPHTM in Java and assessed its performance through 3 micro-benchmarks and 5 complex benchmarks from STAMP benchmark suite on 5 different network topologies, namely, clique, line, grid, cluster, and star, that make an underlying communication network for a representative set of distributed systems commonly used in practice. The results show the efficiency and scalability of our approach.

关键词： distributed system transactional memory scheduling execution time communication cost conflicts waiting time

来源：评论

学校读者我要写书评

暂无评论

(Poster) parallel and distributed Operation of Smart grid with Pulsed Power Transmission 22

(Poster) Parallel and Distributed Operation of Smart Grid wi...

引用

22nd IEEE international conference on Computational Science and Engineering (IEEE CSE) / 17th IEEE international conference on Embedded and Ubiquitous computing (IEEE EUC)

作者： Sugiyama, Hisayoshi Osaka City Univ Dept Phys Elect & Informat Osaka Japan

ISBN: (纸本)9781728116648

parallel and distributed operation of pulsed power network with potential gradient method is confirmed with moderately large scale simulation model. The pulsed power network is already proposed for seamless integration of distributed generations. PG method brings the scalability on the network. To confirm the scalability of this power grid and autonomous clustering around the generations, computer simulations are executed.

关键词： Internet of energy Smart grid distributed operation Pulsed power network

来源：评论

学校读者我要写书评

暂无评论

Towards Scalable Resource Management for Supercomputers

Towards Scalable Resource Management for Supercomputers

引用

Supercomputing conference

作者： Yiqin Dai Yong Dong Kai Lu Ruibo Wang Wei Zhang Juan Chen Mingtian Shao Zheng Wang National University of Defense Technology Changsha China University of Leeds Leeds United Kingdom

ISBN: (纸本)9781665454452

Today's supercomputers offer massive computation resources to execute a large number of user jobs. Effectively managing such large-scale hardware parallelism and workloads is essential for supercomputers. However, existing HPC resource management (RM) systems fail to capitalize on the hardware parallelism by following a centralized design used decades ago. They give poor scalability and inefficient performance on today's supercomputers, which will worsen in exascale computing. We present ESlurm, a better RM for supercomputers. As a departure from existing HPC RMs, ESlurm implements a distributed communication structure. It employs a new communication tree strategy and uses job runtime estimation to improve communications and job scheduling efficiency. ESlurm is deployed into production in a real supercomputer. We evaluate ESlurm on up to 20K nodes. Compared to state-of-the-art RM solutions, ESlurm exhibits better scalability, significantly reducing the resource usage of master nodes and improving data transfer and job scheduling efficiency by a large margin.

关键词： Runtime Processor scheduling Scalability Estimation Production parallel processing Supercomputers

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：