检索结果-内蒙古大学图书馆

A tool for efficient execution of SPMD applications on multicore clusters

Procedia computer Science 2010年第1期1卷 2599-2608页

作者： Ronal Muresano Dolores Rexachs Emilio Luque Universitat Autonoma de Barcelona Computer Architecture and Operating System Department Barcelona Spain C.P 08293

A challenge for parallel programmers is to efficiently execute traditional MPI applications, designed to be run in a cluster of single core nodes, on a multicore cluster. Multicore clusters include communication heterogeneities which have to be handled carefully to improve efficiency and speedup. This research presents an execution tool developed for SPMD applications which is focused on managing communications heterogeneities, distributing the workload among cores and enhancing parallel performance on multicore clusters. Our tool has been designed through using an execution methodology which includes mapping and scheduling strategies. The tool integrates five modules which give programmers a method to execute their applications efficiently. This tool is centered on improving SPMD applications designed to use MPI for communications. These applications were selected because they are the most commonly used in parallel computing. Also, these applications are chosen due to their data synchronization and communications volumes which can generate communication imbalance issues. The novel contribution of this tool is to permit programmers to find a minimum execution time, while the efficiency level is maintained over a defined threshold. Our tool has been tested in different multicore clusters and with a set of scientific applications. The results obtained show a considerable improvement in the applications efficiency when the tool is applied.

关键词： Multicore cluster Efficient execution Methodology Performance SPMD Applications

来源：评论

学校读者我要写书评

暂无评论

Learning parallel programming: a challenge for university students

引用

Procedia computer Science 2010年第1期1卷 875-883页

作者： Ronal Muresano Dolores Rexachs Emilio Luque Universitat Autonoma de Barcelona Computer Architecture and Operating System Department Barcelona SPAIN C.P 08293

Currently, the need to learn parallel applications topics in students has become an important issue due to the rapid growth in the parallel computing field. In fact, this topic has been included in computer Science curriculum, but students present difficulties to design MPI parallel applications efficiently. We present a novel methodology for teaching parallel programming centered on improving parallel applications written by students through their experiences obtained during classes. The methodology integrates theoretical and practical sections which are focused on teaching two parallel paradigms, master/Worker and SPMD. These paradigms were selected due to their different communication and computation behaviors, which generate challenges for students when they wish to improve performance application metrics. Our methodology allows students to discover their own errors and how to correct them. In addition, students analyze the issues and advantages in the application designed in order to enhance the performance metrics. Applying this methodology gave us a significant progress in parallel applications designed by students, where we have observed an improvement of around 47% in the students’ skill about parallel programming when they design parallel applications.

关键词： Active learning Parallel programming Methodology Performance metrics

来源：评论

学校读者我要写书评

暂无评论

Extraction of Parallel Application Signatures for Performance Prediction

Extraction of Parallel Application Signatures for Performanc...

引用

IEEE International Conference on High Performance Computing and Communications (HPCC)

作者： Alvaro Wong Dolores Rexachs Emilio Luque Department of Computer Architecture and Operating System (CAOS) University Autonoma de Barcelona Barcelona Spain

Predicting performance of parallel applications is becoming increasingly complex and the best performance predictor is the application itself, but the time required to run it thoroughly is a onerous requirement. We seek to characterize the behavior of message-passing applications on different systems by extracting a signature which will allow us to predict what system will allow the application to perform best. To achieve this goal, we have developed a method we called Parallel Application Signatures for Performance Prediction (PAS2P) that strives to describe an application based on its behavior. Based on the application's message-passing activity, we have been able to identify and extract representative phases, with which we created a Parallel Application Signature that has allowed us to predict the application's performance. We have experimented with different signature-extraction algorithms and found a reduction in the prediction error using different scientific applications on different clusters. We were able to predict execution times with an average accuracy of over 98%.

关键词： Instruments Prediction algorithms Computational modeling Synchronization Parallel machines Message systems Instruction sets

来源：评论

学校读者我要写书评

暂无评论

Software Probes: A Method for Quickly Characterizing Applications' Performance on Heterogeneous Environments

Software Probes: A Method for Quickly Characterizing Applica...

引用

International Conference on Parallel Processing Workshops (ICPPW)

作者： Alexandre Otto Strube Dolores Rexachs Emilio Luque Computer Architecture and Operating System Department (CAOS) Universitat Autònoma Barcelona Barcelona Spain

This work describes ongoing work for measuring the performance of an application running on a machine, where this measurement takes a fraction of the time required to run the application itself thoroughly. We call it Performance Software Probe. The objective is to have knowledge of this machine/application performance previous to the execution, and without the need to even install this application on the machine to characterize. Our goal is to enhance efficiency of master/worker applications on highly heterogeneous multiclusters, where the available machines - and their respective performance indexes - are not known until the time we have them available for execution.

关键词： Software performance Probes Application software Equations Throughput Software measurement Time measurement Performance analysis computer performance Parallel processing

来源：评论

学校读者我要写书评

暂无评论

Dynamic on demand virtual clusters in grid

Dynamic on demand virtual clusters in grid

引用

Workshops on Parallel Processing, Euro-Par 2008: VHPC 2008, UNICORE 2008, HPPC 2008, SGS 2008, PROPER 2008, ROIA 2008, and DPA 2008

作者： Bertogna, Mario Leandro Grosclaude, Eduardo Naiouf, Marcelo De Giusti, Armando Luque, Emilio Department of Computer Science Universidad Nacional del Comahue C.P. 8300 Buenos Aires 1400 Argentina Informatic Research Institute LIDI Universidad Nacional de La Plata Argentina Computer Architecture and Operating System Department Universidad Autónoma de Barcelona Spain

ISBN: (纸本)3642009549

In Grid environments, many different resources are intended to work in a coordinated manner, each resource having its own features and complexity. As the number of resources grows, simplifying automation and management is among the most important issues to address. This paper's contribution lies on the extension and implementation of a grid metascheduler that dynamically discovers, creates and manages on-demand virtual clusters. The first module selects the clusters using graph heuristics. The algorithm then tries to find a solution by searching a set of clusters, mapped to the graph, that achieve the best performance for a given task. The second module, one per-grid node, monitors and manages physical and virtual machines. When a new task arrives, these modules modify virtual machine's configuration or use live migration to dynamically adapt resource distribution at the clusters, obtaining maximum utilization. Metascheduler components and local administrator modules work together to make decisions at run time to balance and optimize system throughput. This implementation results in performance improvement of 20% on the total computing time, with machines and clusters processing 100% of their working time. These results allow us to conclude that this solution is feasible to be implemented on Grid environments, where automation and self-management are key to attain effective resource usage. © Springer-Verlag Berlin Heidelberg 2009.

关键词： Virtual machine

来源：评论

学校读者我要写书评

暂无评论

On the relevance of network topologies in distributed video-on-demand servers

On the relevance of network topologies in distributed video-...

引用

Euromicro Conference on Parallel, Distributed and Network-Based Processing

作者： L. Souza A. Ripoll X.Y. Yang E. Luque F. Cores Computer Architecture & Operating System Department Universitat Authnòma de Barcelona Spain Computer Science & Industrial Engineering Universitat de Lleida Spain

Distributed video-on-demand servers (DVS) are proposed as a solution to the limited streaming capacity and null scalability of large-scale centralized systems. Server interconnection topology plays an important role in video-on-demand systems' performance. This paper presents an analysis of different topologies and their influence over storage management and distribution, delivery policies performance, refusing requests occurrence, network consumption and scalability. To accomplish the proposal study, we have designed a complete simulation framework for DVS systems. Experimental results obtained under different workload conditions allow us to draw two important conclusions: First, a better connectivity implies a lower mean request service distance and lesser network requirements, improving multicast policies efficiency. Second, topology regularity is essential, as it allows a greater traffic balancing and provides more alternative routing paths. The analysis of global results shows that hypercube presents the best trade-off among all the evaluated metrics, providing a gradual and unlimited scalability for the DVS system.

关键词： Network topology Network servers Voltage control Scalability Streaming media Large-scale systems Performance analysis Proposals Telecommunication traffic Traffic control

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：