检索结果-内蒙古大学图书馆

A Simple Yet Effective Graph Partition Model for GPU computing 18

A Simple Yet Effective Graph Partition Model for GPU Computi...

47th international conference on parallel Processing (ICPP) / international Workshop on Embedded Multicore Systems (EMS)

作者： Zhang, Eddy Z. Rutgers State Univ New Brunswick NJ 08854 USA

ISBN: (纸本)9781450365239

Graph edge partition models have recently become an appealing alternative to graph vertex partition models for parallel and distributed computing due to their flexibility in balancing loads and their performance in reducing communication cost [1, 3]. In this paper, we introduce a simple yet effective graph edge partitioning model for GPU computing. In practice, our model yields high partition quality (better than or the same as the state-of-the-art edge partition approaches, at least for power-law graphs) with low partition overhead. In theory, previous work [1] showed that an approximation factor of O(d(max) root logn log k) apply to the graphs with m = O(k(2)) edges (k is the number of partitions). Our model extends this result to all graphs. We demonstrate how graph edge partition model can be applied to GPU computing. We draw our examples from GPU program for locality enhancement both over time and (processor) space. For the first time, we demonstrate the effectiveness of edge partition for modeling data reuse in a many-core processors, both in theory and in practice.

关键词： Graph partition model GPU data reuse locality

来源：评论

学校读者我要写书评

暂无评论

A case for economy grid architecture for service oriented grid computing 15

A case for economy grid architecture for service oriented gr...

引用

15th international parallel and distributed Processing Symposium, IPDPS 2001

作者： Buyya, R. Abramson, D. Giddy, J. School of Computer Science and Software Engineering Monash University Caulfield Campus Melbourne Australia CRC for Enterprise Distributed Systems Technology Monash University Caulfield Campus Melbourne Australia

ISBN: (纸本)0769509908

Computational grids are a promising platform for executing large-scale resource intensive applications. However, resource management and scheduling in the grid environment is a complex undertaking as resources are (geographically) distributed, heterogeneous in nature, owned by different individuals or organizations with their own policies, have different access and cost models, and have dynamically varying loads and availability. This introduces a number of challenging issues such as site autonomy, heterogeneous interaction, policy extensibility, resource allocation or co-allocation, online control, scalability, transparency, resource brokering, and "computational economy". A number of grid systems (such as Globus and Legion) have addressed many of these issues with exception of a computational economy. We argue that a computational economy is required in order to create a real world scalable grid because it provides a mechanism for regulating the grid resources demand and supply. It offers incentive for resource owners to be part of the grid and encourages consumers to optimally utilize resources and balance timeframe and access costs. We propose a 'computational economy framework' that builds on the existing grid middleware systems and offers an infrastructure for resource management and trading in the grid environment. We discuss the usage economic models for resource trading in the Nimrod/G resource broker and present deadline and cost-based scheduling experimental results on the grid. © 2001 IEEE.

关键词： grid computing

来源：评论

学校读者我要写书评

暂无评论

Enabling Multi task computation on Galaxy-based Gateways using Swift

Enabling Multi task computation on Galaxy-based Gateways usi...

引用

15th IEEE international conference on Cluster computing (CLUSTER)

作者： Maheshwari, Ketan Rodriguez, Alex Kelly, David Madduri, Ravi Wozniak, Justin Wilde, Michael Foster, Ian Argonne Natl Lab MCS Div 9700 S Cass Ave Argonne IL 60439 USA Univ Chicago Inst Computat Argonne Natl Lab Chicago IL 60637 USA

ISBN: (纸本)9781479908981

The Galaxy science portal is a popular gateway to data analysis and computational tools for a broad range of life sciences communities. While Galaxy enables users to overcome the complexities of integrating diverse tools into unified workflows, it has only limited capabilities to execute those tools on the parallel and often distributed high-performance resources that the life sciences fields increasingly requires. We outline here an approach to meet this pressing requirement with the Swift parallel scripting language and its distributed runtime system. Swift's model of computation - implicitly parallel functional dataflow - is an elemental abstraction to which the core computing model of Galaxy maps very closely. We describe an integration between Galaxy and Swift that is transforming Galaxy into a much more powerful science gateway, retaining its user-friendly nature while extending its power to execute highly scalable workflows on diverse parallel environments.

关键词： authoring languages biology computing data analysis parallel processing

来源：评论

学校读者我要写书评

暂无评论

COMPARISON OF VECTOR AND parallel IMPLEMENTATIONS OF THE SIMULATED ANNEALING ALGORITHM

引用

FUTURE GENERATION COMPUTER SYSTEMS-THE international JOURNAL OF grid computing AND ESCIENCE 1995年第4-5期11卷 467-475页

作者： VOOGD, JM SLOOT, PMA VANDANTZIG, R NIKHEF H 1009 DB AMSTERDAM NETHERLANDS

In this paper we describe a vector and a parallel implementation of a stochastic simulation method to solve optimization problems in the field of many particle systems. We use a case-study where the (energetically) optimal distribution of particles on a closed surface is studied, Crystallization on a closed surface is an interesting sub-domain since such a topology causes lattice defects. To obtain the optimal distribution of particles on a sphere we use the simulated annealing algorithm. Simulated annealing is an application of the Markov chain simulation method which, in principle, guarantees that the minimum in energy of our system of particles is found. However, the time for the algorithm to converge increases rapidly with system size. In order to find the best performing implementation we have made vectorized and parallelized implementations. We parallelize the simulated annealing method in several ways. Here we use two types of parallelization in conjunction, a systolic decomposition of the Markov chains and a functional decomposition of the energy calculations. The sequential nature of the simulated annealing algorithm is hard to parallelize and is therefore an important research topic to study the functional differences between parallel and sequential implementations. Results show that the parallelization influences the accuracy of the iterative process. In this paper we give a comparison between the vectorized and the parallelized implementation. It is shown that the current parallel implementation on the Parsytec GC parallel transputer platform is not capable of outruning our vector implementation on the GRAY Y-MP.

关键词： parallel VECTOR OPTIMIZATION

来源：评论

学校读者我要写书评

暂无评论

Preferential load balancing for distributed internet servers

Preferential load balancing for distributed internet servers

引用

1st IEEE/AMC international Symposium on Cluster computing and the grid

作者： Rumsewicz, M Dwyer, M Ericsson Australia Pty Ltd Melbourne Vic 3001 Australia

ISBN: (纸本)0769510116

In this paper we describe a preferential admission control and load balancing algorithm for distributed Internet server clusters. Clients initiate sessions consisting of a series of transactions. The clients may be aware of the various clusters of the distributed server and have preferences as to which cluster should process their session request. The scheme consists of a dispatcher which receives session requests and either admits or rejects those requests. Admitted requests are routed to their preferred cluster when the cluster is not congested. The algorithm also handles the case where a number of clients served have no cluster preference. We describe simulation results which demonstrate that the algorithm provides effective session admission control and load balancing, while maximizing the number of clients preferred by their most preferred cluster.

关键词： Clustering algorithms

来源：评论

学校读者我要写书评

暂无评论

The griddLeS data replication service

The GriddLeS data replication service

引用

1st international conference on e-Science and grid computing

作者： Ho, T Abramson, D Monash Univ Sch Comp Sci & Software Engn Calufield E 3145 Australia

ISBN: (纸本)0769524486

The grid provides inftastructure that allows an arbitrary application to be executed on a range of different computational resources. When input files are very large, or when fault tolerance is important, the data may be replicated Existing grid data replication middleware suffers from two shortcomings. First, it typically requires modification to existing applications. Second, there is limited support on automatic resource selection and a user usually chooses the replica manually to optimize the performance of the system. In this paper we discuss a middleware layer called the griddLeS Replication Service (GRS) that sits above existing replication services, solving both of these shortcomings. Two case studies are presented that illustrate the effectiveness of the approach.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

parallel computing of a large scale spatially distributed model using the Soil and Water Assessment Tool (SWAT)

Parallel computing of a large scale spatially distributed mo...

引用

5th Biennial conference of the international Environmental Modelling and Software Society: Modelling for Environment's Sake, iEMSs 2010

作者： Yalew, S.G. Van Griensven, A. Kokoszkiewicz, L. UNESCO-IHE Institute for Water Education Department of Hydroinformatics and Knowledge Management 2601 DA Delft Netherlands CERN - European Organization for Nuclear Research Switzerland

ISBN: (纸本)9788890357411

The Soil and Water Assessment Tool (SWAT) has been used widely for large scale applications, reaching entire continents. Within the EU funded Envirogrids project, a detailed application of SWAT on the Black Sea Basin is envisaged using high resolution data. In order to support the computation, the model is run on a computer grid. The use of the SWAT allowed for such computations with little adaptations to the source. A 3-step procedure is needed. In the first step, a program is run in order to split the model into several sub-models. Afterwards, the sub-models are run in parallel. In a last step, the outputs of the sub-basins are collected at a central computer and the routing is performed. High computations are also needed when simulations have to be repeated, such as for sensitivity, calibration and uncertainty analysis. In these cases, the simulations are repeated for different parameter sets. In this paper, we discuss the gridification of the algorithm "LH-OAT" that performs sensitivity analysis and has been linked to the SWAT model. The results show a clear improvement in calculation time. Nevertheless, it is concluded that the parallel computing of a distributed model is mainly beneficial for large scale applications with high resolution, while running the sensitivity analysis algorithm has more general and obvious benefit. In a next step, the gridification will be optimised depending on the application and the overheads that are due to submission and receiving of files, as well as potential waiting times for executions on the grid.

关键词： SWAT

来源：评论

学校读者我要写书评

暂无评论

Automatic service deployment using virtualisation

Automatic service deployment using virtualisation

引用

16th Euromicro international conference on parallel, distributed and Network-Based Processing

作者： Kecskemeti, Gabor Kacsuk, Peter Terstyanszky, Gabor Kiss, Tamas Delaitre, Thierry MTA SZTAKI Lab Parallel & Distributed Syst POB 63 H-1518 Budapest Hungary Univ Westminster Ctr Parallel Comp London W1W 6UW England

ISBN: (纸本)9780769530895

Manual deployment of the application usually requires expertise both about the underlying system and the application. Automatic service deployment can improve deployment significantly by using on-demand deployment and self-healing services. To support these features this paper describes an extension the Globus Workspace Service [10]. This extension includes creating virtual appliances for grid services, service deployment from a repository, and influencing the service schedules by altering execution planning services, candidate set generators or information systems.

关键词： Virtualization

来源：评论

学校读者我要写书评

暂无评论

Garbage Collection for Service Oriented distributed Reliable Environment D-ReServE

Garbage Collection for Service Oriented Distributed Reliable...

引用

13th international conference on parallel and distributed computing, Applications, and Technologies (PDCAT)

作者： Brzezinski, Jerzy Danilecki, Arkadiusz Holenko, Mateusz Kobusinska, Anna Zierhoffer, Piotr Poznan Univ Tech Inst Comp Sci Poznan Poland

ISBN: (纸本)9780769548791

D-ReServE increases reliability of SOA-based systems in case of failure occurrence. The fault-tolerant information in D-ReServE is stored in the Stable Storage, which available space depletes with time. Thus, in this paper we propose a garbage collection protocol for D-ReServE that allows the periodic purging of the Stable Storage, and discuss the challenges of garbage collection due to the nature of SOA systems.

关键词： SOA fault tolerance reliability rollback-recovery garbage-collection

来源：评论

学校读者我要写书评

暂无评论

Model-driven simulation of grid scheduling strategies

Model-driven simulation of grid scheduling strategies

引用

3rd IEEE international conference on e-Science and grid computing

作者： Li, Hui Buyya, Rajkumar Leiden Univ Leiden Inst Adv Comp Sci POB 9512 NL-2333 CA Leiden Netherlands Univ Melbourne Dept CSSE Grid Comp & Distributed Syst Lab Melbourne Vic 3010 Australia

ISBN: (纸本)9780769530642

Simulation studies of grid scheduling strategies require representative workloads to produce dependable results. Real production grid workloads have shown diverse correlation structures and scaling behavior, which are different than the characteristics of the available supercomputer workloads and cannot be captured by Poisson or simple distribution-based models. We present models that are able to reproduce various correlation structures, including pseudo-periodicity and long range dependence. By conducting model-driven simulation, we quantitatively evaluate the performance impacts of workload correlations in grid scheduling. The results indicate that autocorrelations in workloads result in worse system performance, both at the local and the grid level. It is shown that realistic workload modeling is not only possible, but also necessary to enable dependable grid scheduling studies.

关键词： Supercomputers

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：