检索结果-内蒙古大学图书馆

Hypercube sandwich approach to conferencing

JOURNAL OF SUPERCOMPUTING 1996年第3期10卷 271-283页

作者： Houlahan, JF Cowen, LJ Masson, GM Computer Science Department The Johns Hopkins University Baltimore USA

This paper presents a novel cascaded conference network that provides distributed processing and signal transmission among members of disjoint sets of generic send/receive devices called conferees. It assumes an online request model in which idle groups of conferees may request the formation of a conference interconnection. Once a conference is established, all conferees remain connected until the entire conference is dissolved. The Hypercube Sandwich Network (HSN) consists of two components. A bidirectional permutation network is used for routing purposes to and from a hypercube of special processing elements for the purpose of conference formation. The HSN achieves strictly nonblocking performance for N conferees using O(N root log N) processing elements, and this is shown to be tight to within a log(1/4) N factor. Previous constructions required a quadratic number of processing elements for strictly nonblocking performance or could only provide wide-sense nonblocking conferencing. If the stronger requirement is made that the communication delay is logarithmic in the conference size, a simple algorithm is presented for wide-sense nonblocking conferencing in an HSN with O(N log N) processing elements.

关键词： hypercube conference nonblocking dynamic networks

来源：评论

学校读者我要写书评

暂无评论

distributed logging in Java with variable leveling

Distributed logging in Java with variable leveling

引用

Proceedings of the international conference on parallel and distributed processing techniques and applications

作者： Varghese, Sunil Brown Andresen, Daniel Dept. of Comp. and Information Sci. Kansas State University 234 Nichols Hall Manhattan KS 66506 United States

ISBN: (纸本)1892512416

This paper deals with defining a distributed logging architecture and extending the Java Logging APIs to support such a framework with variable leveling capabilities. In its current form, the Java Logging API has minimal support for logging in a distributed environment. Rather than using the available SocketHandler class, an RMI solution is considered so as to maintain integrity of log messages on systems that may not be the points of message generation. Toward the end, a configurable RMI Server is presented to compliment the RMI Handler.

关键词： Software engineering

来源：评论

学校读者我要写书评

暂无评论

Monte carlo and quasi monte carlo in parallel computing

Monte carlo and quasi monte carlo in parallel computing

引用

2008 international conference on parallel and distributed processing techniques and applications, PDPTA 2008

作者： Wang, Zizhong J. Department of Mathematics and Computer Science Virginia Wesleyan College Norfolk VA 23502 United States

ISBN: (纸本)1601320841

The studies of Monte Carlo and quasi Monte Carlo have been one of the most interesting topics in computational science in the past decades. In this paper, we present our report on the studies of the two schemes including application in the computation of invariant measures for dynamical systems. The fundamental ideas can be easily applied to other cases of parallel computing.

关键词： Dynamical systems

来源：评论

学校读者我要写书评

暂无评论

Designing Bit-Reproducible Portable High-Performance applications

Designing Bit-Reproducible Portable High-Performance Applica...

引用

IEEE 28th international parallel & distributed processing Symposium (IPDPS)

作者： Arteaga, Andrea Fuhrer, Oliver Hoefler, Torsten Swiss Fed Inst Technol Zurich Switzerland Fed Off Meteorol & Climatol MeteoSwiss Zurich Switzerland

ISBN: (纸本)9780769552071

Bit-reproducibility has many advantages in the context of high-performance computing. Besides simplifying and making more accurate the process of debugging and testing the code, it can allow the deployment of applications on heterogeneous systems, maintaining the consistency of the computations. In this work we analyze the basic operations performed by scientific applications and identify the possible sources of non-reproducibility. In particular, we consider the tasks of evaluating transcendental functions and performing reductions using non-associative operators. We present a set of techniques to achieve reproducibility and we propose improvements over existing algorithms to perform reproducible computations in a portable way, at the same time obtaining good performance and accuracy. By applying these techniques to more complex tasks we show that bit-reproducibility can be achieved on a broad range of scientific applications.

关键词： determinism reproducibility parallelism IEEE-754 standard

来源：评论

学校读者我要写书评

暂无评论

Runtime locality optimizations of distributed Java applications

Runtime locality optimizations of distributed Java applicati...

引用

16th Euromicro international conference on parallel, distributed and Network-Based processing

作者： Huetter, Christian Moschny, Thomas Univ Karlsruhe Karlsruhe Germany

ISBN: (纸本)9780769530895

In distributed Java environments, locality of objects and threads is crucial for the performance of parallel applications. We introduce dynamic locality optimizations in the context of JavaParty, a programming and runtime environment for parallel Java applications. Until now, an optimal distribution of the individual objects of an application has to be found manually, which has several drawbacks. Based on a former static approach, we develop a dynamic methodology for automatic locality optimizations. By measuring processing and communication times of remote method calls at runtime, a placement strategy can be computed that maps each object of the distributed system to its optimal virtual machine. Objects then are migrated between the processing nodes in order to realize this placement strategy. We evaluate our approach by comparing the performance of two benchmark applications with manually distributed versions. It is shown that our approach is particularly suitable for dynamic applications where the optimal object distribution varies at runtime.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

Hierarchical partitioning techniques for Structured Adaptive Mesh Refinement (SAMR) applications

Hierarchical partitioning techniques for Structured Adaptive...

引用

31st international conference on parallel processing (ICPP 2002)

作者： Li, XL Ramanathan, S Parashar, M Rutgers State Univ Appl Software Syst Lab Dept Elect & Comp Engn Piscataway NJ 08854 USA

ISBN: (纸本)0769516807

This paper presents the design and preliminary evaluation of hierarchical partitioning and load-balancing techniques for distributed Structured Adaptive Mesh Refinement (SAMR) applications. The overall goal of these techniques is to enable the load distribution to reflect the state of the adaptive grid hierarchy and exploit it to reduce synchronization requirements, improve load-balance, and enable concurrent communications and incremental redistribution. The hierarchical partitioning algorithm (HPA) partitions the computational domain into subdomains and assigns them to hierarchical processor groups. Two variants of HPA are presented in this paper. The Static Hierarchical Partitioning Algorithm (SHPA) assigns portions of overall load to processor groups. In SHPA, the group size and the number of processors in each group is setup during initialization and remains unchanged during application execution. It is experimentally shown that SHRA reduces communication costs as compared to the Non-HPA scheme, and reduces overall application execution time by up to 41%. The Adaptive Hierarchical Partitioning Algorithm (AHPA) dynamically partitions the processor pool into hierarchical groups that match the structure of the adaptive grid hierarchy. Initial evaluations of AHRA show that it can reduce communication costs by up to 70%.

关键词： dynamic load balancing hierarchical partitioning algorithm distributed computing structured adaptive mesh refinement

来源：评论

学校读者我要写书评

暂无评论

A novel metric for evaluation of computer system heterogeneity

A novel metric for evaluation of computer system heterogenei...

引用

Proceedings of the international conference on parallel and distributed processing techniques and applications

作者： Branco, Kalinka Regina Lucas Jaquie Castelo Santana, Marcos José Santana, Regina Helena C. 400 - Centro - Cx. Postal 668 Sao Carlos - Sao Paulo Brazil

ISBN: (纸本)1892512416

This paper discusses some metrics and models aiming at quantifying the heterogeneity level of distributed computing systems. Many of the metrics proposed in previous works are not entirely suitable when used to support both load and process scheduling mechanisms once practical results show that the metrics are not as general as suggested in the literature. A novel metric, constructed from a new approach using the standard deviation concept, is proposed in this paper. This metric is shown to be adequate for all the case studies adopted and it has potential to support most of the load and process scheduling mechanisms used in parallel/distributed computing.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

Symbolic partitioning and scheduling of parameterized task graphs

Symbolic partitioning and scheduling of parameterized task g...

引用

6th international conference on parallel and distributed Systems (ICPADS98)

作者： Cosnard, M Jeannot, E Yang, T INRIA Lorraine LORIA F-54602 Villers Les Nancy France

ISBN: (纸本)0818686030

The DAG-based task graph model has been found effective in scheduling for performance prediction and optimization of parallel applications. However the scheduling complexity and solution normally depend on the problem size. In this paper we propose a symbolic scheduling scheme for a parameterized task graph which models coarse-grain DAG parallelism independent of the problem size. The algorithm first derives symbolic clusters to group of tasks in order to minimize communication while preserving parallelism and then it evenly assigns task clusters to processors. The run-time system executes clusters on each processor in a multithreaded fashion. This paper also presents preliminary experimental results to demonstrate the effectiveness of our techniques.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

An Efficient Scheduling Algorithm for Energy Consumption Constrained parallel applications on Heterogeneous distributed Systems 15

An Efficient Scheduling Algorithm for Energy Consumption Con...

引用

15th IEEE international Symposium on parallel and distributed processing with applications (ISPA) / 16th IEEE international conference on Ubiquitous Computing and Communications (IUCC)

作者： Song, Jinlin Xie, Guoqi Li, Renfa Chen, Xiaoming Hunan Univ Coll Comp Sci & Elect Engn Changsha Hunan Peoples R China Key Lab Embedded & Network Comp Hunan Prov Changsha Hunan Peoples R China

ISBN: (纸本)9781538637906

As the explosive growth of energy consumption in current heterogeneous distributed systems, energy consumption constraint has been one of the primary design issues Minimizing the schedule length while satisfying the energy consumption constraint of parallel applications is one of the most important problem which has been studied recently. Previous studies have proposed a preassignment approach which tried to presuppose the minimum energy consumption assignment for unassigned tasks to solve the problem based on the dynamic voltage and frequency scaling (DVFS) technique. However, the preassignment of unassigned tasks with the minimum energy consumption does not necessarily lead to the minimization of the schedule length. In this study, we propose an efficient scheduling algorithm using a relative average assignments for tasks. The results of experiments on two real parallel applications validate that the proposed algorithm can obtain shorter schedule length while satisfying the energy consumption constraint compared with the state-ofthe-art methods in various situations.

关键词： dynamic voltage and frequency scaling (DVFS) energy consumption constraint heterogeneous distributed systems parallel application schedule length

来源：评论

学校读者我要写书评

暂无评论

Scheduling Task-parallel applications in Dynamically Asymmetric Environments 20

Scheduling Task-parallel Applications in Dynamically Asymmet...

引用

49th international conference on parallel processing (ICPP)

作者： Chen, Jing Soomro, Pirah Noor Abduljabbar, Mustafa Manivannan, Madhavan Pericas, Miquel Chalmers Univ Technol Gothenburg Sweden

ISBN: (纸本)9781450388689

Shared resource interference is observed by applications as dynamic performance asymmetry. Prior art has developed approaches to reduce the impact of performance asymmetry mainly at the operating system and architectural levels. In this work, we study how application-level scheduling techniques can leverage moldability (i.e. flexibility to work as either single-threaded or multithreaded task) and explicit knowledge on task criticality to handle scenarios in which system performance is not only unknown but also changing over time. Our proposed task scheduler dynamically learns the performance characteristics of the underlying platform and uses this knowledge to devise better schedules aware of dynamic performance asymmetry, hence reducing the impact of interference. Our evaluation shows that both criticality-aware scheduling and parallelism tuning are effective schemes to address interference in both shared and distributed memory applications.

关键词： Interference awareness Task scheduling Asymmetry

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：