检索结果-内蒙古大学图书馆

8th international conference on parallel processing and Applied Mathematics

作者： Olas, Tomasz Lesniak, Robert Wyrzykowski, Roman Gepner, Pawel Czestochowa Tech Univ Dabrowskiego 73 PL-42201 Czestochowa Poland Intel Corp Santa Clara CA USA

ISBN: (纸本)9783642143892

Numerical modeling of 3D thermomechanical problems is a complex and time-consuming issue. Adaptive techniques are powerful tools to perform efficiently such modeling using the FEM analysis. During the adaptation computational workloads change unpredictably at the runtime, therefore dynamic load balancing is required. This paper presents new developments in the parallel FIFA package NuscaS;they allow for extending its functionality and increasing performance. In particular, by including dynamic load balancing capabilities, this package allows us to solve efficiently adaptive FEM problems with 3D unstructured meshes on distributed-memory parallel computers such as PC-clusters. For solving sparse systems of equations, NuscaS uses the message-passing paradigm to implement the PCG iterative method with geometric multigrid as a preconditioner. The implementation of load balancing is based on the proposed performance model.

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

The fault tolerant parallel algorithm: The parallel recomputing based failure recovery

The fault tolerant parallel algorithm: The parallel recomput...

引用

16th international conference on parallel Architecture and Compilation techniques, PACT 2007

作者： Yang, Xuejun Du, Yunfei Wang, Panfeng Fu, Hongyi Jia, Jia Wang, Zhiyuan Suo, Guang National Laboratory for Paralleling and Distributed Processing School of Computer National University of Defense Technology Changsha Hunan 410073 China

ISBN: (纸本)0769529445

This paper addresses the issue of fault tolerance in parallel computing, and proposes a new method named parallel recomputing. Such method achieves fault recovery automatically by using surviving processes to recompute the workload of failed processes in parallel. The paper firstly defines the fault tolerant parallel algorithm (FTPA) as the parallel algorithm which tolerates failures by parallel recomputing. Furthermore, the paper proposes the inter-process definition-use relationship analysis method based on the conventional definition-use analysis for revealing the relationship of variables in different processes. Under the guidance of this new method, principles of fault tolerant parallel algorithm design are given. At last, the authors present the design of FTPAs for matrix-matrix multiplication and NPB kernels, and evaluate them by experiments on a cluster system. The experimental results show that the overhead of FTPA is less than the overhead of checkpointing. © 2007 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

A hybrid scheduling approach for a two-stage flexible flow shop with batch processing machines

引用

JOURNAL OF SCHEDULING 2018年第2期21卷 209-226页

作者： Tan, Yi Moench, Lars Fowler, John W. Univ Hagen Dept Math & Comp Sci D-58097 Hagen Germany Arizona State Univ Dept Supply Chain Management Tempe AZ 85287 USA

In this paper, we discuss a flexible flow shop scheduling problem with batch processing machines at each stage and with jobs that have unequal ready times. Scheduling problems of this type can be found in semiconductor wafer fabrication facilities (wafer fabs). We are interested in minimizing the total weighted tardiness of the jobs. We present a mixed integer programming formulation. The batch scheduling problem is NP-hard. Therefore, an iterative stage-based decomposition approach is proposed that is hybridized with neighborhood search techniques. The decomposition scheme provides internal due dates and ready times for the jobs on the first and second stage, respectively. Each of the resulting parallel machine batch scheduling problems is solved by variable neighborhood search in each iteration. Based on the schedules of the subproblems, the internal due dates and ready times are updated. We present the results of designed computational experiments that also consider the number of machines assigned to each stage as a design factor. It turns out that the proposed hybrid approach outperforms an iterative decomposition scheme where a fairly simple heuristic based on time window decomposition and the apparent tardiness cost dispatching rule is used to solve the subproblems. Recommendations for the design of the two stages with respect to the number of parallel machines on each stage are given.

关键词： Two-stage flexible flow shop Batching Decomposition Variable neighborhood search Computational experiments

来源：评论

学校读者我要写书评

暂无评论

Selfish Neighbor Selection in Peer-to-Peer Backup and Storage applications

引用

15th international Euro-Par conference on parallel Computing

作者： Michiardi, Pietro Toka, Laszlo EURECOM France

ISBN: (纸本)9783642038686

In this work we tackle the problem of on-line backup with a peer-to-peer approach. In contrast to current peer-to-peer architectures that build upon distributed hash-tables;we investigate whether an uncoordinated approach to data placement would prove effective in providing embedded incentives for users to offer local resources to the system. By modeling peers as selfish entities striving for minimizing their cost;in participating to the system, we analyze equilibrium topologies that materialize from the process of peer selection, whereby peers establish bi-lateral links that involve storing data in a symmetric wary. System stratification, that is the emergence of clusters gathering peers with similar contribution efforts, is an essential outcome of the peer selection process: peers are hired to improve the "quality" of local resources they provide to access clusters with lower operational costs. Our results are corroborated by a numerical evaluation of the system that builds upon a polynomial-time best-response algorithm to the selfish neighbor selection game.

关键词： Peer to peer networks

来源：评论

学校读者我要写书评

暂无评论

On-line performance modeling for MPI applications

On-line performance modeling for MPI applications

引用

14th international Euro-Par conference on parallel Computing

作者： Morajko, Oleg Morajko, Anna Margalef, Tomas Luque, Emilio Univ Autonoma Barcelona Dept Comp Sci Bellaterra 08193 Spain

ISBN: (纸本)9783540854500

To develop all efficient parallel application is not an easy task. applications rarely achieve a good performance immediately therefore, a careful performance analysis and optimization are crucial. These tasks are difficult and require a thorough understanding of the program's behavior. In this paper, we propose an on-line performance modeling technique, which enables the automated discovery of causal execution flows, composed of communication and computational activities, in MPI parallel programs. Our model reflects an application behavior and is made up of elements correlated with high-level program structures, such as loops and communication operations. Moreover, our approach enables an assortment of on-fine diagnosis techniques which may further automate the performance understanding process.

关键词： Message passing

来源：评论

学校读者我要写书评

暂无评论

2009 international conference on Advances in Computational Tools for Engineering applications, ACTEA 2009

2009 International Conference on Advances in Computational T...

引用

2009 international conference on Advances in Computational Tools for Engineering applications, ACTEA 2009

ISBN: (纸本)9781424438341

The proceedings contain 129 papers. The topics discussed include: coding for two-user MIMO cooperative systems using matrix-Alamouti techniques;the adaptive RBFNN equalizer for nonlinear time-varying UMTS channel;centralized and distributed LTE uplink scheduling in a distributed base station scenario;parameter exploration in parallel for dynamic vehicular network efficiency;neuro-control of an inverted pendulum using genetic algorithm;design and development of a hybrid feedback control system for an RF remote-controlled robot;non linear global dynamic analysis of reinforced slopes stability under seismic loading;application of reliability analysis on seismic slope stability;concrete compressive strength obtained on uncontrolled construction sites in Lebanon;analysis of an isotropic plate containing three identical circular holes arranged in a triangular configuration;and robust proposal distribution for adaptive visual tracking in a particle filtering frame work.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Supporting Malleability in parallel Architectures with Dynamic CPUSETs Mapping and Dynamic MPI

Supporting Malleability in Parallel Architectures with Dynam...

引用

11th international conference on distributed Computing and Networking

作者： Cera, Marcia C. Georgiou, Yiannis Richard, Olivier Maillard, Nicolas Navaux, Philippe O. A. Univ Fed Rio Grande do Sul BR-90046900 Porto Alegre RS Brazil Lab Informati Grenoble St Martin Dheres France

ISBN: (纸本)9783642113215

Current parallel architectures take advantage of new hardware evolution, like the use of multicore machines in clusters and grids. The availability of such resources may also be dynamic. Therefore, some kind of adaptation is required by the applications and the resource manager to perform a good resource utilization. Malleable applications can provide a certain flexibility, adapting themselves on-the-fly, according to variations in the amount of available resources. However, to enable the execution of this kind of applications, some support from the resource manager is required, thus introducing important complexities like special allocation and scheduling policies. Under this context, we investigate some techniques to provide malleable behavior on M PI applications and the impact of this support upon a resource manager. Our study deals with two approaches to obtain malleability: dynamic CPUSETs mapping and dynamic MPI, using the OAR resource manager. The validation experiments were conducted upon Grid5000 platform. The testbed associates the charge of real workload traces and the execution of MPI benchmarks. Our results show that a dynamic approach using malleable jobs can lead to almost 25% of improvement in the resources utilization, when compared to a non-dynamic approach. Furthermore, the complexity of the malleability support, for the resource manager, seems to be overlapped by the improvement reached.

关键词： parallel architectures

来源：评论

学校读者我要写书评

暂无评论

parallel processing and Applied Mathematics 1

引用

丛书名： Lecture Notes in Computer Science

1000年

作者： Roman Wyrzykowski Ewa Deelman Jack Dongarra Konrad Karczewski Jacek Kitowski Kazimierz Wiatr

ISBN: (数字)9783319321493

ISBN: (纸本)9783319321486

This two-volume set LNCS 9573 and LNCS 9574 constitutes the refereed proceedings of the 11th international conference of parallel processing and Applied Mathematics, PPAM 2015, held in Krakow, Poland, in September 2015.;The 111 revised full papers presented in both volumes were carefully reviewed and selected from 196 submissions. The focus of PPAM 2015 was on models, algorithms, and software tools which facilitate efficient and convenient utilization of modern parallel and distributed computing architectures, as well as on large-scale applications, including big data problems.

关键词： Software Engineering Algorithm Analysis and Problem Complexity Information Systems applications (incl. Internet) Programming techniques Computer Communication Networks Mathematics of Computing

来源：评论

学校读者我要写书评

暂无评论

Complex Event processing over distributed probabilistic event streams

Complex Event Processing over distributed probabilistic even...

引用

2012 9th international conference on Fuzzy Systems and Knowledge Discovery, FSKD 2012

作者： Wang, Yongheng Zhang, Xiaoming College of Information Science and Engineering HuNan University Changsha China

ISBN: (纸本)9781467300223

With the rapid development of Internet of Things (IoT), enormous events are produced everyday. Complex Event processing (CEP) is the key part of the IoT middleware. Since current hardware and wireless communication techniques cannot support 100% confident data, CEP engine which can report confidence for processed complex events over uncertain data is needed. Most of the current study of complex event processing has not considered much about how to process complex event over distributed probabilistic event streams and large sliding window. In this paper, a high performance complex event processing method over distributed probabilistic event streams is proposed. This method uses probabilistic Nondeterministic Finite Automaton and Active Instance Stacks to process complex event in single probabilistic event stream. Multiple processes can run parallel to improve the performance. A query plan based method using tree data structure is used to process hierarchical complex event from distributed event streams. Query plan optimization is proposed based on query optimization technology of probabilistic databases. The experimental study shows that this method is efficient to process complex events over distributed probabilistic event streams. © 2012 IEEE.

关键词： Internet of things

来源：评论

学校读者我要写书评

暂无评论

A performance-based parallel loop self-scheduling on grid computing environments 1

引用

IFIP international conference on Network and parallel Computing, NPC 2005

作者： Shih, Wen-Chung Yang, Chao-Tung Tseng, Shian-Shyong Department of Computer and Information Science National Chiao Tung University Hsinchu 300 Taiwan High-Performance Computing Laboratory Department of Computer Science and Information Engineering Tunghai University Taichung 407 Taiwan Department of Information Science and Applications Asia University Taichung 413 Taiwan

ISBN: (数字)9783540322467

ISBN: (纸本)354029810X

Efficient loop scheduling on parallel and distributed systems depends mostly on load balancing, especially on heterogeneous PC-based cluster and grid computing environments, In this paper, a general approach, named Performance-Based parallel Loop Self-Scheduling (PPLSS), was given to partition workload according to performance of grid nodes. This approach was applied to three types of application programs, which were executed on a testbed grid. Experimental results showed that our approach could execute efficiently for most scheduling parameters when estimation of node performance was accurate. © IFIP international Federation for Information processing 2005.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：