检索结果-内蒙古大学图书馆

An efficient implementation of Term Rewriting System on a distributed memory architecture

IEICE TRANSACTIONS ON INFORMATION AND systems 1997年第4期E80D卷 510-517页

作者： Hachisu, Y Yamamoto, S Hamaguchi, T Agusa, K School of Engineering Nagoya University Nagoya-shi 464-01 Japan

Term Rewriting System (TRS) is a model of computation and it is used in various application such as algebraic specification. TRS has an inherent concurrency and it is suitable for parallel computing. We have already proposed BOB (Bundle Of Branches), which is a mechanism of data management for parallel rewriting. We have proposed a model of parallel rewriting using BOB and implemented a TRS simulator based on this model on a shared memory parallel computer. Because it fully depends on the feature of a shared memory architecture, that is, a process can access any memory element, it is hard to transport it on a distributed memory parallel computer. In this paper, we propose autonomous BOB model. This model is suitable for a distributed memory architecture since a process uses message passing protocol and the method of load balancing is provided. We implement a TRS simulator using this model on a distributed memory architecture and it runs about 30 times faster on 64 processors than on a single processor.

关键词： TRS parallel rewriting BOB

来源：评论

学校读者我要写书评

暂无评论

An Efficient Algorithm for Solving Eigenproblem 13

An Efficient Algorithm for Solving Eigenproblem

引用

13th international symposium on distributed Computing and Applications to Business, engineering and Science (DCABES)

作者： Zhang, Huirong Cao, Jianwen Chinese Acad Sci Grad Univ Lab Parallel Software & Computat Sci Software Inst Software Beijing Peoples R China Chinese Acad Sci Inst Software Lab Parallel Software & Computat Sci Software Beijing Peoples R China

ISBN: (纸本)9781479941698

In this paper, we consider second order elliptic ODE eigenproblems on general grids. We construct an efficient algorithm for computing the eigenvalue by using weighted mean combination of the linear finite element method and corresponding 2nd-order finite difference method. We first take the arithmetic mean of the two methods. Then we compute the quasi-optimal combined parameters for different eigenvalues to improve our efficient algorithm. The algorithm we construct convergence faster and have higher accuracy than the linear finite element method and corresponding 2nd-order finite difference method. Some numerical examples tested on both uniform meshes and nonuniform meshes are given to illustrate the computational cost of different numerical methods for solving eigenvalue problems. For efficiency, all the matrices use sparse storage in our algorithm.

关键词： Efficient algorithm combinatorial quasi-optimal eigenproblem

来源：评论

学校读者我要写书评

暂无评论

Exploiting idle cycles to execute data mining applications on clusters of PCs

引用

JOURNAL OF systems AND software 2007年第5期80卷 778-790页

作者： Senger, Hermes Hruschka, Eduardo R. Silva, Fabricio A. B. Sato, Liria M. Bianchini, Calebe P. Jerosch, Bruno F. Univ Catolica Santos BR-11070906 Santos SP Brazil Univ Sao Paulo Escola Politecn BR-05508900 Sao Paulo Brazil

In this paper we present and evaluate Inhambu, a distributed object-oriented system that supports the execution of data mining applications on clusters of PCs and workstations. This system provides a resource management layer, built on the top of Java/RMI, that supports the execution of the data mining tool called Weka. We evaluate the performance of Inhambu by means of several experiments in homogeneous, heterogeneous and non-dedicated clusters. The obtained results are compared with those achieved by a similar system named Weka-parallel. Inhambu outperforms its counterpart for coarse grain applications, mainly for heterogeneous and non-dedicated clusters. Also, our system provides additional advantages such as application checkpointing, support for dynamic aggregation of hosts to the cluster, automatic restarting of failed tasks, and a more effective usage of the cluster. Therefore, Inhambu is a promising tool for efficiently executing real-world data mining applications. The software is delivered at the project's web site available at http://***/projects/inhambu/. (c) 2006 Elsevier Inc. All rights reserved.

关键词： parallel and distributed data mining commodity clusters load sharing

来源：评论

学校读者我要写书评

暂无评论

The Implementation and Comparison of Two Kinds of parallel Genetic Algorithm Using Matlab

The Implementation and Comparison of Two Kinds of Parallel G...

引用

9th international symposium on distributed Computing and Applications to Business, engineering and Science (DCABES 2010)

作者： Li Nan Gao Pengdong Lu Yongquan Yu Wenhua Commun Univ China Ctr High Performance Comp Beijing 100024 Peoples R China

ISBN: (纸本)9780769541105

Two kinds of parallel genetic algorithm (PGA) are implemented in this paper based on the MATLAB (R) parallel Computing Toolbox (TM) and distributed Computing Server T software. parallel for-loops, SPMD (Single Program Multiple Data) block and co-distributed arrays, three basic parallel programming modes in MATLAB are employed to accomplish the global and coarse-grained PGAs. To validate and compare our implementation, both PGAs are applied to run the problem of range image registration. A set of experiments have illustrated that it is convenient and effective to use MATLAB to parallelize the existing algorithms. At the same time, a higher speed-up and performance enhancement can be obtained obviously.

关键词： parallel genetic algorithm MATLAB distributed computing parallel programming

来源：评论

学校读者我要写书评

暂无评论

A few of the most popular models for heterogeneous parallel programming 16

A few of the most popular models for heterogeneous parallel ...

引用

16th international symposium on distributed Computing and Applications to Business, engineering and Science (DCABES)

作者： Xie, Gang Zhang, Ya-lin China Acad Engn Phys Inst Comp Applicat MianYang Peoples R China

ISBN: (纸本)9781538621622

In this paper we consider the problem of programming for heterogeneous computer systems consist of CPUs and various accelerating devices such as GPUs. We introduce a few of the most popular models for heterogeneous parallel programming, including OpenCL (Open Computing Language), CUDA (Compute Unified Device Architecture), OpenACC, OpenHMPP (Hybrid Multicore parallel Programming), C++ AMP (accelerated massive parallelism), HPL (Heterogeneous Programming Library), etc.

关键词： Heterogeneous computer systems GPU OpenCL CUDA OpenACC OpenHMPP C plus plus AMP

来源：评论

学校读者我要写书评

暂无评论

KEREP: Experience in Extracting Knowledge on distributed System Behavior through Request Execution Path 29

KEREP: Experience in Extracting Knowledge on Distributed Sys...

引用

29th IEEE international symposium on software Reliability engineering (ISSRE)

作者： Gu, Jing Wang, Long Yang, Yong Li, Ying Peking Univ Sch Software & Microelect Beijing Peoples R China IBM Corp IBM Watson Armonk NY 10504 USA Peking Univ Natl Engn Ctr Software Engn Beijing Peoples R China

ISBN: (纸本)9781538694435

Expertise on distributed systems is critical for system maintenance and improvement. However, it is challenging to keep the up-to-date knowledge from distributed systems due to the complexity and continuous updates. Hence, computing platform providers study on how to extract knowledge directly from system behavior. In this paper, we propose a methodology called KEREP to automatically extract knowledge on distributed system behavior through request execution path. Technologies are devised to construct component structures, to depict the in-depth dynamic behavior and to identify the heartbeat mechanisms of target distributed systems. Experiments on two real-world distributed systems show the KEREP methodology extracts accurate knowledge of request processing and discovers undocumented features with good execution performance.

关键词： execution path knowledge extraction distributed system heartbeat trace discovery job request service request

来源：评论

学校读者我要写书评

暂无评论

Thalweg: A Framework For Programming 1,000 Machines With 1,000 Cores

Thalweg: A Framework For Programming 1,000 Machines With 1,0...

引用

23rd IEEE international parallel and distributed Processing symposium

作者： Beberg, Adam L. Pande, Vijay S. Stanford Univ Dept Comp Sci Stanford CA 94305 USA Stanford Univ Dept Chem Stanford CA 94305 USA

ISBN: (纸本)9781424437511

While modern large-scale computing tasks have grown to span many machines, each with many cores, traditional programming models have not kept up with these advancements, resulting in difficulty exploiting these computing resources with only modest programmer effort. Thalweg seeks to address this breakdown in several ways. It provides a model for designing algorithms that have the potential to scale to multiple cores and machines, with subsequent optimization by software engineers. Based on this concept, Thalweg presents an API for handling these algorithms, for transferring data to and from nodes and coprocessors, and for verifying the correct operation of the hardware. Finally, Thalweg presents a set of concepts and a laboratory framework for pedagogical use that will educate the next generation of software engineers to operate in a world in which multi-core and distributed computing are everywhere.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

Scheduling Closed-Nested Transactions in distributed Transactional Memory

Scheduling Closed-Nested Transactions in Distributed Transac...

引用

26th IEEE international parallel and distributed Processing symposium (IPDPS) / Workshop on High Performance Data Intensive Computing

作者： Kim, Junwhan Ravindran, Binoy Virginia Tech ECE Dept Blacksburg VA 24061 USA

ISBN: (纸本)9780769546759

distributed software transactional memory (D-STM) is an emerging, alternative concurrency control model for distributed systems that promises to alleviate the difficulties of lock-based distributed synchronization-e.g., distributed deadlocks, livelocks, and lock convoying. We consider Herlihy and Sun's dataflow D-STM model, where objects are migrated to invoking transactions, and the closed nesting model of managing inner (distributed) transactions. We present a transactional scheduler called, reactive transactional scheduler (or RTS) to boost the throughput of closed-nested transactions. RTS determines whether a conflicting parent transaction must be aborted or enqueued according to the level of contention. If a transaction is enqueued, its nested inner transactions do not have to retrieve objects again, resulting in reduced communication delays. Our implementation of RTS in the HyFlow D-STM framework and experimental evaluations reveal that RTS improves throughput over D-STM without RTS, by as much as 88%.

关键词： software Transactional Memory Closed-Nested Transactions Transactional Scheduling distributed systems

来源：评论

学校读者我要写书评

暂无评论

Automated generation of explicit connectors for component based hardware/software interaction in embedded real-time systems

Automated generation of explicit connectors for component ba...

引用

10th Workshop on Advances in parallel and distributed Computational Models/22nd IEEE international parallel and distributed Processing symposium

作者： Forster, Wolfgang Kutschera, Christof Steinilnger, Andreas Goeschka, Karl M. Vienna Univ Technol Karlspl 13 A-1040 Vienna Austria Univ Appl Sci Tech Vienna Dept Embedded Syst A-1200 Vienna Austria

ISBN: (纸本)9781424416936

The complexity of today's embedded real-time systems is continuously growing with high demands on dependability, resource-efficiency, and reusability Two solution approaches address these needs: First, in the component based software engineering (CBSE) paradigm, software is decomposed into self-contained components with explicit interactions and context dependencies. Connectors represent the abstraction of interactions between these components. Second, components can be shifted from software to reconfigurable hardware, typically field programmable gate arrays (FPGAs), in order to meet real-time constraints. This paper proposes a component-based concept to support efficient hardware/software co-design: A hardware component together with the hardware/soflware connector can seamlessly replace a software component with the same functionality, while the particularities of the alternative interaction are encapsulated in the component connector. Our approach provides for tools that can generate all necessary interaction mechanisms between hardware and software components. A proof-of-concept application demonstrates the advantages of our concept: Rapid change and comparison of different partitioning decisions due to automated and faultless generation of the hardware/software connectors.

关键词： HW/SW interaction CBSE embedded real-time systems automated design flow

来源：评论

学校读者我要写书评

暂无评论

PB-VSS. software version selection system based on logical programming

PB-VSS. Software version selection system based on logical p...

引用

Proceedings of the IMACS/IFAC international symposium on parallel and distributed Computing in engineering systems

作者： Vescoukis, V.C. Psaraniligos, J. Skordalakis, E.

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：