检索结果-内蒙古大学图书馆

parallel and systolic solution of normalized explicit approximate inverse preconditioning

JOURNAL OF SUPERCOMPUTING 2004年第2期30卷 77-96页

作者： Gravvanis, GA Giannoutakis, KM Bekakos, MP Efremides, OB Hellen Open Univ Dept Comp Sci Patras Greece Univ Athens Dept Informat & Telecommun GR-15784 Athens Greece

A new class of normalized approximate inverse matrix techniques, based on the concept of sparse normalized approximate factorization procedures are introduced for solving sparse linear systems derived from the finite difference discretization of partial differential equations. Normalized explicit preconditioned conjugate gradient type methods in conjunction with normalized approximate inverse matrix techniques are presented for the efficient solution of sparse linear systems. Theoretical results on the rate of convergence of the normalized explicit preconditioned conjugate gradient scheme and estimates of the required computational work are presented. Application of the new proposed methods on two dimensional initial/boundary value problems is discussed and numerical results are given. The parallel and systolic implementation of the dominant computational part is also investigated.

关键词： finite difference systems normalized approximate factorization normalized approximate inverses preconditioning rate of convergence and complexity parallel iterative methods parallel computations

来源：评论

学校读者我要写书评

暂无评论

Increasing efficiency in parallel programming teaching 26

Increasing efficiency in parallel programming teaching

引用

26th Euromicro international conference on parallel, distributed, and Network-Based processing (PDP)

作者： Danelutto, Marco Torquati, Massimo Univ Pisa Dept Comp Sci Pisa Italy

ISBN: (纸本)9781538649756

The ability to teach parallel programming principles and techniques is becoming fundamental to prepare a new generation of programmers able to master the pervasive parallelism made available by hardware vendors. Classical parallel programming courses leverage either low-level programming frameworks (e.g. those based on Pthreads) or higher level frameworks such as OpenMP or MPI. We discuss our teaching experience within the Master in "Computer Science and networking" where parallel programming is taught leveraging structured parallel programming principles and frameworks. The paper summarizes the results achieved in eight years of experience and shows how the adoption of a structured parallel programming approach improves the efficiency of the teaching process.

关键词： parallel programming teaching techniques parallel design patterns algorithmic skeletons

来源：评论

学校读者我要写书评

暂无评论

Evaluations of parallel double divide and conquer on a 16-core computer

Evaluations of parallel double divide and conquer on a 16-co...

引用

2008 international conference on parallel and distributed processing techniques and applications, PDPTA 2008

作者： Nakamura, Yoshimasa Konda, Taro Toyokawa, Hiroki Department of Applied Mathematics and Physics Graduate School of Informatics Kyoto University Yoshida Honmachi Sakyo-ku Kyoto Japan SORST JST Japan

ISBN: (纸本)1601320841

For bidiagonal SVD, double Divide and Conquer was proposed. It first computes singular values by a compact version of Divide and Conquer. The corresponding singular vectors are then computed by twisted factorization. The speed and accuracy of double Divide and Conquer are as good or even better than standard algorithms such as QR and the original Divide and Conquer. Moreover, it shows high scalability even on a PC cluster, distributed memory architecture. This paper presents evaluations of parallel double Divide and Conquer for singular value decomposition on a 16-core architecture.

关键词： Singular value decomposition

来源：评论

学校读者我要写书评

暂无评论

QoS Manager for Energy Efficient Many-Core Operating Systems

QoS Manager for Energy Efficient Many-Core Operating Systems

引用

21st Euromicro international conference on parallel, distributed, and Network-Based processing (PDP)

作者： Holmbacka, Simon Agren, Dag Lafond, Sebastien Lilius, Johan Abo Akad Univ Dept Informat Technol FIN-20520 Turku Finland

ISBN: (纸本)9780769549392;9781467353212

The oncoming many-core platforms is a hot topic these days, and this next generation hardware sets new focus on energy and thermal awareness. With a more and more dense packing of transistors, the system must be made energy aware to not suffer from overheating and energy waste. As a step towards increased energy efficiency, we intend to add the notion of QoS handling to the OS level and to applications. We suggest the design of a QoS manager as a plug-in OS extension capable of providing applications with the necessary resources leading to better energy efficiency.

关键词： QoS distributed Operating Systems Many-Core Systems Energy Efficiency

来源：评论

学校读者我要写书评

暂无评论

Blockchain-based Security Architecture for distributed Cloud Storage 15

Blockchain-based Security Architecture for Distributed Cloud...

引用

15th IEEE international Symposium on parallel and distributed processing with applications (ISPA) / 16th IEEE international conference on Ubiquitous Computing and Communications (IUCC)

作者： Li, Jiaxing Liu, Zhusong Chen, Long Chen, Pinghua Wu, Jigang Guangdong Univ Technol Sch Comp Sci & Technol Guangzhou Guangdong Peoples R China

ISBN: (纸本)9781538637906

With the development of ICT industry, the volume of produced data is experiencing tremendous growth, which motivates more demands of storage capacity. Because of the limited storage capacity of users' terminals, more and more applications prefer to upload data to cloud platforms. However, it is well known that security should not be neglected in existing cloud storage architectures. Motivated by the increasing popularity of emerging blockchain technology, we propose a blockchain-based security architecture for distributed cloud storage. Moreover, we customize a genetic algorithm to solve the file block replica placement problem between multiple users and multiple data centers in the distributed cloud storage environment. Numerical experimental results show that the proposed architecture outperforms the traditional cloud storage architectures in terms of security, with acceptable network transmission delay.

关键词： cloud storage security blockchain architecture distributed

来源：评论

学校读者我要写书评

暂无评论

Reliable distributed lookup service based on Jini and JGroups

Reliable distributed lookup service based on Jini and JGroup...

引用

2005 international conference on parallel and distributed processing techniques and applications, PDPTA'05

作者： Aldaoud, Omar Guduru, Krishnapriya Malluhi, Qutaibah Computer Science Dept. Jackson State University Jackson MS United States

ISBN: (纸本)9781932415605

distributed systems that are deployed using Jini technology employ the concept of dynamic registration, discovery and utilization of distributed services. Jini uses a central registry (lookup service), which is the primary means for service providers to advertise their services and allows clients to locate and enlist the help of those services. As a central component of Jini's runtime infrastructure, reliability and fault tolerance of this lookup service becomes an essential requirement. This paper presents the design and evaluation of a fault-tolerant distributed Jini lookup service that utilizes group communication systems. The proposed design enhances the reliability and performance of the Jini lookup service. In addition, the paper presents experimental results that evaluate the performance and reliability of the proposed distributed lookup service.

关键词： Fault tolerance

来源：评论

学校读者我要写书评

暂无评论

IEEE 1394: Another low cost viable alternative interconnect for high performance computing

IEEE 1394: Another low cost viable alternative interconnect ...

引用

2005 international conference on parallel and distributed processing techniques and applications, PDPTA'05

作者： Gill, Joseph Torsoo, Charles Burge, Legand Li, Jiang Systems and Computer Science Howard University Washington DC 20059 United States

ISBN: (纸本)9781932415605

With the advent of IP over IEEE 1394 as a network technology, a new contender for a low cost next-generation cluster interconnect is on the horizon. In this paper, we benchmark IEEE 1394 and compare it to other cluster interconnects, namely Fast Ethernet, and Gigabit Ethernet. For a meaningful comparison, benchmark experiments are carried out at three levels: TCP/IP networking and MPI parallel programming, parallel application benchmarks. Using high-end PCs (Pentium IV 800 MHz) and standard system software (Linux and MPICH), our results show that IEEE 1394 is a viable alternative.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Two applications of parallel processing in power system computation

引用

IEEE TRANSACTIONS ON POWER SYSTEMS 1996年第1期11卷 246-253页

作者： Lemaitre, C Thomas, B Research and Development Division Electricité de France Clamart France

This paper discusses performance improvements achieved in two power system software modules through the use of parallel processing techniques. The first software module, EVARISTE, outputs a voltage stability indicator for various power system situations. This module was designed for extended real-rime use and is therefore required to give guaranteed response times. The second module, MEXICO, assesses power system reliability and operating costs by simulating a large number of contingencies for generation and transmission equipment. This module, used for power system planning purposes, uses a Monte-Carlo method to build the various system states, and makes heavy demands on CPU time for running simulations. Like many power system computation packages, both software modules are well-suited to coarse-grain parallel processing. The first module was parallelized on a distributed-memory machine and the second on a shared-memory machine. In this paper, we start by a description of the parallelization process used in these two cases, then go on to give details on the performance levels achieved, discussing aspects of programming, parameter selection (number of situations processed, number of processors), and machine characteristics (limitations due to interprocessor communications network, for instance).

关键词： parallel processing Power systems Concurrent computing Power system stability Power system reliability Power system simulation Power system planning Computational modeling Application software Software performance

来源：评论

学校读者我要写书评

暂无评论

MatrixMap: Programming Abstraction and Implementation of Matrix Computation for Big Data applications 21

MatrixMap: Programming Abstraction and Implementation of Mat...

引用

21st IEEE international conference on parallel and distributed Systems ICPADS

作者： Huangfu, Yaguang Cao, Jiannong Lu, Hongliang Liang, Guanqing Hong Kong Polytech Univ Dept Comp Hong Kong Hong Kong Peoples R China Natl Univ Def Technol Parallel & Distributed Proc Lab Changsha Hunan Peoples R China

ISBN: (纸本)9780769557854

The computation core of many big data applications can be expressed as general matrix computations, including linear algebra operations and irregular matrix operations. However, existing parallel programming systems such as Spark do not have programming abstraction and efficient implementation for general matrix computations. In this paper, we present MatrixMap, a unified and efficient data-parallel system for general matrix computations. MatrixMap provides powerful yet simple abstraction, consisting of a distributed data structure called bulk key matrix and a computation interface defined by matrix patterns. Users can easily load data into bulk key matrices and program algorithms into parallel matrix patterns. MatrixMap outperforms current state-of-the-art systems by employing three key techniques: matrix patterns with lambda functions for irregular and linear algebra matrix operations, asynchronous computation pipeline with optimized data shuffling strategies for specific matrix patterns and in-memory data structure reusing data in iterations. Moreover, it can automatically handle the parallelization and distribute execution of programs on a large cluster. The experiment results show that MatrixMap is 12 times faster than Spark.

关键词： Big Data parallel Programming Matrix Computation Machine Learning Graph processing

来源：评论

学校读者我要写书评

暂无评论

Using page access behavior for load sharing on software distributed shared memory system

Using page access behavior for load sharing on software dist...

引用

Proceedings of the international conference on parallel and distributed processing techniques and applications

作者： Chua, Elaine Jane College of Computer Studies De La Salle University Professional Schools 2401 Taft Avenue Manila 1004 Philippines

ISBN: (纸本)1892512416

Performance of a software distributed shared memory (DSM) system can be improved if load sharing is employed. However, traditional load sharing algorithms are not directly suitable for DSM systems since they do not consider the memory access patterns of tasks. This paper presents a load sharing algorithm that takes into account memory access patterns as well as individual processor load information to distribute tasks in a DSM environment. A vector that keeps track of the frequency of page accesses by tasks is used to determine the processor with the best locality of access. The general idea is to minimize the amount of remote page accesses. Simulation results are presented to illustrate the behavior of the algorithm.

关键词： Multiprocessing systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：