检索结果-内蒙古大学图书馆

Degree of scalability: scalable reconfigurable mesh algorithms for multiple addition and matrix-vector multiplication

引用

parallel COMPUTING 2003年第1期29卷 95-109页

作者： Vaidyanathan, R Trahan, JL Lu, CM Louisiana State Univ Dept Elect & Comp Engn Baton Rouge LA 70803 USA Certware Technol Sterling VA USA

The usual concern when scaling an algorithm on a parallel model ofcomputation is preserving efficiency while increasing or decreasing the number of processors. Manyalgorithms for reconfigurable models, however, attain constant time at the expense of an inefficientalgorithm. For these algorithms, scaling down the number of processors while preservinginefficiency is no benefit once constant time execution is lost. In fact, one can often acceleratethe efficiency of these algorithms while reducing the number of processors. To quantify thisimprovement in efficiency, this paper introduces the measure of degree of scalability to complementthe insight obtained from efficiency for such algorithms. Demonstrating the utility of this measure,we present new reconfigurable mesh (R-Mesh) algorithms for multiple addition and matrix-vectormultiplication, improving both the number of processors and the degree of scalability compared toprevious algorithms. We also extend these results to floating point number operands, which havepreviously received little attention on the R-Mesh.

关键词： reconfigurable mesh parallel algorithms scalability arithmetic algorithms reconfigurable models

来源：评论

学校读者我要写书评

暂无评论

List-ranking on interconnection networks

引用

INFORMATION AND COMPUTATION 2003年第2期181卷 75-87页

作者： Sibeyn, JF Univ Halle Wittenberg Inst Informat D-06120 Halle Saale Germany

The list-ranking problem is considered for parallel computers which communicate through an interconnection network. Each PU holds k nodes of a set of linked lists. A no-vel randomized algorithm gives a considerable improvement over earlier ones: for a large class of networks and sufficiently large k, it takes only twice the number of steps required by a k-k routing. For hypercubes the condition is k = omega(log(2) N). Even better results are achieved for d-dimensional meshes: we show that the ranking time exceeds the routing time only by lower-order terms for all k = omega(d(2)). We also show that list-ranking requires at least the time required for k-k routing. Thus, the results are within a factor two from optimal, those for meshes even match the lower bound up to lower-order terms. (C) 2002 Elsevier Science (USA). All rights reserved.

关键词： parallel algorithms interconnection networks list-ranking randomization meshes hypercubes

来源：评论

学校读者我要写书评

暂无评论

Evolving a model of transaction management with embedded concurrency control for mobile database systems

引用

INFORMATION AND SOFTWARE TECHNOLOGY 2003年第9期45卷 587-596页

作者： Bhalla, S Univ Aizu Data Syst Lab Fukushima 9658580 Japan

Transactions within a mobile database management system face many restrictions. These cannot afford unlimited delays or participate in multiple retry attempts for execution. The proposed embedded concurrency control (ECC techniques provide support on three counts, namely-to enhance concurrency, to overcome problems due to heterogeneity, and to allocate priority to transactions that originate from mobile hosts. These proposed ECC techniques can be used to enhance the server capabilities within a mobile database management system. Adoption of the techniques can be beneficial in general, and for other special cases of transaction management in distributed real-time database management systems. The proposed model can be applied to other similar problems related to synchronization, such as the generation of a backup copy of an operational database system. (C) 2003 Elsevier Science B.V. All rights reserved.

关键词： distributed algorithms mobile database systems non-blocking protocols parallel algorithms serializability

来源：评论

学校读者我要写书评

暂无评论

Fault-tolerant computations over replicated finite rings

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-FUNDAMENTAL THEORY AND APPLICATIONS 2003年第7期50卷 858-864页

作者： Imbert, L Dimitrov, VS Jullien, GA CNRS Lab Informat Robot & Microelect F-34392 Montpellier 5 France Univ Calgary Dept Elect & Comp Engn ATIPS Lab Calgary AB T2N 1N4 Canada

This paper presents a fault-tolerant technique based on the modulus replication residue number system. (MRRNS) which allows for modular arithmetic computations over identical channels. In this system, fault tolerance is provided by adding extra computational channels that can be used to redundantly compute the mapped output. An algebraic technique is used to determine the error position in the mapped outputs and provide corrections. We also show that by taking advantage of some elementary polynomial properties we obtain the same level of fault tolerance with about a 30% decrease in the number of channels. This new system is referred to as.. the symmetric MRRNS (SMRRNS).

关键词： fault-tolerant computation modulus replication parallel algorithms residue arithmetic symmetric modulus replication residue number system (SMRRNS)

来源：评论

学校读者我要写书评

暂无评论

Design and implementation of a distributed evolutionary computing software

引用

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS 2003年第3期33卷 325-338页

作者： Tan, KC Tay, A Cai, J Natl Univ Singapore Dept Elect & Comp Engn Singapore 117576 Singapore

Although evolutionary algorithm is a powerful optimization tool, its computation cost involved in terms of, time and hardware resources increases as the size or complexity of the problem increases. One promising approach to overcome this limitation is to exploit the inherent parallelism of evolutionary algorithms by creating an infrastructure necessary to support distributed evolutionary computing using existing Internet, and hardware resources. This paper presents a Java-based distributed evolutionary computing software (Paladin-DEC), which enhances the concurrent processing and performance of evolutionary algorithms by allowing inter-communications of subpopulations among various computers over the Internet. Such a distributed system enables individuals to migrate among multiple subpopulations according to some patterns to induce diversity of elite individuals periodically, in a way that simulates the species evolve in natural environment. The Paladin-DEC software is capable of keeping data integrity throughout the computation, and is incorporated with the features of robustness, security, fault tolerance, and work balancing. The effectiveness and advantages of the Paladin-DEC are illustrated upon two case studies of drug scheduling in cancer chemotherapy and searching probe sets of yeast genome.

关键词： distributed systems evolutionary algorithms parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Motion-compensated wavelet packet zerotree video coding on multicomputers

引用

JOURNAL OF SYSTEMS ARCHITECTURE 2003年第3期49卷 75-87页

作者： Feil, M Uhl, A Salzburg Univ Dept Comp Sci A-5020 Salzburg Austria Salzburg Univ RIST A-5020 Salzburg Austria

In this work we describe and analyze algorithms for advanced video coding on distributed memory MIMD architectures. In particular, we consider a wavelet packet based codec using the concept of zerotree encoding. The main contribution of this work is the design of a parallel motion-compensated video coder composed of a wavelet packet decomposition in conjunction with the best basis algorithm followed by zerotree coding. Whereas two sensible parallelization techniques can be employed for the wavelet packet decomposition (subband based partitioning and stripe partitioning), the zerotree coding and motion compensation stages only allow one reasonable parallelization method (stripe partitioning). We investigate the advantages and drawbacks of the resulting different overall data distribution strategies and show experimental results obtained on a Siemens hpcLine cluster and a Cray T3E. (C) 2003 Elsevier B.V. All rights reserved.

关键词： wavelet packets video coding MIMD architectures data distribution parallel algorithms motion compensation

来源：评论

学校读者我要写书评

暂无评论

An improved parallel algorithm for certain Toeplitz cyclic tridiagonal systems on distributed-memory multicomputer

引用

5th International Workshop on Advanced parallel Processing Technologies

作者： Zhang, XB Luo, ZG Li, XM Coll Equipment Command & Technol Beijing 101416 Peoples R China Natl Lab Parallel & Distributed Proc Changsha 410073 Peoples R China

ISBN: (纸本)3540200541

Based on Luo's parallel algorithm [4] for certain Toeplitz cyclic tridiagonal systems on distributed-memory multicomputer, we present an improved algorithm. Its communication mechanism is simple and redundant computing is small for solving massively systems. The numerical experiments show that the parallel efficiency of the improved algorithm is higher than Luo's algorithm [4].

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

On the numerical evaluation of linear recurrences

引用

JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS 2003年第1期150卷 71-86页

作者： Barrio, R Melendo, B Serrano, S Univ Zaragoza Grp Mecan Espacial E-50009 Zaragoza Spain Univ Zaragoza Dpt Matemat Aplicada E-50009 Zaragoza Spain Univ Zaragoza Dpt Matemat Aplicada E-50015 Zaragoza Spain

We present some remarks on the numerical evaluation of recurrence relations. Rounding error bounds are presented of the numerical scheme and some numerical examples are given, in particular, we analyse conversion recurrences from different families of orthogonal polynomials, the limit case of Jacobi-Sobolev polynomials, random recurrences and perturbed Gegenbauer polynomials. In all these examples the theoretical bounds give sharp relative rounding error estimations. The parallel evaluation of recurrences are also considered and numerical tests on a Cray T3D are presented. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： linear recurrence relations rounding errors parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

A bandwidth latency tradeoff for broadcast and reduction

引用

INFORMATION PROCESSING LETTERS 2003年第1期86卷 33-38页

作者： Sanders, P Sibeyn, JF Max Planck Inst Informat D-66123 Saarbrucken Germany Umea Univ Dept Comp Sci S-90187 Umea Sweden

The "fractional tree" algorithm for broadcasting and reduction is introduced. Its communication pattern interpolates between two well known patterns-sequential pipeline and pipelined binary tree. The speedup over the best of these simple methods can approach two for large systems and messages of intermediate size. For networks which are not very densely connected the new algorithm seems to be the best known method for the important case that each processor has only a single (possibly bidirectional) channel into the communication network. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： collective communication broadcast reduction tree single ported half-duplex full-duplex parallel algorithms mesh hierarchical crossbar

来源：评论

学校读者我要写书评

暂无评论

Accuracy-based sampling and reconstruction with adaptive grid for parallel hierarchical tetrahedrization 03

Accuracy-based sampling and reconstruction with adaptive gri...

引用

2003 Eurographics/IEEE TVCG Workshop on Volume Graphics, VG '03

作者： Tanaka, Hiromi T. Takama, Yasufumi Wakabayashi, Hiroki Computer Vision Laboratory Department of Computer Science Ritsumeikan University Noji-Higashi 1chome 1-1 Kusatsu Shiga 525-8577 Japan

ISBN: (纸本)1581137451

Recent advances in volume scanning techniques have made the task of acquiring volume data of 3-D objects easier and more accurate. Since the quantity of such acquired data is generally very large, a volume model capable of compressing data while maintaining a specified accuracy is required. The objective of this work is to construct a multi resolution tetrahedra representation of input volume data. This representation adapts to local field properties while preserving their discontinuities. In this paper, we present an accuracy-based adaptive sampling and reconstruction technique, we call an adaptive grid, for hierarchical tetrahedrization of C1 continuous volume data. We have developed a parallel algorithm of adaptive grid generation that recursively bisects tetrahedra gird elements by increasing the number of grid nodes, according to local field properties and such as orientation and curvature of isosurfaces, until the entire volume has been approximated within a specified level of view-invariant accuracy. We have also developed a parallel algorithm that detects and preserves both C0 and C1 discontinuities of field values, without the formation of cracks which normally occur during independent subdivision. Experimental results demonstrate the validity and effusiveness of the proposed approach. © The Eurographics Association 2003.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：