检索结果-内蒙古大学图书馆

International Symposium on parallel and Distributed Processing (IPDPS)

作者： D. Cachera P. Quinton S. Rajopadhye T. Risset Campus de Beaulieu IRISA France École normale supérieure de Cachan France

来源：评论

学校读者我要写书评

暂无评论

An evaluation of adaptive numerical integration algorithms on parallel systems

引用

parallel algorithms and Applications 2003年第1-2期18卷 27-47页

作者： Schürer, Rudolf Uhl, Andreas Department of Mathematics University of Salzburg Hellbrunner str. 34 A-5020 Salzburg Austria Department of Scientific Computing University of Salzburg Salzburg Austria

parallel adaptive algorithms for the approximation of a multi-dimensional integral over an hyper-rectangular region are described. algorithms with centralized global region collection are compared to algorithms using local region collections. The latter algorithms should result in better scalability since global communication is avoided. Both types of algorithms are compared to quasi-Monte Carlo integration. Tests are performed using Genz's test functions and speed-up results are given.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel interior-point solver for structured linear programs

引用

MATHEMATICAL PROGRAMMING 2003年第3期96卷 561-584页

作者： Gondzio, J Sarkissian, R Univ Edinburgh Dept Math & Stat Edinburgh EH9 3JZ Midlothian Scotland

Issues of implementation of an object-oriented library for parallel interior-point methods are addressed. The solver can easily exploit an), special structure of the underlying optimization problem. In particular, it allows a nested embedding of structures and by this means very complicated real-life optimization problems can be modelled. The efficiency of the solver is illustrated on several problems arising in the optimization of networks. The sequential implementation outperforms the state-of-the-art commercial optimization software. The parallel implementation achieves speed-ups of about 3.1-3.9 on 4-processors parallel systems and speed-ups of about 10-12 on 16-processors parallel systems.

关键词： parallel algorithms Linear programming Computer software Mathematical optimization Mathematics

来源：评论

学校读者我要写书评

暂无评论

parallel stored-integral and semidirect Hartree-Fock and DFT methods with data compression

引用

JOURNAL OF COMPUTATIONAL CHEMISTRY 2003年第2期24卷 154-160页

作者： Mitin, AV Baker, J Wolinski, K Pulay, P Univ Arkansas Dept Chem & Biochem Fayetteville AR 72701 USA Parallel Quantum Solut Fayetteville AR 72703 USA Marie Curie Sklodowska Univ Lublin Poland

Recent developments in magnetic disk technology have made stored-integral techniques competitive with the currently more widely used direct methods, which involve the recalculation of the basic two-electron integrals. We present efficient conventional (all integrals stored) and semidirect Hartree-Fock and DFT algorithms with data compression for single-processor and distributed memory parallel computers, and compare them with the corresponding direct algorithms. On inexpensive modem personal computer-based hardware, the stored integral method is up to three times more efficient than the direct method in terms of total elapsed job time. (C) 2002 Wiley Periodicals, Inc.

关键词： Hartree-Fock method DFT method integral compression index compression parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Derivation of a parallel string matching algorithm

引用

INFORMATION PROCESSING LETTERS 2003年第5期85卷 255-260页

作者： Misra, J Univ Texas Austin TX 78712 USA

We derive an efficient parallel algorithm to find all occurrences of a pattern string in a subject string in O(log n) time, where n is the length of the subject string. The number of processors employed is of the order of the product of the two string lengths. The theory of powerlists [J. Kornerup, PhD Thesis, 1997;J. Misra, ACM Trans. Programming Languages Systems 16 (16) (1994) 1737-1740] is central to the development of the algorithm and its algebraic manipulations. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： parallel algorithms string data structures powerlist

来源：评论

学校读者我要写书评

暂无评论

Solving awari with parallel retrograde analysis

引用

COMPUTER 2003年第10期36卷 26-+页

作者： Romein, JW Bal, HE Free Univ Amsterdam Amsterdam Netherlands

In awari, a two-person game of pure skill, players sow stones into pits on a board. The game's rules define how to capture stones, and the player who captures the most wins the game. For more than a decade, researchers have studied computerized techniques to play awari. The authors have now solved the game by determining the score of 889,063,398,406 board positions and storing them in databases. They performed the necessary computations on a 144-processor parallel computer with 72 gigabytes of main memory and a fast Myrinet interconnect.

关键词： Concurrent computing Databases Africa parallel algorithms Clustering algorithms Clocks

来源：评论

学校读者我要写书评

暂无评论

Testing parallel random number generators

引用

parallel COMPUTING 2003年第1期29卷 69-94页

作者： Srinivasan, A Mascagni, M Ceperley, D Florida State Univ Dept Comp Sci Tallahassee FL 32308 USA Univ Illinois Natl Ctr Supercomp Applicat Urbana IL 61801 USA

Monte Carlo computations are considered easy to parallelize. However, the results can be adversely affected by defects in the parallel pseudorandom number generator used. A parallel pseudorandom number generator must be tested for two types of correlations-(i) intrastream correlation, as for any sequential generator, and (ii) inter-stream correlation for correlations between random number streams on different processes. Since bounds on these correlations are difficult to prove mathematically, large and thorough empirical tests are necessary. Many of the popular pseudorandom number generators in use today were tested when computational power was much lower, and hence they were evaluated with much smaller test sizes. This paper describes several tests of pseudorandom number generators, both statistical and application-based. We show defects in several popular generators. We describe the implementation of these tests in the SPRNG [ACM Trans. Math. Software 26 (2000) 436;SPRNG-scalable parallel random number generators. SPRNG 1.0-http: //www. ncsa. uiuc, edu/ Apps/SPRNG;SPRNG 2. 0-http: //sprng. cs, fsu. edu] test suite and also present results for the tests conducted on the SPRNG generators. These generators have passed some of the largest empirical random number tests. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： parallel random number generators random number tests parallel algorithms random number software

来源：评论

学校读者我要写书评

暂无评论

An efficient parallel algorithm with application to computational fluid dynamics

引用

COMPUTERS & MATHEMATICS WITH APPLICATIONS 2003年第1-3期45卷 165-188页

作者： Rivera, W Zhu, JP Huddleston, D Univ Akron Dept Theoret & Appl Math Akron OH 44224 USA Univ Puerto Rico Dept Elect & Comp Engn Mayaguez PR 00680 USA Mississippi State Univ Dept Civil Engn Mississippi State MS 39762 USA

When solving time-dependent partial differential equations on parallel computers using the nonoverlapping domain decomposition method, one often needs numerical boundary conditions on the boundaries between subdomains. These numerical boundary conditions can significantly affect the stability and accuracy of the final algorithm. In this paper, a stability and accuracy analysis of the existing methods for generating numerical boundary conditions will be presented, and a new approach based on explicit predictors and implicit correctors will be used to solve convect ion-diffusion equations on parallel computers, with application to aerospace engineering for the solution of Euler equations in computational fluid dynamics simulations. Both theoretical analyses and numerical results demonstrate significant improvement in stability and accuracy by using the new approach. (C) 2003 Elsevier Science Ltd. All rights reserved.

关键词： time lagging explicit predictor domain decomposition parallel algorithms partial differential equations

来源：评论

学校读者我要写书评

暂无评论

Constructing H4, a fast depth-size optimal parallel prefix circuit

引用

JOURNAL OF SUPERCOMPUTING 2003年第3期24卷 279-304页

作者： Lin, YC Hsu, YH Liu, CK Natl Taiwan Univ Sci & Technol Dept Comp Sci & Informat Engn Taipei 106 Taiwan Natl Taiwan Univ Sci & Technol Dept Elect Engn Taipei 106 Taiwan

Given n values x(1), x(2),...,x(n) and an associative binary operation x, the prefix problem is to compute x(1) x(2) x...x x(i), 1 less than or equal to i less than or equal to n. Prefix circuits are combinational circuits for solving the prefix problem. For any n-input prefix circuit D with depth d and size s, if d + s = 2 n-2, then D is depth-size optimal. In general, a prefix circuit with a small depth is faster than one with a large depth. For prefix circuits with the same depth, a prefix circuit with a smaller fan-out occupies less area and is faster in VLSI implementation. This paper is on constructing parallel prefix circuits that are depth-size optimal with small depth and small fan-out. We construct a depth-size optimal prefix circuit H 4 with fan-out 4. It has the smallest depth among all known depth-size optimal prefix circuits with a constant fan-out;furthermore, when n greater than or equal to 136, its depth is less than, or equal to, those of all known depth-size optimal prefix circuits with unlimited fan-out. A size lower bound of prefix circuits is also derived. Some properties related to depth-size optimality and size optimality are introduced;they are used to prove that H 4 is depth-size optimal.

关键词： depth depth-size optimal fan-out parallel algorithms prefix circuits size optimal

来源：评论

学校读者我要写书评

暂无评论

Optimal parallel prefix on the postal model

引用

JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 2003年第1期19卷 75-83页

作者： Lin, YC Yeh, CS Natl Taiwan Univ Sci & Technol Dept Comp Sci & Informat Engn Taipei 106 Taiwan Natl Taiwan Univ Sci & Technol Dept Elect Engn Taipei 106 Taiwan

This paper explores the prefix operation on a message-passing fully connected multicomputer with multiport postal communication. We present an exact communication lower bound for the prefix operation on the model. Two efficient parallel prefix algorithms are also presented;they are optimal in terms of the number of communication steps. For an input of size n, one of the algorithms using n processors is also time-optimal;the other algorithm using p < n processors can be cost-optimal and can achieve linear speedup.

关键词： exact communication lower bound message-passing multicomputer parallel algorithms postal model prefix operation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：