检索结果-内蒙古大学图书馆

IEEE International conference on Algorithms and Architectures for parallel Processing (ICAP)

作者： R. Wong R. Topor Hong Shen School of Computing and Information Technology Griffith University Nathan QLD Australia

This paper presents an efficient parallel algorithm for computing the mutual range-join of N sets of numbers on shared-nothing hypercube computers. The algorithm iteratively joins each set to the mutual range-join of the preceding sets. Each join is performed on all processors of the hypercube in parallel. The algorithm uses a global sorting method to distribute the elements of the first set evenly across all processors in increasing order, a new data balancing technique to distribute the elements of subsequent sets to match the intermediate set at each processor and to compensate for join skew, and a new efficient local range-join procedure. We analyse the performance of this algorithm and demonstrate that it improves on the best previously published algorithm for this problem when the join selectivity factor is small. The method can also be applied to similar problems such as band-join and equi-join.

关键词： Hypercubes Concurrent computing distributed computing Costs Computer architecture Iterative algorithms Application software Spatial databases Calculus Relational databases

来源：评论

学校读者我要写书评

暂无评论

Operator design pattern for data parallel computation

Operator design pattern for data parallel computation

引用

Technology of Object-Oriented Languages and Systems (TOOLS)

作者： J.-L. Pacherie I. R. I. S. A. Campus de Beaulieu Rennes France

We present the Operator design pattern which can be used for the design of both sequential and data parallel applications. To reach this goal, we show how the participants of this pattern can be implemented either for a sequential or a parallel execution. Besides, reusing the sequential design for a parallel application decreases the cost of parallelization by allowing the maintenance of a unique application for the two execution environments. The proposed approach for parallel programming does not require any dedicated compiler or code pre-processing. Nothing but object oriented features, such as inheritance and polymorphism, is used to provide the distributed behaviour of the parallel participants of the pattern. The Operator pattern can help to solve many design issues in relation with the development of reusable software components for data collection processing. Moreover, we are confident that many programmers who want to migrate their applications towards parallelism can find it helpful.

关键词： Concurrent computing Electrical capacitance tomography Message passing parallel processing distributed computing Guidelines Programming profession Computer applications

来源：评论

学校读者我要写书评

暂无评论

A comparison of data-parallel collective communication performance and its application

A comparison of data-parallel collective communication perfo...

引用

International conference on High Performance computing in the Asia-Pacific Region

作者： Y. Tanaka K. Kubota M. Matsuda M. Sato S. Sekiguchi Real World Computing Partnership Tsukuba Ibaraki Japan Electro Technical Laboratory Tsukuba Ibaraki Japan

Collective communications such as broadcast and reduction are commonly used in data parallel programs. It is important to understand the performance of such primitive communications to characterize parallel systems and analyze the performance of parallel applications running on specific parallel systems. We measured the performance of collective communication operations on several multiprocessor systems. In this paper, we report experimental results for collective communication performance on distributed memory systems. We also describe the performance prediction of data parallel programs using the performance of the primitives.

关键词： Workstations Switches Network topology Broadcasting Performance analysis Databases Communication system control Ethernet networks Concurrent computing Laboratories

来源：评论

学校读者我要写书评

暂无评论

Performances of the PS/sup 2/ parallel storage and processing system for tomographic image visualization

Performances of the PS/sup 2/ parallel storage and processin...

引用

International conference on parallel and distributed Systems (ICPADS)

作者： V. Messerli B. Gennart R.D. Hersch Ecole Polytechnique Federale de Lausanne Lausanne Switzerland

We propose a new approach for developing parallel I/O- and compute-intensive applications. At a high level of abstraction, a macro data flow description describes how processing and disk access operations are combined. This high-level description (CAP) is precompiled into compilable and executable C++ source language. parallel file system components specified by CAP are offered as reusable CAP operations. Low-level parallel file system components can, thanks to the CAP formalism, be combined with processing operations in order to yield efficient pipelined parallel I/O and compute intensive programs. The underlying parallel system is based on commodity components (PentiumPro processors, Fast Ethernet) and runs on top of WindowsNT. The CAP-based parallel program development approach is applied to the development of an I/O and processing intensive tomographic 3D image visualization application. Configurations range from a single PentiumPro I-disk system to a four PentiumPro 27-disk system. We show that performances scale well when increasing the number of processors and disks. With the largest configuration, the system is able to extract in parallel and project into the display space between three and four 512/spl times/512 images per second. The images may have any orientation and are extracted from a 100 MByte 3D tomographic image striped over the available set of disks.

关键词： Image storage Tomography File systems Yarn Concurrent computing Data visualization Ethernet networks Computer displays Computer applications parallel processing

来源：评论

学校读者我要写书评

暂无评论

Interpretive performance prediction for high performance application development

Interpretive performance prediction for high performance app...

引用

Annual Hawaii International conference on System Sciences (HICSS)

作者： M. Parashar S. Hariri Department of Computer Sciences University of Texas Austin Austin TX USA NPAC & Department of Computer Engineering Syracuse University Syracuse NY USA

Software development for high-performance (parallel/distributed) computing (HPC) is a non-trivial process; its complexity can be primarily attributed to the increased degrees of freedom that have to be resolved and tuned in such an environment. Performance prediction tools enable a developer to evaluate various available design alternatives and can assist in HPC application software development. In this paper, we first present a novel "interpretive" approach for accurate and cost-effective performance prediction. The approach has been used to develop an interpretive HPF/Fortran 90D application performance prediction framework. The accuracy and usability of the performance prediction framework are experimentally validated. We then outline the stages typically encountered during application software development for parallel/distributed HPC and highlight the significance and requirements of a performance prediction tool at the relevant stages. Numerical results using benchmarking kernels and application codes are presented to demonstrate the application of the interpretive performance prediction framework at different stages of the software development process.

关键词： Application software High performance computing Programming Concurrent computing distributed computing Hardware Usability Kernel Software algorithms Data visualization

来源：评论

学校读者我要写书评

暂无评论

Trace-driven analysis of migration-based gang scheduling policies for parallel computers

Trace-driven analysis of migration-based gang scheduling pol...

引用

International conference on parallel Processing (ICPP)

作者： S.K. Setia Computer Science Department George Mason University Fairfax VA USA

Gang scheduling is a job scheduling policy for parallel computers that combines elements of space-sharing and time-sharing. In this paper we analyze the performance of gang scheduling policies that allow the remapping of an executing job to a new set of processors. Most previously proposed gang-scheduling policies do not allow such job remapping under the assumption that it is prohibitively expensive. Through a detailed trace-driven simulation, we analyze the tradeoff between the benefits and overheads of such job relocation. Our results show that gang-scheduling policies that support such job relocation offer significant performance gains over policies that do not use remapping.

关键词： Processor scheduling Concurrent computing Time sharing computer systems Performance analysis Analytical models Costs Bandwidth Computer science Performance gain distributed control

来源：评论

学校读者我要写书评

暂无评论

Certification reports: supporting transactions in wireless systems

Certification reports: supporting transactions in wireless s...

引用

International conference on distributed computing Systems

作者： D. Barbara Bell Communications Research Inc. Morristown NJ USA

The emergence of small portable computers and the advances in wireless networking have made mobile computing today a reality. Information systems and databases are among the applications that make mobile computing attractive. While the topic of querying data in wireless and mobile systems has received a lot of attention, techniques to efficiently update data in these systems while providing transaction semantics are not fully developed. We present a novel protocol that uses the broadcast facility to help mobile units do some of the work of verifying if the transactions being run by them need to be aborted. Only when the mobile unit cannot detect any conflict is the server involved in completing the verification. Of course, if the transaction can commit, the server will install the valves in the central database and notify the mobile units (again, using the broadcast channel). The protocol uses a modified version of optimistic control. We study the performance of the protocol by means of a detailed simulation.

关键词： Certification Protocols Portable computers Mobile computing Transaction databases Broadcasting Computer networks Information systems Application software Valves

来源：评论

学校读者我要写书评

暂无评论

parallel data cube construction for high performance on-line analytical processing

Parallel data cube construction for high performance on-line...

引用

International conference on High Performance computing

作者： S. Goil A. Choudhary Department of Electrical and Computer Engineering Northwestern University Evanston IL USA

Decision support systems use online analytical processing (OLAP) to analyze data by posing complex queries that require different views of data. Traditionally, a relational approach (ROLAP) has been taken to build such systems. More recently, multi-dimensional database techniques (MOLAP) have been applied to decision-support applications. Data is stored in multi-dimensional arrays, which is a natural way to express the multi-dimensionality of the enterprise and is more suited for analysis. Precomputed aggregate calculations in a data cube can provide efficient query processing for OLAP applications. In this paper, we present algorithms and results for in-memory data cube construction on distributed-memory machines.

关键词： Performance analysis Aggregates Data analysis Algorithm design and analysis Concurrent computing distributed computing Decision support systems Relational databases Query processing Memory management

来源：评论

学校读者我要写书评

暂无评论

An efficient and authenticated group-oriented cryptoscheme based on a geometric method in Internet environments

An efficient and authenticated group-oriented cryptoscheme b...

引用

International conference on parallel and distributed Systems (ICPADS)

作者： Woei-Jiunn Tsaur Shi-Jinn Horng Shung-Shing Lee Ruey-Chang Tsai Tllepartment of Electrical Engineering National Taiwan University of Science and Technology Taipei Taiwan Department of Electronic Engineering Fu Shin Institute of Technology and Commerce I-Lan Taiwan

ISBN: (纸本)0818682272

Based on a geometric method, this paper presents an efficient and authenticated cryptoscheme for establishing secure group-oriented data communications in Internet environments. We assume that the Internet environments consist of many hosts, and each host has many users attached to it. The secure group-oriented communication scheme proposed in this paper incorporates the public-key distribution and the trigonometry concepts as the basic theory. Since this scheme does not need any trusted key distribution center to distribute the common secret session key between two parties and can reduce the computation time needed for securely sending messages to a group of receivers by using multiplication operations instead of modular exponentiation, it is quite suitable to be used in Internet environments so that the key distribution is convenient, time-saving and reparable. Furthermore, an authentication protocol is also proposed. Such a protocol can not only identify both the sender and the receiver of a group correctly, but can also make sure the transmitted message has reached its destination safely.

关键词： Internet Protocols Public key cryptography Data communication distributed computing IP networks Public key Authentication Data security Computer networks

来源：评论

学校读者我要写书评

暂无评论

Design of a processing element of a SIMD computer for genetic algorithms

Proceedings of the Conference on High Performance Computing ...

引用

proceedings of the conference on High Performance computing on the Information Superhighway, HPC Asia'97 1997年 688-691页

作者： Inoue, Tomio Sano, Masahiko Takahashi, Yoshizo Univ of Tokushima Tokushima Japan

We have been investigating the efficiency of Genetic Algorithms (GA) for solving for a variety of real problems. During our investigations we have concluded that the large amount computational time required to find GA based solutions on conventional computers is restrictive. We are therefore developing an innovative new computer architecture, suitable for the solution of large scale problems using GAs. In this paper we introduce the SIMD-GA (Single Instruction stream Multiple Data stream Genetic Algorithm), and discuss its hardware design and implementation. By taking advantage of the recent advances is HDLs (Hardware Description Language) and FPGA's (Field Programmable Gate Array) we have been able to quickly develop and prototype a PE (Processing Element) for a SIMD-GA. This approach allows us to build a cost-effective parallel processing architecture to overcome the problem of the computational time required for traditional sequential GA implementation.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：