检索结果-内蒙古大学图书馆

Distribution-insensitive parallel external sorting on PC clusters

5th international symposium on High Performance computing/3rd international Workshop on OpenMP: Experiences and Implementations (WOMPEI 2003)

作者： Jeon, M Kim, D Korea Univ Dept Elect Engn Seoul 136701 South Korea

ISBN: (纸本)3540203591

there have been many parallel external sorting algorithms reported such as NOW-Sort, SPsort, and hill sort, etc. they are for sorting large-scale data stored in the disk, but they differ in the speed, throughput, and cost-effectiveness. Mostly they deal with data that are uniformly distributed in their value range. Few research results have been yet reported for parallel external sort for data with arbitrary distribution. In this paper, we present two distribution-insensitive parallel external sorting algorithms that use sampling technique and histogram counts to achieve even distribution of data among processors, which eventually contribute to achieve superb performance. Experimental results on a cluster of Linux workstations show up to 63% reduction in the execution time compared to previous NOW-sort.

关键词： Sorting

来源：评论

学校读者我要写书评

暂无评论

Cqserver: An example of applying a distributed object infrastructure for heterogeneous enterprise computation of continual queries 5

Cqserver: An example of applying a distributed object infras...

引用

5th international Conference on Enterprise Information Systems, ICEIS 2003

作者： Leopold, Jennifer Palmer, Tyler Department of Computer Science University of Missouri RollaMO65409 United States Cyntergy Technology LLC TulsaOK74119 United States

ISBN: (纸本)9729881618

the revolution in computing brought about by the Internet is changing the nature of computing from a personalized computing environment to a ubiquitous computing environment in which both data and computational resources are network-distributed. Client-server communications protocols permit parallel ad hoc queries of frequently-updated databases, but they do not provide the functionality to automatically perform continual queries to track changes in those data sources through time. the lack of persistence of the state of data resources requires users to repeatedly query databases and manually compare the results of searches through time. To date, continual query systems have lacked both external and internal scalability. Herein we describe CQServer, a scalable, platform- and implementation-independent system that uses a distributed object infrastructure for heterogeneous enterprise computation of both content- and time-based continual queries.

关键词： Ubiquitous computing

来源：评论

学校读者我要写书评

暂无评论

A decomposition algorithm of VHDL-AMS simulation solver 5

A decomposition algorithm of VHDL-AMS simulation solver

引用

5th international Conference on ASIC

作者： Wang, JF Ye, YZ Heilongjiang Univ Sch Comp Sci & Technol Harbin 150080 Peoples R China

ISBN: (纸本)078037889X

the decomposition issue of tasks for VLSI simulation oil distributed memory, multi computers is discussed in this paper. Mathematical and physical analyses are given for exploiting the parallelisms of these operations. An efficient decomposition algorithm is proposed. Using this algorithm, we can decompose a large-scale circuit into N sub-circuits of similar size while keeping the interconnect set of nodes to a minimum, which is beneficial to dynamic load distribution and balance later. this algorithm can be implemented in a parallel environment processing. Some experimental results of this decomposition algorithm are presented. Finally, the conclusion and future work are included.

关键词： VHDL-AMS simulation solver decomposition algorithm parallel processing block bordered equations LU factorization

来源：评论

学校读者我要写书评

暂无评论

An improved parallel algorithm for certain Toeplitz cyclic tridiagonal systems on distributed-memory multicomputer

引用

5th international Workshop on Advanced parallel Processing Technologies

作者： Zhang, XB Luo, ZG Li, XM Coll Equipment Command & Technol Beijing 101416 Peoples R China Natl Lab Parallel & Distributed Proc Changsha 410073 Peoples R China

ISBN: (纸本)3540200541

Based on Luo's parallel algorithm [4] for certain Toeplitz cyclic tridiagonal systems on distributed-memory multicomputer, we present an improved algorithm. Its communication mechanism is simple and redundant computing is small for solving massively systems. the numerical experiments show that the parallel efficiency of the improved algorithm is higher than Luo's algorithm [4].

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

the security architecture of the Java operating system JX -: A security architecture for distributed parallel computing

引用

5th international Workshop on Advanced parallel Processing Technologies

作者： Wawersich, C Felser, M Golm, M Kleinöder, J Dept Comp Sci 4 D-91058 Erlangen Germany Siemens AG Corp Technol CT SE 2 D-81730 Munich Germany

ISBN: (纸本)3540200541

Using the unneeded computation power in the internet for distributed computing is getting more and more eligible. To increase the willingness to provide unneeded computing power, a secure platform is needed for the execution of untrusted code. We present the architecture of the JX operating system, which can be used to safely execute untrusted code. the problem of erroneous agents crashing the system is solved by using Java - a typesafe language - as implementation language. the resource consumption of the agents is controlled by a security manager, that inspects every interaction between an agent and a system service. If the security policy does not approve the use of a system service, the access can be denied. An agent execution system build upon JX is presented to illustrate the security problems occurring and the solutions provided by the operating system JX.

关键词： computing power

来源：评论

学校读者我要写书评

暂无评论

Online remote trace analysis of parallel applications on high-performance clusters

引用

5th international symposium on High Performance computing, ISHPC 2003

作者： Brunst, Holger Malony, Allen D. Shende, Sameer S. Bell, Robert Department for Computer and Information Science University of Oregon Eugene United States Center for High Performance Computing Dresden University of Technology Germany

ISBN: (纸本)3540203591

the paper presents the design and development of an online remote trace measurement and analysis system. the work combines the strengths of the TAU performance system with that of the VNG distributed parallel trace analyzer. Issues associated with online tracing are discussed and the problems encountered in system implementation are analyzed in detail. Our approach should port well to parallel platforms. Future work includes testing the performance of the system on largescale machines. © Springer-Verlag Berlin Heidelberg 2003.

关键词： Trace analysis

来源：评论

学校读者我要写书评

暂无评论

DNS of fully turbulent flow in a LPT passage

引用

international JOURNAL OF HEAT AND FLUID FLOW 2003年第4期24卷 636-644页

作者： Kalitzin, G Wu, XH Durbin, PA Stanford Univ Dept Mech Engn Ctr Integrated Turbulence Simulat Stanford CA 94305 USA

this work addresses the pattern of turbulent kinetic energy generated by distortion and the effect of external disturbances on boundary layer transition. this is investigated with direct numerical simulation of grid turbulence convected through a linear turbine blade cascade. Comparisons are made with results from earlier computations of flow through the same cascade with a turbulence free inflow and an inflow with migrating wakes. the distribution of turbulence in the passage strongly depends on the mean flow field and can partly be explained by the travel time needed for the inlet turbulence to drift to a certain location. this results in a local amplification of turbulence near the leading edge stagnation region and in the passage on the pressure side near the trailing edge. the penetration of disturbances into the blade boundary layers induces bypass transition. In particular, the transition pattern on the suction side of the blade differs significantly for the three types of inflow. (C) 2003 Elsevier Science Inc. All rights reserved.

关键词： DNS fully turbulent flow migrating wakes low pressure turbine blade passage OpenMP parallel computing

来源：评论

学校读者我要写书评

暂无评论

parallel matrix multiplication and LU factorization on ethernet-based clusters

引用

5th international symposium on High Performance computing, ISHPC 2003

作者： Tinetti, Fernando G. Denham, Mónica de Giusti, Armando UNLP 50 y 115 La Plata1900 Argentina

ISBN: (纸本)3540203591

this work presents a simple but effective approach for two representative linear algebra operations to be solved in parallel on Ethernet- based clusters: matrix multiplication and LU matrix factorization. the main objectives of this approach are: simplicity and performance optimization. the approach is completed at a lower level by including a broadcast routine based directly on top of UDP to take advantage of the Ethernet physical broadcast facility. the performance of the proposed algorithms implemented on Ethernet-based clusters is compared with the performance obtained with the ScaLAPACK library, which is taken as having highly optimized algorithms for distributed memory parallel computers in general and clusters in particular. © Springer-Verlag Berlin Heidelberg 2003.

关键词： Cluster computing

来源：评论

学校读者我要写书评

暂无评论

distributed location of shared resources and its application to the load sharing problem in heterogeneous distributed systems

引用

5th international symposium on High Performance computing/3rd international Workshop on OpenMP: Experiences and Implementations (WOMPEI 2003)

作者： Fujita, S Tagashira, S Hiroshima Univ Grad Sch Engn Dept Informat Engn Hiroshima 730 Japan

ISBN: (纸本)3540203591

In this paper, we propose a distributed algorithm for solving a resource location problem in distributed systems. the proposed algorithm is fully distributed in the sense that it assumes no centralized control, and has a remarkable property such that it can always find a target node satisfying a certain property, if any. the result of simulations implies that: (1) the performance of the underlying load sharing scheme can be significantly improved by increasing the preciseness of a node location, and (2) in the proposed scheme, the average number of inquiries per location is bounded by a very small value (e.g., only two inquiries are enough even when the underlying system consists of 100 nodes).

关键词： Location

来源：评论

学校读者我要写书评

暂无评论

parallel LU-decomposition on pentium streaming SIMD extensions

引用

5th international symposium on High Performance computing/3rd international Workshop on OpenMP: Experiences and Implementations (WOMPEI 2003)

作者： Takahashi, A Soliman, M Sedukhin, S Univ Aizu Fukushima 9658580 Japan

ISBN: (纸本)3540203591

Solving systems of linear equations is central in scientific computation. In this paper, we focus on using Intel's Pentium Streaming SIMD Extensions (SSE) for parallel implementation of LU-decomposition algorithm. Two implementations (non-SSE and SSE) of LU-decomposition are compared. Moreover, two different variants of the algorithm for the SSE version are also compared. Our results demonstrate an average performance of 2.25 times faster than the non-SSE version. this speedup is higher than 1.74 times the speedup of Intel's SSE implementation. the source of the speedup is highly reusing of loaded data by efficiently organizing SSE instructions.

关键词： Gaussian elimination streaming SIMD parallel processing performance evaluation data reusing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：