检索结果-内蒙古大学图书馆

MorphoSys: A coarse grain reconfigurable architecture for multimedia applications 8th

8th international Euro-Par conference on parallel processing

作者： Parizi, H Niktash, A Bagherzadeh, N Kurdahi, F Univ Calif Irvine Dept Elect & Comp Engn Irvine CA 92697 USA

ISBN: (纸本)3540440496

MorphoSys is a reconfigurable architecture for computation intensive applications. It combines both coarse grain and fine grain reconfiguration techniques to optimize hardware, based on the application domain. M2, the current implementation, is developed as an IP core. It is synthesized based on the TSMC 0.13 micron technology. Experimental results show that for multimedia applications MorphoSys has a performance comparable to ASICs with the added benefit of being able to be reconfigured for different applications in one clock cycle.

关键词： Reconfigurable architectures

来源：评论

学校读者我要写书评

暂无评论

parallel solving symmetric eigenproblems 5

Parallel solving symmetric eigenproblems

引用

5th international conference on algorithms and architectures for parallel processing, ICA3PP 2002

作者： Cao, Xing-Qin Chi, Xue-Bin Gu, Ming Department of Computer Science Huazhong University of Science and Technology China Supercomputing Center Computer Network Information Center Chinese Academy of Sciences Beijing100080 China Department of Mathematics University of California BerkeleyCA United States

ISBN: (纸本)0769515126

In this paper parallel solving symmetric eigenproblems, which include standard and generalized eigenvalue problems, is discussed. For standard eigenvalue problem and tridiagonal eigenvalue problem is not the key point. For symmetric-definite generalized eigenvalue problem, which arises in solving many actual application problems, we give a new parallel computational method for reducing the generalized eigenproblem to standard one. the parallel algorithm is fully considered in reducing the communications. the numerical measurement is given in SGI/Cray T3E and Hitachi SR2201. Some computational results are compared with ScaLAPACK. © 2002 IEEE.

关键词： Eigenvalues and eigenfunctions

来源：评论

学校读者我要写书评

暂无评论

Creating portable and automatically scalable parallel software using the PARSA^TM programming methodology 5

Creating portable and automatically scalable parallel softwa...

引用

5th international conference on algorithms and architectures for parallel processing

作者： Murthi, V Levine, D Marquis, J Shirazi, B Univ Texas Dept Comp Engn & Sci Arlington TX 76019 USA

ISBN: (纸本)0769515126

Creating portable and automatically scalable parallel software has been a goal for researchers and practitioners since the advent of parallel computing. In this paper we present a programming methodology that reduces parallel programming complexity, while creating portable and automatically scalable parallel software. To support this methodology two separate tools have been developed - the PARSA Software Development Environment and an accompanying thread manager. the development environment addresses programming issues via an object-based graphical programming methodology that transforms a project automatically into a portable and scalable source code. Generated source code makes calls to the user-level thread manager, which manages the run time execution of the parallel software. Two sample applications that contain various forms of parallelism have been developed and are compiled on three different systems with diverse native threading mechanisms to demonstrate portability Finally, the automatic scalability is demonstrated with the run time performance of the applications on multiprocessor systems.

关键词： Managers

来源：评论

学校读者我要写书评

暂无评论

parallel convex hull computation by generalised regular sampling 8th

引用

8th international Euro-Par conference on parallel processing

作者： Tiskin, A Univ Warwick Dept Comp Sci Coventry CV4 7AL W Midlands England

ISBN: (纸本)3540440496

the model of bulk-synchronous parallel (BSP) computation is an emerging paradigm of general-purpose parallel computing. We propose the first optimal deterministic BSP algorithm for computing the convex hull of a set of points in three-dimensional Euclidean space. Our algorithm is based on known fundamental results from combinatorial geometry, concerning small-sized, efficiently constructible e-nets and c-approximations of a given point set. the algorithm generalises the technique of regular sampling, used previously for sorting and two-dimensional convex hull computation. the cost of the simple algorithm is optimal only for extremely large inputs;we show how to reduce the required input size by applying regular sampling in a multi-level fashion.

关键词： Approximation algorithms

来源：评论

学校读者我要写书评

暂无评论

An improved parallel watershed algorithm for distributed memory system 5

An improved parallel watershed algorithm for distributed mem...

引用

5th international conference on algorithms and architectures for parallel processing, ICA3PP 2002

作者： Zhou, Hai-Fang Jiang, Yan-Huang Yang, Xue-Jun Department of Computer Science and Technology National University of Defense Technology Changsha410073 China

ISBN: (纸本)0769515126

As a classical method of image segmentation in mathematical morphology, the watershed transform has been applied successively into some fields like remote sensing image processing, biomedical and computer vision applications. However the watershed transform is a relatively time consuming task and classical watershed algorithms have a strong recursive nature, so straightforward parallel ones have a very low efficiency. Mekjster and Roerdink (1996;1995) have proposed an alternative algorithm (M-R for short) which consists of three stages aimed to exploit parallelism fully. the M-R algorithm has much limitation and some underlying logical errors, therefore we present an improved parallel watershed algorithm based on a directed valued graph called components graph for image segmentation. We firstly point out the limits of the M-R algorithm, then describe the theory and steps of our algorithm in detail. Finally, experimental studies and performance measurements are given. © 2002 IEEE.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

Retrieval of multispectral satellite imagery on cluster architectures 8th

引用

8th international Euro-Par conference on parallel processing

作者： Bretschneider, T Kao, O Nanyang Technol Univ Sch Comp Engn Singapore Singapore Paderborn Univ Dept Comp Sci Paderborn Germany

ISBN: (纸本)3540440496

the retrieval of images in remote sensing databases is based on world-oriented information like the location of the scene, the utilised scanner, and the date of acquisition. However, these descriptions are not meaningful for many users who have a limited knowledge about remote sensing but nevertheless have to work with satellite imagery. therefore a content-based dynamic retrieval technique using a cluster architecture to fulfil the resulting computational requirements is proposed. Initially the satellite images are distributed evenly over the available computing nodes and the retrieval operations are performed simultaneously. the dynamic strategy creates the need for a workload balancing before the sub-results are joined in a final ranking.

关键词： Remote sensing

来源：评论

学校读者我要写书评

暂无评论

parallel and distributed databases, data mining and knowledge discovery 8th

引用

8th international Euro-Par conference on parallel processing

作者： Kosch, H Skilicorn, D Talia, D

We would like to welcome you to Paderborn and to the Europar 2002 topic on parallel and Distributed Databases, Data Mining and Knowledge Discovery.

ISBN: (纸本)3540440496

We would like to welcome you to Paderborn and to the Europar 2002 topic on parallel and Distributed Databases, Data Mining and Knowledge Discovery.

关键词：

来源：评论

学校读者我要写书评

暂无评论

parallel difference schemes for parabolic problems 5

Parallel difference schemes for parabolic problems

引用

5th international conference on algorithms and architectures for parallel processing, ICA3PP 2002

作者： Yuan, Guangwei Shen, Longjiun Zhou, Yu-Lin Institute of Applied Physics and Computational Mathematics Laboratory of Computational Physics P. O. Box 8009 Beijing100088 China

ISBN: (纸本)0769515126

In this paper some implicit domain decomposition procedures for solving parabolic problems are proposed. In these methods, the classic implicit scheme is used in each sub-domain, and Dirichlet boundary values at the (interior) boundaries of sub-domains are just taken as the values of the difference solution at the previous time level. these implicit domain decomposition procedures are easy to be implemented on parallel computers and are called parallel difference schemes. they are proved to be stable and convergent unconditionally in discrete L∞ and H1 norms, and the convergence order is O(τ + h) though the truncation error at the sub-domain boundaries is O(1). © 2002 IEEE.

关键词： Domain decomposition methods

来源：评论

学校读者我要写书评

暂无评论

Scalability of Scheduled Dataflow architecture (SDF) with register contexts 5

Scalability of Scheduled Dataflow architecture (SDF) with re...

引用

5th international conference on algorithms and architectures for parallel processing

作者： Arul, JM Kavi, KM Fu Jen Catholic Univ. Taiwan University of North Texas United States

ISBN: (纸本)0769515126

Our new architecture, known as Scheduled DataFlow (SDF) system deviates from current trend of building complex hardware to exploit Instruction Level parallelism (ILP) by exploring a simpler, yet powerful execution paradigm that is based on dataflow, multithreading and decoupling of memory accesses from execution. A program is partitioned into non-blocking threads. In addition, all memory accesses are decoupled from the thread's execution. Data is pre-loaded into the thread's context (registers), and all results are post-stored after the completion of the thread's execution. Even though multithreading and decoupling are possible with control-flow architecture, the non-blocking and functional nature of the SDF system make it easier to coordinate the memory accesses and execution of a thread. In this paper we show some recent improvements on SDF implementation, whereby threads exchange data directly in register contexts, thus eliminating the need for creating thread frames. thus it is now possible to explore the scalability of our architecture's performance when more register contexts are included on the chip.

关键词： scheduled dataflow architecture superscalar superspeculative multithreaded architectures

来源：评论

学校读者我要写书评

暂无评论

parallelizing the data cube

引用

DISTRIBUTED AND parallel DATABASES 2002年第2期11卷 181-201页

作者： Dehne, F Eavis, T Hambrusch, S Rau-Chaplin, A Carleton Univ Sch Comp Sci Ottawa ON K1S 5B6 Canada Dalhousie Univ Fac Comp Sci Halifax NS B3H 1W5 Canada Purdue Univ Dept Comp Sci W Lafayette IN 47907 USA

this paper presents a general methodology for the efficient parallelization of existing data cube construction algorithms. We describe two different partitioning strategies, one for top-down and one for bottom-up cube algorithms. Both partitioning strategies assign subcubes to individual processors in such a way that the loads assigned to the processors are balanced. Our methods reduce inter processor communication overhead by partitioning the load in advance instead of computing each individual group-by in parallel. Our partitioning strategies create a small number of coarse tasks. this allows for sharing of prefixes and sort orders between different group-by computations. Our methods enable code reuse by permitting the use of existing sequential (external memory) data cube algorithms for the subcube computations on each processor. this supports the transfer of optimized sequential data cube code to a parallel setting. the bottom-up partitioning strategy balances the number of single attribute external memory sorts made by each processor. the top-down strategy partitions a weighted tree in which weights reflect algorithm specific cost measures like estimated group-by sizes. Both partitioning approaches can be implemented on any shared disk type parallel machine composed of p processors connected via an interconnection fabric and with access to a shared parallel disk array. We have implemented our parallel top-down data cube construction method in C++ with the MPI message passing library for communication and the LEDA library for the required graph algorithms. We tested our code on an eight processor cluster, using a variety of different data sets with a range of sizes, dimensions, density, and skew. Comparison tests were performed on a SunFire 6800. the tests show that our partitioning strategies generate a close to optimal load balance between processors. the actual run times observed show an optimal speedup of p.

关键词： OLAP data cube parallel processing partitioning load balancing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：