检索结果-内蒙古大学图书馆

parallel algorithms for Fast Fourier transformation using PowerList, ParList and PList theories 8th

8th international Euro-Par conference on parallel processing

作者： Niculescu, V Univ Babes Bolyai Dept Comp Sci R-3400 Cluj Napoca Romania

ISBN: (纸本)3540440496

Power-List, ParList and PList data structures are efficient tools for functional descriptions of parallel programs that are divide & conquer in nature. the goal of this work is to develop three parallel variants for Fast Fourier Transformation using these theories. the variants are implied by the degree of the polynomial, which can be a power of two, a prime number, or a product of prime factors. the last variant includes the first two, and represents a general and efficient parallel algorithm for Fast Fourier Transformation. this general algorithm has a very good time complexity, and can be mapped on a recursive interconnection network.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

HARDWIRED RESOURCE ALLOCATORS FOR RECONFIGURABLE architectures.

Proceedings of the International Conference on Parallel Proc...

引用

Proceedings of the international conference on parallel processing 1980年 109-111页

作者： Rathi, Bharat Deep Tripathi, Anand R. Lipovski, G.Jack

Hardwired resource allocators for TRAC-like reconfigurable architectures are described. these allocators facilitate searching for available resources in the system and allocation of a subset of these to a given request. Various algorithms can be implemented for the search and the allocation of the resources. Tree-structured allocators look particularly attractive with the cost-delay product being of the order of M* (log M)**2 for a system with M resources of the same type. It is shown how this scheme can be extended to allocate multiple type of resources in the system.

关键词： COMPUTER ARCHITECTURE

来源：评论

学校读者我要写书评

暂无评论

Efficient weighted multiselection in parallel architectures 5

Efficient weighted multiselection in parallel architectures

引用

5th international conference on algorithms and architectures for parallel processing

作者： Shen, H Japan Adv Inst Sci & Technol Grad Sch Informat Sci Tatsunokuchi Ishikawa 9231292 Japan

ISBN: (纸本)0769515126

We study parallel solutions to the problem of weighted multiselection to select r elements on given weighted-ranks from a, set S of n weighted elements, where an element is on weighted rank k if it is the smallest element such that the aggregated weight of all elements not greater than it in S is not smaller than k. We propose efficient algorithms on two of the most popular parallel architectures, hypercube and mesh. For a hypercube with p < n processors, we present a parallel algorithm running in O(n(epsilon) min{r, log p}) time for p = n(1-epsilon), 0 < epsilon < 1, which is cost optimal when r greater than or equal to p. Our algorithm on rootp x rootp mesh runs in O(rootp + n/p log(3) p) time P which is the same as multiselection on mesh when r greater than or equal to log p, and thus has the same optimality as multiselection in this case.

关键词： hypercube mesh multiselection parallel algorithm weighted selection

来源：评论

学校读者我要写书评

暂无评论

parallel processing Puzzle N²-1 on cluster architectures performance analysis

Parallel processing Puzzle N<SUP>2</SUP>-1 on cluster archit...

引用

30th international conference on Information Technology Interfaces

作者： Sanz, Victoria de Giusti, Armando Chichizola, Franco Naiouf, Marcelo De Giusti, Laura Instituto de Investigación en Informática (III-LIDI) School of Computer Sciences UNLP

ISBN: (纸本)9789537138127

An analysis of a parallel solution of N-2-1 Puzzle using clusters, is presented. this problem is interesting due to its complexity and related applications, particularly in the field of robotics. A variation of classic heuristics for forecasting the work to be done in order to reach a solution is analyzed, and it is shown that its use significantly improves the time of sequential algorithm A*. then, a parallel solution on a distributed architecture is presented and speedup is analyzed based on the number of processors, efficiency, and the possible superlinearity when scaling the problem.

关键词： parallel algorithms distributed processing speedup superlinearity efficiency scalability

来源：评论

学校读者我要写书评

暂无评论

Reduction to Condensed Forms for Symmetric Eigenvalue Problems on Multi-core architectures

Reduction to Condensed Forms for Symmetric Eigenvalue Proble...

引用

8th international conference on parallel processing and Applied Mathematics

作者： Bientinesi, Paolo Igual, Francisco D. Kressner, Daniel Quintana-Orti, Enrique S. Rhein Westfal TH Aachen AICES D-52074 Aachen Germany Univ Jaume 1 Dept Ingn & Ciencia Comp Castellon de La Plana 12071 Spain ETH Seminar Angewandte Mathemat Zurich Switzerland

ISBN: (纸本)9783642143892

We investigate the performance of the routines in LAPACK and the Successive Band Reduction (SBR) toolbox for the reduction of a dense matrix to tridiagonal form, a crucial preprocessing stage in the solution of the symmetric eigenvalue problem, on general-purpose multicore processors. In response to the advances of hardware accelerators, we also modify the code in SBR. to accelerate the computation by off-loading a significant part of the operations to a graphics processor (GPU). Performance results illustrate the parallelism and scalability of these algorithms on current high-performance multi-core architectures.

关键词： Eigenvalues and eigenfunctions

来源：评论

学校读者我要写书评

暂无评论

parallel processing architecture for sensory information

Parallel processing architecture for sensory information

引用

Proceedings of the 1995 8th international conference on Solid-State Sensors and Actuators and Eurosensors IX. Part 1 (of 2)

作者： Ishikawa, Masatoshi Univ of Tokyo Tokyo Japan

With the shift of the information processing architecture from sequential processing to parallel and distributed processing has come a great change in the role of the sensor technology, with the integration of the electronic circuit as the background. In other words, the sensor is no longer considered simply as a signal-transforming device but rather as an information processing module. this paper discusses the processing architecture for sensing in terms of the parallel processing. Some examples of the parallel processing architectures for the sensor information is described from such new viewpoints as massively parallel processing vision, optical neuro-computing, active sensing, and sensor fusion.

关键词： Sensor data fusion

来源：评论

学校读者我要写书评

暂无评论

parallel Stateful Logic in RRAM: theoretical Analysis and Arithmetic Design 30

Parallel Stateful Logic in RRAM: Theoretical Analysis and Ar...

引用

30th IEEE international conference on Application-Specific Systems, architectures and Processors (ASAP)

作者： Wang, Feng Luo, Guojie Sun, Guangyu Zhang, Jiaxi Huang, Peng Kang, Jinfeng Peking Univ Ctr Energy Efficient Comp & Applicat Beijing Peoples R China Peking Univ Inst Microelect Beijing Peoples R China

ISBN: (纸本)9781728116013

processing-in-memory (PIM) provides massive parallelism with high energy efficiency and becomes a promising solution to the "memory wall" problem. Recently, the emerging metal-oxide resistive random access memory (RRAM) has shown its potential to design a PIM architecture. Several stateful logic operations, e.g., NOR and NAND, can be executed in parallel in an RRAM crossbar. Although previous works have designed some algorithms using the stateful logic, it is still under exploration how to fully exploit its potential high parallelism and design an asymptotically fast algorithm for a given function. In this work, we theoretically analyze the parallelism in an RRAM crossbar and design several asymptotically optimal arithmetic algorithms. In detail, we first propose the Single Instruction Multiple Lines (SIML) model to unify the stateful logic families and prove three lower bounds on the time complexity of a parallel RRAM algorithm. then, we design three algorithms for integer addition functions with the stateful logic, guided by the lower bound analysis. All of them reach the time complexity lower bound. Finally, We make two extensions of the integer addition algorithms, supporting multiplication functions by decomposing them to additions and supporting the flex-point data type by proposing an exponent and mantissa update flow. Experimental evaluation shows that our integer algorithms achieves a speedup up to 13.79x over the previous RRAM algorithms. Our flex-point implementation achieves a 26.60x speedup and saves 73.68% energy compared to an ARM.

关键词： Time complexity Computer architecture parallel processing Computational modeling Resistance Switches Random access memory

来源：评论

学校读者我要写书评

暂无评论

Modeling of the motion and interaction of carbon particles in the plasma electric arc discharge using parallel programming technologies 8

Modeling of the motion and interaction of carbon particles i...

引用

8th international Multi-conference on Complexity, Informatics and Cybernetics, IMCIC 2017

作者： Abramov, Gennady Gavrilov, Alexander Ivashin, Alexei Tolstova, Irina Voronezh State University Voronezh Russia Voronezh State University of Engineering Technologies Voronezh Russia

ISBN: (纸本)9781941763551

the questions of application of various parallel programming technologies for the solution of the problem of modeling of carbon nanostructure synthesis are studied in the article. the description of the developed algorithms with application of the graphic accelerator and the central processor is considered and the calculation of algorithms efficiency is made as well. © 2017 international Institute of Informatics and Systemics IIIS. All rights reserved.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

Improving the parallel Schnorr-Euchner LLL Algorithm

Improving the Parallel Schnorr-Euchner LLL Algorithm

引用

11th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Backes, Werner Wetzel, Susanne Stevens Inst Technol Hoboken NJ 07030 USA

ISBN: (纸本)9783642246494

this paper introduces a number of modifications that allow for significant improvements of parallel LLL reduction. Experiments show that these modifications result in an increase of the speed-up by a factor of more than 1.35 for SVP challenge type lattice bases in comparing the new algorithm with the state-of-the-art parallel LLL algorithm.

关键词： Computers

来源：评论

学校读者我要写书评

暂无评论

Enforcing cache coherence at data sharing boundaries without global control: A hardware-software approach 8th

引用

8th international Euro-Par conference on parallel processing

作者： Sarojadevi, H Nandy, SK Balakrishnan, S Indian Institute of Science India Philips Research Laboratories Netherlands

ISBN: (纸本)3540440496

the technology and application trends leading to current day multiprocessor architectures such as chip multiprocessors, embedded architectures, and massively parallel architectures, demand faster, mode efficient, and more scalable cache coherence schemes than the existing ones. In this paper we present a new scheme that has a potential to meet such a demand. the software support for our scheme is in the form of program annotations to detect shared accesses as well as release synchronizations that represent data sharing boundaries. A small hardware called Coherence Buffer, (CB) with an associated controller, local to each processor forms the control unit to locally enforce cache coherence actions which are off the critical path. Our simulation study shows that a 8 entry 4-way associative CB helps achieve a speedup of 1.07 - 4.31 over full-map 3-hop directory scheme for five of the SPLASH-2 benchmarks (representative of migratory sharing, producer-consumer and write-many workloads), under Release Consistency model.

关键词： parallel architectures

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：