检索结果-内蒙古大学图书馆

Elektrotehniski Vestnik/Electrotechnical Review 2002年第2期69卷 90-94页

作者： Šuligoj, Domen Trobec, Roman Robič, Borut Franca Kramarja 14 5290 Šempeter pri Novi Gorici Slovenia

Viterbi algorithm is an optimal convolutional decoding algorithm with superpolynomial time complexity. Basic principles of Viterbi algorithm are shown in Figures 1, 2, and 3. In order to improve the algorithm throughput, one has to apply parallelism. This can be done at different levels, e.g., bit, word, or algorithm level. The paper discusses various approaches to the parallelisation of the decoding algorithm, some implemented in VLSI processing elements, and the other implemented by multiprocessor systems with general purpose processors. A Viterbi decoder basically consists of three main functional blocks shown in Figure 4. Branch Metrics block BM calculates in each time step all branch weights. Add-Compare-Select ACS unit calculates sums of weights and selects optimal survivor paths. Survivor Memory SM analyses partial results from BM and ACS and outputs decoded data within a time delay D. Note that data dependent loop is present in the ACS unit that limits the speed of the decoding procedure because actual branch weight has to be added to the accumulated weights of the survivor path at each time step. Performances of the Viterbi decoder can be improved on bit level by breaking the data dependant loop using carry-save addition and pipelining (see Figure 5). Further, several ACS units can be used in parallel on the word level. Finally, more independent decoders may work on different blocks of input data. After decoding procedure the final result can be obtained by the multiplexing of decoded segments. The mentioned principles can be implemented either in VLSI components connected into a ring topology or by several independent general purpose or DSP processors (see Figures 6 and 7). Theoretical speedup attainable by parallel processing is estimated to be S = pN M / E, where pN represents the number of processors, E the length of the decoded block and M &le E the length of the uniquely decoded data in a block. Considering the performance of contemporary processing e

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Efficient Data-parallel Computations on Distributed Systems

引用

High Technology Letters 2002年第3期8卷 92-96页

作者：曾志勇 LU Xinda Dept. of Computer Science and Engineering Shanghai Jiaotong University Shanghai 200030 P.R.China

Task scheduling determines the performance of NOW computing to a large extent. However, the computer system architecture, computing capability and system load are rarely proposed together. In this paper, a biggest heterogeneous scheduling algorithm is presented. It fully considers the system characteristics (from application view), structure and state. So it always can utilize all processing resource under a reasonable premise. The results of experiment show the algorithm can significantly shorten the response time of jobs.

关键词： parallel algorithms heterogeneous computing message passing load balancing

来源：评论

学校读者我要写书评

暂无评论

New distributed algorithm for connected dominating set in wireless ad hoc networks

New distributed algorithm for connected dominating set in wi...

引用

35th Annual Hawaii International Conference on System Sciences, HICSS 2002

作者： Alzoubi, K.M. Wan, Peng-Jun Frieder, O. Department of Computer Science Illinois Institute of Technology ChicagoIL60616 United States

ISBN: (纸本)0769514359

Connected dominating set (CDs) has been proposed as virtual backbone or spine of wireless ad hoc networks. Three distributed approximation algorithms have been proposed in the literature for minimum CDS. We first reinvestigate their performances. None of these algorithms have constant approximation factors. Thus these algorithms can not guarantee to generate a CDs of small size. Their message complexities can be as high as O(n2), and their time complexities may also be as large as O(n2) and O(n2). We then present our own distributed algorithm that outperforms the existing algorithms. This algorithm has an approximation factor of at most 8, O(n) time complexity and O(n log n) message complexity. By establishing the Ω(n log n) lower bound on the message complexity of any distributed algorithm for nontrivial CDs, our algorithm is thus message-optimal. © 2002 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Scalable parallel wavelet transforms for image processing

Scalable parallel wavelet transforms for image processing

引用

Canadian Conference on Electrical and Computer Engineering (CCECE)

作者： N.I. Chadha A. Cuhadar H. Card Department of Electrical and Computer Engineering University of Manitoba Winnipeg MAN Canada Systems and Computer Engineering Department Carleton University Canada

algorithms for 2D wavelet transform decomposition on clusters of workstations are described and analyzed. For the parallel algorithm employed, the computation of the transform is structured so that the exchange of intermediate transform coefficients is restricted only to neighboring processors and the amount of data communicated is independent of the problem size. Results show that the performance of the parallel implementation improves with increasing data size making the parallel algorithm particularly suitable for applications such as image processing, image coding and computer vision. Timings measured on a Myrinet connected Beowulf cluster agree well with the theoretical analysis and indicate that the implementation is cost optimal.

关键词： Wavelet transforms Image processing parallel algorithms Clustering algorithms Wavelet analysis Workstations Algorithm design and analysis Concurrent computing Application software Image coding

来源：评论

学校读者我要写书评

暂无评论

Verification of computations of a parallel FDTD algorithm

Verification of computations of a parallel FDTD algorithm

引用

International Conference on parallel Computing in Electrical Engineering (PARLEC)

作者： W. Walendziuk J. Forenc Electrical Engineering Bialystok Technical University Poland

In the presented work the authors included the comparison of the calculations of a parallel FDTD algorithm with the computations obtained with the use of the Quick Wave programme published by QWED. The authors worked out a parallel implementation of the standard FDTD algorithm which is based on MPI communication library. The parallel algorithm was examined in a heterogeneous PC cluster.

关键词： Concurrent computing Finite difference methods Time domain analysis Clustering algorithms parallel algorithms Dielectrics Boundary conditions Frequency Electromagnetic analysis Material properties

来源：评论

学校读者我要写书评

暂无评论

parallel FDTD analysis of active integrated antenna array

Parallel FDTD analysis of active integrated antenna array

引用

Antennas and Propagation Society International Symposium

作者： Qing-Xin Chu Ka-Fai Chan Chi-Hou Chan School of Electronic Engineering Xidian University China Department of Electronic Engineering City University of Hong Kong Kowloon Hong Kong China

The active integrated antenna array presented by S. Nogi et al. (see IEEE Microwave Theory Tech., vol.41, p.1827-37, 1993) is simulated by use of a parallel FDTD algorithm to reduce the computational requirement. A 4-... 详细信息

关键词： Finite difference methods Time domain analysis Antenna arrays Phased arrays Computational modeling Concurrent computing parallel processing Microstrip antennas parallel algorithms Military computing

来源：评论

学校读者我要写书评

暂无评论

The IBiCGStab method on bulk synchronous parallel architectures

The IBiCGStab method on bulk synchronous parallel architectu...

引用

IEEE International Symposium on High Performance Computing Systems and Applications (HPCS)

作者： L.T. Yang R.E. Shaw Department of Computer Science Saint Francis Xavier University Antigonish NS Canada Department of Computer Science University of New Brunswick Saint John Saint John's NB Canada

In this paper, an improved version of the BiCGStab method for the solutions of large and sparse linear systems of equations with unsymmetric coefficient matrices is proposed. The method combines elements of numerical stability and parallel algorithm design without increasing the computational costs. The algorithm is derived such that all inner products of a single iteration step are independent and communication time required for inner product can be overlapped efficiently with computation time of vector updates. Therefore, the cost of global communication can be significantly reduced. In this paper, the bulk synchronous parallel (BSP) model is used to design a fully efficient, scalable and portable parallel proposed algorithm and to provide accurate performance prediction of the algorithm for a wide range of architectures including the Cray T3D, the Parsytec, and a cluster of workstations connected by an Ethernet. This performance model provides us useful insight in the time complexity of the method using only a few system dependent parameters based on a simple and accurate cost modelling. The theoretical performance prediction are compared with some preliminary measured timing results of a numerical application from ocean flow simulation.

关键词： parallel architectures Algorithm design and analysis Costs Predictive models Clustering algorithms Linear systems Equations Sparse matrices Numerical stability parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel genetic algorithms with a continuity operator that allows for knowledge inclusion

Parallel genetic algorithms with a continuity operator that ...

引用

Congress on Evolutionary Computation

作者： B. de Andres y Toro J.M. Giron-Sierra P. Fernandez-Blanco J.M. de la Cruz J.A. Lopez-Orozco Departamento Arquitectura de Computadores y Automática Universidad Complutense de Madrid Madrid Spain

In recent years we introduced a continuity operator, the "Superindividual", that allows for the inclusion of knowledge in the evolution of the genetic algorithm. Since we deal with very complex optimization problems, we developed a parallel genetic algorithm, with the Superindividual operator. The paper presents this parallel algorithm, which improves on the results of the conventional genetic algorithm. Two different models of parallel genetic algorithms are compared. The results are very encouraging.

关键词： Genetic algorithms Temperature Fungi Beverage industry Ethanol Biomass parallel algorithms Optimization methods Computational efficiency Genetic engineering

来源：评论

学校读者我要写书评

暂无评论

parallel implementation of a dynamic programming paradigm

Parallel implementation of a dynamic programming paradigm

引用

International Conference on parallel Computing in Electrical Engineering (PARLEC)

作者： M. Craus D. Ardelean Computer Engineering Department Gheorghe Asachi Technical University of Iasi Iasi Romania

A new parallel algorithm that solves a dynamic programming paradigm is proposed. It has the time complexity of O(n) and uses (n-1)n/2 processors. An MPI implementation is used to test the algorithm.

关键词： Dynamic programming parallel algorithms Pipeline processing Concurrent computing Testing Design methodology Code standards parallel machines Computer architecture Costs

来源：评论

学校读者我要写书评

暂无评论

Fully connected cubic network applied in parallel processing

Fully connected cubic network applied in parallel processing

引用

IEEE Region 10 International Conference TENCON

作者： Wang Hongyu Gu Weikang R.K.C. Chang Department of Information and Electronics Engineering University of Zhejiang Hangzhou Zhejiang China Department of Computing Hong Kong Polytechnic University Kowloon Hong Kong China

The paper introduces the application of FCCN (fully connected cubic network) topology in massively parallel processing systems. Because of the simple self-routing algorithm, small diameter and average number of internode distance, the fault-tolerance, FCCN can act as a high performance interconnection network in the massively parallel processing systems. Moreover, the hypercube can be embedded in FCCN nature, so that FCCN will implement all developed parallel algorithms for the hypercube easily and efficiently. And the broadcasting algorithm of FCCN is proposed.

关键词： Intelligent networks parallel processing Hypercubes Network topology Multiprocessor interconnection networks parallel algorithms Broadcasting Algorithm design and analysis Computer networks Concurrent computing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：